Hello AI Enthusiasts!

Welcome to the Twenty-Eighth edition of "This Week in AI Engineering"!

This week, OpenAI launched the revolutionary ChatGPT Agent, Moonshot AI's Kimi K2 beats Opus4 being 90% cheaper, Mistral released worlds #1 speech recognition models, Perplexity unveiled their smartest AI browser, and Cursor;s CEO had to apologise publicly .

As always, we'll also explore some under-the-radar tools that can supercharge your development workflow.


ChatGPT Agent is FINALLY here

OpenAI has released ChatGPT Agent, a unified system that combines deep research capabilities with computer operation abilities. The agent can browse the web, use terminals, write code, analyze data, and create reports, spreadsheets, and presentations, all while achieving state-of-the-art performance across multiple benchmarks.

What's New

Benchmark Domination

ChatGPT Agent is crushing industry benchmarks across the board:

Use Cases & Practical Applications

ChatGPT Agent excels in several key areas that demonstrate its real-world utility:

Research & Analysis

Business Operations

Content Creation & Documentation

What Makes It Superior to Other Agents

Availability & Safety

Rolling out now to Pro, Plus, and Team users, with Pro users getting 400 messages per month and other paid users receiving 40 messages monthly. OpenAI has implemented extensive safeguards including explicit user confirmation for consequential actions and enhanced biological and chemical safety controls.


Kimi K2 Beats Claude Opus 4 being 90% cheaper

Moonshot AI's Kimi K2 has achieved the remarkable feat of becoming the #1 open model on the LMSys Chatbot Arena while delivering exceptional performance at a fraction of the cost of proprietary alternatives.

What's New

Technical Innovation

Benchmark Performance

Real-World Applications

Data Science & Analytics

Academic & Research Applications

Software Development

Business Intelligence

Content & Documentation


Mistral Releases World's Best Open Speech Recognition Models

Mistral AI has unveiled Voxtral, claiming to deliver the world's best open-source speech recognition models. Available in two sizes, Voxtral (24B) for production and Voxtral Mini (3B) for edge deployment, both are released under the Apache 2.0 license.

What's New

Enterprise-Ready Features

Availability

Available via API, Hugging Face downloads, and Le Chat voice interface, with enterprise options including private deployment and fine-tuning for specialized domains.


Perplexity's Latest AI web browser

Perplexity has officially launched Comet, an AI-powered browser that moves beyond traditional search to create an intelligent, conversational web experience. Now in early access for Perplexity Max users, Comet transforms passive browsing into active thinking.

From Navigation to Cognition

From Answers to Action

Key Advantages Over Traditional Browsers

How Comet Surpasses Chrome, Safari, and Arc

Chrome Comparison

Safari Comparison

Arc Comparison

Tasks Made Significantly Easier

Research & Analysis

Daily Productivity

Content Creation

Trust and Accuracy

Built on Perplexity's signature commitment to factual answers with trust, transparency, and truth, ideal for high-stakes decisions like comparing insurance plans or understanding investments.


Cursor Faces Backlash Over Pro Plan Pricing Shift

Cursor, the AI-powered coding platform by Anysphere, was under fire after an abrupt change to its $20/month Pro plan sparked user confusion, unexpected charges, and widespread frustration.

What Changed

User Frustration

Cursor's Response

The Rationale

Cursor cited growing API costs from model providers, explaining that request-based pricing couldn't reflect the real cost of longer, token-heavy prompts, while API-based pricing provides more accurate cost structure for advanced usage.


Tools & Releases YOU Should Know About

Leap AI is a no-code workflow automation platform for building and deploying AI-powered workflows. Connect AI services and tools to create sophisticated automation pipelines that automate repetitive work and streamline your processes. Perfect for teams looking to integrate AI capabilities without complex development overhead.

Windframe.dev is a powerful drag-and-drop UI builder built on top of Tailwind CSS. Think of it like Figma for front-end developers, but with live Tailwind code generation and component-level control. Design interfaces visually and export clean, production-ready code instantly, making it ideal for rapid prototyping and professional development.

Replicate is a leading cloud platform enabling software developers to run, fine-tune, and deploy machine learning models effortlessly with a simple API. Removing the barriers of complex AI infrastructure, Replicate offers access to thousands of open-source models as well as the ability to host custom solutions, making AI deployment accessible to developers at any scale.


And that wraps up this issue of "This Week in AI Engineering."

Thank you for tuning in! Be sure to share this newsletter with your fellow AI enthusiasts and follow for more weekly updates.

Until next time, happy building!