Hello AI Enthusiasts!

Welcome to the sixth edition of "This Week in AI Engineering"!

This week started with Mistral’s new AI Assistant, Le Chat making noise in the community, followed by major releases from Perplexity and GitHub.

With this, we’ll be covering news from DeepSeek and Cline, with some must-know tools to make developing AI agents and apps easier.


Le Chat: 10x Faster than ChatGPT

Mistral AI has introduced Le Chat, featuring Cerebras-powered Flash Answers for enhanced response speeds. The platform has integrated Cerebras Inference technology with the 123B parameter Mistral Large 2 model, delivering significant performance improvements in text processing.

Technical Architecture:

Performance Metrics:

The initial release has focused on text-based queries, with Cerebras and Mistral AI planning expanded model support throughout 2025.


Perplexity Sonar: New Search Model with Enhanced Speed and Accuracy

Perplexity Labs has introduced Sonar, a new search-optimized model built on the Llama 3.3 70B architecture. The model has integrated Cerebras inference infrastructure to deliver response speeds of 1,200 tokens per second, establishing significant performance improvements over existing solutions.

Technical Architecture:

Performance Metrics:

Comparative Testing:

The platform has enhanced its search capabilities through A/B testing.


GitHub Copilot: Agent Mode Integration with Multi-Model Support

GitHub has introduced Agent Mode for Copilot, integrating advanced AI models including Gemini 2.0 Flash, GPT-4o, and Claude 3.5 Sonnet. The platform has enhanced its autonomous coding capabilities through VS Code Insiders, focusing on automated error resolution and task management.

Technical Architecture:

Core Features:

Deployment Options:

The platform has demonstrated significant improvements in code completion and error handling, with Project Padawan scheduled for expanded autonomous agent capabilities later in 2025.


DeepSeek VL2: Advanced Vision-Language Model with MoE Architecture

DeepSeek has released DeepSeek-VL2, a new series of Mixture-of-Experts (MoE) vision-language models designed for enhanced multimodal understanding. The model family has introduced three variants with different parameter scales and efficiency optimizations.

Technical Architecture:

Performance Features:

Core Capabilities:

The model has focused on efficient parameter activation while maintaining competitive performance against larger dense models, with full commercial use support under the DeepSeek Model License.


Cline 3.3: AI Programming Assistant enhances security and API integration

Cline, an AI-powered code assistant for VS Code that helps developers write, review, and explain code has released version 3.3. The update introduces key security features and expanded API provider support. It focuses on file access control through a new .clineignore system while increasing its model compatibility with additional providers.

Technical Updates:

Core Improvements:

The update has maintained backward compatibility while introducing significant security features and reliability improvements for enterprise development workflows.


Tools & Releases YOU Should Know About


And that wraps up this issue of "This Week in AI Engineering."

Thank you for tuning in! Be sure to share this with your fellow AI enthusiasts and follow for the latest weekly updates.

Until next time, happy building!