Frozen AI News archive

OpenAI adopts MCP

**OpenAI** announced support for **MCP**, a significant technical update. **Google's Gemini 2.5 Pro** leads benchmarks with top scores in **MMLU-Pro (86%)**, **GPQA Diamond (83%)**, and **AIME 2024 (88%)**, featuring a **1 million token context window** and multimodal inputs. **Alibaba's Qwen 2.5 Omni 7B** was released as a fully multimodal, interactive, open-source model with a novel "thinker-talker" architecture supporting voice and video chat. **DeepSeek V3-0324** outperforms its predecessor on multiple benchmarks. Research on reasoning features in large language models using sparse autoencoders was highlighted, alongside a study on scaling laws of synthetic data showing performance plateaus near **300B tokens**. Discussions also covered the fastest output speeds of Gemini models and concerns about over-reliance on benchmarks for intelligence measurement. *Swyx* will curate the Data Council AI Engineering Track in April.

Canonical issue URL

AI News for 3/25/2025-3/26/2025. We checked 7 subreddits, 433 Twitters and 29 Discords (228 channels, and 4998 messages) for you. Estimated reading time saved (at 200wpm): 467 minutes. You can now tag @smol_ai for AINews discussions!

Amid all the 4o Ghibli memes you could be forgiven for missing the technical update that OpenAI announced MCP support today:

image.png

We attempted to articulate Why MCP Won in a recent Latent Space article.


Special Shoutout: Swyx will be curating the Data Council AI Engineering Track in Oakland on Apr 22. You can use LATENTSPACE20 for a little discount.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Language Models and Benchmarks

Model Quantization and Efficiency

Tools and Frameworks

Image Generation and Multimodality

Company and Product Announcements

China, DeepSeek, and Qwen

Other

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. DeepSeek V3 Gains and Benchmarking

Theme 2. Google's TxGemma: Integrating Therapeutics and AI

Theme 3. Qwen 2.5 Omni Multimodal Capabilities

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. DeepSeek V3 Gains and Benchmarking

Theme 2. Google's TxGemma: Integrating Therapeutics and AI

Theme 3. Qwen 2.5 Omni Multimodal Capabilities


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. Gemini 2.5 Pro: Performance Hype and Practicality Questions

Theme 2. DeepSeek V3: Coding Champ and Cost-Effective Contender

Theme 3. Model Context Protocol (MCP) Gains Momentum and Adoption

Theme 4. OpenRouter Landscape: Pricing, Limits, and New Features

Theme 5. OpenAI's 4o Image Generation: DALL-E's Demise?


PART 1: High level Discord summaries

LMArena Discord


Perplexity AI Discord


Cursor Community Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


OpenRouter (Alex Atallah) Discord


Interconnects (Nathan Lambert) Discord


LM Studio Discord


Nous Research AI Discord


Notebook LM Discord


Yannick Kilcher Discord


Modular (Mojo 🔥) Discord


MCP (Glama) Discord


GPU MODE Discord


Latent Space Discord


Eleuther Discord


LlamaIndex Discord


Cohere Discord


DSPy Discord


Torchtune Discord


LLM Agents (Berkeley MOOC) Discord


Nomic.ai (GPT4All) Discord


tinygrad (George Hotz) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

LMArena ▷ #general (910 messages🔥🔥🔥):

Gemini 2.5 Pro bugs, Deepseek V3 0324 strengths, Model size estimations, Livebench benchmark viability, Gemini 2.5 pro overhyped?

Links mentioned:


Perplexity AI ▷ #announcements (1 messages):

Answer Modes, Vertical Search


Perplexity AI ▷ #general (622 messages🔥🔥🔥):

Image generation, Gemini 2.5 Pro, Proton VPN issues, Deep Research Limits

Links mentioned:


Perplexity AI ▷ #sharing (5 messages):

Perplexity AI, Mikrotik Router, AI Potential


Perplexity AI ▷ #pplx-api (2 messages):

Web Access Cost, r1-1776 Offline Model, Search Context Size


Cursor Community ▷ #general (608 messages🔥🔥🔥):

Thinking Tokens, Gemini 2.5, OpenRouter rate limited, RepoMix, DeepSeek

Links mentioned:


OpenAI ▷ #ai-discussions (257 messages🔥🔥):

Gemini 2.5 Pro, 4o Image Gen, Data collection, Em-dashes vs Semicolons, PDF editing with AI

Links mentioned:


OpenAI ▷ #gpt-4-discussions (21 messages🔥):

GPT remote computer control, Image generation limits for plus users, Reasoning and deepsearch in custom GPT, GPT-4o Image generation


OpenAI ▷ #prompt-engineering (85 messages🔥🔥):

Custom GPTs, ChatGPT memory, Git and GPL, AI Prompting for Git, Memory retention issues


OpenAI ▷ #api-discussions (85 messages🔥🔥):

Custom GPTs, Browser Cache, Long Context LLM, GPL_v3, Mermaid Diagrams


Unsloth AI (Daniel Han) ▷ #general (246 messages🔥🔥):

TRL v0.16.0 Support, GGUF Export Issues, Gemma3Config Error, Qwen 2.5 Training Time, Multi-GPU Setups

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (7 messages):

Instruct template ergonomics, LLMs with audio input, Qwen2.5-Omni, Future tech evolution (GPU VRAM, ASIC, NPU/CPU), YouTube feed filled with quintics after looking up Galois theory


Unsloth AI (Daniel Han) ▷ #help (73 messages🔥🔥):

Gemma3Config issue, Deepseek replacement models, Unsloth training failures, Cerebras model loading error, GRPO trainer OOM issues

Links mentioned:


Unsloth AI (Daniel Han) ▷ #research (7 messages):

Pivotal Token Search, ByteDance Training Policy, DAPO RL System

Link mentioned: GitHub - BytedTsinghua-SIA/DAPO: An Open-source RL System from ByteDance Seed and Tsinghua AIR: An Open-source RL System from ByteDance Seed and Tsinghua AIR - BytedTsinghua-SIA/DAPO


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

Model Comparison Feature, Side-by-Side Model Comparison

Link mentioned: Tweet from OpenRouter (@OpenRouterAI): New feature: compare models side-by-side.You can now compare any two models and providers. Clicking "Chat" takes you to a chatroom with both.


OpenRouter (Alex Atallah) ▷ #general (312 messages🔥🔥):

Gemini 2.5 Pro, GPT-4o Image Generation, DeepSeek V3, OpenRouter Pricing, Stripe Payment Issues

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (172 messages🔥🔥):

Gemini 2.5 Pro, Qwen2.5-Omni, Nvidia acquires Lepton AI, AI2 Paper Finder, OpenAI Revenue Projections

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (4 messages):

OpenRouter, Hyperparams, Academic Evals vs Production, OpenAI Spending Controls


Interconnects (Nathan Lambert) ▷ #random (27 messages🔥):

MCP, Gemini 2.5, Ghibli images, OpenAI 4o

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (14 messages🔥):

Gemini vs GPT4o Vision, Google Polymarket Stonks, Sama as Twink Ghibli

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (11 messages🔥):

Gemini 2.5, ChatGPT, Claude, O1 Pro


LM Studio ▷ #general (103 messages🔥🔥):

Tokenizing on GPU vs CPU, Gemini 2.5 Pro experience, LM Studio Dockerization, Uncensored Models on LM Studio, Cursor vs Copilot

Links mentioned:


LM Studio ▷ #hardware-discussion (36 messages🔥):

ROCm support for gfx1200/gfx1201, Resizable Bar Performance Boost, Intel Arc GPU recognition issue, DeepSeek model size, Gemma3 performance on 9070XT vs 7800XT

Links mentioned:


Nous Research AI ▷ #general (52 messages🔥):

Q-LoRA finetuning 200B parameters, Deepseek hallucinations, GPT-4.5 image generation, Multi turn multi agent dataset, Gemini 2.5 Pro Experimental explanation of Transformers

Links mentioned:


Nous Research AI ▷ #ask-about-llms (60 messages🔥🔥):

Embedding Matrix Redundancy, Deeper MLP for Weight Savings, PCA for Embedding Alignment, Low Rank Projection Issues, Character-Level LLMs vs. Tokenized LLMs

Link mentioned: Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling: Tokenization is a fundamental component of large language models (LLMs), yet its influence on model scaling and performance is not fully explored. In this paper, we introduce Over-Tokenized Transforme...


Nous Research AI ▷ #interesting-links (7 messages):

Ling Lite MoE Model, Qwen 3 Release, GPU requirements for LLMs

Links mentioned:


Notebook LM ▷ #use-cases (4 messages):

Audio Overviews, Branding Instructions, Ignore previous branding instructions and title the production


Notebook LM ▷ #general (84 messages🔥🔥):

Multilingual Podcasts, Mind Map Access, Gemini 2.5 Pro, Audio Overviews, API for NotebookLM


Yannick Kilcher ▷ #general (54 messages🔥):

Highway Networks, Skip Connections, Attention Mechanisms, ResNets, LADDER Framework

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (14 messages🔥):

LADDER paper, Gemini 2.5 Pro, NP-Completeness Clarification, DeepSeek Paper Review

Links mentioned:


Yannick Kilcher ▷ #ml-news (11 messages🔥):

Autoregressive Pixel Generation vs Diffusion, Image Quality Levels, Transformer vs Diffusion, Gemini Flash Image Generation, Recent Autoregressive Models


Modular (Mojo 🔥) ▷ #general (1 messages):

SIMD, SIMT, SMT, Andrew Glew, NVIDIA GPUs

Link mentioned: SIMD < SIMT < SMT: parallelism in NVIDIA GPUs: no description found


Modular (Mojo 🔥) ▷ #mojo (69 messages🔥🔥):

Rust uomlibrary limitations, Parameter Domain Shenanigans,@parameter match in Mojo, Parametric traits, Returning a value from a Dict based on index

Link mentioned: uom - Rust: no description found


Modular (Mojo 🔥) ▷ #max (2 messages):

CUDA, PTX, nvidia GPUs


MCP (Glama) ▷ #general (54 messages🔥):

Docker and SSE for AI Stack, Excel MCP, Multi-AI Advisor MCP, Vibe Check MCP Server, JSON-RPC Errors

Links mentioned:


MCP (Glama) ▷ #showcase (2 messages):

MCP Agent, CapCut Integration

Link mentioned: - YouTube: no description found


GPU MODE ▷ #general (3 messages):

FSDP Fine Tuning, TRL Library, Data Handling


GPU MODE ▷ #triton (4 messages):

prune configs, kernel porting


GPU MODE ▷ #cuda (9 messages🔥):

CuTe coordinate mapping, Serverless GPU kernel profiling, Barrier arrive & wait pattern

Link mentioned: cutlass/include/cute/atom/mma_atom.hpp at 62750a2b75c802660e4894434dc55e839f322277 · NVIDIA/cutlass: CUDA Templates for Linear Algebra Subroutines. Contribute to NVIDIA/cutlass development by creating an account on GitHub.


GPU MODE ▷ #torch (11 messages🔥):

torch.compile transpose error, Flash attention autograd stall, PyTorch documentation redesign

Link mentioned: PyTorch documentation: PyTorch is an optimized tensor library for deep learning using GPUs and CPUs. Features described in this documentation are classified by release status: Stable: These features will be maintained lo...


GPU MODE ▷ #jobs (9 messages🔥):

AMD GPU support in Triton, NA/Europe remote job positions for Triton, GitHub - TuckerBMorgan/poro: Toy NN LIB

Links mentioned:


GPU MODE ▷ #torchao (2 messages):

GPTFast Generation Benchmark, Cudagraphs skipping, TorchAO


GPU MODE ▷ #rocm (3 messages):

Workstation Cards, MI300 Access, hipSPARSE vs hipSPARSELt


GPU MODE ▷ #sparsity-pruning (2 messages):

Pruning Masks, L1 Unstructured Pruning


GPU MODE ▷ #liger-kernel (6 messages):

transformers backward compatibility, qwen2-vl and qwen2.5-vl implementations, LoRA with modules_to_save

Links mentioned:


GPU MODE ▷ #self-promotion (1 messages):

Discord Event


GPU MODE ▷ #gpu模式 (2 messages):

Academic Prowess, Graduate Studies, Imposter Syndrome


GPU MODE ▷ #submissions (2 messages):

Leaderboard Submissions, Modal Runners


Latent Space ▷ #ai-general-chat (44 messages🔥):

Dwarkesh's "The Scaling Era", Anthropic's AI Sabotage, Brampton Model Scam or Stunt, Databricks' TAO, Gemini 2.5 Pro Access

Links mentioned:


Latent Space ▷ #ai-announcements (4 messages):

Evo 2, Convolutional Multi-Hybrid Language Models, ARC Institute

Link mentioned: Evo 2: Systems and Algorithms for Convolutional Multi-Hybrid Language Models at Scale: ​RJ will cover https://arcinstitute.org/manuscripts/Evo2-ML​Here's the press release: https://arcinstitute.org/news/blog/evo2 and the companion bio paper: ht...


Eleuther ▷ #general (21 messages🔥):

Environmental impact of LLMs, Deepseek V3 on Mac studios, AI-generated piano music, ICLR 2025

Links mentioned:


Eleuther ▷ #research (11 messages🔥):

Transformers Generalization, Hypernetworks, Test-time compute

Links mentioned:


Eleuther ▷ #interpretability-general (3 messages):

Privileged Basis, Point-wise nonlinearities


Eleuther ▷ #gpt-neox-dev (2 messages):

GPT-NeoX Data Preprocessing, Chunking for Long Documents


LlamaIndex ▷ #general (20 messages🔥):

Open Source Automatic Evaluations, LlamaIndex Workflow for Agentic application, OpenAI's responses api, LlamaExtract Schema Inference, Postgres database analysis using LlamaIndex


Cohere ▷ #「💬」general (11 messages🔥):

Vector Database Options, AI Agents: Pricing and Monetization

Link mentioned: Integrating Embedding Models with Other Tools — Cohere: Learn how to integrate Cohere embeddings with open-source vector search engines for enhanced applications.


Cohere ▷ #「🔌」api-discussions (5 messages):

Chat Stream V2, Tool Call ID, direct-injected-document, command-a-03-2025


Cohere ▷ #「🤖」bot-cmd (2 messages):

``


DSPy ▷ #general (10 messages🔥):

Module sizing, Azure OpenAI Rate Limits, ColBERT v2 retriever endpoint

Link mentioned: [Bug] ColBERT v2 wiki17_abstracts is overloaded · Issue #7966 · stanfordnlp/dspy: What happened? I'm trying to retrieve some passages using a basic MultiHop program (3 passages per hop), This is how I setup the retriever endpoint: COLBERT_V2_ENDPOINT = "http://20.102.90.50...


Torchtune ▷ #dev (4 messages):

Gemini 2.5 Pro, AI Model Pricing, MMLU-Pro, GPQA Diamond, Humanity’s Last Exam

Link mentioned: Tweet from Artificial Analysis (@ArtificialAnlys): Google’s new Gemini 2.5 Pro Experimental takes the #1 position across a range of our evaluations that we have run independentlyGemini 2.5 Pro is a reasoning model, it ‘thinks’ before answering questio...


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

AgentX Competition, Registration Deadline, Entrepreneurship Track, Research Track, Prizes and Resources

Links mentioned:


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

Lecture Recording, MOOC sign up


Nomic.ai (GPT4All) ▷ #general (3 messages):

Verso Industries, AI-Powered Twin-Screw Extruder Model, OpenAI-API compatible

Link mentioned: Verso Industries - Elevating American Industries Through Unified Digital Transformation: no description found


tinygrad (George Hotz) ▷ #general (1 messages):

CleanRL, TinyGrad, RL trainer





{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}