Frozen AI News archive

Google's Agent2Agent Protocol (A2A)

**Google Cloud Next** announcements featured the launch of **Google and DeepMind's** full **MCP support** and a new **Agent to Agent protocol** designed for agent interoperability with multiple partners. The protocol includes components like the **Agent Card**, **Task communication channels**, **Enterprise Auth and Observability**, and **Streaming and Push Notification support**. On the model front, **Moonshot AI** released **Kimi-VL-A3B**, a multimodal model with **128K context** and strong vision and math benchmark performance, outperforming **gpt-4o**. **Meta AI** introduced smaller versions of **llama-4** family models: **llama-4-scout** and **llama-4-maverick**, with a larger **Behemoth** model still in training. **DeepCoder 14B** from **UC Berkeley** is an open-source coding model rivaling **openai's o3-mini** and **o1** models, trained with reinforcement learning on 24K coding problems. **Nvidia** released **llama-3.1-nemotron-ultra-253b** on Hugging Face, noted for beating **llama-4-behemoth** and **maverick** and competing with **deepseek-r1**.

Canonical issue URL

AI News for 4/8/2025-4/9/2025. We checked 7 subreddits, 433 Twitters and 30 Discords (229 channels, and 5996 messages) for you. Estimated reading time saved (at 200wpm): 563 minutes. You can now tag @smol_ai for AINews discussions!

We are deep in Google Cloud Next announcements, and in a 1-2 punch, the CEOs of Google and DeepMind announced both their full MCP support:

image.png

And their new Agent to Agent protocol to complement MCP with a huge list of partners:

image.png

It is tempting to pit Google against Anthropic, but the protocols were designed to work together to address perceived gaps in MCP:

image.png

The spec includes:

Launch artifacts include:

image.png


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Model Releases and Updates

Hardware and Infrastructure

Agent and Tooling Development

Education and Resources

Analysis and Benchmarking

Broader AI Discussion

Humor and Sarcasm


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. "Unleashing DeepCoder: The Future of Open-Source Coding"

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. "Revolutionizing AI: Models, Hardware, and Customization"

Theme 2. Evolving Connections: From Romance to Daily AI Chats


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: Model Mania - New Releases, Capabilities, and Comparisons

Theme 2: Rise of the Agents - Protocols, Tools, and Collaboration

Theme 3: Under the Hood - Training, Optimization, and Inference Insights

Theme 4: Platforms, Tooling, and the Almighty API

Theme 5: Data, Evaluation, and Ensuring Models Aren't Just Copycats


PART 1: High level Discord summaries

LMArena Discord


Unsloth AI (Daniel Han) Discord


Manus.im Discord Discord


Perplexity AI Discord


OpenRouter (Alex Atallah) Discord


OpenAI Discord


LM Studio Discord


aider (Paul Gauthier) Discord


Eleuther Discord


Cursor Community Discord


Yannick Kilcher Discord


Interconnects (Nathan Lambert) Discord


MCP (Glama) Discord


HuggingFace Discord


Notebook LM Discord


GPU MODE Discord


Latent Space Discord


Nous Research AI Discord


Nomic.ai (GPT4All) Discord


Modular (Mojo 🔥) Discord


LlamaIndex Discord


Cohere Discord


tinygrad (George Hotz) Discord


Torchtune Discord


LLM Agents (Berkeley MOOC) Discord


Codeium (Windsurf) Discord


The DSPy Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

LMArena ▷ #general (1004 messages🔥🔥🔥):

Gemini 2.5 Pro, DeepMind Ultra, NightWhisper Speculation, Gemini Coder Model, Deep Research Update


Unsloth AI (Daniel Han) ▷ #general (712 messages🔥🔥🔥):

GPU configuration with Unsloth, DDP (Distributed Data Parallel) vs Unsloth performance, VLLM integration with Unsloth, Llama 4 Scout Model Analysis, Model Quantization


Unsloth AI (Daniel Han) ▷ #off-topic (22 messages🔥):

Model Pruning, Model2Vec, GoC79hYXwAAPTMs.jpg, Transformer Based Models


Unsloth AI (Daniel Han) ▷ #help (156 messages🔥🔥):

GRPO Training Tips for Large Models, Multi-GPU GRPO, 4-bit Training, Orpheus TTS Locally, Gemma and Granite Training Errors


Unsloth AI (Daniel Han) ▷ #showcase (3 messages):

Geographic origins of users, Belgium proximity to Netherlands


Unsloth AI (Daniel Han) ▷ #research (8 messages🔥):

Together AI's DeepCoder, Apple Metal Quantization Kernels, Visual Guide to Quantization


Manus.im Discord ▷ #showcase (6 messages):

Website Building Code, Japan Cherry Blossom Trip Website, Galaxy Model, Impact of Tariffs on Consumers, Recommender System


Manus.im Discord ▷ #general (511 messages🔥🔥🔥):

AI for App Creativity, Gemini 2.5 and Claude 3.7 for Coding, Best Hosting for Social Media App, Manus Credit Usage, Improving Apps Post-Launch


Perplexity AI ▷ #announcements (2 messages):

Perplexity for Startups, Aravind AMA


Perplexity AI ▷ #general (501 messages🔥🔥🔥):

Gemini 2.5 Pro Reasoning Tokens, Perplexity Discover bias, Deepseek 10 dollar deepsearch, Perplexity NHL sports, Troubleshooting tasks and deep research


Perplexity AI ▷ #sharing (1 messages):

Holo-live


Perplexity AI ▷ #pplx-api (6 messages):

Image in API call, Sonar and Make.com, Playground vs API


OpenRouter (Alex Atallah) ▷ #app-showcase (5 messages):

Olympia.chat for sale, OSS AI agent tooling with Quasar, Iterative code generation


OpenRouter (Alex Atallah) ▷ #general (444 messages🔥🔥🔥):

DeepSeek v3, OpenRouter Pricing, Google Cloud Next announcements, Gemini 2.5 Pro, API connectivity issues


OpenAI ▷ #ai-discussions (274 messages🔥🔥):

GPT Ad Distribution, GPT Recommendations, Deep Research Comparison, SuperGrok Performance, Gemini 2.5 Pro


OpenAI ▷ #gpt-4-discussions (2 messages):

Emoji support, Emoji test


OpenAI ▷ #prompt-engineering (61 messages🔥🔥):

linguistic program AI, recursion system, multiple choice questions generation, prompt engineering for relevant MCQ options, OpenAI image generation


OpenAI ▷ #api-discussions (61 messages🔥🔥):

Linguistic AI program, GPT capabilities discovery, Recursion system for AGI, Multiple choice question generation, Prompt engineering for MCQ relevance


LM Studio ▷ #general (57 messages🔥🔥):

Fast model recommendations for CPU usage, MoE model explanation, Jinja template issue with cogito-v1-preview-llama-3b, Cogito reasoning models in LM Studio, Llama 4 support and updates in LM Studio


LM Studio ▷ #hardware-discussion (331 messages🔥🔥):

NND's SuperComputer: cost-effective alternative to Nvidia DGX B300?, Classified Project on a Laptop?, Framework Desktop for LLMs and Gaming, unified ram performance in LLMs on laptops, Alternative to NVidia


aider (Paul Gauthier) ▷ #general (262 messages🔥🔥):

DeepSeek R1, Gemini 2.5 Pro HIGH, Gemini 2.5 Flash, OpenRouter Gemini Limits, Aider MCP Integration


aider (Paul Gauthier) ▷ #questions-and-tips (18 messages🔥):

Aider conventions vs Cursor rules, Adding gitignored files to Aider, Claude pricing plans, Aider PR review


aider (Paul Gauthier) ▷ #links (1 messages):

.becquerel: https://yuxi-liu-wired.github.io/essays/posts/cyc/


Eleuther ▷ #general (221 messages🔥🔥):

Apache 2.0 vs MIT License, GFlowNets, Memory Bandwidth, Topological Model Semantics, Model Sycophancy


Eleuther ▷ #research (46 messages🔥):

Reward value representation, Batch sizes and convergence, Residual Modifications for Information Flow, Learning Rate Batchsize Scaling, Mollifiers for ML Research


Eleuther ▷ #interpretability-general (10 messages🔥):

AI2 tools, infingram's opensource, Influence functions, tokengrams


Cursor Community ▷ #general (218 messages🔥🔥):

Gemini Advanced, Firebase Studio, Cursor MDC files settings, Gemini vs Claude vs DeepSeek, Restore Checkpoint feature


Yannick Kilcher ▷ #general (190 messages🔥🔥):

DeepSeek vs ByteDance, Meta Reward Modeling Criticism, Memory Bandwidth Effects on Inference, AI Sentience and Legal Personhood, Definitions of Consciousness and Self-Awareness


Yannick Kilcher ▷ #paper-discussion (9 messages🔥):

Beautiful.ai Alternatives, Ultra-Scale Playbook, DeepSeek-MoE


Yannick Kilcher ▷ #agents (2 messages):

Google ADK, Agent2Agent Protocol (A2A)


Yannick Kilcher ▷ #ml-news (16 messages🔥):

Cogito V1, Triton vs Cutile, Claude Subscription, Google's Agent2Agent, Claude 3.7 vs o1 pro


Interconnects (Nathan Lambert) ▷ #news (178 messages🔥🔥):

Cogito LLMs, Gemini 2.5 Deep Research, Google's Gemini chaos, Ironwood TPUs, Kimi-VL


Interconnects (Nathan Lambert) ▷ #ml-drama (7 messages):

AI2 Fun Times, Google Quitters Paid, AIAI Opportunity


Interconnects (Nathan Lambert) ▷ #random (2 messages):

Wintermoat Post


Interconnects (Nathan Lambert) ▷ #memes (1 messages):

xeophon.: https://x.com/rogutkuba/status/1909422087510671854


Interconnects (Nathan Lambert) ▷ #rl (3 messages):

RLVR, RAG reward model, Deep Research RLS


Interconnects (Nathan Lambert) ▷ #reads (3 messages):

Cyc project, Llama performance, Ghibli memes


MCP (Glama) ▷ #general (107 messages🔥🔥):

MCP for RAG use case with Neo4j, mcpomni-connect client, Google A2A vs Anthropic MCP, A2A agent discovery, parallel_tool_calls flag


MCP (Glama) ▷ #showcase (9 messages🔥):

Easymcp v0.4.0 release, mcp_ctl CLI tool, ToolHive MCP runner, Unleash MCP server, GitHub GraphQL MCP server


HuggingFace ▷ #general (41 messages🔥):

Best models under 55B for data processing, Qwen with LORA and Distributed Data-Parallel, Oblix tool for orchestrating AI, Anomaly detection models, System message for OpenGVLab/InternVL2_5-8B-MPO


HuggingFace ▷ #today-im-learning (3 messages):

NLP, structured LLM output


HuggingFace ▷ #i-made-this (4 messages):

Graph-based Academic Recommender System, Manus AI web application launch, Athena-3 LLM, Athena-R3 reasoning variant, Embedders and RAG


HuggingFace ▷ #computer-vision (2 messages):

Tools recognition task, Model adaptation for specific tools, Enhancing model feature extraction


HuggingFace ▷ #gradio-announcements (1 messages):

Gradio, ImageEditor component


HuggingFace ▷ #agents-course (28 messages🔥):

Ollama models, Cogito:32b, Small models vs Large Models, Agentic Coding, RooCode


HuggingFace ▷ #open-r1 (13 messages🔥):

Deepseek, Active AI chats


Notebook LM ▷ #use-cases (13 messages🔥):

NotebookLM Privacy, NotebookLM Training, NotebookLM as a Notetaking App, Google Drive Integration, Microsoft OneNote


Notebook LM ▷ #general (75 messages🔥🔥):

PDF image processing in NLM, Discover sources feature in NLM, Interactive mode issues in NLM, Audio overviews in 2.5 Pro, Text formatting in NotebookLM chat


GPU MODE ▷ #general (15 messages🔥):

FP4 on 5090, CUTLASS for tensor cores, Flash Attention 3, torchao 0.10, MX dtypes


GPU MODE ▷ #cuda (6 messages):

Linux Distro, NVIDIA drivers, LDSM Instruction, Warp Shuffling


GPU MODE ▷ #torch (2 messages):

FSDP2, Model Parallelism, Accelerate Hack


GPU MODE ▷ #cool-links (2 messages):

CUDA vs other, SMERF, Berlin Demo


GPU MODE ▷ #beginner (19 messages🔥):

Graph Neural Networks (GNNs), CUDA C vs CUDA C++, Graph Attention Networks, GAN parallelism, Producers and Consumers architecture


GPU MODE ▷ #torchao (1 messages):

torchao v0.10.0, MXFP8 Training, Nvidia B200, PARQ, Quantization API


GPU MODE ▷ #off-topic (2 messages):

Brooklyn Apartments, Apartment Hunting Tips


GPU MODE ▷ #self-promotion (1 messages):

Mediant32, FP32, BF16, integer-only inference, Rationals


GPU MODE ▷ #reasoning-gym (2 messages):

DeepCoder, Llama 4 Scout


GPU MODE ▷ #gpu模式 (3 messages):

NVSHMEM and RDMA, Deepseek Library, RoCE or Infiniband Compatibility


GPU MODE ▷ #general (11 messages🔥):

CUDA Inline Submissions, Datamonsters AMD Developer Challenge


GPU MODE ▷ #submissions (14 messages🔥):

Grayscale Leaderboard Submissions, Matmul Leaderboard Submissions, Vectoradd Leaderboard Submissions, Modal Runners Success


GPU MODE ▷ #feature-requests-and-bugs (1 messages):

leikowo: ah, sorry I didn't see your message in time, seems that you guys fixed it already


Latent Space ▷ #ai-general-chat (77 messages🔥🔥):

Together AI X-Ware.v0, Gemiji Plays Pokemon, AI Excel Formulas, Microsoft Copilot for Indie Game Devs, Agent2Agent Protocol (A2A) by Google


Nous Research AI ▷ #general (66 messages🔥🔥):

Llama 4 Fine-tuning, Deep Herme's Dataset, Selling 3090 turbo card, DeepCogito's LLMs, Iterated Distillation and Amplification


Nous Research AI ▷ #ask-about-llms (2 messages):

BPE Tokenizer, Hugging Face library, Non-English text encoding


Nous Research AI ▷ #interesting-links (1 messages):

anka039847: https://mlss2025.mlinpl.org/


Nomic.ai (GPT4All) ▷ #general (57 messages🔥🔥):

Local Embedding Models, GPT4All Document Indexing, Local LLM Loading Issues, RAG Implementation, GPT4All Stop Button


Modular (Mojo 🔥) ▷ #general (3 messages):

Mojo Language, Mojo Documentation, Mojo Community


Modular (Mojo 🔥) ▷ #mojo (14 messages🔥):

Span Lifetimes in Mojo, Fearless Concurrency in Mojo, MLIR Type Construction with Compile-Time Parameters, Parametric Operations Dialect (POP)


LlamaIndex ▷ #blog (2 messages):

Auth0 Auth for GenAI, LlamaIndex support, agent workflows, FGA-authorized RAG, visual citations


LlamaIndex ▷ #general (8 messages🔥):

Reasoning LLMs, GraphRAG V2, Milvus DB, Blockchain Expertise


LlamaIndex ▷ #ai-discussion (3 messages):

LlamaIndex Deep Research, create-llama Tool


Cohere ▷ #「💬」general (9 messages🔥):

Cohere's documentation, Pydantic schema, cURL request, List of companies


Cohere ▷ #「🤖」bot-cmd (1 messages):

competent: Currently not working!


Cohere ▷ #「🤝」introductions (2 messages):

Introductions, Machine Vision, Web/AI Projects, Cohere AI Exploration


Cohere ▷ #【🟢】status-updates (1 messages):

competent: Should work!


tinygrad (George Hotz) ▷ #general (4 messages):

PMPP Book, Compiler Series, LLVM Tutorial, Tiny Box for Chinese Market


tinygrad (George Hotz) ▷ #learn-tinygrad (7 messages):

METAL virtual device sync issue, LLaMA 7B on 4 virtual GPUs, gradient accumulation in training routine, t.grad is None issue, zero_grad() before the step


Torchtune ▷ #general (4 messages):

Contributor Tag Request, Gus from Psych


Torchtune ▷ #dev (4 messages):

FSDP, DeepSpeed, Sharding Strategies


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (1 messages):

aniket_19393: did anybody heard back from mentors in the research track of AgentX?


Codeium (Windsurf) ▷ #announcements (1 messages):

Windsurf, JetBrains, AI agent, IDE ecosystems





{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}