Frozen AI News archive

not much happened today

The AI news recap highlights independent evaluations showing **Grok-3** outperforming models like **GPT-4.5** and **Claude 3.7 Sonnet** on reasoning benchmarks, while **Grok-3 mini** excels in reasoning tasks. Research on **reinforcement learning (RL)** fine-tuning reveals potential improvements for small reasoning models but also notes instability in reported gains. Benchmark results suggest **Quasar Alpha** and **Optimus Alpha** may be versions of **GPT-4.1**. Vision and multimodal models like **Kaleidoscope**, supporting 18 languages, and **InternVL3**, built on **InternViT** and **Qwen2.5VL**, demonstrate advances in multilingual vision and reasoning. The fusion model **TransMamba** combines transformer precision with speed via **SSM** mechanisms. Alibaba's **FantasyTalking** generates realistic talking portraits. Agent-focused events at **CMU** and tools like **FilmAgent AI** for virtual film production and **BrowseComp** benchmark for browsing agents were announced. The coding assistant **Augment** supports multiple IDEs with code analysis and suggestions. Discussions also covered Google’s new agent-to-agent protocol concept.

Canonical issue URL

AI News for 4/10/2025-4/11/2025. We checked 7 subreddits, 433 Twitters and 30 Discords (230 channels, and 4040 messages) for you. Estimated reading time saved (at 200wpm): 401 minutes. You can now tag @smol_ai for AINews discussions!

To close off a surprisingly quiet week compared to expectations, we recommend the great SF Compute/GPU Neocloud discussion released today on Latent.Space.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Language Models and Benchmarks

Vision Language Models (VLMs) and Multimodal Models

Agents, Tooling, and Applications

AI Infrastructure and Hardware

ChatGPT's Memory Feature

Tariffs and Geopolitical Implications

Humor/Memes


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. "Evaluating AI Model Performance and Ethical Challenges"

Theme 2. "Debating the Future of Open Source AI"

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. Unlocking AI's Memory: ChatGPT's Game-Changing Feature

Theme 2. "Mastering Realism: ChatGPT's Image Generation Secrets"

Theme 3. Celebrating AI Creativity: Nostalgia, Humor, and Art


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. New Models and Performance Face Off

Theme 2. Ecosystem Tooling and Open Source Initiatives Grow

Theme 3. Model Reliability and Infrastructure Challenges Persist

Theme 4. Agentic AI Architectures and Protocol Debates Heat Up

Theme 5. Community Dynamics and Industry Shifts


PART 1: High level Discord summaries

LMArena Discord


OpenRouter (Alex Atallah) Discord


Unsloth AI (Daniel Han) Discord


Manus.im Discord Discord


aider (Paul Gauthier) Discord


Latent Space Discord


OpenAI Discord


LM Studio Discord


Interconnects (Nathan Lambert) Discord


Perplexity AI Discord


GPU MODE Discord


Cursor Community Discord


Yannick Kilcher Discord


MCP (Glama) Discord


Eleuther Discord


Modular (Mojo 🔥) Discord


Nomic.ai (GPT4All) Discord


HuggingFace Discord


Nous Research AI Discord


tinygrad (George Hotz) Discord


Torchtune Discord


LlamaIndex Discord


Notebook LM Discord


Cohere Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


MLOps @Chipro Discord


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

LMArena ▷ #general (721 messages🔥🔥🔥):

i_am_dom discord disable chat, 4.5 vs gem2.5p, OpenAI's naming scheme, private openai reasoning model, 2.5 flash and gpt4o mini


OpenRouter (Alex Atallah) ▷ #announcements (4 messages):

Quasar Alpha, Optimus Alpha, Gemini 2.5 Pro Preview, Chutes Provider Outage, Gemini Pricing Update


OpenRouter (Alex Atallah) ▷ #general (404 messages🔥🔥🔥):

Quasar Alpha, Gemini 2.5 Pro, OpenRouter API limits, Character AI Bypassing, Unsloth Finetuning


Unsloth AI (Daniel Han) ▷ #general (209 messages🔥🔥):

Hugging Face Shout-out, GPU Grant for Unsloth, Gemma Model Issues, Attention Output Visualization, Unsloth Accuracy


Unsloth AI (Daniel Han) ▷ #off-topic (30 messages🔥):

GRU comeback?, GGUF quantization, Vision finetuning Gemma, Unsloth exit strategy, Startup enshitification


Unsloth AI (Daniel Han) ▷ #help (104 messages🔥🔥):

Gemma3 finetuning with Unsloth, GRPO notebook errors on Colab Pro, VLM for invoice extraction, Llama3.2-1b-Instruct BOS token issue, Teaching facts to existing models


Unsloth AI (Daniel Han) ▷ #research (13 messages🔥):

Tensor Quantization, Metal Kernels, Pytorch Extension, Eval Repurposing


Manus.im Discord ▷ #showcase (1 messages):

shirley778__69848: Let's see what is discussing on Reddit 🔥


Manus.im Discord ▷ #general (319 messages🔥🔥):

Claude Pro Max Value, Manus vs ChatGPT, Manus for Website Creation, Qwen MCP Integration, Manus Credit Structure


aider (Paul Gauthier) ▷ #general (237 messages🔥🔥):

Optimus Alpha review, Gemini 2.5 performance issues, Google's load shedding strategies, Code2prompt usage and documentation, Channel organization and moderation


aider (Paul Gauthier) ▷ #questions-and-tips (37 messages🔥):

Aider Loop with Deepseek, Security Team Fears about Aider, Aider and Nemotron Ultra, Gemini Pro Benchmarks, Restoring Chat History Intuitively


aider (Paul Gauthier) ▷ #links (3 messages):

Claude 3.5 Sonnet, o3-mini context windows, Gemini performance, Claude performance


Latent Space ▷ #ai-general-chat (23 messages🔥):

Google's agent2agent protocol, GPT4.5 alpha, exponent.run, arxiv ai feature, Portland AI Engineer's group


Latent Space ▷ #ai-announcements (1 messages):

GPT 4.5 watch party, Alpha Leaks


Latent Space ▷ #llm-paper-club-west (249 messages🔥🔥):

GPT-4.5, Kagi Orion Browser, Data Efficiency, Model Compression, Ray Solomonoff


OpenAI ▷ #ai-discussions (145 messages🔥🔥):

ChatGPT Memory, Gemini Veo 2, Google AI Studio, Sora video, Mercury Coder


OpenAI ▷ #gpt-4-discussions (6 messages):

New Memory rollout, Context window in conversations, GPT-4o token limit, Memory storage, Free-tier availability


OpenAI ▷ #prompt-engineering (57 messages🔥🔥):

Prompt engineering resources, Model-specific quirks, MusicGPT creation help, Copyright and ToS risks


OpenAI ▷ #api-discussions (57 messages🔥🔥):

Prompt Engineering Resources, MusicGPT Customization, API Usage for MusicGPT, Policy Compliance for ChatGPT Use


LM Studio ▷ #general (119 messages🔥🔥):

Prompt Preprocessor in LM Studio, HuggingFace Login in LM Studio, Image Generation with Gemma 3, Quantization and QAT, Loading Models with Specified Context Limit


LM Studio ▷ #hardware-discussion (115 messages🔥🔥):

MLX distributor fix, M3 Ultra Value, Nvidia DGX Motherboard, Deepseek R1 Token Generation, M1 Ultra vs M4 Max


Interconnects (Nathan Lambert) ▷ #news (76 messages🔥🔥):

Memory in Context, Meta dodged AI week, OSS releases, AI Safety Community, New Image Model


Interconnects (Nathan Lambert) ▷ #ml-drama (2 messages):

Ex-OpenAI Staff Amicus Brief, Peter Wildeford post


Interconnects (Nathan Lambert) ▷ #random (117 messages🔥🔥):

Claude Credits Cost, High Taste LMSYS, Gemini App Usability, Tool Use Open Model, MCP Tool Calls


Interconnects (Nathan Lambert) ▷ #memes (1 messages):

philpax: https://fixvx.com/typedfemale/status/1910599582226272457


Interconnects (Nathan Lambert) ▷ #rl (7 messages):

Gemini paywall, Cooking AI


Interconnects (Nathan Lambert) ▷ #reads (3 messages):

Amy Prbs Threads


Perplexity AI ▷ #announcements (2 messages):

Gemini 2.5 Pro, API Overview, Grok 3, Perplexity Pro


Perplexity AI ▷ #general (194 messages🔥🔥):

Gemini 2.5 Pro, Deep Research, Telegram Bot Official, Firebase Studio AI Builder, Perplexity Android App Security


Perplexity AI ▷ #sharing (1 messages):

Republican voters, Perplexity AI Search


GPU MODE ▷ #general (5 messages):

CUDA in Python/PyTorch models, GTC talk on CUDA, Custom ops and load inline


GPU MODE ▷ #triton (4 messages):

Triton beginner resources, FP8 support on AMD GPUs, Austin Meetup


GPU MODE ▷ #torch (4 messages):

AOT Inductor, Libtorch C++, Torch.compile


GPU MODE ▷ #cool-links (2 messages):

AlexNet Source Code


GPU MODE ▷ #jobs (2 messages):

Thunder Compute, GPU virtualization, C++ distributed systems engineer


GPU MODE ▷ #beginner (55 messages🔥🔥):

A100 FP32 core limitations, NCU assembly view for warp stalls, FADD instruction latency, Citadel microarchitecture papers, Microbenchmarking


GPU MODE ▷ #rocm (41 messages🔥):

ROCm Profilers, MI300 vs H100, Runpod Clock Speeds, Runpod Profiling Issues, GPU Cloud Providers


GPU MODE ▷ #self-promotion (8 messages🔥):

MI300X support, vLLM, SGLang, GemLite, AMD


GPU MODE ▷ #general (1 messages):

felix456: anyone know any cheap / free alternative solutions to using openai API websearch?


GPU MODE ▷ #submissions (11 messages🔥):

vectoradd, vectorsum, Modal runners, GPU Benchmarks, Leaderboard submissions


GPU MODE ▷ #amd-competition (22 messages🔥):

MI300 Profiling, Kernel Development details, Team formation, Github link


Cursor Community ▷ #general (154 messages🔥🔥):

MCP, Gemini API, Cursor bugs, Deepseek v3.1, usage based pricing


Yannick Kilcher ▷ #general (83 messages🔥🔥):

Schrödinger Bridges, DeepCoder 14B, KV Cache Distillation, AlphaProof, Math AIs


Yannick Kilcher ▷ #paper-discussion (4 messages):

AWS Site Visit, nanotron/ultrascale-playbook


Yannick Kilcher ▷ #ml-news (4 messages):

Awkward Youtuber, rQJmDWB9Zwk, 6nJZopACRuQ


MCP (Glama) ▷ #general (66 messages🔥🔥):

Enact Protocol, Semantic Tool Calling, A2A podcast, MCP sandboxing, MCP client integration


MCP (Glama) ▷ #showcase (5 messages):

MCP Protocol Validator Open Source, MCP Server Adoption Challenges, Cloud Hosted MCP Inspector, MatlabMCP - MATLAB Meets LLMs


Eleuther ▷ #general (31 messages🔥):

Discord referral, Dyslexia, KL vs CE, Model size


Eleuther ▷ #research (32 messages🔥):

Lambada Parity, RWKV vs Transformers, UTs and RWKV, Muon and Transformer Layers


Eleuther ▷ #interpretability-general (2 messages):

GPTs Agents, String Matching


Modular (Mojo 🔥) ▷ #general (55 messages🔥🔥):

SIMD store, bench functions incorrect use, @parameter needed, lock files, random integers list


Modular (Mojo 🔥) ▷ #mojo (4 messages):

Mojo project discrepancies, magic.lock file issues, Mojo version conflicts


Nomic.ai (GPT4All) ▷ #general (48 messages🔥):

L1-Qwen-1.5B-Max model, Nomic embed text v1.5, LLM query logging, System prompts for embedding models, Re-ranker models


HuggingFace ▷ #general (31 messages🔥):

Gradio GUI, Transformer Training Data Volume, Finding Python Expert, Reporting HF Course Errors, Fine Tuning Model


HuggingFace ▷ #today-im-learning (3 messages):

Life's Unexpected Surprises


HuggingFace ▷ #cool-finds (1 messages):

not_lain: the app is offline


HuggingFace ▷ #reading-group (1 messages):

Stanford CME 295 Transformers, LLM Book Discussions


HuggingFace ▷ #computer-vision (6 messages):

Object Tracking, ReID for Object Recognition, Owlv2 Model, Segment Anything Model (SAM), YOLO Model


HuggingFace ▷ #agents-course (4 messages):

LangGraph vs Google ADK, Google Agent Development Kit, Meta Llama access


Nous Research AI ▷ #general (14 messages🔥):

vgel's control-vectors, DisTrO details, Psyche's testnet run


Nous Research AI ▷ #ask-about-llms (4 messages):

Azure API, Reasoning Content, Token Limits


Nous Research AI ▷ #interesting-links (3 messages):

X Post, Teknium User Mentions


tinygrad (George Hotz) ▷ #general (6 messages):

Pathways Paper, TPU vs GPU, Tinygrad cloud, Tinygrad virtualization


tinygrad (George Hotz) ▷ #learn-tinygrad (14 messages🔥):

Position-Independent Code, ELF Loader, Compiler Linking, TinyGrad Architecture, Memory Map Generation


Torchtune ▷ #announcements (1 messages):

Finetune Llama4, Scout Model, Maverick Model, MoE models


Torchtune ▷ #general (1 messages):

jovial_lynx_74856: @here office hours in 43 mins!


Torchtune ▷ #dev (16 messages🔥):

running_loss.detach() fix, test tolerances, sampler seed, bitsandbytes Mac issues, FSDPModule import error


Torchtune ▷ #papers (1 messages):

krammnic: I was speaking about something like this


LlamaIndex ▷ #general (18 messages🔥):

FunctionCallingAgent JSON Schema Response, Llama Cloud API 404 Error, FaissVectorStore Index from Weights, Intelligent Metadata Filtering in RAG Agent


Notebook LM ▷ #use-cases (7 messages):

Microphone recognition issues in NotebookLM, Upload source errors, Phishing attempts


Cohere ▷ #「💬」general (2 messages):

Vague questions, Specific Queries


Cohere ▷ #「🔌」api-discussions (2 messages):

Java API, Network error


DSPy ▷ #general (2 messages):

DSPy module as a persona, AI Agents & Reasoning, Large Language Models (LLMs), Machine Learning Frameworks, Infrastructure


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

Course Deadlines, Certificate Availability


MLOps @Chipro ▷ #events (1 messages):

Event reminder




{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}