Frozen AI News archive

Gemini 2.5 Pro Preview 05-06 (I/O edition) - the SOTA vision+coding model

**Gemini 2.5 Pro** has been updated with enhanced multimodal image-to-code capabilities and dominates the WebDev Arena Leaderboard, surpassing **Claude 3.7 Sonnet** in coding and other tasks. **Nvidia** released the **Llama-Nemotron** model family on Hugging Face, noted for efficient reasoning and inference. **Alibaba's Qwen3** models range from 0.6B to 235B parameters, including dense and MoE variants. **KerasRS** was released by **Fran\0ois Chollet** as a new recommender system library compatible with JAX, PyTorch, and TensorFlow, optimized for TPUs. These updates highlight advancements in coding, reasoning, and speech recognition models.

Canonical issue URL

Gemini is all you need.

AI News for 5/5/2025-5/6/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (214 channels, and 4980 messages) for you. Estimated reading time saved (at 200wpm): 468 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

3 weeks after 2.5 Flash captured the low end of the Pareto Frontier, it is time for Gemini to re-up the high end.

Google I/O is in two weeks, and there's an old adage that adding more coding in a model's dataset somehow helps it improve in all other respects, and today's Gemini 2.5 Pro update (which was only released 6 weeks ago) highlights its multimodal image-to-code capabilities that evoke the viral Tldraw moment of last year.

Making a clean sweep of #1 across LMArena leaderboards these days carries less weight than it used to, but beating Sonnet 3.7 at Coding still is noteworthy.

The finer details of the rollout across AIStudio and Gemini App are also to be appreciated.


AI Twitter Recap

Model Updates and Releases

Leaderboard and Benchmark Results

AI and Machine Learning Research

AI Tooling and Applications

Industry and Business Developments

Society

Humor


AI Reddit Recap

/r/LocalLlama Recap

1. Qwen Model Performance and VRAM Usage Discussions

2. New Open-Source SOTA Music Generation Model ACE-Step

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Gemini 2.5 Pro Model Updates and Benchmarks

2. OpenAI Acquisition of Windsurf Coverage

3. Latest AI Image and Video Generation Model Launches


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1. LLM Releases and Performance Showdowns

Theme 2. Tooling Up: AI Development Platforms & Frameworks Evolve

Theme 3. Under the Hood: Model Optimization, Fine-Tuning, and Interpretability

Theme 4. The Silicon Battleground: Hardware Pushing AI Frontiers

Theme 5. AI in Action: Applications, Prompting Quirks, and Ethical Considerations


Discord: High level Discord summaries

LMArena Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


LM Studio Discord


OpenAI Discord


OpenRouter (Alex Atallah) Discord


Manus.im Discord Discord


GPU MODE Discord


aider (Paul Gauthier) Discord


MCP (Glama) Discord


HuggingFace Discord


Yannick Kilcher Discord


Notebook LM Discord


Latent Space Discord


Nous Research AI Discord


Eleuther Discord


Torchtune Discord


Modular (Mojo 🔥) Discord


LlamaIndex Discord


tinygrad (George Hotz) Discord


LLM Agents (Berkeley MOOC) Discord


Cohere Discord


Nomic.ai (GPT4All) Discord


MLOps @Chipro Discord


Codeium (Windsurf) Discord


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1179 messages🔥🔥🔥):

Claude Code, Gemini 2.5 Pro, Grok 3.5 release, o3 vs Gemini


Cursor Community ▷ #general (536 messages🔥🔥🔥):

Cursor Pro slow requests, Connection failed error, Cursor 4.1 groove, ASI Age internal logics, Open Router accounts


Unsloth AI (Daniel Han) ▷ #general (167 messages🔥🔥):

Gemma-3 Finetuning, Qwen3 Fine-tuning Issues, GLM notebook testing, Saving models as GGUF, Vision data format for Gemma3


Unsloth AI (Daniel Han) ▷ #off-topic (5 messages):

Gemma 3 12b, Qwen3 14b, Hallucinations in models, Tool calling fixes


Unsloth AI (Daniel Han) ▷ #help (239 messages🔥🔥):

Qwen3 model differences, LMStudio and Gemma3 Issues, SafetensorError on Windows, Granite model finetuning, Sample weights during training


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

GroqStreamChain, Real-time AI chat apps, WebSockets, LangChain integration


Unsloth AI (Daniel Han) ▷ #research (59 messages🔥🔥):

Transformer + BERT Model Combination, Pretraining Gemma3 with Medical Data, Multi-label multi-class classification, Muon paper by Google, Integrating Muon with Hugging Face


Perplexity AI ▷ #general (451 messages🔥🔥🔥):

O3 lazy, AI competition recipes, O3 pro 2x, Discord bot down, Perplexity image quality loss


Perplexity AI ▷ #pplx-api (3 messages):

workaround with Beautiful Soup, URL citations


LM Studio ▷ #general (275 messages🔥🔥):

Mistral 3.1 24b Image Recognition, LM Studio model updates, Self-hosted LLM vs API costing, LLM Training and User Data, Speculative Decoding with Qwen 3


LM Studio ▷ #hardware-discussion (87 messages🔥🔥):

Q8 XL Model Performance, 4080 vs 4060 GPU Setup, Memory Bandwidth Bottleneck, Random Token Generation Speeds, Apple M3 Memory Configurations


OpenAI ▷ #ai-discussions (198 messages🔥🔥):

Google AI Studio vs Gemini, Gemini 2.5 Pro Coding Prowess, GPT's Flattery Glaze, Lucid Dreaming Techniques, Grok 3.5 expectations


OpenAI ▷ #gpt-4-discussions (24 messages🔥):

GPT-4o issues, 4o Browser performance, Dragging chats into folders


OpenAI ▷ #prompt-engineering (54 messages🔥):

Prompt engineering definition, ChatGPT cost, Truth in AI, Atomic theory in AI, Custom GPT design


OpenAI ▷ #api-discussions (54 messages🔥):

Prompt Engineering, Truth in AI, Atomic Theory, Customizing ChatGPT, Eastern Philosophy in Chatbots


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

Gemini 2.5 Pro, Activity Page Enhanced, Reasoning Model Perf Metrics, Request Builder API, Prompt Category API


OpenRouter (Alex Atallah) ▷ #app-showcase (10 messages🔥):

Openrouter-powered Discord Bot, LMarena database, SimpleAIChat LLM chat client


OpenRouter (Alex Atallah) ▷ #general (252 messages🔥🔥):

OpenRouter 500 errors, Wayfarer-Large-70B-Llama-3.3, Google Gemini embedding model pricing and rate limits, CPU-only provider feasibility for OpenRouter, OpenAI API errors and debugging


Manus.im Discord ▷ #general (189 messages🔥🔥):

Subscription Issues, Selling Credits, Manus Invitation Codes, Manus Reading Links, Manus vs ChatGPT


GPU MODE ▷ #triton (2 messages):

Triton, torch.index_select, GPU kernel, row-indexing functionality


GPU MODE ▷ #cuda (66 messages🔥🔥):

RTX 6000 PRO, compute capability, A6000, cuda cores


GPU MODE ▷ #torch (1 messages):

YOLO Model Training, Multi-GPU utilization


GPU MODE ▷ #cool-links (3 messages):

Hugging Face Kernels Community, Leaderboard Kernels Publication


GPU MODE ▷ #beginner (10 messages🔥):

Running CUDA code without NVIDIA GPU, Quantization and dequantization for transformer engine, Generating roofline plot


GPU MODE ▷ #torchao (3 messages):

torchao quantization, LSTM model quantization, CPU vs GPU operators, torch.quantization vs torchao


GPU MODE ▷ #off-topic (1 messages):

s1r_o: https://www.cursor.com/students for students around, this could be of use


GPU MODE ▷ #webgpu (9 messages🔥):

WebGPU crashes, Zig and wgpu-native, Shader module creation, WGPUChainedStruct errors, GLSL vs WGSL


GPU MODE ▷ #self-promotion (1 messages):

NVIDIA L2 GPU Optimization, Custom Memory Allocator, Elementwise Kernel Builder


GPU MODE ▷ #submissions (40 messages🔥):

amd-fp8-mm leaderboard, amd-mixture-of-experts leaderboard, MI300 performance


GPU MODE ▷ #status (1 messages):

Security risk assessment, Competition code platforms


GPU MODE ▷ #amd-competition (11 messages🔥):

AITER benchmark data, ROCm private repo, Leaderboard submission for amd-mixture-of-experts, CLI resubmits and timeouts


GPU MODE ▷ #mojo (7 messages):

FP8 Support, Hardware Extensibility, ML Compilers, End-to-End ML Models


aider (Paul Gauthier) ▷ #general (120 messages🔥🔥):

Aider 0.82.3, udiff-simple, gemini 2.5, Data Privacy, Vertex API


aider (Paul Gauthier) ▷ #questions-and-tips (29 messages🔥):

Aider Subtree, Lint Command, HTML representations, OpenRouter, Authentication Error


MCP (Glama) ▷ #general (108 messages🔥🔥):

PM2 for MCP servers, OAuth with Keycloak for MCP, MPC Server initiating communication with Claude Desktop, Memory options for Claude, Controlling Claude's tool access


MCP (Glama) ▷ #showcase (4 messages):

Graphlit, MCP search engine, MCP servers


HuggingFace ▷ #general (45 messages🔥):

api-inference.huggingface.co, Object Tracking models, DOI deletion, Summarising caselaw, Data Parallelism vs Model Parallelism


HuggingFace ▷ #today-im-learning (9 messages🔥):

AI Study Group GitHub Repo, List of AI Papers Plan, Discord Usage


HuggingFace ▷ #i-made-this (8 messages🔥):

Huggingface Desktop App, Dank Leaks, Flux-Pro-Unlimited AI image generator, candle-holder


HuggingFace ▷ #core-announcements (1 messages):

HiDream LoRAs, Quantization Support, Memory Savings


HuggingFace ▷ #NLP (13 messages🔥):

GPU memory considerations, Emotion Classification Models, FullyShardedDataParallelPlugin Error


HuggingFace ▷ #smol-course (3 messages):

HF API Limits, GAIA files evaluation, Gemma3 vs Qwen


HuggingFace ▷ #agents-course (12 messages🔥):

GAIA Questions, Agent UI Timeouts, Final Agent Build, Frameworks, Final Challenge Solution


Yannick Kilcher ▷ #general (71 messages🔥🔥):

M4 Macbook Pro vs RTX 5070Ti for LLM inference, Diffusion Models as Evolutionary Algorithms, Search vs Optimization, AI-assisted Academic Articles and Patents, Claude Code with Gemini


Yannick Kilcher ▷ #paper-discussion (1 messages):

k_nearest_neighbor: I won't be able to do a paper today, but feel free if anyone wants to.


Yannick Kilcher ▷ #ml-news (13 messages🔥):

Em Dashes, OAI, US Gov, Chinese Models, Deepseek vs OAI, Sam Altman


Notebook LM ▷ #use-cases (8 messages🔥):

Podcast Length Discrepancies, Inserting Instructions, Audio Overview Experiences, Mind Map Generation


Notebook LM ▷ #general (70 messages🔥🔥):

NotebookLM Audio Transcription, Gemini Flash 2.5 Confirmation, Cantonese Language Support, NotebookLM's Gemini Version, Interactive Mode in NotebookLM


Latent Space ▷ #ai-general-chat (63 messages🔥🔥):

Dwarkesh Automated Firms Essay, Revenue Numbers, Exa Blogpost, OpenAI Acquires Windsurf, Gemini 2.5 Pro Elo Bump


Nous Research AI ▷ #general (38 messages🔥):

OpenAI Public Benefit Corp, Flights to the US, RL Environments Hackathon, Fine tuning a base model LLM, M4 Macbook Pro vs RTX 5070Ti Linux laptop


Nous Research AI ▷ #interesting-links (5 messages):

AnySphere, AGI Agents


Eleuther ▷ #general (27 messages🔥):

Page Faults, Disk Swapping, Torch, Transformers, vLLM


Eleuther ▷ #research (3 messages):

Data vs Model Parallelism, arXiv:2305.18153 Citations, Anthropic Work


Eleuther ▷ #interpretability-general (9 messages🔥):

Circuit Identification, Anthropic Transformer Circuits, Monosemanticity, Interpretability Tooling Challenges, TransformerLens Limitations


Eleuther ▷ #multimodal-general (1 messages):

Multimodal VLMs, Community Research Hub, Weekly Updates


Torchtune ▷ #dev (19 messages🔥):

Codegen for new models, Reducing engineering time for new models, Tokenizer Support, HF Transformers Adapter, Qwen3 Support


Modular (Mojo 🔥) ▷ #general (3 messages):

Modular Puzzles on Macbook, Apple Silicon GPUs, NVIDIA GPU architectures


Modular (Mojo 🔥) ▷ #mojo (12 messages🔥):

Blogging Platforms, Ownership Semantics in Mojo, Mojo Getting Started Guide Errors, Comptime Try-Except Handling


LlamaIndex ▷ #blog (2 messages):

MCP Hackathon, Agent Communication, Deep Research Agent


LlamaIndex ▷ #general (6 messages):

Property Graph Indexes, LlamaIndex GraphRAG, LangChain GraphRAG, Vector Database Storage


tinygrad (George Hotz) ▷ #general (2 messages):

M4 Macbook vs RTX 5070Ti for local LLM, tinygrad discord rules


tinygrad (George Hotz) ▷ #learn-tinygrad (3 messages):

Bounty Picking, Rockship Device


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

Auth0 Workshop, AI Agent Security, Entrepreneurship Track Prizes


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

HuggingFace Credits, Quiz Scores


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (1 messages):

LLMs and Conditional Statements, Formal Methods in LLMs, Representing Conditions for LLMs


LLM Agents (Berkeley MOOC) ▷ #mooc-readings-discussion (1 messages):

LLM Reasoning, LLM Formal Methods, LLM Knowledge Representation


Cohere ▷ #🔌-api-discussions (5 messages):

Cohere-AI npm, Aya vision


Nomic.ai (GPT4All) ▷ #general (3 messages):

Claude System Prompt, Chat Template Generation with Python, GPT4All Integration


MLOps @Chipro ▷ #events (1 messages):

MLOps, LLMOps, AI Project Lifecycle, Data Phoenix


MLOps @Chipro ▷ #general-ml (1 messages):

Experiment Setup vs. Model Application, Model Validation Strategies


Codeium (Windsurf) ▷ #announcements (1 messages):

Windsurf Wave 8, Windsurf Reviews, Knowledge Base, Conversation Sharing, Teams Deploys