Frozen AI News archive

Google I/O: new Gemini native voice, Flash, DeepThink, AI Mode (DeepSearch+Mariner+Astra)

**Google I/O 2024** showcased significant advancements with **Gemini 2.5 Pro** and **Deep Think** reasoning mode from **google-deepmind**, emphasizing AI-driven transformations and developer opportunities. **GeminiApp** aims to become a universal **AI assistant** on the path to **AGI**, with new features like **AI Mode** in Google Search expanding generative AI access. The event included multiple keynotes and updates on over a dozen models and 20+ AI products, highlighting **Google's** leadership in AI innovation. Influential voices like **demishassabis** and **philschmid** provided insights and recaps, while the launch of **Jules** as a competitor to Codex/Devin was noted.

Canonical issue URL

Gemini is all you need.

AI News for 5/19/2025-5/20/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (215 channels, and 7031 messages) for you. Estimated reading time saved (at 200wpm): 622 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Twelve months ago we covered Google I/O, but if we're being honest Gemini wasn't quiiite frontier yet and it was somewhat overshadowed by 4o's launch.

Six months ago we wrote that Google wakes up with Gemini 2.0, and that began an epic multi-month run of increasing Gemini dominance (even adopting the AINews chart):

gemini

and today confirmed by official numbers from Gemini (though much of this helped by having the most generous free tier in the world):

gemini

The AI Twitter recap below does a pretty good job of recapping the major launches so we won't really bother redoing it, but we'd definitely say it missed the launch of Jules (Gemini's Codex/Devin competitor) because Jules was somewhat pre-leaked.

As always the Verge does a great job of condensing the 3 hour keynote into 30 mins:

verge


AI Twitter Recap

Google I/O 2024 Event and Announcements

AI Model Releases, Evaluation, and Analysis

AI in Robotics, Agents, and Automation

Company Partnerships, Investments, and Business Applications

Techniques, Tools, and Tutorials

Political, Ethical, and Philosophical Musings

Humor and Miscellaneous


AI Reddit Recap

/r/LocalLlama Recap

1. Gemma 3n Model Announcements and Community Reactions

2. Gemma 3 Technical Updates and Optimizations in llama.cpp

3. OpenEvolve and AlphaEvolve System Open Source Implementation

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Google Gemini 2.5 Pro & Ultra Model Benchmarks and Features

2. Civitai Payment Ban and Community Responses

3. Cutting-Edge AI for Science, Creativity, and Automation


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: Google's AI Blitz and New Model Onslaught

Theme 2: Revolutionizing AI Tooling and Developer Platforms

Theme 3: Rise of the AI Agents: Coding, Research, and Beyond

Theme 4: Pushing Performance Frontiers: Model Optimization and Evaluation

Theme 5: AI's Societal Pulse: Ethics, Slop, and Community Dynamics


Discord: High level Discord summaries

LMArena Discord


Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


LM Studio Discord


OpenRouter (Alex Atallah) Discord


Eleuther Discord


Cursor Community Discord


Modular (Mojo 🔥) Discord


Notebook LM Discord


GPU MODE Discord


Nous Research AI Discord


HuggingFace Discord


Latent Space Discord


Yannick Kilcher Discord


aider (Paul Gauthier) Discord


Manus.im Discord Discord


Cohere Discord


MCP (Glama) Discord


tinygrad (George Hotz) Discord


LlamaIndex Discord


Torchtune Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


Nomic.ai (GPT4All) Discord


MLOps @Chipro Discord


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1432 messages🔥🔥🔥):

Special Tokens, Gemma 3, Google I/O, OpenAI versus Google


Perplexity AI ▷ #announcements (1 messages):

Perplexity Updates, F1 Standings, Sidebar Shortcuts


Perplexity AI ▷ #general (808 messages🔥🔥🔥):

Notebooklm releases on Android, GPTs Agents, OpenAI's sidebars, Perplexity AI Discord chatbot, Grok is sweet


Perplexity AI ▷ #sharing (4 messages):

Grok, Data, Github Copilot, India


Perplexity AI ▷ #pplx-api (14 messages🔥):

Playground vs API Output Quality, Deep Research API Issues, Timeout Issues with Perplexity API, JSON schema via OpenAI Python library


Unsloth AI (Daniel Han) ▷ #general (538 messages🔥🔥🔥):

128k context on 32gb, VRAM Calculator, KernelLLM GGUFs, Vision Quants, Training VITS


Unsloth AI (Daniel Han) ▷ #off-topic (9 messages🔥):

Mistral Small 3.1, Qwen2.5 VL benchmark, IBM Granite 4.0, Gemini Diffusion, Visual AI Learning App


Unsloth AI (Daniel Han) ▷ #help (235 messages🔥🔥):

Unsloth Model Merging, PPO Training, GRPO Training, Qwen 3 Models


Unsloth AI (Daniel Han) ▷ #research (33 messages🔥):

Entropix Pruning, VLM Gemma3 evaluation metrics, OpenEvolve released


LM Studio ▷ #general (158 messages🔥🔥):

LM Studio API, RoPE Frequency Scale, Qwen 3 Speculative Decoding, Model Unloading via API, Sliding Window Attention


LM Studio ▷ #hardware-discussion (449 messages🔥🔥🔥):

Intel Arc IPEX support in LM Studio, AMD GPU drivers issues, AVX2 support in LM Studio, Dual GPU setup, PCIE5 vs SATA SSD speeds


OpenRouter (Alex Atallah) ▷ #announcements (3 messages):

Provider slugs, Quantization slugs, Gemini Flash 2.5 release, Llama provider by Meta


OpenRouter (Alex Atallah) ▷ #general (235 messages🔥🔥):

Gemini 2.5 Pro DeepThink, Veo 3, Imagen 4, Gemma 3n, audio support


Eleuther ▷ #general (183 messages🔥🔥):

Discord bot for message deletion, Definition of 'slop' in AI, Gemini Diffusion, Mentorship in AI/ML, ARC-AGI performance improvements inspired by compression


Eleuther ▷ #research (50 messages🔥):

Yi Ma's talk on Intelligence, Autoencoders and Compression, SSL Methods like DINOv2, Paper Code Releases, OpenEvolve release


Eleuther ▷ #lm-thunderdome (3 messages):

VLM Evaluation, Text-Only Evaluations, Codebase Conditionals


Cursor Community ▷ #general (207 messages🔥🔥):

25 tools limit kills the chat, DeepSeek-R1T-Chimera model breaks loop, MCPs refresh frequently, Gemini's thinking process changed, O3 Pro coming soon


Modular (Mojo 🔥) ▷ #general (24 messages🔥):

Running models without CUDA, MAX and HF models, Robotics models and data streams, Cosmos 'world model', Porting from PyTorch to MAX


Modular (Mojo 🔥) ▷ #mojo (174 messages🔥🔥):

False positive warnings in 25.3, Unused variable warnings in Mojo, fn() -> raises syntax, IO API design with parametric traits, DMA-based APIs


Modular (Mojo 🔥) ▷ #max (7 messages):

Max vs Fireworks.ai/Together.ai/Groq.com, vLLM comparison, Optimize Max for lower latency and higher throughput, Max imports source code visibility, Enterprise solution with large scale disaggregated inference


Notebook LM ▷ #announcements (3 messages):

NotebookLM mobile app release, Audio Overviews customization, Google I/O Keynote summary, Video Overviews feature preview


Notebook LM ▷ #use-cases (23 messages🔥):

Pronunciation issues in podcasts, Exporting timelines to Google Calendar, Integrating NBLM into Discord, AI Protocols to prevent source alteration, NotebookLM mobile app


Notebook LM ▷ #general (153 messages🔥🔥):

NotebookLM Android app feedback, Podcast generation, File size limits, Output language options, Sharing notebooks


GPU MODE ▷ #general (12 messages🔥):

cutotune autotuner, FSDP1 vs FSDP2, Liger-Kernel, multihead GRU layers in cute-kernels


GPU MODE ▷ #triton (2 messages):

Triton CPU Support, TRITON_INTERPRET API, CPU Parallelism Limitations


GPU MODE ▷ #cuda (6 messages):

CUDA Usage, CGO Impact, GPU Utilization


GPU MODE ▷ #torch (1 messages):

CUDA graph model capture, Distributed operations in models


GPU MODE ▷ #cool-links (3 messages):

MAXSUN Arc Pro B60 Dual, SageAttention, Gemini Diffusion


GPU MODE ▷ #torchao (4 messages):

Axolotl QAT/PTQ Workflow, Llama3.2 Quantization, OpenAssistant/oasst1 Dataset Evaluation


GPU MODE ▷ #off-topic (6 messages):

Microsoft Build Conference, Network Connection Issues, LB broken


GPU MODE ▷ #irl-meetup (1 messages):

CUDA Developer Meet Up, NVIDIA, UCL, London, Python-native GPU programming


GPU MODE ▷ #self-promotion (1 messages):

OpenEvolve release, Evolutionary coding agents, LLMs for algorithm optimization


GPU MODE ▷ #🍿 (2 messages):

Reasoning Models, Pass @K


GPU MODE ▷ #thunderkittens (1 messages):

simran9493: https://www.youtube.com/watch?v=xcpEl0cGCC4


GPU MODE ▷ #reasoning-gym (1 messages):

rasdani: awesome! looking forward to the paper 🙂


GPU MODE ▷ #submissions (47 messages🔥):

MI300 Leaderboard Updates, AMD-FP8-MM performance, Histogram Leaderboard, MLA Decode Results, Mixture of Experts Leaderboard


GPU MODE ▷ #status (6 messages):

Leaderboard Explanations, Histogram Submission Error


GPU MODE ▷ #factorio-learning-env (14 messages🔥):

FLE Use-Cases and Evaluation, Colab Server for Agent Prototyping, Factorio TAS Generator, Gym Interface for FLE, Meeting Time Coordination


GPU MODE ▷ #amd-competition (32 messages🔥):

MLA decode kernel, File Size Submission Limit, FP8-GEMM issues, MoE Submission down


GPU MODE ▷ #cutlass (7 messages):

Cutlass DSL Python Windows support, CUTLASS thread tiling error, CUTLASS GTC slide outdated


GPU MODE ▷ #singularity-systems (2 messages):

Picograd, Rust implementation, Pedagogical Resource


Nous Research AI ▷ #general (95 messages🔥🔥):

Google's Code Agent, Google I/O Announcements, Gemma 3n Model, Gemini Diffusion Model, Decentralized AI


Nous Research AI ▷ #ask-about-llms (3 messages):

Restricting model domains, AI models in education, Gemini Flash, AI as a teaching assistant


Nous Research AI ▷ #research-papers (1 messages):

LLMs spontaneously generate social conventions, Collective biases in decentralized LLM populations, Adversarial LLM agents driving social change


Nous Research AI ▷ #interesting-links (3 messages):

OpenEvolve Release, Evolutionary Coding Agents, Google DeepMind's AlphaEvolve, LLMs for Algorithm Optimization, Matformer architecture


Nous Research AI ▷ #research-papers (1 messages):

LLM social conventions, Collective biases in LLMs, Adversarial LLM agents


HuggingFace ▷ #general (52 messages🔥):

Xet file size limits, HuggingFolks role, LLM recommendations, Training Data Errors, Hugging Face collaboration


HuggingFace ▷ #today-im-learning (1 messages):

Research Paper Reading Workflow, Summarization tools, YouTube Video Explanations, Paper Selection Criteria


HuggingFace ▷ #i-made-this (12 messages🔥):

Video dropping page, Browser AI tool calls, Data transformations with LLMs, MCP server support, Cyberdesk computer agent


HuggingFace ▷ #reading-group (1 messages):

arpitbansal.: By any chance recording available for the recent session??


HuggingFace ▷ #computer-vision (6 messages):

Stanford CS231n lectures, Estimating bathymetry (sea depth) from Sentinel 1 SAR images, Object Detection, Segmentation Model


HuggingFace ▷ #NLP (1 messages):

BERT-style model inference, Logit Differences, Candle vs PyTorch


HuggingFace ▷ #agents-course (30 messages🔥):

GAIA formatting issues, Ollama Setup Help, LiteLLMModel, InferenceClientModel, AI Agent Course Certificate Sharing on LinkedIn


Latent Space ▷ #ai-general-chat (92 messages🔥🔥):

Perplexity Free Tier Costs, AI Builder Survey, Jules: Asynchronous Coding Agent, Coding Agents Comparison, Anti-AI Sentiment on Forums


Yannick Kilcher ▷ #general (27 messages🔥):

MLOps Courses, Features trained on single image, GNN implementation with torch_geometric``


Yannick Kilcher ▷ #paper-discussion (16 messages🔥):

Physics of Language Models, Knowledge Storage, Knowledge Extraction, Knowledge Manipulation, Out-of-Distribution Buzzword


Yannick Kilcher ▷ #ml-news (28 messages🔥):

Alpha Evolve, Google Codex Competitor, Labor Saturation Theory, LatentSeek vs COCONUT, Gemini Diffusion


aider (Paul Gauthier) ▷ #general (54 messages🔥):

Qwen MoE 3, Qwen 2 35B Polyglot benchmark, Aider Notifications, Aider as Agent, Navigator PR


aider (Paul Gauthier) ▷ #questions-and-tips (15 messages🔥):

Aider Shell Command Execution, Aider YAML Configuration, Aider Prompt Context, Gemini 2.5 Flash Benchmark


Manus.im Discord ▷ #general (64 messages🔥🔥):

Manus AI Agent, Credit System & Invitation Code, Manus Website Creation, Network Connection Errors, Manus Tech Stack


Cohere ▷ #💬-general (28 messages🔥):

Category Theory and AI, Cohere Research Grants Program, Private Deployment Options at Cohere, Command A and Structured Responses Slowdown, JSON Output Hanging Issues with Command-R


Cohere ▷ #💡-projects (1 messages):

Vitalops datatune, Open source data transformation tool


Cohere ▷ #🎯-private-deployments (2 messages):

Private Deployment, Data Sovereignty, LLM Sovereignty, Cohere models on-prem


MCP (Glama) ▷ #general (20 messages🔥):

MCP best practices, MCP and Cursor, crawl4ai mcp server, A2A protocol Agents, Wallet MCP


MCP (Glama) ▷ #showcase (9 messages🔥):

MCP-GraphQL issues, Public SearXNG MCP server, AI-friendly Data API


tinygrad (George Hotz) ▷ #general (12 messages🔥):

AMD enum changes, 7900XTX vs 9070XT flash attention, RDNA4 wmma instructions, BERT training bounty, tinygrad gemm optimization


tinygrad (George Hotz) ▷ #learn-tinygrad (7 messages):

tinygrad control flow, jax.lax.cond equivalent, Tensor.where


LlamaIndex ▷ #blog (2 messages):

Financial Analysis Workshop, Multi-Agent Communication Protocol (MCP), AWS joins MCP steering committee


LlamaIndex ▷ #general (11 messages🔥):

Agent Handoff Examples, Llama Parse Service Issues, VectorStoreIndex vs Local FAISS


Torchtune ▷ #general (2 messages):

Recipe Tutorials, Automated CI, Llama2 Evaluation


Torchtune ▷ #dev (2 messages):

DistCp, Safetensors, Async Checkpointing


Torchtune ▷ #rl (3 messages):

async_grpo, async_rl, vllm dependencies, torch version compatibility


DSPy ▷ #general (3 messages):

DSPy X post, DSPy is all about, Getting what DSPy is all about?


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

AgentX Competition, Submission Forms, Judging Panel, Entrepreneurship Track, Research Track


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (2 messages):

OpenAI API Keys, Trailblazer Tier, Mastery Tiers


Nomic.ai (GPT4All) ▷ #general (2 messages):

PDF text extraction, GPT4All OpenAI API Key installation


MLOps @Chipro ▷ #general-ml (1 messages):

DataTune, Data Transformation, Open Source Tool, Natural Language Instructions, LLMs