Frozen AI News archive

OpenAI buys Jony Ive's io for $6.5b, LMArena lands $100m seed from a16z

**OpenAI** confirmed a partnership with **Jony Ive** to develop consumer hardware. **LMArena** secured a $100 million seed round from **a16z**. **Mistral** launched a new code model fine-tune. **Google DeepMind** announced multiple updates at **Google I/O 2024**, including over a dozen new models and 20 AI products. Key highlights include the release of **Gemini 2.5 Pro** and **Gemini Diffusion**, featuring advanced multimodal reasoning, coding, and math capabilities, and integration of Gemini in **Google Chrome** as an AI browsing assistant. **Deep Think** enhanced reasoning mode and **Project Astra** improvements were also introduced, focusing on voice output, memory, and computer control for a universal AI assistant.

Canonical issue URL

Jony Ive is all you need.

AI News for 5/20/2025-5/21/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (215 channels, and 6969 messages) for you. Estimated reading time saved (at 200wpm): 597 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

A day after Google's I/O, OpenAI consummated the long rumored Jony Ive partnership and confirmed plans to ship consumer hardware.

LMArena announced their $100m SEED (!?!?) from a16z.

Mistral launched a new code model finetune.

You could be forgiven for completely missing OpenAI's nice updates to the Responses API.


AI Twitter Recap

Here's a summary of the tweets you provided, organized by category:

Google I/O 2024 Announcements and Keynotes

Gemini Models and Capabilities

Agentic Web and AI Agents

Open Source Models and Tools

Model Architecture and Techniques

Google DeepMind's AI Filmmaking Tools

Other News

Humor/Memes


AI Reddit Recap

/r/LocalLlama Recap

1. Mistral Devstral Coding Model Announcements and Benchmarks

2. Major New Model and Architecture Releases (Gemini Diffusion, Bagel MOE, Falcon-H1)

3. Feedback and Application News for Next-Gen Local/Edge AI (Gemma3n, MedGemma, General LLM User Impressions)

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Google Veo 3 and AI-Generated Video Breakthroughs

2. Multimodal and Open-Source Model Releases (Bagel, TTS)

3. Anthropic Claude 4 Sonnet/Opus Launch and Expectations


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: The Model Gauntlet: New Releases, Performance Puzzles, and Shifting Tides

Theme 2: Powering the Prompts: Hardware Hustles and GPU Grandeur

Theme 3: Forging the Future: Frameworks, Fine-Tuning, and Agent Architectures

Theme 4: AI in Action: Multimodal Marvels to Model Misbehaviors

Theme 5: Ecosystem Evolution: Open Source Onslaughts, Funding Feats, and Platform Puzzles


Discord: High level Discord summaries

LM Studio Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


Perplexity AI Discord


OpenRouter (Alex Atallah) Discord


Cursor Community Discord


HuggingFace Discord


Eleuther Discord


Notebook LM Discord


Latent Space Discord


GPU MODE Discord


aider (Paul Gauthier) Discord


Manus.im Discord Discord


Nous Research AI Discord


Modular (Mojo 🔥) Discord


Yannick Kilcher Discord


MCP (Glama) Discord


Cohere Discord


LlamaIndex Discord


Torchtune Discord


tinygrad (George Hotz) Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


Nomic.ai (GPT4All) Discord


Gorilla LLM (Berkeley Function Calling) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LM Studio ▷ #general (297 messages🔥🔥):

Context Filling, Gemma 3 architecture, Multimodal model performance, Qwen 3 integration, Falcon-H1 GGUF


LM Studio ▷ #hardware-discussion (885 messages🔥🔥🔥):

Strix Halo and 96GB of RAM, Optimal GPU setup, Running vs. Using LLMs, PCIE bus bottlenecks, NVLink vs SLI


Unsloth AI (Daniel Han) ▷ #general (911 messages🔥🔥🔥):

OpenAI vs OSS, Medgemma finetuning, VITS 2 and TTS models, MoE models


Unsloth AI (Daniel Han) ▷ #off-topic (16 messages🔥):

Diffusion vs Autoregression, Gemini Diffusion, New SOTA 1B Model, Daniel Han's Tweets as a Blog


Unsloth AI (Daniel Han) ▷ #help (226 messages🔥🔥):

Phi-4 model issues after merging LoRA adapter, CSM-1B voice training issues, Qwen3 function calling problems, DeepSeek V3 GGUF download, Unsloth notebook for retrieval augmented finetuning


Unsloth AI (Daniel Han) ▷ #research (3 messages):

KernelLLM, Mixture of Experts Models


LMArena ▷ #general (952 messages🔥🔥🔥):

Gemini 2.5 Pro, Claude 4 leak, LMArena raised 100M, Grok 3.5


LMArena ▷ #announcements (1 messages):

LMArena new website, LMArena $100M seed funding, LMArena staff AMA


Perplexity AI ▷ #general (827 messages🔥🔥🔥):

Grok PDF Export, Perplexity and Gemini 2.5 Flash, Image Generation on Mobile, Comet Browser, RoboForm


Perplexity AI ▷ #sharing (1 messages):

``


Perplexity AI ▷ #pplx-api (8 messages🔥):

Deep Research Model Confusion, WebUI vs API Access Differences, Perplexity Hackathon Rules, Community Forum Announcement


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

MCP Support, Gemini image gen, PDF/File support/viewing, PDF Generation, UI/UX improvements


OpenRouter (Alex Atallah) ▷ #general (326 messages🔥🔥):

Gemini Diffusion Speed, Meta Llama 3.3 Release, Gemma-3n-4B vs Claude 3.7, TTS Models for Yoga AI, Veo 3 Cost and Capabilities


Cursor Community ▷ #general (278 messages🔥🔥):

Service Unavailable error, Chatgpt pro is bargain, Rust for backend, Integrating memory banks, Gemini flash


HuggingFace ▷ #general (230 messages🔥🔥):

GPU rental, AMD vs NVIDIA for local models, Model Inference speed, HuggingFace SEO, Falcon H1


HuggingFace ▷ #today-im-learning (3 messages):

Smolagent Real World Agents, HFAPI Issues, LiteLLM Slowdown, Ollama based models and Reasoning, GGUF models


HuggingFace ▷ #cool-finds (1 messages):

Manus AI Agent, Referral Link


HuggingFace ▷ #i-made-this (17 messages🔥):

Lunaris Codex, DataTune, LLMChat GNOME Shell Extension, Optuna and Transformers Integration, Scaling Mixture of Experts Models


HuggingFace ▷ #computer-vision (2 messages):

LayoutLMv3 Workflow, Donut Model Integration, OCR Methods Comparison, GeoMeetup SF


HuggingFace ▷ #gradio-announcements (1 messages):

MCP, Gradio, AI Agent Development, Prizes


HuggingFace ▷ #agents-course (11 messages🔥):

Dummy agents lib on HF Interface, LinkedIn Certificate creation links, Agents Course Deadlines, Scores Dataset Submission, Certificate Issues


Eleuther ▷ #general (203 messages🔥🔥):

AI Slop definition, Computational Irreducibility and Novelty, Gemini Diffusion Noise Model, Discord Dataset Use Cases


Eleuther ▷ #research (15 messages🔥):

LinkedIn icon usage, BEAR Probe evaluation, RAG database leaks, Platonic Representations Hypothesis


Eleuther ▷ #lm-thunderdome (34 messages🔥):

Qwen 2.5 GSM8k Evaluation, lm-evaluation-harness PR issue, Qwen 2.5 Evaluation Prompt


Notebook LM ▷ #announcements (2 messages):

Audio Overviews, Video Overviews, Google I/O


Notebook LM ▷ #use-cases (30 messages🔥):

NotebookLM PDF Upload, AI Studio and Webpage Reading, Project Astra Improvements, Gemini App Features, Video Overviews Languages


Notebook LM ▷ #general (136 messages🔥🔥):

Political censorship, Longer Audios, NotebookLM updates, Gemini features, Video overviews in other languages


Latent Space ▷ #ai-general-chat (146 messages🔥🔥):

Google I/O 2025, Gemma 3n, Stitch by Google, OpenAI Structured Outputs, Sam Altman and Jony Ive


GPU MODE ▷ #general (12 messages🔥):

Multihead GRU Layers in Cute Kernels, Warp Specialization Algorithms, Loss Spikes in Softmax-Attention-1b, Training Time Indication, RNN vs Softmax Performance at Different Scales


GPU MODE ▷ #triton (4 messages):

Triton autotuner, extern_elementwise API, Blackwell support


GPU MODE ▷ #cuda (9 messages🔥):

__reduce_add_sync, asynchronous wgmma pipelines Hopper, complete::tx_bytes async TMA loads, wait_group<0> consumer


GPU MODE ▷ #algorithms (3 messages):

MCMC, Variational Inference


GPU MODE ▷ #cool-links (2 messages):

Google Gemini Diffusion, Block Diffusion, KV Cache


GPU MODE ▷ #beginner (1 messages):

Elementwise Kernel, Vectorized Loads/Stores


GPU MODE ▷ #torchao (1 messages):

OpenAssistant/oasst1 dataset, Default settings


GPU MODE ▷ #off-topic (2 messages):

Elon Musk buying GPUs, 1 million GPU facility, dotnet runtime PR


GPU MODE ▷ #irl-meetup (1 messages):

viranchee: Any in person events in SF in next 30 days


GPU MODE ▷ #self-promotion (1 messages):

Multi-Agent Hackathon, Tenstorrent hardware, Koyeb cloud platform


GPU MODE ▷ #🍿 (7 messages):

KernelBench, KernelBook, GRPO, NVCC logs, RL Baseline


GPU MODE ▷ #reasoning-gym (1 messages):

rasdani: awesome! looking forward to the paper 🙂


GPU MODE ▷ #submissions (55 messages🔥🔥):

AMD MI300 Performance, amd-mixture-of-experts, amd-mla-decode, amd-fp8-mm, Workflow Timeouts


GPU MODE ▷ #status (2 messages):

amd-mla-decode, MLA running on GPU


GPU MODE ▷ #factorio-learning-env (19 messages🔥):

Easier Integration Interface for External Agents, Championship Build (4M SPM), Project Sid, Multi-agent Minecraft simulator


GPU MODE ▷ #amd-competition (7 messages):

MLA-Decode Data Generator, Ranked sequence length in DRAM


GPU MODE ▷ #cutlass (10 messages🔥):

WSL2 performance with CUDA, PTX or SASS dumping with cute.compile, CuTe DSL Feedback


GPU MODE ▷ #singularity-systems (1 messages):

picograd, Rust, Python, FFN, RNN


aider (Paul Gauthier) ▷ #general (62 messages🔥🔥):

Gemini 2.5 Flash Preview, Aider and Jules as background Agents, Aider Polyglot Benchmark, Copilot getting open sourced


aider (Paul Gauthier) ▷ #questions-and-tips (20 messages🔥):

Gemini 2.5 Flash Benchmark, Aider and Pip Packages, Running IPYNB Notebooks, --read flag issues, Context from file


Manus.im Discord ▷ #general (82 messages🔥🔥):

RizzDial Marketing, Manus Credits, Manus vs cluely.ai, Manus Image Generation, Manus for Coding Projects


Nous Research AI ▷ #general (73 messages🔥🔥):

audio output parameters, Gemma access, Gemini diffusion model, WildChat-1M dataset, Devstral coding agent


Nous Research AI ▷ #ask-about-llms (5 messages):

Restricting Models, AI in education, Hermes 3


Nous Research AI ▷ #interesting-links (2 messages):

Gemma 3n models, Matformer arch


Modular (Mojo 🔥) ▷ #general (19 messages🔥):

Claude Code with Mojo, Mojo Code Generation, AI coding assistance for Mojo, Cursor vs Claude


Modular (Mojo 🔥) ▷ #mojo (42 messages🔥):

Float16 exp implementation, Mojo compile times, String null termination changes


Modular (Mojo 🔥) ▷ #max (6 messages):

Modular max imports, Torch CustomOpLibrary, MLIR context errors, Modular Forum for issues


Yannick Kilcher ▷ #general (29 messages🔥):

Pytorch Geometric, GATConv AssertionError, SGD vs Genetic Breeding, Concept entanglement vs fracture, Picbreeder


Yannick Kilcher ▷ #paper-discussion (14 messages🔥):

Physics of Language Models: Part 3.2, Knowledge Manipulation, Out-of-distribution abuse, Data contamination


Yannick Kilcher ▷ #ml-news (17 messages🔥):

Gemma 3n, Google AI Edge, Anthropic AI, Humane AI Pin Failure, Rabbit R1 Failure


MCP (Glama) ▷ #general (33 messages🔥):

Streaming transport adoption, Decoupling transport and wire protocols, MCP memory bank, OpenAI rolling out MCP support, Tool name constraints


MCP (Glama) ▷ #showcase (11 messages🔥):

MCP SDK, Auth server, Typescript, MCP resource server, Frontend CLI


Cohere ▷ #💬-general (23 messages🔥):

Private Deployment Options with Cohere, Command A Slow Response Times, Entity Extraction and JSON Output Issues, Embed v4 and Bedrock Availability, Self-Hosting Command Models


Cohere ▷ #🔌-api-discussions (15 messages🔥):

Embed v4, vector DB, rate limiting, open models, AWS


Cohere ▷ #🟢-status-updates (1 messages):

embed-v4.0, Cohere Status Page


Cohere ▷ #🎯-private-deployments (1 messages):

Cohere Sales, Cohere Support


LlamaIndex ▷ #blog (2 messages):

Monorepo management, uv package management, LlamaDev build tool, Discord Office Hours


LlamaIndex ▷ #general (35 messages🔥):

Llama Parse Issues with Layout Agent, VectorStoreIndex vs FAISS, Model llamaindex/vdr-2b-multi-v1 issues, Azure AI Search Integration, LlamaIndex Office Hours


Torchtune ▷ #general (15 messages🔥):

Torchtune generate Qwen2_5_0_5b, Tokenizer bug in inference mode, Custom tokenizer patch, LORA finetuning gibberish, Resizing Token Embeddings


Torchtune ▷ #dev (2 messages):

DistCp, Safetensors, Async Checkpointing, DCP Team


Torchtune ▷ #rl (4 messages):

Async RL Recipe, Microsoft's Verl framework


tinygrad (George Hotz) ▷ #general (6 messages):

Job Opportunities, tinygrad bounties, distributed training, mmapeak work, RDNA4 instructions


tinygrad (George Hotz) ▷ #learn-tinygrad (6 messages):

JAX control flow, Tensor.where, jax.lax.cond


DSPy ▷ #general (8 messages🔥):

DSPy Framework, Bias Training, Case Study


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (3 messages):

Course deadlines, Certificate requirements


Nomic.ai (GPT4All) ▷ #general (3 messages):

GPT4All OpenAI API Key, Extending GPT4All interface for more than text LLMs


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (1 messages):

Manus AI Referral, Powerful Agents