Frozen AI News archive

Gemini 2.5 Pro (06-05) launched at AI Engineer World's Fair

At the second day of **AIE**, **Google's Gemini 2.5 Pro** reclaimed the top spot on the LMArena leaderboard with a score of **1470** and a +24 Elo increase, showing improvements in coding, reasoning, and math. **Qwen3** released state-of-the-art embedding and reranking models, with **Qwen3-Embedding-8B** topping the MTEB multilingual leaderboard. **OpenThinker3-7B** emerged as the top open reasoning model trained on the **OpenThoughts3-1.2M dataset**, outperforming previous models by 33%. **LightOn** introduced **FastPlaid**, achieving up to a 554% speedup for late-interaction models. **Morph Labs** hired **Christian Szegedy** as Chief Scientist to lead Verified Superintelligence development. The **AI Engineer World's Fair** featured a fireside chat with **Greg Brockman** and **NVIDIA CEO Jensen Huang**, highlighting the return of basic research and engineering best practices.

Canonical issue URL

AIE is all you need.

AI News for 6/5/2025-6/6/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (218 channels, and 7848 messages) for you. Estimated reading time saved (at 200wpm): 636 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

In a busy day of nonAI events, we still managed to pull out the AI news. At the second day of AIE, Logan Kilpatrick knew how to please the crowd: Release a new Gemini:

The Twitter recap below does a pretty good job, and you can watch the full stream here, including a well reviewed live demo from Solomon Hykes and an insane trade deal from Christian Szegedy.


AI Twitter Recap

New AI Models & Benchmark Results

AI Engineer World's Fair

New Tools, Libraries, and Features

Industry & Platform Dynamics

Technical Concepts & Research

Humor & Memes


AI Reddit Recap

/r/LocalLlama Recap

1. 8B Parameter Model Head-to-Head: DeepSeek R1-0528-Qwen3-8B and Qwen3 8B

2. Recent Open LLM and Embedding Model Releases (OpenThinker3, Qwen3-Embedding, BAIDU on HuggingFace)

3. Benchmarks and Experiments: Sparse Transformers and LLM Town of Salem Tournament

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Gemini 2.5 Pro Release, Benchmarks, and Comparisons

2. ElevenLabs v3 Expressive Text-to-Speech Breakthroughs

3. Anthropic Claude Code and Project Features: User Experiences and Updates


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: Gemini Model Mayhem: Performance, Problems, and Predictions

Theme 2: New Models on the Block: Notable Releases and Rising Stars

Theme 3: Developer Tooling Turmoil: IDEs, Frameworks, and Open Source Offerings

Theme 4: GPU Gauntlets and Benchmarking Battles: Pushing AI to its Limits

Theme 5: Navigating the AI Maze: Privacy, Prompts, and Practical Problems


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


Eleuther Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


OpenRouter (Alex Atallah) Discord


HuggingFace Discord


OpenAI Discord


aider (Paul Gauthier) Discord


GPU MODE Discord


LM Studio Discord


Latent Space Discord


Manus.im Discord Discord


Modular (Mojo 🔥) Discord


Nous Research AI Discord


LlamaIndex Discord


MCP (Glama) Discord


Notebook LM Discord


tinygrad (George Hotz) Discord


Yannick Kilcher Discord


Torchtune Discord


DSPy Discord


Nomic.ai (GPT4All) Discord


Cohere Discord


LLM Agents (Berkeley MOOC) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1223 messages🔥🔥🔥):

Image generation in Perplexity, Billing issues, Loss of threads, Gemini 2.5 Pro, OpenAI vs Gemini vs Claude


Perplexity AI ▷ #sharing (6 messages):

Tetrix App, Gemini Share, Michael Tait, Pakistan Deception, Dark Tetrad


Perplexity AI ▷ #pplx-api (24 messages🔥):

API vs Online Results, Reasoning Effort + Async Mode for Sonar, Academic Mode, Richer Citations, Camera integration


LMArena ▷ #general (1447 messages🔥🔥🔥):

Gemini 2.5, Kingfall model, O3 Pro, Model Selector, AI Benchmarks


Eleuther ▷ #general (688 messages🔥🔥🔥):

LLMs Sycophancy, AI Verification, LLMs learning and data, Hallucinations, LLMs and unfalsifiable ideas


Eleuther ▷ #research (24 messages🔥):

Quantum Field Neural Networks (QFNN), Attention mechanisms, Potential data leak


Eleuther ▷ #scaling-laws (7 messages):

AI ROI Realization, AI Startup Bubble, Future of AI and Job Market, Non-LLM AI Ventures


Eleuther ▷ #interpretability-general (21 messages🔥):

Interpretability without teacher-forcing, RL with Teacher Forcing, Chat templates for Instruction Tuned Models, Token Embeddings


Eleuther ▷ #lm-thunderdome (2 messages):

Reasoning Model Evaluations, Answer Extractions, LLM as Judge, MMLU Flan Few-Shot


Cursor Community ▷ #general (661 messages🔥🔥🔥):

Claude Code vs Cursor, MCP as backend replacement, Cursor 1.0, Gemini models, Cursor documentation


Cursor Community ▷ #background-agents (42 messages🔥):

Background Agents, Cursor Github, Background Agent PRs, Mono Repo setup, Background Agents Slack Bot


Cursor Community ▷ #announcements (1 messages):

Cursor 1.0 release, code review features, background tasks


Unsloth AI (Daniel Han) ▷ #general (226 messages🔥🔥):

Unsloth Llama.cpp vision inference, GRPO Model Checkpoints, Learning rate and SFT, Synthetic Dataset Generation with Gemini 2.5, Deepseek R1 Qwen mergekit


Unsloth AI (Daniel Han) ▷ #off-topic (3 messages):

QLoRA Instruction Tuning Experiences, Pretraining Gemma for Varied Language, Triton Learning for LLM Inference


Unsloth AI (Daniel Han) ▷ #help (102 messages🔥🔥):

Orpheus-3b model new language training issues, Deepthink R2 Model, DeepSeek-R1-0528 memory usage during finetuning, VLM instruction tuning, unsloth library import errors


Unsloth AI (Daniel Han) ▷ #showcase (3 messages):

Qwen 2.5 SFT, Image Analysis


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

OpenRouter RSS Feed, Model Announcements


OpenRouter (Alex Atallah) ▷ #app-showcase (5 messages):

iOS App with OpenRouter LLM backend, Spreadsheets for Business World, Personality.gg


OpenRouter (Alex Atallah) ▷ #general (253 messages🔥🔥):

OpenRouter API request limits, Character cards for roleplay, Gemini 2.5, OpenAI logging, Gemini Pro vs Flash


HuggingFace ▷ #general (150 messages🔥🔥):

Responsible Prompting API by IBM, Consensus Validation for LLM Outputs, Hugging Face Transformers Installation on Windows, French AI Company's Lack of Moat, Choosing an LLM Model


HuggingFace ▷ #today-im-learning (2 messages):

Fraud Detection Resources, Financial Transactions Security


HuggingFace ▷ #i-made-this (3 messages):

Claude Desktop MCP Playground, Keras parallel training utility


HuggingFace ▷ #reading-group (1 messages):

shadow_lilac: Reading group seems amazing tbh, i'd gladly participate once new schedule


HuggingFace ▷ #NLP (51 messages🔥):

NLP on Windows, Semantic Similarity, BERT embeddings


HuggingFace ▷ #smol-course (2 messages):

Agents course, Certificate deadline


HuggingFace ▷ #agents-course (10 messages🔥):

Penguin Youtube Question, Passing audio to agents, Smolagens usage with Gemini, Course sign-up deadlines


OpenAI ▷ #ai-discussions (174 messages🔥🔥):

Codex CLI API, Adobe Firefly, O3 Pro Mode, Claude Max vs ChatGPT Pro, ChatGPT Connectors


OpenAI ▷ #gpt-4-discussions (3 messages):

AI in housing projects, Software and Business Developer with AI competence


OpenAI ▷ #prompt-engineering (19 messages🔥):

D&D Adventure Generation, LLM Prompt Pipeline, Consistent Tone in AI Content, AI as a Sponsor for NA Meetings, Prompt Version Tracking and Evaluation


OpenAI ▷ #api-discussions (19 messages🔥):

D&D Adventure Module Generation, Consistent Tone in Prompt Pipelines, Prompt Version Tracking and Evaluation, AI Sponsor for NA Meetings, Veo3 Results


aider (Paul Gauthier) ▷ #general (171 messages🔥🔥):

Qwen 2 35B vs DS R1.5, Gemini 2.5 Pro release date predictions, Gemini 2.5 Pro in chat mode downgrades, Aider webapp IDE, Gemini 2.5 Pro Aider Polyglot benchmark


aider (Paul Gauthier) ▷ #questions-and-tips (14 messages🔥):

Figma MCP on Claude Code, Aider vs Cursor, Gemini STT and Speech-to-Text Workflows, Superwhisper and Wispr Flow, Aider development style


GPU MODE ▷ #general (4 messages):

Hardware-aware Optimizations, ML Compilers, PyTorch CUDA, CUTLASS/CUB/cuDNN, Triton


GPU MODE ▷ #triton (1 messages):

Megakernel, Full-Model Kernel, KernelLLM, KernelLLM to the Rescue!


GPU MODE ▷ #cuda (7 messages):

CTA Data Transfer, Blackwell TF32 Block Scaling, GMEM Coalescing


GPU MODE ▷ #torch (8 messages🔥):

Torch Compile, AITemplate status, Tensor Constant Wrapping


GPU MODE ▷ #cool-links (3 messages):

Parallel Keras Models, Experience Replay Pool, Scaling Intelligence


GPU MODE ▷ #beginner (2 messages):

GPU access costs, ECE408 lectures


GPU MODE ▷ #jax (1 messages):

blueredblue: How does ffi_call work with pmap, will one kernel get launched per device?


GPU MODE ▷ #torchao (1 messages):

drisspg: I’ll do a review today


GPU MODE ▷ #off-topic (1 messages):

TiKZ, GIF


GPU MODE ▷ #irl-meetup (1 messages):

GTC Paris, CUDA C++ Workshop, Connect With the Experts


GPU MODE ▷ #rocm (10 messages🔥):

Root user, Ubuntu, Sudo


GPU MODE ▷ #liger-kernel (3 messages):

Triton, \alpha-entmax kernels, sparsemax


GPU MODE ▷ #self-promotion (5 messages):

Keras parallelization tool, Reinforcement learning pool, GPU Workload Security


GPU MODE ▷ #🍿 (3 messages):

Triton Kernels, GPU uops information


GPU MODE ▷ #thunderkittens (6 messages):

Thunderkittens Flexibility, Producer/Consumer Model in Thunderkittens, TMA and HBM usage with Thunderkittens, AMD Porting


GPU MODE ▷ #submissions (11 messages🔥):

Grayscale leaderboard submissions, AMD Mixture of Experts leaderboard submissions, Prefixsum leaderboard submissions, AMD FP8 MM leaderboard submissions


GPU MODE ▷ #ppc (1 messages):

Open 2025 course statistics


GPU MODE ▷ #tpu (2 messages):

SparseCores in TPUs, Transformer training/inference, Nvidia Tensorcore sparsity


GPU MODE ▷ #factorio-learning-env (28 messages🔥):

FLE decoupling, FLE roadmap, FLE vision, Factorio AI API, LLM agents


GPU MODE ▷ #amd-competition (49 messages🔥):

AMD FP8 GEMM, MI300 Cache Line Optimization, DPP Transpose, H100 Competition, Backward Pass Optimization


GPU MODE ▷ #cutlass (7 messages):

CuTe Layout, Blackwell Performance, cuDNN Performance, Cutlass Error


LM Studio ▷ #general (67 messages🔥🔥):

AgenticSeek, OpenManus, Embedding Models, Gemma vs DeepSeek, ROCm llama.cpp vision module


LM Studio ▷ #hardware-discussion (67 messages🔥🔥):

NAND Cell Refreshing, AVX512 Support in Llama.cpp, Adrenalin 25.6.1 and Flash Attention, ReBAR and Shared Memory Issues, Model Recommendations for LM Studio


Latent Space ▷ #ai-general-chat (109 messages🔥🔥):

Langfuse OSS, Shisa v2 405B model, ChatGPT integrates with Internal Tools & Adds Record Mode, Veris AI seed funding, Anthropic cuts Claude capacity


Manus.im Discord ▷ #general (101 messages🔥🔥):

Manus context vs alternatives, Dev tools replit vs cursor, Invitation spam, AI podcast looking for manus dev guest, Manus credits vs alternatives


Modular (Mojo 🔥) ▷ #general (3 messages):

Discord's @everyone Tag, Notification Preferences, Announcements Channel


Modular (Mojo 🔥) ▷ #mojo (63 messages🔥🔥):

StringLiteral autopromotion, Slablist performance, JSON Parser in Mojo, Mojo memory safety vs Rust, Mojo origin tutorial


Nous Research AI ▷ #announcements (2 messages):

DeepHermes 24B API Outage, Model Stability Restored


Nous Research AI ▷ #general (57 messages🔥🔥):

Claude vs other Models, Privacy Nightmare with ChatGPT Logs, Arcee AI Homunculus-12B model, Psyche Network Forum for Nous, Training LLMs on Real World Datasets


Nous Research AI ▷ #ask-about-llms (3 messages):

Voigt-Kampff test, XQuartz Docker


Nous Research AI ▷ #research-papers (1 messages):

Evolving LLMs Through Text-Based Self-Play


Nous Research AI ▷ #research-papers (1 messages):

LLMs Self-Play, Emergent Performance


LlamaIndex ▷ #blog (5 messages):

LlamaIndex Agents, Agent Design Patterns, LlamaExtract, SEC Form 4, Spreadsheet Agent


LlamaIndex ▷ #general (55 messages🔥🔥):

Ollama Model Integration with Code Interpreter Tool, LlamaIndex Documentation Downtime, Offline Local RAG Framework Selection, AgentWorkflow Memory Block Error, Agent Team Orchestration in AgentWorkflow


LlamaIndex ▷ #ai-discussion (1 messages):

SchemaLLMPathExtractor, Graph database


MCP (Glama) ▷ #general (45 messages🔥):

pydantic-ai-slimpow, MCP Server issues, MCP Sampling, Sage OAuth implementation, MCPs for hardware engineers


MCP (Glama) ▷ #showcase (14 messages🔥):

MCP servers, Google's A2A protocol, MCP virality, MCP implementation difficulties, MCP production challenges


Notebook LM ▷ #use-cases (4 messages):

MP3 Audio Files, NotebookLM Limits, RAG Types


Notebook LM ▷ #general (42 messages🔥):

Multi Language Update, Interactive Mode, Viewing Space on NotebookLM, Public Notebooks, NotebookLM API


tinygrad (George Hotz) ▷ #general (1 messages):

Speeding up CAT, LLVM loop splitting, InductiveRangeCheckElimination


tinygrad (George Hotz) ▷ #learn-tinygrad (34 messages🔥):

Debugging Tinygrad, CUDA kernel examples, Slow Dataset Shuffling, BEAM optimizations


Yannick Kilcher ▷ #general (14 messages🔥):

Recursive Hyper Dimensional Emergence (RHDE), Marius (symbolic AI), Training LLMs with Real-World Datasets, RedPajama dataset, The Pile dataset


Yannick Kilcher ▷ #paper-discussion (11 messages🔥):

Muon Optimizer, vec2vec code review


Yannick Kilcher ▷ #ml-news (9 messages🔥):

OpenAI Chat Logs Privacy, Baidu Model, World economic position


Torchtune ▷ #dev (14 messages🔥):

Iterable dataset refactoring, Optimizer testing in torchtune, SGD and Adafactor issues in distributed SFT


DSPy ▷ #show-and-tell (1 messages):

Anthropic, Claude 3.7, Claude 4.0


DSPy ▷ #general (5 messages):

DSPy Evangelism, DSPy Office Hours, DSPy Hackathon Success, DSPy Agent Code Golfing, DSPy Funding & Professorships


Nomic.ai (GPT4All) ▷ #general (6 messages):

vLLM Engine in GPT4ALL, Nikola Tesla, China Buying Airbus, Windows vLLM Fork, Quantization Types


Cohere ▷ #💬-general (2 messages):

``


Cohere ▷ #🤝-introductions (3 messages):

Introductions, AI Engineer Introduces Self


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (3 messages):

Completion Certificates, Assignment Deadlines