Frozen AI News archive

AI Engineer World''s Fair: Second Run, Twice The Fun

**The 2025 AI Engineer World's Fair** is expanding with **18 tracks** covering topics like **Retrieval + Search**, **GraphRAG**, **RecSys**, **SWE-Agents**, **Agent Reliability**, **Reasoning + RL**, **Voice AI**, **Generative Media**, **Infrastructure**, **Security**, and **Evals**. New focuses include **MCP**, **Tiny Teams**, **Product Management**, **Design Engineering**, and **Robotics and Autonomy** featuring foundation models from **Waymo**, **Tesla**, and **Google**. The event highlights the growing importance of **AI Architects** and enterprise AI leadership. Additionally, **Demis Hassabis** announced the **Gemini 2.5 Pro Preview 'I/O edition'**, which leads coding and web development benchmarks on **LMArena**.

Canonical issue URL

AIE is all you need.

AI News for 5/6/2025-5/7/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (214 channels, and 4624 messages) for you. Estimated reading time saved (at 200wpm): 485 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

It's a quiet day, but as we did almost exactly a year ago, we'll spend an issue talking about the new speakers announced for the second large-scale AI Engineer World's Fair next month:

TLDR for the second year running, we're offering a onetime discount to AI News readers: CLICK HERE and enter AINEWS before EOD Friday :)

The first World's Fair was a big experiment - is AI Engineering big enough to warrant its own large multitrack conference? We were fortunate to be the first to preview capabilities that now every single AI Engineer can build with and take for granted. In between then and now, the NYC Summit blew out all expectations, with 4 Madison Square Gardens worth of people tuning in on the livestream and a viral MCP workshop.

The 2025 AI Engineer World's Fair (Jun 3-5 in SF)

AIEWF 2025 is now 2x as big again as last year, with expo booths, talks, workshops across 18 tracks. You can browse the llms.txt or llms-full.txt, to get up to speed on the evolving meta of AI Engineering:

To celebrate the launch, we're offering a onetime discount to AI News readers: CLICK HERE and enter AINEWS before EOD Friday to lock in the Early Bird tickets before prices go up this weekend.

If the curation here/on Latent Space has the most cosine similarity with your interests, this conference was made for you. See you in SF June 3-5!


AI Twitter Recap

Gemini 2.5 Pro Model Improvements and Performance

AI Models and Frameworks

Tools and Platforms

AI Education, Learning Resources, and Community

Broader AI Industry Trends and Discussions


AI Reddit Recap

/r/LocalLlama Recap

1. New SOTA AI Models, Benchmarks, and Training Innovations

2. Open Source AI Video and Image Generation Tools & Compression

3. Major Industry Shifts: Google, Reddit, and OpenAI AI Impact

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Gemini 2.5 Pro Model Updates and Benchmarks

2. OpenAI Acquisition of Windsurf Coverage

3. Latest AI Image and Video Generation Model Launches


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Flash Preview

Theme 1. New Models Hit the Scene, Bringing Power and Problems

Theme 2. Pushing Silicon to the Limit: Hardware, Speed, and Optimization

Theme 3. Dev Tools and Ecosystems Evolve with AI

Theme 4. Training, Tuning, and Trusting the Data

Theme 5. AI Agents Gain Skills, Face Hurdles


Discord: High level Discord summaries

LMArena Discord


Perplexity AI Discord


OpenAI Discord


Cursor Community Discord


OpenRouter (Alex Atallah) Discord


Unsloth AI (Daniel Han) Discord


Manus.im Discord Discord


aider (Paul Gauthier) Discord


LM Studio Discord


GPU MODE Discord


MCP (Glama) Discord


Yannick Kilcher Discord


Latent Space Discord


HuggingFace Discord


Nous Research AI Discord


Eleuther Discord


DSPy Discord


Modular (Mojo 🔥) Discord


LlamaIndex Discord


tinygrad (George Hotz) Discord


Torchtune Discord


Nomic.ai (GPT4All) Discord


LLM Agents (Berkeley MOOC) Discord


Cohere Discord


Codeium (Windsurf) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (914 messages🔥🔥🔥):

GPT-4o vs O1 capabilities, Grok 3.5 release rumors, Gemini 2.5 Pro (0506) reviews, O3 Pro release speculation, Dense vs MoE model architectures


LMArena ▷ #announcements (1 messages):

Discord improvements, LMArena community growth, AI community space


Perplexity AI ▷ #general (618 messages🔥🔥🔥):

Discord Bot, Perplexity Image Generation, AI ads, Traffic Acquisition Costs, Deepseek R2


Perplexity AI ▷ #sharing (1 messages):

Deep Research Reports, Long Reports


Perplexity AI ▷ #pplx-api (3 messages):

Office Hours, Credits


OpenAI ▷ #ai-discussions (353 messages🔥🔥):

XenArcAI Introduction, Lucid Dreaming Techniques, AI's Em Dash Obsession, DeepSeek vs. OpenAI, Veo 2 vs. Sora video generation


OpenAI ▷ #gpt-4-discussions (11 messages🔥):

GPT-4o degradation, Brief GPT-4o responses, Placebo upvote buttons


OpenAI ▷ #prompt-engineering (68 messages🔥🔥):

Atoms in Visible Light, Hypertree Planning Prompting, Building a ChatGPT website, Project-Based Prompt Management, Prompt engineering and experimental sciences


OpenAI ▷ #api-discussions (68 messages🔥🔥):

Atoms in visible light, Prompt engineering techniques, Hypertree planning prompting, Project-based custom prompts, Creating a Chat GPT website


Cursor Community ▷ #general (484 messages🔥🔥🔥):

Cursor student discounts, Gemini 2.5 Pro performance, MCP server security concerns, Channel Organization, PowerShell default version


OpenRouter (Alex Atallah) ▷ #announcements (5 messages):

OpenRouter activity page, Cerebras new provider


OpenRouter (Alex Atallah) ▷ #app-showcase (2 messages):

Clippy as VSCode Extension, Bring back Clippy with VS Code Extension


OpenRouter (Alex Atallah) ▷ #general (299 messages🔥🔥):

Gemini 2.5 Pro Upgrade, OpenRouter API Issues, Cerebras vs Groq, DeepSeek Models, Mistral New Release


Unsloth AI (Daniel Han) ▷ #general (231 messages🔥🔥):

Qwen3 models, gguf conversion issues, vLLM issues with LoRA adapters, Datasets for QandA training


Unsloth AI (Daniel Han) ▷ #off-topic (9 messages🔥):

Mistral Medium, Vast.ai, AI Project Recruitment, Model Weights


Unsloth AI (Daniel Han) ▷ #help (39 messages🔥):

Phi-3.5 Finetuning, Phi-4 Confusion, GGUF Finetuning, Qwen3 Training, Gradient Accumulation


Unsloth AI (Daniel Han) ▷ #research (18 messages🔥):

Absolute Zero paper, Self-play Reasoning, Unsupervised Training, Code validator, Memory Layer Hooking


Manus.im Discord ▷ #general (202 messages🔥🔥):

Manus vs ChatGPT, International Channel Request, o3 as an agent, AI learning Advice, GPT-4.5 language and writing


aider (Paul Gauthier) ▷ #general (159 messages🔥🔥):

Gemini 2.5 Pro Exp, Mistral Medium 3 Performance, LLM Benchmarks, Aider with Cursor


aider (Paul Gauthier) ▷ #questions-and-tips (36 messages🔥):

Golang Authentication Error, Gemini 2.5 'Thinking Mode', Aider RAG functionality, Claude CLI vs. Aider cost, Perplexity API for Web Search


LM Studio ▷ #general (142 messages🔥🔥):

Gemini issues, Qwen 3 model, LM Studio fine-tuning, Speculative decoding, Local model capabilities


LM Studio ▷ #hardware-discussion (37 messages🔥):

Mac Studio RAM, HP Z2 Mini Workstation, Strix Halo PC, Model Quality vs Speed, DDR5 Memory


GPU MODE ▷ #general (3 messages):

CI environment modifications, Python packages in CI


GPU MODE ▷ #triton (17 messages🔥):

Triton compiler passes, atomic ops, non-deterministic results, floating point arithmetic


GPU MODE ▷ #cuda (29 messages🔥):

A6000 Ada, L40s, 4090, ECC Memory, Vast.ai Quality


GPU MODE ▷ #torch (1 messages):

CUDAGraphs, Warmup Stream, Graph Capture Isolation


GPU MODE ▷ #cool-links (7 messages):

Devin, 32B model, GPU kernel, KernelBench


GPU MODE ▷ #beginner (4 messages):

Roofline Plot Generation, Nsight Compute for Roofline Analysis, Memory Allocation Strategies for Roofline Plots, Tensor Core Programming Pattern


GPU MODE ▷ #torchao (6 messages):

torchao scaled_mm op usage, quantized Phi-4 Mini Instruct models, INT8 dynamic activation & INT4 weight quant for ExecuTorch


GPU MODE ▷ #off-topic (2 messages):

Cursor.com student offer, IDE for coding


GPU MODE ▷ #irl-meetup (1 messages):

random.oof: Anyone at the vllm meet up in nyc?


GPU MODE ▷ #rocm (2 messages):

install .whl file manually, python script import pip module


GPU MODE ▷ #webgpu (1 messages):

WGPU Sampling Rate


GPU MODE ▷ #liger-kernel (3 messages):

Qwen 3, Liger-Kernel, Qwen 3 MoE


GPU MODE ▷ #self-promotion (1 messages):

ML efficiency, Linear Layer Optimization, Quantization, Low-bit Matmul Kernels


GPU MODE ▷ #🍿 (1 messages):

CognitionAI, Kevin 32B, Multi-Turn RL, CUDA Kernels


GPU MODE ▷ #reasoning-gym (1 messages):

RL in LoRA, Base Model Quality


GPU MODE ▷ #submissions (52 messages🔥):

amd-fp8-mm leaderboard, MI300 optimization, amd-mixture-of-experts leaderboard


GPU MODE ▷ #status (2 messages):

popcorn-cli, github releases, timeout fix


GPU MODE ▷ #hardware (3 messages):

DGX Spark, Blackwell ISA, New SASS Instructions, FP8 Operations


GPU MODE ▷ #factorio-learning-env (24 messages🔥):

FLE Docker server connectivity issues, LangGraph agent integration with FLE, Factorio client version, Steam update, harvest_resource/server.lua is broken


GPU MODE ▷ #amd-competition (7 messages):

AMD Mixture-of-Experts Leaderboard, popcorn-cli timeout patch, aiter/test_moe


GPU MODE ▷ #mojo (11 messages🔥):

Mojo GPU Kernel, PyTorch Mojo, Qualcomm GPU support, Modular GPU Kernel Hackathon


MCP (Glama) ▷ #general (145 messages🔥🔥):

Cursor MCP, A2A discussion, Debugging MCP Servers, Cloudflare Deployment issues


MCP (Glama) ▷ #showcase (3 messages):

MCP Client, OpenLink AI Layer, Model Context Protocol


Yannick Kilcher ▷ #general (81 messages🔥🔥):

LLM output spam, AI-generated content, Em dashes in LLM output, AI article/patent writing agents, Nerf field with Gemini


Yannick Kilcher ▷ #paper-discussion (1 messages):

Time off, Volunteers needed


Yannick Kilcher ▷ #ml-news (26 messages🔥):

Winner-Takes-All Economics, Mistral Medium, Zed AI Code Editor, Cerebras Inference, Windows Compilation


Latent Space ▷ #ai-general-chat (80 messages🔥🔥):

Windsurf AI, Cursor vs Windsurf, OpenAI internal models, Gemini 2.5, Product Market Fit


Latent Space ▷ #ai-announcements (2 messages):

New Claude code pod, AI Engineer conference


HuggingFace ▷ #general (38 messages🔥):

Hugging Face billing inquiries, LLM User info approach, Text to 3D diffusion on Mac, Dolphin model to be more human, Reinforcement Learning at scale for agents


HuggingFace ▷ #today-im-learning (9 messages🔥):

Cache-Augmented Generation, Distributed RLHF, Converting .tensorflow to .bin, Offline Model


HuggingFace ▷ #i-made-this (11 messages🔥):

RADLADS, Alpha-Root Dataset, CommonCrawl Data Extraction, Embedder Collections, ACE-STEP Music Generation


HuggingFace ▷ #computer-vision (9 messages🔥):

Flash Attention 2, FP16 and BF16 support, local file formats


HuggingFace ▷ #NLP (2 messages):

DaoDeCode, Maximilian-Winter, github.com


HuggingFace ▷ #smol-course (1 messages):

Smolagents Transcriber, Speech-to-text pipeline, Whisper-Turbo


HuggingFace ▷ #agents-course (12 messages🔥):

404 Client Error, Running models locally, Including Image in AgentWorkflow, Gated Repo Access, RAG over a CSV


Nous Research AI ▷ #general (37 messages🔥):

Open Codex forks for Gemini, M4 Max MacBook Pro for LLMs, Dolphin Logger for Naughty Chats, Zed AI Code Editor, Gemini model tps


Nous Research AI ▷ #ask-about-llms (5 messages):

DeepHermes-3-Llama-3 Sizes, 1b model size limitations


Nous Research AI ▷ #research-papers (3 messages):

Arxiv Paper, Learn Mandarin


Nous Research AI ▷ #interesting-links (1 messages):

kotykd: https://cognition.ai/blog/kevin-32b


Nous Research AI ▷ #research-papers (3 messages):

Arxiv Paper, Learning Mandarin


Eleuther ▷ #general (27 messages🔥):

Cursor free for students, Scale Maximalism, Advertising saturation point, SLURM memory allocation


Eleuther ▷ #research (2 messages):

MTurk, Prolific, Human Evals


Eleuther ▷ #lm-thunderdome (10 messages🔥):

lm-eval-harness implementation, HuggingFace vs vLLM, lm-eval-harness BOS token, lm-eval-harness sampling


DSPy ▷ #show-and-tell (1 messages):

Unsloth, Claude Sonnet Finetuning, Qwen3-4b comparison, GRPO


DSPy ▷ #general (30 messages🔥):

Efficient Domain Knowledge Injection in DSPy, DSPy Signature Docstrings, ReAct Module Signature without direct output, Accessing full LLM history


DSPy ▷ #examples (3 messages):

GitHub Notebook Rendering Issues, Colab vs GitHub for Notebooks, Missing "State" Key Error


Modular (Mojo 🔥) ▷ #general (6 messages):

Modular Puzzles on Macbook, Fields in traits, Modular Hackathon


Modular (Mojo 🔥) ▷ #mojo (28 messages🔥):

Public/Private Syntax, Enum Recommendations, Open-Source Contributions, Compile-Time Abort, Testing constrained``


LlamaIndex ▷ #blog (3 messages):

Deep Research Agent, LlamaExtract, Anthropic API support


LlamaIndex ▷ #general (27 messages🔥):

Memgraph using Neo4j client, Multimodal LLMs with GPT-4o-mini, ChatGPT System Prompt Memory, Agentic RAG App Structure, Medical LLM Bot Building


tinygrad (George Hotz) ▷ #general (2 messages):

Mojo Kernels, Chris Latner


tinygrad (George Hotz) ▷ #learn-tinygrad (8 messages🔥):

tinygrad Color meanings, beam search cache location


Torchtune ▷ #dev (9 messages🔥):

Torchtune PR, Tokenizer arguments, Uniform tokenizer interface


Nomic.ai (GPT4All) ▷ #general (9 messages🔥):

GPT4All on AMD ROCm, GPT4All on iOS, GGUF token limits, Uncensored models


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (3 messages):

Auth0 Workshop, Lambda Workshop


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (3 messages):

Hugging Face, Email Notification Issues


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (1 messages):

LLMs, Statistical Pattern Recognition, Conditional Statements in LLMs, Neural Attention


Cohere ▷ #💬-general (4 messages):

AWS x Cohere Workshop, Coral Status


Cohere ▷ #🤝-introductions (1 messages):

xvarunx: Welcome everyone! 🥳 🎉 Thanks for joining!


Codeium (Windsurf) ▷ #announcements (2 messages):

Windsurf 1.8.2 Fixes, Windsurf Regional Channels, Cascade Customization, File-Based Rules, Simultaneous Cascades