Frozen AI News archive

AI Engineer World's Fair Talks Day 1

**Mistral** launched a new **Code** project, and **Cursor** released version **1.0**. **Anthropic** improved **Claude Code** plans, while **ChatGPT** announced expanded connections. The day was dominated by **AIE** keynotes and tracks including **GraphRAG**, **RecSys**, and **Tiny Teams**. On Reddit, **Google** open-sourced the **DeepSearch** stack for building AI agents with **Gemini 2.5** and **LangGraph**, enabling flexible agent architectures and integration with local LLMs like **Gemma**. A new **Meta** paper analyzed language model memorization, showing GPT-style transformers store about **3.5–4 bits/parameter** and exploring the transition from memorization to generalization, with implications for **Mixture-of-Experts** models and quantization effects.

Canonical issue URL

A happy day.

AI News for 6/3/2025-6/4/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (218 channels, and 6571 messages) for you. Estimated reading time saved (at 200wpm): 503 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Mistral launched a Code project and Cursor went 1.0 and Anthropic improved Claude Code plans and ChatGPT announced more connections, but probably the day rightfully belonged to AIE in terms of the news cycle, with an incredible set of keynotes bookending the MCP track for the main stream, and notable GraphRAG and RecSys and Tiny Teams tracks streamed as well.


AI Twitter Recap

pipeline down today sorry


AI Reddit Recap

/r/LocalLlama Recap

1. Recent Open-Source and Research Releases (Google DeepSearch, Meta Model Paper)

2. LLM and Vision Multimodal Model Announcements and Benchmarks

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. AI Model and Feature Releases (VEO 3, Sora, Chroma, Codex, ChatGPT Memory/Research)

2. Concerns About AI-Driven Economic Inequality and Job Loss

3. Personal Experiences Using AI for Real World Tasks


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: The Model Frontier: Launches, Leaks, and Lingering Questions

Theme 2: Agentic AI Ascends: Frameworks, Features, and Frustrations

Theme 3: Under the Hood: GPU Optimizations, Hardware Quirks, and Performance Puzzles

Theme 4: Bleeding Edge Research: Finetuning Breakthroughs, Semantic Threats, and Novel Architectures

Theme 5: Ecosystem Evolution: API Shakeups, Community Tools, and Developer Resources


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


Cursor Community Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


OpenRouter (Alex Atallah) Discord


GPU MODE Discord


HuggingFace Discord


Manus.im Discord Discord


Nous Research AI Discord


Latent Space Discord


Notebook LM Discord


Yannick Kilcher Discord


Eleuther Discord


LM Studio Discord


MCP (Glama) Discord


LlamaIndex Discord


tinygrad (George Hotz) Discord


Torchtune Discord


LLM Agents (Berkeley MOOC) Discord


DSPy Discord


Cohere Discord


Nomic.ai (GPT4All) Discord


MLOps @Chipro Discord


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

Reddit AMA, Labs, Aravind, Denis, Tyler Tate


Perplexity AI ▷ #general (1289 messages🔥🔥🔥):

Deep Research High, O3-pro, GPT-5 Release


Perplexity AI ▷ #sharing (2 messages):

working app, smuggled north korean smartphone


Perplexity AI ▷ #pplx-api (19 messages🔥):

Academic Filter Feedback, Sonar Reasoning Pro API with PMC, NCBI Rate Limiting, Firecrawl proxies


LMArena ▷ #general (1468 messages🔥🔥🔥):

Gemini 2.5 Pro Release, Google's Kingfall Model, OpenAI's o3 Pro Release, Model Performance Comparisons (Gemini, Claude, Grok, OpenAI), AI Hardware and Compute Considerations


Cursor Community ▷ #general (547 messages🔥🔥🔥):

Cursor Pro 'unauthorized' error, Claude 4 Sonnet limitations, CursorRIPER framework, Claude Code vs Cursor, Manual updates vs auto-updates


Cursor Community ▷ #background-agents (16 messages🔥):

Background Agents Hangs, Cursor Version Upgrade, Background Agent Research Projects, Slackbot Installation, Repo Connection Issues


Cursor Community ▷ #announcements (1 messages):

Cursor 1.0 Release, Code Review Improvements, Background Task Management


OpenAI ▷ #ai-discussions (391 messages🔥🔥):

O3 Pro, GPT-5 Release, ChatGPT hallucination, Sora for everyone, ChatGPT Connectors


OpenAI ▷ #gpt-4-discussions (11 messages🔥):

Hallucination rates, Bitbucket and Plastic Svn support, OpenAI TTS Pricing Discrepancies, GlazeGPT's Return


OpenAI ▷ #prompt-engineering (8 messages🔥):

Agent design for Elasticsearch queries, Model finetuning vs prompt engineering, Mermaid sequence diagrams in prompts, Elasticsearch sorting issues


OpenAI ▷ #api-discussions (8 messages🔥):

Elasticsearch DSL Queries, RAG Implementation, OpenAI model discussion etiquette


Unsloth AI (Daniel Han) ▷ #general (113 messages🔥🔥):

DeepSeek R1 0528 speed, Qwen 4B vs Gemma 4B, Vision support for Mistral-Small-3.1-24B-Instruct-2503-GGUF, Multi-GPU support, Fastest lib for production inference


Unsloth AI (Daniel Han) ▷ #off-topic (11 messages🔥):

GRPO Training on Qwen3-32B, AI Engineer Costs, Basic Fine Tuning Datasets, HuggingFace Navigation, QLORA Instruction Tuning


Unsloth AI (Daniel Han) ▷ #help (139 messages🔥🔥):

GRP trainer inference, Sequence length max length, Gemma 3 model unsloth, Unsloth info logging, Deepthink R2 model


Unsloth AI (Daniel Han) ▷ #research (30 messages🔥):

Weightwatcher AI, LLM Analysis, VLM Visualization


OpenRouter (Alex Atallah) ▷ #announcements (8 messages🔥):

GIF Support, Omni-Search, Tool Call Caching, BYOK Flag


OpenRouter (Alex Atallah) ▷ #app-showcase (3 messages):

iOS App, TestFlight, OpenRouter, LLM Backend


OpenRouter (Alex Atallah) ▷ #general (258 messages🔥🔥):

Opus Rate Limits, Chutes Business Model, Nous Training, OpenRouter Batch Inference API, Chutes R1 Quality


GPU MODE ▷ #general (12 messages🔥):

GPU Mode Merchandising, GPU Mascot Creation, AI-generated Mascot Design, Copyright safe mascot


GPU MODE ▷ #cuda (8 messages🔥):

__syncthreads vs bar.sync, mbarrier details, cuda::pipeline usage, Producer/consumer pipeline synchronization


GPU MODE ▷ #torch (6 messages):

CUPTI Profiling Overhead, Torch Dynamo Recompiles, CUDA Command Buffer Bottleneck


GPU MODE ▷ #beginner (2 messages):

PMPP Lectures, ECE408 Lectures


GPU MODE ▷ #torchao (3 messages):

MPS Kernels, vLLM, VL Models


GPU MODE ▷ #off-topic (2 messages):

TiKZ, JAX ML animations


GPU MODE ▷ #rocm (29 messages🔥):

MI300X memory access cycles, rocprof and L2CacheHit on MI300X, rocprof-compute and omniprof locale errors, MFMA utilization in kernel profiling, Root user sudo errors


GPU MODE ▷ #self-promotion (1 messages):

Hopper GPUs, TMA, CUDA, Mojo, NVPTX


GPU MODE ▷ #🍿 (7 messages):

Code Completion Benchmark for GLSL Fragment Shaders, Multi-Device Kernel Codegen, Architectural Feature Evolution, Profiling Nvidia ISA


GPU MODE ▷ #thunderkittens (2 messages):

ThunderKittens, LayerNorm kernel, dimensional handling, sequence length divisibility, producer/consumer model


GPU MODE ▷ #general (1 messages):

jacklee0897: <@299045948146057218>Where is hackcathon?


GPU MODE ▷ #submissions (1 messages):

H100 Speed, Leaderboard submissions


GPU MODE ▷ #ppc (1 messages):

Open 2025 Course, Course Statistics


GPU MODE ▷ #factorio-learning-env (9 messages🔥):

Factorio Learning Environment (FLE) Configuration, Decoupling FLE from Python, FLE Project Structure and Roadmap, Dockerizing Factorio with FLE Mod


GPU MODE ▷ #amd-competition (29 messages🔥):

Double Buffering, FP8 Solution Writeup, Cache Line Optimization, MI300 coalescing, GPU Mode solutions


GPU MODE ▷ #cutlass (5 messages):

sdpa and cutlass, CuTe Layout, Blackwell Cutlass Samples, MXFP8 performance on Blackwell, NVFP4 vs BF16 on Blackwell


GPU MODE ▷ #singularity-systems (2 messages):

Zero To Hero, nanoGPT, nanoR1


HuggingFace ▷ #general (73 messages🔥🔥):

CUDA on HF, ASR Leaderboards, MCP Course Progress, Responsible Prompting API by IBM, Blockchain-Inspired Models for AI Reliability


HuggingFace ▷ #today-im-learning (1 messages):

AI Safety Benchmark, LLM Agents, Ethical scenarios, AI Security


HuggingFace ▷ #cool-finds (2 messages):

CUA MCP Server, trycua


HuggingFace ▷ #i-made-this (4 messages):

Prisma toolkit, GitHub Chat, Claude Desktop MCP Playground, Market research basics


HuggingFace ▷ #reading-group (1 messages):

Session Schedule, Summer Break


HuggingFace ▷ #NLP (1 messages):

Generative AI, LLMs, Substack, Online Education, LangChain


HuggingFace ▷ #gradio-announcements (1 messages):

Gradio Agents, MCP Hackathon, Mistral AI Agentic Support, LlamaIndex framework


HuggingFace ▷ #smol-course (2 messages):

Meta-Llama model access, Agents course deadlines


HuggingFace ▷ #agents-course (21 messages🔥):

OpenAI Free Tier Eligibility, Unit 4 Assignment Difficulties, Local LLM Performance, Audio and YouTube Processing, Whisper Model Usage


Manus.im Discord ▷ #general (89 messages🔥🔥):

Manus task context limit, Manus AI Competitor H Runner, Manus AI credits, Interactive experiences: website or app, Cursor and Replit IDE


Nous Research AI ▷ #announcements (1 messages):

DeepHermes 24B Outage, API Issues


Nous Research AI ▷ #general (68 messages🔥🔥):

Server Tags, Parameter-Efficient Finetuning, Shisa-v2 405B Model, Drowning in AI Releases, Claude's Agentic Behavior


Nous Research AI ▷ #ask-about-llms (4 messages):

Loom Tool, Hermes 70b


Nous Research AI ▷ #research-papers (1 messages):

Evolving LLMs Through Text-Based Self-Play, AI Paper Feedback


Nous Research AI ▷ #interesting-links (1 messages):

Merlin app, bird identification, sound analysis


Nous Research AI ▷ #research-papers (1 messages):

Evolving LLMs, Self-Play, Emergent Performance


Latent Space ▷ #ai-general-chat (72 messages🔥🔥):

LLM Engineer's Almanac by Modal Labs, PDF ingestion pipeline in AWS, Anthropic's capacity cuts, Codex with internet access, OpenAI Agent Development


Notebook LM ▷ #use-cases (5 messages):

Notebook LM with Microsoft Learn, Notebook for city and county, MP3 vs M4A


Notebook LM ▷ #general (67 messages🔥🔥):

Gemini 2.5 Pro vs Flash, Audio Generation length, NotebookLM and Google Docs Syncing, Public Notebook Sharing, NotebookLM Mobile App


Yannick Kilcher ▷ #general (29 messages🔥):

Parameter-Efficient Finetuning, Knowledge Extension for LLMs, MCP Server for Isomorphism Testing, Prototype Theory in Graph Neural Networks


Yannick Kilcher ▷ #paper-discussion (25 messages🔥):

vec2vec code review, Muon Optimizer details, Paper Reading Techniques


Yannick Kilcher ▷ #ml-news (15 messages🔥):

Mistral Code Release, OpenAI ChatGPT Logs Privacy Concerns, Elon's stance on AI


Eleuther ▷ #general (46 messages🔥):

Parameter-efficient finetuning, Twitter scraper, Imitation Learning, Scalable web scraping with AI agents


Eleuther ▷ #research (4 messages):

UDAIR.md document on AI Rights, Universal Algorithm POC for NLP, Options Trading, and Electrochemical Reactions, Quantum Field Based Architecture with Sinusoidal Sparsity, AI-generated Research


Eleuther ▷ #scaling-laws (7 messages):

AI Compute Investment, AI ROI, AI Startups, AI Job Market, PhD Earnings


Eleuther ▷ #interpretability-general (9 messages🔥):

General agents and world models, Semantic Virus exploits LLM vulnerabilities, NCF, Semantic Viruses, and the CupCake framework study, Interpretability intact without teacher-forcing, AI training without teacher-forcing


Eleuther ▷ #gpt-neox-dev (2 messages):

Pythia Remake, Percy plans


LM Studio ▷ #general (17 messages🔥):

Llama 4 Image Support, ROCm drivers on Ubuntu, agenticSeek vs OpenManus, Embedding model choice, ROCm vision module slowdown


LM Studio ▷ #hardware-discussion (48 messages🔥):

Server boot times, SSD vs HDD, NAND cell refreshing


MCP (Glama) ▷ #general (49 messages🔥):

MCP API key monetization, MCP Context Management, A2A Framework vs MCP, Pydantic-AI, Hosting MCP servers


MCP (Glama) ▷ #showcase (4 messages):

MCP value, Block adoption of MCP, Goose and A2A protocol, deeplinks


LlamaIndex ▷ #blog (6 messages):

Agentic AI, Financial report chatbot, LlamaIndex questions, Agent Design Patterns


LlamaIndex ▷ #general (20 messages🔥):

Gradio MCP Hackathon, Property Graph Index, Code Interpreter Agent, Ollama, readthedocs website


tinygrad (George Hotz) ▷ #learn-tinygrad (23 messages🔥):

Numpy removal challenges in random_crop/cutmix, Performance intuition in tinygrad, Windows backend issues with tinygrad, LSTM performance bottleneck in tinygrad, Understanding DEBUG=2 output


Torchtune ▷ #dev (15 messages🔥):

Python 3.9 Support, Asynchronous Reward Functions, Iterable Dataset Refactoring RFC, Optimizer Compatibility Beyond AdamW, DTensor DeviceMesh Errors


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (14 messages🔥):

Assignment Deadlines, Assignment Feedback, MOOC Next Steps


DSPy ▷ #show-and-tell (1 messages):

Claude 3.7 vs 4.0, Anthropic's dev cycle, Anthropic's priorities


DSPy ▷ #general (12 messages🔥):

oneformer game theorist, agenspy vs frameworks, claude_sdk execution engine, HTNs and LLM agents, Fine-tuning LLMs in ReACT format


Cohere ▷ #💬-general (3 messages):

Cohere Sponsorship


Cohere ▷ #🤝-introductions (3 messages):

Introductions to Cohere's Discord Server


Nomic.ai (GPT4All) ▷ #general (2 messages):

GPT4All updates, MOE models and VRAM, Mac M3 Max VRAM advantage, vLLM engine for GPT4All, Nikola Tesla


MLOps @Chipro ▷ #events (1 messages):

AI Programming, SVCAI, Liang Guo