Frozen AI News archive

Mary Meeker is so back: BOND Capital AI Trends report

**Mary Meeker** returns with a comprehensive **340-slide report** on the state of AI, highlighting accelerating tech cycles, compute growth, and comparisons of **ChatGPT** to early Google and other iconic tech products. The report also covers enterprise traction and valuation of major AI companies. On Twitter, **@tri_dao** discusses an "ideal" inference architecture featuring attention variants like **GTA**, **GLA**, and **DeepSeek MLA** with high arithmetic intensity (~256), improving efficiency and model quality. Other highlights include the release of **4-bit DWQ of DSR1 Qwen3 8B** on Hugging Face, **AnthropicAI**'s open-source interpretability tools for LLMs, and discussions on transformer training and abstractions by various researchers.

Canonical issue URL

340 slides are all you need

AI News for 5/29/2025-5/30/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (217 channels, and 5932 messages) for you. Estimated reading time saved (at 200wpm): 508 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Those old enough to remember the rise of the internet will be very familiar with the annual Mary Meeker reports which have the rare distinction of being an industry event when they come out. It seems she retired for a few years but is now back with a vengeance - 340 slides on the state of AI.

she has a fun chart on how the 2000s tech wave compares to today:

Tech cycles are accelerating:

with a marked kink in the compute curve

comparisons of chatgpt to early Google

and other hall of fame tech products

some traction in the enterprise

AWS Traininum being half the size of Google's TPU business was surprising

and where the valuation of the AI majors stand today.


AI Twitter Recap

Here's the breakdown of tweets, categorized and summarized as requested:

Language Models, Architectures, and Implementations

Benchmark Evaluations and Performance Analysis

AI Agents and Autonomous Systems

Perplexity Labs and Applications

Tooling and Development

Humor and Miscellaneous


AI Reddit Recap

/r/LocalLlama Recap

1. Ollama DeepSeek-R1 Model Naming and Community Reactions

2. DeepSeek-R1-0528 Model Releases, Quantization, and Benchmarks

3. Recent Model and Benchmark Launches: Xiaomi MiMo 7B, Gemma 3 27B, DeepSeek Shift

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Anthropic Claude Opus 4 Safety Concerns and AI Risks

2. Google Veo3 vs OpenAI Sora and Multimedia AI Model Race

3. Recent Large Model and AI System Launches & Benchmarks


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: Model Mania - New Releases, Capabilities, and Community Chatter

Theme 2: Tooling Up - Frameworks and Utilities Accelerate AI Development

Theme 3: Silicon Surges - GPU Advances and Optimization Efforts

Theme 4: Research Frontiers - From Reinforcement Learning to Interpretability

Theme 5: Platform Power-Ups and User Ponderings


Discord: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


OpenAI Discord


Cursor Community Discord


Latent Space Discord


HuggingFace Discord


OpenRouter (Alex Atallah) Discord


aider (Paul Gauthier) Discord


MCP (Glama) Discord


Yannick Kilcher Discord


Nous Research AI Discord


Notebook LM Discord


GPU MODE Discord


Manus.im Discord Discord


LlamaIndex Discord


Eleuther Discord


Torchtune Discord


Nomic.ai (GPT4All) Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


Cohere Discord


tinygrad (George Hotz) Discord


AI21 Labs (Jamba) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

Perplexity Labs, Shopping & Travel in Deep Research, Personal Search & Memory, Crypto Leaderboard, F1 on Android


Perplexity AI ▷ #general (1183 messages🔥🔥🔥):

Perplexity AI tricks, Opus, Smartwatches, Perplexity Labs' Limits, AI models APIs


Perplexity AI ▷ #sharing (5 messages):

Perplexity Labs Release, Research Presentation with Perplexity, Shareable Threads on Discord, Agentic AI Presentation


Perplexity AI ▷ #pplx-api (28 messages🔥):

Sonar Deep Research Async API, Limit bot responses, Scaling Perplexity infrastructure, API Announcement Role misconfiguration


Unsloth AI (Daniel Han) ▷ #general (899 messages🔥🔥🔥):

Gemma Finetuning Costs, AMD Max+ 365, ROCm Support, GraLoRA, 0.5bit


Unsloth AI (Daniel Han) ▷ #off-topic (18 messages🔥):

PiSSA method, Sesame, Orpheus, Chinese Language Models, Model Output Filtering


Unsloth AI (Daniel Han) ▷ #help (165 messages🔥🔥):

DeepSeek R1 0528, GGUF models, Quantization and VLLM, Mistral 7B, Gemma3 Vision


Unsloth AI (Daniel Han) ▷ #research (3 messages):

HF team implementation, bits and bytes team


LMArena ▷ #general (526 messages🔥🔥🔥):

LMArena Discord Filters, O3 Pro Speculation, Redsword's Removal, Gemini 2.5 Pro vs Flash, Gemini's Raw Thoughts


LMArena ▷ #announcements (1 messages):

AI Generation Contest, LMArena Battle Mode, Cozy Desk Image Contest


OpenAI ▷ #ai-discussions (451 messages🔥🔥🔥):

Sustainable AI, DeepSeek R1 0528 vs Gemini 2.5 Pro, Claude vs ChatGPT for Coding, AI for Creative Writing, Veo 3 pricing and limitations


OpenAI ▷ #gpt-4-discussions (6 messages):

Deep Research feature in ChatGPT Pro, Custom GPTs in project chats, AI model diagnostic signatures and recursive adaption


OpenAI ▷ #prompt-engineering (31 messages🔥):

Jailbreaking, Safety Layers, Prompt Engineering, Symbolic Ecology


OpenAI ▷ #api-discussions (31 messages🔥):

Guardrails circumvention, Semantic Orphans & Lonely Words, Symbolic ecology, Quantify the co-evolution


Cursor Community ▷ #general (416 messages🔥🔥🔥):

Claude 4 Speed vs Cost, ESLint, Cursor Slow Pool, VerbalCodeAI, Gemini vs Claude


Cursor Community ▷ #background-agents (5 messages):

Background Agent Janky UI, Background Agent Full Stack Web Dev


Latent Space ▷ #ai-general-chat (45 messages🔥):

Black Forest Labs, Osmosis-Structure-0.6B Model, Claude's Chain of Thought, Hashbrown AI Framework, Vibe Coding Hype Cycle


Latent Space ▷ #ai-in-action-club (247 messages🔥🔥):

Discord audio/video issues, LLMs for document processing, Data ingestion pipeline improvements, GPT-4o fine-tuning, Embedding model benchmarking


HuggingFace ▷ #announcements (1 messages):

Gradio MCP hackathon, Filtering spaces for MCP compatibility, LightEval v0.10.0 release, HF space as an MCP server, nanoVLM for training VLMs


HuggingFace ▷ #general (141 messages🔥🔥):

Chatterbox-tts install errors, Genie 2 alternatives, Deepseek-r1 performance, Gradio MCP argument, HF Inference API usage


HuggingFace ▷ #today-im-learning (1 messages):

roldanx: Anybody deployed "my_first_agent"? Gradio is giving me error 😦


HuggingFace ▷ #cool-finds (1 messages):

mikus____: https://github.com/safety-research/circuit-tracer/tree/main


HuggingFace ▷ #i-made-this (20 messages🔥):

VerbalCodeAI, Nix for AI, Lunaris Development, XTRUST Dataset, Handwritten Datasets


HuggingFace ▷ #agents-course (22 messages🔥):

Course Onboarding, Compute resources requirements, Inference credits exhausted, Final Assignment Submissions, Certificate Credibility


OpenRouter (Alex Atallah) ▷ #general (127 messages🔥🔥):

OpenRouter Support, Anthropic models, DeepSeek models, Meta LLaMA, GPTs and OpenAI data sharing


aider (Paul Gauthier) ▷ #announcements (1 messages):

Aider v0.84.0 release, New Claude models, Vertex AI Gemini, GitHub Copilot tokens, Automatic commit messages


aider (Paul Gauthier) ▷ #general (108 messages🔥🔥):

Deepseek R1 with Aider, Aider with file snapshots for concurrent edits, Gemini vs Deepseek for massive context, MCP Recommendations, LLM Benchmarks discussion


aider (Paul Gauthier) ▷ #questions-and-tips (1 messages):

aider with conda, pytest and conda


MCP (Glama) ▷ #general (69 messages🔥🔥):

MCP Server Authentication, Roots and Workspaces, MCP Client Usage, Elicitation in MCP, MCP Spec Extension Proposal


MCP (Glama) ▷ #showcase (10 messages🔥):

Debugging Improvements, Financial Analysis Agent, VerbalCodeAI Tool, arrs MCP Servers, Kroger MCP


Yannick Kilcher ▷ #general (56 messages🔥🔥):

GFlow Networks, LLM Thinking, Pass@K Training, RL for LLMs, Anthropic's mechinterp code


Yannick Kilcher ▷ #paper-discussion (10 messages🔥):

Two Minute Papers, Overoptimism in research, rigorous experimental setup


Yannick Kilcher ▷ #ml-news (1 messages):

nelfar5459: https://youtu.be/cP8xpkvs_UI


Nous Research AI ▷ #general (56 messages🔥🔥):

R1 and System Prompts, Multilingual Reasoning with R1, DeepHermes3 Language Reasoning, Gooner Investigations in AI, China's AI and Robotics Advancements


Nous Research AI ▷ #ask-about-llms (7 messages):

RL bot release, Linux terminal simulator prompts


Nous Research AI ▷ #research-papers (1 messages):

promptsiren: https://arxiv.org/abs/2505.22954 code: https://github.com/jennyzzt/dgm


Nous Research AI ▷ #interesting-links (1 messages):

BFL image editing model, playground.bfl.ai


Nous Research AI ▷ #research-papers (1 messages):

promptsiren: https://arxiv.org/abs/2505.22954 code: https://github.com/jennyzzt/dgm


Notebook LM ▷ #use-cases (13 messages🔥):

Gemini Pro, Gemini custom instructions, Gemini Apps, LLMNotebook customization


Notebook LM ▷ #general (53 messages🔥):

Notebook API, Audio Summary Language, NotebookLM Free Tier, Gemini usage, podcast


GPU MODE ▷ #triton (9 messages🔥):

GPU programming in Triton class, Triton gather kernel failing, Triton community meetings


GPU MODE ▷ #cuda (2 messages):

ldmatrix operation, Bank conflicts in memory access, Simplifying thread indexing


GPU MODE ▷ #torch (1 messages):

Autotuning kernels, IndexSelect Backwards Custom Implementations, Input Shape Based Kernel Selection


GPU MODE ▷ #cool-links (2 messages):

VLMs for video games, DeepSeek R1, NVIDIA Blackwell GPUs


GPU MODE ▷ #jobs (3 messages):

ML Engineer, LLM training, GPU


GPU MODE ▷ #beginner (4 messages):

Blackwell, Hadamard product, Tensor cores, CUDA cores


GPU MODE ▷ #irl-meetup (1 messages):

alxcspr: Anyone going to GTC Paris?


GPU MODE ▷ #liger-kernel (4 messages):

Liger-Kernel checkstyle, commit formatting


GPU MODE ▷ #🍿 (4 messages):

Kernelbook Unit Tests, Kernel Verification, PyTorch Code Verification


GPU MODE ▷ #thunderkittens (1 messages):

AMD, ROCm, HIPify, TK


GPU MODE ▷ #edge (1 messages):

DINOv2, C++ Inference Engine, Real-time Robotics Perception, GGUF Format, Quantized Model Implementations


GPU MODE ▷ #reasoning-gym (2 messages):

Osmosis-Structure-0.6B, Skywork Open Reasoner 1 Technical Report


GPU MODE ▷ #submissions (15 messages🔥):

MI300, amd-fp8-mm, amd-mla-decode


GPU MODE ▷ #factorio-learning-env (1 messages):

2kian: nice going to try it out today


GPU MODE ▷ #amd-competition (8 messages🔥):

Submission Tool, Crypting String, Torch Issue, Mixture of Experts AMD Problem


GPU MODE ▷ #cutlass (2 messages):

CMake build process, Kernel building speed


GPU MODE ▷ #mojo (1 messages):

alxcspr: Anyone interested in a London mojo meetup?


Manus.im Discord ▷ #general (48 messages🔥):

Manus Credits, Earn Manus Credits, Manus and mgx.dev, Manus Claude 4, Manus's API calls


LlamaIndex ▷ #announcements (1 messages):

Discord Impostors, seldo_v impostor, Blockchain Scams


LlamaIndex ▷ #blog (1 messages):

Gradio Agents & MCP Hackathon 2025, AI agent development


LlamaIndex ▷ #general (36 messages🔥):

LlamaParse Support, Docling PDF issues, MCP server for LlamaIndex, Ollama streaming issues, llama-index-llms-openai dependency issue


Eleuther ▷ #general (12 messages🔥):

Vector Database Research, GPU Cluster Management


Eleuther ▷ #research (12 messages🔥):

rsLoRA alpha parameter, arxiv2prompt tool, Speedrun Tweet


Eleuther ▷ #interpretability-general (5 messages):

Graph demo, baukit hooks


Eleuther ▷ #gpt-neox-dev (5 messages):

GPT-NeoX, Isambard cluster, ARM CPUs


Torchtune ▷ #dev (14 messages🔥):

async GRPO ray.exceptions.ActorDiedError, TP and CP fix, H200 nodes long-term access, Llama4 perf improvements, FSDP memory implications


Nomic.ai (GPT4All) ▷ #general (13 messages🔥):

Germany AI, Nomic Cloud, Saving Chat Data


DSPy ▷ #general (12 messages🔥):

Structured Outputs Comparison, RL for Fixing Outputs, o3 Limitations, Two-Step Extraction


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (4 messages):

Adding certificate to LinkedIn, Using images from lectures in article


Cohere ▷ #🤝-introductions (2 messages):

N8n Specialist, Make.com Expert, AI Agent Developer


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

TinyJit Compilation Detection