Frozen AI News archive

Air Street's State of AI 2025 Report

**Reflection** raised **$2B** to build frontier open-weight models with a focus on safety and evaluation, led by a team with backgrounds from **AlphaGo**, **PaLM**, and **Gemini**. **Figure** launched its next-gen humanoid robot, **Figure 03**, emphasizing non-teleoperated capabilities for home and large-scale use. **Radical Numerics** released **RND1**, a **30B-parameter sparse MoE diffusion language model** with open weights and code to advance diffusion LM research. **Zhipu** posted strong results with **GLM-4.6** on the Design Arena benchmark, while **AI21 Labs**' **Jamba Reasoning 3B** leads tiny reasoning models. **Anthropic** introduced a plugin system for **Claude Code** to enhance developer tools and agent stacks. The report also highlights SoftBank's acquisition of ABB's robotics unit for **$5.4B** and the growing ecosystem around open frontier modeling and small-model reasoning.

Canonical issue URL

300 slides are all you need.

AI News for 10/8/2025-10/9/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (197 channels, and 7870 messages) for you. Estimated reading time saved (at 200wpm): 583 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Congrats to Reflection, Mastra, Datacurve, Spellbook, and Kernel on their fundraises.

The AI-native equivalent of the annual Mary Meeker report has been Nathan Benaich's State of AI report. You can catch the highlight in tweet thread or youtube, but we'll offer some notes here:

The labs with the mandate of heaven (where does Anthropic show up?):

and their valuations:

AI-first "tiny teams":

The 2026 cluster buildouts:


AI Twitter Recap

Humanoid Robotics: Figure 03 launch, capabilities, and industry moves

Open frontier modeling and releases: Reflection’s $2B, Diffusion LMs, GLM-4.6, and small-model reasoning

Developer tools and agent stacks: Claude Code plugins, VS Code AI, Gemini ecosystem

Benchmarks and evaluations: ARC-AGI, METR time-horizons, FrontierMath, and domain leaderboards

Systems, performance, and infra: GPU kernels, inference benchmarking, and MLX speed

Multimodal/video: Sora 2 momentum, Genie 3 recognition, and WAN 2.2

Safety, bias, and security

Top tweets (by engagement)

Notes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Microsoft UserLM-8B 'User' Role-Simulation Model Announcement

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Qwen Image Edit 2509: Next Scene LoRA and named-entity editing tips

2. AI progress retrospectives: 'Will Smith spaghetti' and '2.5 years' revisited

3. Figure 03 launch and AI policy/workforce debates (Anthropic limits, EO compliance, Altman)


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. Kernel and Attention Performance Engineering

2. Agents: Protocols, Tooling, and Guardrails

3. New Models and Memory Architectures

4. Efficient Generation and Multimodal Benchmarks


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


OpenRouter Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


Cursor Community Discord


HuggingFace Discord


GPU MODE Discord


LM Studio Discord


Latent Space Discord


Nous Research AI Discord


Yannick Kilcher Discord


aider (Paul Gauthier) Discord


tinygrad (George Hotz) Discord


Eleuther Discord


MCP Contributors (Official) Discord


Moonshot AI (Kimi K-2) Discord


Modular (Mojo 🔥) Discord


Manus.im Discord Discord


DSPy Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1137 messages🔥🔥🔥):

Perplexity slow, GPTs Agents, OpenAI's sidebars, Sora codes, Comet Browser


Perplexity AI ▷ #sharing (4 messages):

Hack for Social Impact, Shareable Threads, Budget Robot Vacuums


Perplexity AI ▷ #pplx-api (6 messages):

Search API access, Search API query length restrictions, Search API Key


LMArena ▷ #general (1267 messages🔥🔥🔥):

Comet Browser, Gemini 3 Release Speculation, Model Purges, LMArena Video Generation, Maverick Controversy


LMArena ▷ #announcements (1 messages):

LMArena survey, Arena Champions Program


OpenRouter ▷ #app-showcase (2 messages):

Perplexity comparison, Browser automation interest, Funding sources, Legal rights, Robots.txt and LinkedIn lawsuits


OpenRouter ▷ #general (1027 messages🔥🔥🔥):

Free Deepseek vs Paid Deepseek, Chutes BYOK, AI Chatbot Censorship, Troubleshooting Codex, Cursor AI vs OpenRouter


OpenRouter ▷ #new-models (1 messages):

Readybot.io: OpenRouter - New Models


OpenRouter ▷ #discussion (17 messages🔥):

OpenInference Relation, AI Generated Image, Token Usage on OpenRouter, NSFW Filter on OpenAI, Model releases on OpenRouter


OpenAI ▷ #annnouncements (1 messages):

Reddit AMA, AgentKit, Apps SDK, Sora 2 in the API, GPT-5 Pro in the API


OpenAI ▷ #ai-discussions (486 messages🔥🔥🔥):

AI and Mental Health, AI Tagging Law, AI Browser Analysis, Multi-User LLMs, Sora 2


OpenAI ▷ #gpt-4-discussions (3 messages):

OpenAI Liability, Parental Responsibility, Dedicated Tools


OpenAI ▷ #prompt-engineering (4 messages):

Product ad prompts


OpenAI ▷ #api-discussions (4 messages):

Prompt for Ad Creation, Seeking Assistance for Ad Prompts


Unsloth AI (Daniel Han) ▷ #general (132 messages🔥🔥):

LoRA Merging, Nix GPU Drivers, Ling 1T llama.cpp, GLM-4.6 Capabilities, Imbalanced data


Unsloth AI (Daniel Han) ▷ #off-topic (269 messages🔥🔥):

Learning Rate Wiping, Retirement Savings, AI-Generated Speech, ASI Prerequisites, Ling-1T Model


Unsloth AI (Daniel Han) ▷ #help (25 messages🔥):

Ollama memory usage with context size, Using the 'input' field in datasets, LM Studio and LFM2-8B-A1B-GGUF model, Improving answer accuracy with prompt engineering, Unsloth on Amazon ml.g4dn.xlarge


Unsloth AI (Daniel Han) ▷ #research (3 messages):

Compact Reads, New Fields, Arxiv PDFs


Cursor Community ▷ #general (371 messages🔥🔥):

Firebase integration, Wrangler CLI, Cloudflare Ecosystem, Next.js, Dioxus


Cursor Community ▷ #background-agents (24 messages🔥):

Cursor Background Agents, 500 Errors on Cursor API, GitHub Outage Impact, Background Agents Access to Snapshots


HuggingFace ▷ #general (335 messages🔥🔥):

Final year project proposal, Dyslexia friendly notes, Samsung tiny recursive model, ImageFolder load, Sentiment analysis model on product reviews


HuggingFace ▷ #today-im-learning (1 messages):

WebRTC Client in Python, FastRTC FastAPI server, aiortc struggles, WebRTC Documentation Issues


HuggingFace ▷ #cool-finds (1 messages):

Hyperparameters are all you need, Diffusion breakthrough


HuggingFace ▷ #i-made-this (6 messages):

HyDRA RAG Agent, WSL Pytorch vLLM venv bootstrap, AI features aggregator tool, OpenlabX for AI Research, Prompt Engineering Contest


HuggingFace ▷ #reading-group (1 messages):

avanee5h: hello


HuggingFace ▷ #NLP (2 messages):

Cross-Posting Reminders, Discord Etiquette


HuggingFace ▷ #agents-course (6 messages):

Agent guardrails, Agent agency, Tool limits


GPU MODE ▷ #general (18 messages🔥):

Hackathon terms and conditions, LLMs in performance engineering, FlashInfer blog post, Data engineering book, Developing on Trainium


GPU MODE ▷ #triton (5 messages):

Registers-Address Mapping, ld.shared Layout Compatibility, ldmatrix Tiling Calculation, Triton Lowerings Implementation


GPU MODE ▷ #cuda (26 messages🔥):

CUDA reduction PDF and code, Thread block cluster reductions, mbarriers in shared smem, Execution order of thread blocks, Blackwell CLC


GPU MODE ▷ #cool-links (3 messages):

Non-Determinism Work, Full-Text Search Engine in Go, Array-Based Library in C++ on GPUs


GPU MODE ▷ #jobs (1 messages):

Aurora, Deep Learning Acceleration, Software Engineer


GPU MODE ▷ #beginner (8 messages🔥):

VSCode for remote GPU programming, CUDA debugging in Visual Studio, CUDA Graphs with dynamic tensor allocations, Distributed training: TorchTitan vs NVIDIA NeMo, CUDA kernels


GPU MODE ▷ #youtube-recordings (6 messages):

Distributed Tensors, JAX Scaling


GPU MODE ▷ #off-topic (11 messages🔥):

GPU programming jobs for new grads, AI labs hiring for GPU programming, Sneaking in GPU work into unrelated jobs


GPU MODE ▷ #irl-meetup (1 messages):

garrett.garrett: Your workplace sounds awesome


GPU MODE ▷ #self-promotion (2 messages):

Model Serving Communities, vLLM, KServe, llm-d, Red Hat AI


GPU MODE ▷ #submissions (12 messages🔥):

MI300x8 Performance, amd-all2all leaderboard, amd-ag-gemm leaderboard


GPU MODE ▷ #status (6 messages):

BioML leaderboard, github actions down


GPU MODE ▷ #amd-competition (10 messages🔥):

GitHub Actions Outage, A2A Timeouts, Runpod MI300


GPU MODE ▷ #cutlass (5 messages):

DLPack Interop, Grouped GEMM performance for MoEs, PTX Docs K-contig and Swizzling


GPU MODE ▷ #singularity-systems (1 messages):

SITP Table of Contents, Picograd Repo Wipe


GPU MODE ▷ #general (1 messages):

Discord Roles, Competition Winners


GPU MODE ▷ #multi-gpu (5 messages):

Cost comparison for running large models at home, GPU recommendations for model parallelism, RTX 3090, RTX 5080, RTX 5070 ti super 24gb


GPU MODE ▷ #irl-accel-hackathon (1 messages):

Inference Project Teams, Project Team Application, Project Joining Process


GPU MODE ▷ #helion (13 messages🔥):

Helion vs Triton, FLA ops performance, Nvidia/AMD partnership, TileLang benchmarks, Gated DeltaNet


LM Studio ▷ #general (99 messages🔥🔥):

Chat Degradation, LM Studio performance boost, LM Studio issues, Text to speech LLM


LM Studio ▷ #hardware-discussion (14 messages🔥):

CPU Graphics Support, RAM and VRAM allocation, GPU bricked, integrated graphics fixed in v1.52.1, Sparkle Arc Pro B60 Dual Server


Latent Space ▷ #ai-general-chat (103 messages🔥🔥):

Magic Dev hate, VC Slop Bubble Meltdown, OpenAI Tokens, AlphaGo AI Moment, Techno-Capital Singularity


Nous Research AI ▷ #general (58 messages🔥🔥):

NousCon in Ohio, BDH Pathway adoption, VLMs blogpost, Arcee AI MoE model, Atropos usage


Nous Research AI ▷ #ask-about-llms (15 messages🔥):

Hermes4 image understanding, Hermes vision model, Grafting Llama 3.2, Vision tool calling with Qwen VL 3, Gemini 2.5 Flash as a vision tool


Nous Research AI ▷ #research-papers (4 messages):

Recursive Reasoning, Tiny Networks, HRM performance on ARC-AGI


Nous Research AI ▷ #research-papers (4 messages):

Tiny Networks, Recursive Reasoning, HRM Model Performance


Yannick Kilcher ▷ #general (39 messages🔥):

RL debate & information bottlenecks, Thinking Machines blog & Shannon entropy, Sutton Interview and transferring bits from RL by SFT, Fringe ML ideas (non-DL): ART, SNN, RNN, weightless NN, GDL, Evolutionary Search (ES) vs backprop


Yannick Kilcher ▷ #paper-discussion (14 messages🔥):

Ovi fails at edge detection, Rights, Freedoms, and Technology, Tiny Recursive Model Discussion, Cat studies


Yannick Kilcher ▷ #ml-news (4 messages):

Artificial Hippocampus Networks (AHNs), ByteDance AHN Model


aider (Paul Gauthier) ▷ #general (36 messages🔥):

Gemini API integration in aider, GLM-4.6 vs Sonnet 4, OpenCode with GLM models, Local Models vs API Models, GPT-5-Codex with aider


aider (Paul Gauthier) ▷ #questions-and-tips (2 messages):

Aider Project, Project Updates


tinygrad (George Hotz) ▷ #general (17 messages🔥):

Code review, AI slop, Algebraic Upat


tinygrad (George Hotz) ▷ #learn-tinygrad (11 messages🔥):

tinygrad vector operations, Loop splitting resources, CUDA kernel reverse engineering with IOCTL


Eleuther ▷ #general (1 messages):

inarikami: I meant the proposed Go and game benchmark plans


Eleuther ▷ #research (25 messages🔥):

World Models vs Language Models, nGPT Failure Analysis, Harvard CMSA Seminars, VLM Image Resolution Optimization


Eleuther ▷ #interpretability-general (1 messages):

Interpretable AI, Advice for college students


Eleuther ▷ #multimodal-general (1 messages):

Moxin-VLM, VLM-R1


MCP Contributors (Official) ▷ #general (9 messages🔥):

MCP integration with ChatGPT, Refresh button issues, .well-known/ endpoint for MCP server metadata, Minimal SEP proposal


MCP Contributors (Official) ▷ #general-wg (4 messages):

Sub-registry syncing, Registry app


Moonshot AI (Kimi K-2) ▷ #general-chat (13 messages🔥):

organic models, sora 2 invite codes, kimi coding


Modular (Mojo 🔥) ▷ #general (2 messages):

New member introductions, Community Welcome


Modular (Mojo 🔥) ▷ #mojo (10 messages🔥):

Mojo multithreading, Jetson Thor support in Mojo, Using Python threads with Mojo


Manus.im Discord ▷ #general (7 messages):

Sora 2 invite code request, Credit refund request due to agent failure, Threat to chargeback and cancel membership, Manus support availability, Manus help page


DSPy ▷ #general (6 messages):

DSPy Community Projects, MCP Tool Authentication with DSPy, Official vs Community Repositories