Frozen AI News archive

not much happened today

**vLLM** announced support for **NVIDIA Nemotron Nano 2**, featuring a hybrid Transformer–Mamba design and tunable "thinking budget" enabling up to 6× faster token generation. **Mistral AI Studio** launched a production platform for agents with deep observability. **Baseten** reported high throughput (650 TPS) for **GPT-OSS 120B** on NVIDIA hardware. **Hugging Face InspectAI** added inference provider integration for cross-provider evaluation. **Thinking Machines Tinker** abstracts distributed fine-tuning for open-weight LLMs like **Qwen3** and **Llama 3**. In China, **MiniMax M2** shows competitive performance with top models and is optimized for agents and coding, while **Zhipu GLM-4.6-Air** focuses on reliability and scaling for coding tasks. Rumors suggest **Gemini 2.5 Flash** may be a >500B parameter MoE model, and a possible **GPT-5.1 mini** reference appeared. Outside LLMs, **Tahoe-x1 (3B)** foundation model achieved SOTA in cancer cell biology benchmarks. Research from Stanford introduces a method to detect model provenance via training-order "palimpsest" with strong statistical guarantees.

Canonical issue URL

a quiet day.

AI News for 10/23/2025-10/24/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (198 channels, and 6241 messages) for you. Estimated reading time saved (at 200wpm): 457 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Members of the AIE CODE Expo were announced today.


AI Twitter Recap

Serving and Production Platforms: vLLM x NVIDIA, Mistral AI Studio, Baseten performance, InspectAI evals

China model race: MiniMax M2 surge; Zhipu GLM-4.6-Air update

Research and Safety: model provenance, reward hacking, continual learning, RL post-training

Agents, Memory, and Dev Tooling

Open-source end-to-end: Karpathy’s nanochat

Multimodal and OCR wave

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. AI Model and Workflow Releases

2. ChatGPT in Personal and Educational Contexts

3. Pop Culture AI Imaginations


AI Discord Recap

A summary of Summaries of Summaries by X.ai Grok-4

Theme 1. AI Models Spark Hype and Skepticism

Theme 2. Coding Tools Clash in Cost Wars

Theme 3. Hardware Hacks Heat Up

Theme 4. Research Papers Probe AI Limits

Theme 5. Scam Alerts and User Gripes


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


Cursor Community Discord


OpenAI Discord


LM Studio Discord


OpenRouter Discord


Modular (Mojo 🔥) Discord


DSPy Discord


GPU MODE Discord


HuggingFace Discord


Eleuther Discord


Latent Space Discord


Yannick Kilcher Discord


Moonshot AI (Kimi K-2) Discord


Manus.im Discord Discord


aider (Paul Gauthier) Discord


Nous Research AI Discord


tinygrad (George Hotz) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MCP Contributors (Official) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1185 messages🔥🔥🔥):

Referral Bounties, Comet Browser Issues, Image Generation Limits, Chat Functionality, Steam Scams


Perplexity AI ▷ #sharing (3 messages):

Computational Evidence, Claude for Life Sciences, Abstract Image Generation


LMArena ▷ #general (952 messages🔥🔥🔥):

Gemini 3, Lithiumflow's removal, NimbleBean Kling 2.5 Turbo, Tamazight Language LLM support, Code Arena Usability


LMArena ▷ #announcements (1 messages):

LMArena, minimax-m2-preview


Cursor Community ▷ #general (496 messages🔥🔥🔥):

Cursor Ultra Budgeting, Claude 4.5 Sonnet vs Thinking, Cursor Terminal Issues on Windows, Cursor Refund


Cursor Community ▷ #background-agents (2 messages):

API key source for BG agent status reports, Background agent ratings


OpenAI ▷ #annnouncements (2 messages):

ChatGPT Atlas, Shared Projects Expansion


OpenAI ▷ #ai-discussions (366 messages🔥🔥):

Claude Sonnet 4.5 vs Gemini 2.5 Pro, Sora Code, MultiModal AI, GPT-OSS-120B, AgentML Open Sourced


OpenAI ▷ #gpt-4-discussions (13 messages🔥):

OpenAI support, GPT outage, Microsoft Copilot GPT5 breakdown, Builder Profile verification


OpenAI ▷ #prompt-engineering (52 messages🔥):

Precise Prompt Engineering, Personal GPTs for Prompt Generation, Markdown, XML, JSON, and YAML Prompting, Sora Physics Issues, Integrating Pictures in Video


OpenAI ▷ #api-discussions (52 messages🔥):

Sora Physics Issues, Prompt Engineering for Image Generation, GPTs for Prompt Refinement, Markdown, XML, JSON, and YAML Prompting, GPT-5-Codex Instruction Files


LM Studio ▷ #general (127 messages🔥🔥):

LM Studio platform differences, Qwen 3 VL models, MCP server reliability, CPU usage anomalies, LLM tool usage


LM Studio ▷ #hardware-discussion (51 messages🔥):

5950x as a server processor, Mixed GPUs, Modded MI50s, eGPU docks, PCIE impact on inference


OpenRouter ▷ #general (139 messages🔥🔥):

Rate-limited error responses, Sora 2 code, Purchasing points, Deepseek OCR model, GPT-5 emotional intelligence


OpenRouter ▷ #new-models (1 messages):

Readybot.io: OpenRouter - New Models


OpenRouter ▷ #discussion (25 messages🔥):

OpenRouter's native /v1/completions request support, MoonshotAI's kimi-cli, Sloppy Creative writing prevention


Modular (Mojo 🔥) ▷ #mojo (132 messages🔥🔥):

Julia autovectorization vs Mojo, SIMD Operations in Mojo, Ark.jl Benchmark, Mojo Iterator Interface, Property Testing Framework


DSPy ▷ #show-and-tell (1 messages):

Instagram Analyzer, Automated Instagram analysis


DSPy ▷ #papers (1 messages):

lidar36: They just added the code


DSPy ▷ #general (86 messages🔥🔥):

ReAct Module Granularity, Framework Frustrations, DSPy vs Langchain, Google Vista & DSPy, Monkey Patching


GPU MODE ▷ #general (8 messages🔥):

Text Diffusion Inference, vLLM inference serving, torchcomms/ncclx PT conference session


GPU MODE ▷ #torch (1 messages):

GIL, Priority Inversion


GPU MODE ▷ #cool-links (1 messages):

vipul_todo_18: https://www.stephendiehl.com/posts/mlir_gpu/

talks about MLIR to PTX lowering


GPU MODE ▷ #torchao (2 messages):

HQQ+ blog post, mobiusml github, dropbox github


GPU MODE ▷ #off-topic (6 messages):

Mobius Labs, Personal News, Acquisition, Electric Grill


GPU MODE ▷ #irl-meetup (2 messages):

Netherlands Meetup, European Meetup


GPU MODE ▷ #intel (1 messages):

vk_cooperative_matrix_perf, roofline.png


GPU MODE ▷ #submissions (3 messages):

Grayscale B200, Grayscale H100, Grayscale A100, Grayscale L4, Prefixsum A100


GPU MODE ▷ #factorio-learning-env (1 messages):

Factorial Learning Environment, Reinforcement Learning Projects


GPU MODE ▷ #cutlass (5 messages):

Nsight Python, CUTLASS Python stack, CuTE talk slides


GPU MODE ▷ #singularity-systems (6 messages):

SITP, picograd, lazy semantics, torchdynamo, EagerTensor vs LazyTensor


GPU MODE ▷ #irl-accel-hackathon (43 messages🔥):

H100 availability, Hackathon Waitlist, Dynamic SASS Kernel Instrumentation with nvbit, Memory Allocators on GPU, PyTorch Distributed Hacking


GPU MODE ▷ #opencl-vulkan (1 messages):

erichallahan: New spec update https://www.phoronix.com/news/Vulkan-1.4.330-Released


GPU MODE ▷ #llmq (1 messages):

NPU, CPU Offloading


GPU MODE ▷ #helion (6 messages):

Helion vs Triton, Cudagraph support, Kernel hyperparams


HuggingFace ▷ #general (63 messages🔥🔥):

zero3 config, Text-SAL, AI infrastructure collaboration, ROMA (Reasoning Over Multiple Agents), synthetic data gen


HuggingFace ▷ #today-im-learning (1 messages):

waffles1: Ah yes this is totally legit


HuggingFace ▷ #i-made-this (4 messages):

Pacific-Prime model, 6GB VRAM check, Zero Amnesia AI, Night Learn Engine, RAG Pipeline


HuggingFace ▷ #NLP (1 messages):

yusarseph: hello, is hugface inference endpoints servless ? do we pay for what we dont use ?


HuggingFace ▷ #smol-course (2 messages):

Karpathy Server, HF, nanochat-students, MLX Porting, MLX Stability


HuggingFace ▷ #agents-course (5 messages):

Agents course unit 4, 404 Error


Eleuther ▷ #general (17 messages🔥):

Server Acceptance Process, Distributed Inference, AI Ownership, AI Accelerator Chips, Petals Project


Eleuther ▷ #research (54 messages🔥):

50M model Loss, 1B model Validation, lm-eval, activation steering


Eleuther ▷ #interpretability-general (1 messages):

stellaathena: Okay what the hell is this nonsense: https://www.arxiv.org/abs/2510.15511


Latent Space ▷ #ai-general-chat (31 messages🔥):

gpt-4o-transcribe-diarize, GPT-5, Cursor Enterprise, Kimi Code CLI, Cohere's AI Win


Latent Space ▷ #private-agents (5 messages):

Local AI Apps, QA on Scanned PDFs, OpenWebUI, Qwen3-vl-4b


Latent Space ▷ #genmedia-creative-ai (7 messages):

Video Models, MJ, Kling, LTX-2, a16z


Yannick Kilcher ▷ #general (28 messages🔥):

Mythworx AI, ARC-AGI 1, Elastic Weight Consolidation, Activation-aware Weight Quantization (AWQ), Cherry-picked verifications


Yannick Kilcher ▷ #paper-discussion (7 messages):

Transformer Circuits Linebreaks Paper, Neuronpedia Attribution Graphs, Gemma-2-2b Line Break Attribution, Qwen3-4b Line Break Attribution


Yannick Kilcher ▷ #ml-news (3 messages):

Genie 3 World Model, Google, David Sacks, Donald Trump


Moonshot AI (Kimi K-2) ▷ #general-chat (37 messages🔥):

Chutes vs Moonshot AI, Kimi K2, Data Policy, Uptime, Tool call accuracy


Manus.im Discord ▷ #general (35 messages🔥):

Manus Network connection error, Manus credits usage, Claude Code vs Manus, Manus Room database, Manus deprecated code


aider (Paul Gauthier) ▷ #general (15 messages🔥):

Gemini Pricing, Aider vs. Codex, aider-ce Community Fork, RAG with GitHub Copilot


aider (Paul Gauthier) ▷ #questions-and-tips (1 messages):

Aider's future and development, aider-ce feature set


Nous Research AI ▷ #general (4 messages):

ChatGPT emoji bug, Unreal Engine competitor


Nous Research AI ▷ #ask-about-llms (4 messages):

Nous Research Models, YARN Context Scaling, Western Ideological Views in GPT


Nous Research AI ▷ #research-papers (2 messages):

Mech Interp Surgeon's Bag, RL Desirability Questioned


Nous Research AI ▷ #interesting-links (1 messages):

MARL Consensus, Hamiltonian Path Problem, BFT consensus algorithm


Nous Research AI ▷ #research-papers (2 messages):

Mech Interp Surgeon's Bag, RL Desirability, Limits of Scale


tinygrad (George Hotz) ▷ #general (8 messages🔥):

Becoming a Tinygrad Dev, Mojo and AI Compilers, AI Box Recommendations