Frozen AI News archive

not much happened today

**FrontierMath Tier 4** results show **GPT-5 Pro** narrowly outperforming **Gemini 2.5 Deep Think** in reasoning accuracy, with concerns about problem leakage clarified by **Epoch AI Research**. **Mila** and **Microsoft** propose **Markovian Thinking** to improve reasoning efficiency, enabling models to reason over 24K tokens with less compute. New research suggests base models inherently contain reasoning mechanisms, with "thinking models" learning to invoke them effectively. In systems, **NVIDIA Blackwell** combined with **vLLM** wins InferenceMAX with significant throughput gains, while **Together AI's ATLAS** adaptive speculative decoding achieves 4× speed improvements and reduces RL training time by over 60%. **SparseServe** introduces dynamic sparse attention with KV tiering, drastically improving throughput and latency in GPU memory management.

Canonical issue URL

a quiet day

AI News for 10/9/2025-10/10/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (197 channels, and 7403 messages) for you. Estimated reading time saved (at 200wpm): 586 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Second round applications for AIE CODE close in 5 days!


AI Twitter Recap

Reasoning: FrontierMath shootout, Markovian Thinking, and what “reasoning training” actually teaches

Systems and inference: Blackwell + vLLM, adaptive speculators, and sparse-attention KV tiering

Model and tooling releases

Scale, compute, and training estimates

Robotics and embodied AI

Evals, benchmarking, and governance

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

nothing met our bar

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. NVIDIA GB300 NVL72 + ComfyUI GDS Performance Updates

2. AniSora V3.2 (Wan2.2) 360° I2V and Sora-2 Demos

3. Delivery Fails: DoorDash Porch Collapse and Amazon Drops


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. Inference Acceleration & Kernel Optimizations

2. Tiny Models, Mighty Benchmarks

3. AI Funding & M&A Roundup

4. Protocol Standards & Structured Tools


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


OpenAI Discord


OpenRouter Discord


Unsloth AI (Daniel Han) Discord


LM Studio Discord


Cursor Community Discord


GPU MODE Discord


HuggingFace Discord


Latent Space Discord


Eleuther Discord


Nous Research AI Discord


MCP Contributors (Official) Discord


Modular (Mojo 🔥) Discord


Yannick Kilcher Discord


aider (Paul Gauthier) Discord


Moonshot AI (Kimi K-2) Discord


tinygrad (George Hotz) Discord


DSPy Discord


Manus.im Discord Discord


MLOps @Chipro Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1204 messages🔥🔥🔥):

Perplexity AI Models, Comet Browser, AI code debugging, AI programming language, Coding on phones


Perplexity AI ▷ #sharing (1 messages):

GPTs Agents, OpenAI's sidebars


Perplexity AI ▷ #pplx-api (2 messages):

Search API issues, Permission Denied Error, Cloudflare deny


LMArena ▷ #general (1202 messages🔥🔥🔥):

LM Arena Pricing, Sonnet 4.5 Error, Video Arena Features, Gemini 3.0 Launch, AI Video Generation


LMArena ▷ #announcements (1 messages):

LMArena Survey, Arena Champions Program


OpenAI ▷ #ai-discussions (493 messages🔥🔥🔥):

LinuxComet Privacy Issues, Browser Recommendations, DuckDuckGo Privacy, AI Agency Business Model, Sora2 Availability in EU


OpenAI ▷ #gpt-4-discussions (17 messages🔥):

OpenAI liability waiver, Responsibility for OpenAI's part, ChatGPT Business vs Enterprise, GPT-5 Thinking Mini, MCP dev channel


OpenAI ▷ #prompt-engineering (13 messages🔥):

Sora 2 prompting, Visual learning, Prompt engineering by example


OpenAI ▷ #api-discussions (13 messages🔥):

Sora 2 Prompting, Bypassing Guidelines


OpenRouter ▷ #app-showcase (2 messages):

VST Popularity, YouTube channel subscribers and views


OpenRouter ▷ #general (401 messages🔥🔥):

BYOK payment issues, Gemini 3 release, Constraining PDF pages with API, Roleplayers and free models, Usage data issues with SSE


OpenRouter ▷ #new-models (2 messages):

``


OpenRouter ▷ #discussion (12 messages🔥):

Sambanova Deepseek R1/V3, BYOK Azure Keys Routing, ChatQwen Models


Unsloth AI (Daniel Han) ▷ #general (115 messages🔥🔥):

Qwen3-VL Support, DGX Spark performance, trl library changes, Pretraining datasets, Fine-tuning vulnerabilities


Unsloth AI (Daniel Han) ▷ #introduce-yourself (2 messages):

AI Development, Full-Stack Engineering, Blockchain Building, AI + Web3 Projects


Unsloth AI (Daniel Han) ▷ #off-topic (173 messages🔥🔥):

Datacenter cooling for municipal heating, One More Parameter, RAM Usage and Browsers, Zen Browser, Batch Size vs. Forward Pass Time


Unsloth AI (Daniel Han) ▷ #help (84 messages🔥🔥):

Unsloth installation on Amazon ml.g4dn.xlarge, GGUF model creation from Lora, Distributed FSDP runs support, WSL2 package compatibility issues, AI code agent for generating test cases


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

Qwen3-8B fine-tuning, Novel chapter training, Data Cleaning


Unsloth AI (Daniel Han) ▷ #research (27 messages🔥):

Recursive Model, HRM, ARC-AGI, Data Augmentation, GNN + Reasoning Model


LM Studio ▷ #general (290 messages🔥🔥):

Model choice recommendations, Context length impact on performance, Tool call failures, Uncensored Models


LM Studio ▷ #hardware-discussion (26 messages🔥):

Sparkle Arc Pro B60 Dual Server, Nvidia 4080 Ti Super vs Apple M3, GPU Benchmarks for LLM Inference, Server GPUs vs Multi-GPU Setups, Galax Single Slot RTX 5060 Ti GPU


Cursor Community ▷ #general (270 messages🔥🔥):

API Keys in Models Page, AutoHotkey and Cursor Integration, Cursor Plan Pricing, GPT-5 Cost, Apply button in ASK mode


Cursor Community ▷ #background-agents (6 messages):

Agents responding with Hello, Linear integration issues, Using Background Agents

  1. Create a new BA to code a new feature.
  2. Allow the BA to implement the code suggestions.
  3. Interact with the BA to review code, fix bugs, remove hallucinations.
  4. Merge the new code changes into the main branch.

GPU MODE ▷ #general (2 messages):

CUDA kernels, Trainium Platform, High-Level Books


GPU MODE ▷ #triton (12 messages🔥):

ldmatrix tiling calculation, optimal swizzling in linear layout, deterministic atomic_add reduction in Triton, Triton community meetup


GPU MODE ▷ #cuda (24 messages🔥):

Thread block execution order, Blackwell CLC, CUB library, cuda::ptx::mbarrier_try_wait, Model cudagraph capture


GPU MODE ▷ #cool-links (4 messages):

parrot lib, InferenceMAX, ATLAS, Billy Dally hardware


GPU MODE ▷ #jobs (1 messages):

GPU Performance Engineer Hiring, NVIDIA GPU Architecture, Kernel Optimization, Software-Hardware Co-design


GPU MODE ▷ #beginner (6 messages):

Distributed Training Libraries: TorchTitan vs NVIDIA-Nemo, CUDA Kernel Debugging in Visual Studio, 4D Parallelism, Megatron Core, TorchTitan's Adaptability


GPU MODE ▷ #off-topic (2 messages):

GPU Programming with CUDA, Resources for learning CUDA, CUDA Projects for Students


GPU MODE ▷ #triton-puzzles (3 messages):

Triton Projects, Picograd


GPU MODE ▷ #intel (6 messages):

Panther Lake, Xe3 Details, Celestial Architecture Speculation, Memory Bandwidth Bottleneck


GPU MODE ▷ #self-promotion (1 messages):

vipul_todo_18: Other people's promotion:

https://x.com/jyo_pari/status/1976324891545829876


GPU MODE ▷ #thunderkittens (1 messages):

ThunderKittens compilation issues, Nvidia GH200, CUDA 12.3, fp8e8m0 undefined type


GPU MODE ▷ #submissions (26 messages🔥):

amd-gemm-rs Leaderboard Updates, amd-all2all Leaderboard Submissions, MI300x8 Performance, amd-ag-gemm Submissions


GPU MODE ▷ #status (1 messages):

Runner Timeouts


GPU MODE ▷ #factorio-learning-env (4 messages):

COVID, Factorio Crime Scene


GPU MODE ▷ #amd-competition (16 messages🔥):

AMD Cluster Timeout Issues, Memory Access Fault Debugging, Jot Runner Timeout Extension


GPU MODE ▷ #cutlass (28 messages🔥):

Grouped GEMM performance for MoEs, PTX docs for K-contig and swizzling errors, Torch tensor support in CUTLASS, Compiler aborts with ldmatrix copy atoms, Debugging pipeline stalls


GPU MODE ▷ #general (1 messages):

Discord Roles, Competition Winners, AMD Competition


GPU MODE ▷ #multi-gpu (5 messages):

5070 ti super 24gb vs 5080, Distributed training libraries: torchtitan vs nvidia-nemo, Tesla P40 24GB performance


GPU MODE ▷ #irl-accel-hackathon (2 messages):

Project Teams, GPU Inference, Novel GPU Work


GPU MODE ▷ #llmq (7 messages):

Attention-Residual Recalculation, Contributor Guide, Weird Quantizations


GPU MODE ▷ #helion (2 messages):

FLA Benchmark, GDN, Mamba2, PTC Talk


HuggingFace ▷ #general (108 messages🔥🔥):

Colab Cost, Continual Learning, GPU Time on A100, Fine-tuning Llama3, HF Fellowship


HuggingFace ▷ #cool-finds (1 messages):

Hyperparameter Diffusion, Faster Image Generation, Compute Reduction


HuggingFace ▷ #i-made-this (2 messages):

BERT model for Polish tweets, MedScan AI medical tool, NLP in healthcare


HuggingFace ▷ #computer-vision (1 messages):

Custom model vs finetune, Text encoder model


HuggingFace ▷ #NLP (1 messages):

cakiki: <@892799950470144060> no cross-posting please


Latent Space ▷ #ai-general-chat (101 messages🔥🔥):

Datacurve Funding, Spellbook Funding, OpenAI AMA Cancellation, Kernel Funding & Agent Auth, Elastic Acquires Jina AI


Latent Space ▷ #genmedia-creative-ai (8 messages🔥):

Gemini Flash 2.5 Nano Banana, JSON prompting, nano-banana AI outputs, replicating nano-banana AI outputs, removing nano-banana AI outputs


Eleuther ▷ #general (15 messages🔥):

ROPE Implementation in llama.cpp, Precision Sensitivity in RoPE Calculations, Neural Theorem Proving Channels


Eleuther ▷ #research (71 messages🔥🔥):

OOD Generalization and Training Loss, Vision Language Model optimization, Cosine Decay vs Infinite LR for Scaling Laws, Warmup Stable No-Decay Training, Scalar RMSProp Adaptivity


Eleuther ▷ #interpretability-general (4 messages):

Mechanistic Interpretability, Attribution Graphs, Circuit Tracing, Model Biology


Eleuther ▷ #multimodal-general (1 messages):

Moxin-VLM, VLM-R1


Nous Research AI ▷ #general (46 messages🔥):

Atropos tutorial video, Training Qwen3-Next, Predicted Outputs in vllm, r/LocalLLaMA post removal


Nous Research AI ▷ #ask-about-llms (4 messages):

FP8 LLM Finetuning, QLoRA effectiveness, Test Time RL, LoRA Precision


Nous Research AI ▷ #research-papers (18 messages🔥):

Constant Params * Samples / Data Ratio, ThinkingMachine's LoRA Framework, LoRA vs FFT, Information Bottleneck in RL, Robust Fine Tuning Strategies


Nous Research AI ▷ #research-papers (18 messages🔥):

LoRA, FFT, 8bit, Information Bottleneck, Thinking Machines Blog


MCP Contributors (Official) ▷ #general (85 messages🔥🔥):

MCP .well-known Endpoint, Representing Image Content in structuredContent, skybridge tool and outputSchema


Modular (Mojo 🔥) ▷ #general (5 messages):

Mojo language, GPU compatibility


Modular (Mojo 🔥) ▷ #mojo (64 messages🔥🔥):

Apple Silicon M3 GPU support, Jetson Orin Nano for Robotics, Mojo Native FFT Implementation, SIMD vector width loading tricks, Complex number memory interleaving


Yannick Kilcher ▷ #general (46 messages🔥):

Forgetting in Models, IID Sampling Issues, Prompt Injection Defense, Separate Token Embeddings, Graph Neural Network Training


Yannick Kilcher ▷ #paper-discussion (6 messages):

Cat Studies, Follow up paper


Yannick Kilcher ▷ #agents (2 messages):

Camel AI updates, Berkeley LLM Agents Course


Yannick Kilcher ▷ #ml-news (2 messages):

YouTube Links, oN0nViY4gn4, 1gO2bC5xLlo


aider (Paul Gauthier) ▷ #general (18 messages🔥):

GPT-5-Codex with Aider, Deepseek v3.1 with roo code, GLM 4.6 in Aider, Default Prompt Function in Aider, Aider Chat Modes


aider (Paul Gauthier) ▷ #questions-and-tips (9 messages🔥):

Haiku for Git Commits, Custom Profiles in Openrouter, Model Specification Syntax, Weak vs. Main Model, Persona Definitions


Moonshot AI (Kimi K-2) ▷ #general-chat (23 messages🔥):

Sora invite codes, Kimi coding abilities, Hack Club & Moonshot AI, Making videos with Kimi


tinygrad (George Hotz) ▷ #general (10 messages🔥):

AI Slop in PRs, Algebraic Upat Tests, Tinygrad vs PyTorch, Reduce Group Color Change


tinygrad (George Hotz) ▷ #learn-tinygrad (11 messages🔥):

tinygrad vector operations, loop splitting, cuda_ioctl_sniffer in Rust, Winograd test failures


DSPy ▷ #show-and-tell (9 messages🔥):

Prompt Injection Tasks, DSPy Community Repo, AgentFlow


DSPy ▷ #papers (1 messages):

batmanosama: https://arxiv.org/abs/2510.05592


DSPy ▷ #general (9 messages🔥):

dspy.Tool from MCP Tool with authentication, shadcn inspiration for DSPy, TTD-DR module download, Platform/marketplace for DSPy modules, AgentFlow


Manus.im Discord ▷ #general (18 messages🔥):

Godhand AI-assisted previz creation, Manus Support, Prompt Engineering, Cloudflare integration, Claude API