Frozen AI News archive

The Karpathy-Dwarkesh Interview delays AGI timelines

The recent AI news highlights the **Karpathy interview** as a major event, alongside significant discussions on reasoning improvements without reinforcement learning, with **test-time sampling** achieving GRPO-level performance. Critiques on context window marketing reveal effective limits near **64K tokens**, with **Claude Haiku 4.5** showing competitive reasoning speed. **GPT-5** struggles with advanced math benchmarks, and data quality issues termed "Brain Rot" affect model reasoning and safety. In agent frameworks, **Anthropic Skills** enable modular coding workflows, **OpenAI Codex IDE** extensions enhance developer productivity, and **HuggingChat Omni** introduces meta-routing across 100+ open models using **Arch-Router-1.5B**. LangChain and LlamaIndex advance graph-first agent infrastructure, while **Google Gemini** integrates with Google Maps for real-world grounding.

Canonical issue URL

Hard work is all you need

AI News for 10/16/2025-10/17/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (197 channels, and 4036 messages) for you. Estimated reading time saved (at 200wpm): 321 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

The much anticipated Karpathy interview dropped this week and was instantly the talk of the town.

Just go watch:

https://youtu.be/lXUZvyajciY


AI Twitter Recap

Reasoning without RL: sampling-based gains, long-context reality checks, and eval trends

Agent frameworks and tooling: Skills, IDEs, routing, and real-world grounding

Vision and document intelligence surge

Research highlights: science, RL, and decoding efficiency

Infra and performance: serving, TFLOPs, and Apple ML

Open-source momentum and geopolitics

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Qwen3-0.6B Instruction Following Test

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. AI Model and Benchmark Announcements

2. AI's Impact on Society and Emotions

3. Energy Consumption and AI Infrastructure


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. New Multimodal and On-Device Models

2. Agentic Search and Retrieval Systems

3. GPU Kernels and Multi-GPU Frameworks

4. Infra and Funding Moves

5. Open-Source Hardware/Software and RAI Tooling

gpt-5-mini

1. Agentic retrieval & SWE-grep

2. Multimodal & video-generation push

3. Low-bit, quantization & hardware tooling

4. Orchestration, memory systems & OpenRouter tooling


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


OpenAI Discord


Cursor Community Discord


OpenRouter Discord


LM Studio Discord


Unsloth AI (Daniel Han) Discord


HuggingFace Discord


Latent Space Discord


GPU MODE Discord


DSPy Discord


Eleuther Discord


Nous Research AI Discord


Manus.im Discord Discord


Moonshot AI (Kimi K-2) Discord


Yannick Kilcher Discord


Modular (Mojo 🔥) Discord


tinygrad (George Hotz) Discord


aider (Paul Gauthier) Discord


MLOps @Chipro Discord


MCP Contributors (Official) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1040 messages🔥🔥🔥):

Claude Censorship, Comet Browser, Perplexity Pro, AI Models, referral program


Perplexity AI ▷ #sharing (4 messages):

Perplexity AI App, Shareable threads


Perplexity AI ▷ #pplx-api (4 messages):

Spaces new chat issues, API credit request


LMArena ▷ #general (709 messages🔥🔥🔥):

Sora 2 Pro Access, GPT-5 Pro vs Codex, Ocean AI and XAI Model Vail, Gemini 3 Release, Flash Lite Preview


LMArena ▷ #announcements (1 messages):

Claude-Haiku-4-5, Text Arena Leaderboard


OpenAI ▷ #ai-discussions (445 messages🔥🔥🔥):

Consistent Outputs with AI, GPT-5 Coding Apps, Gemini 2.5 Pro vs Claude Sonnet, AI Text Detection, Sora 2 Video Generation


OpenAI ▷ #gpt-4-discussions (11 messages🔥):

AI Voice Assistant, Sora Global VPN, Tech Discord Security


OpenAI ▷ #prompt-engineering (23 messages🔥):

futuristic robot prompt, Sora 2 AI, viral video prompt, jujutsu kaisen vs goku prompt, Sora's image recognition


OpenAI ▷ #api-discussions (23 messages🔥):

Text-to-image prompts, Copyrighted Image Generation, Sora AI capabilities, Extended fight scenes without word limit


Cursor Community ▷ #general (383 messages🔥🔥):

Repo Mapping to Cursor Account, Perplexity Comet Invite & ChatGPT Promo, Games Inventory UI Overhaul, Cursor's Blip, Platform UI Changes


OpenRouter ▷ #app-showcase (124 messages🔥🔥):

True Remembering AI, deterministic model agnostic Framework, objective metrics, nochain orchestrator


OpenRouter ▷ #general (126 messages🔥🔥):

Combining reasoning with web search, Audio processing models, Image inputs in Responses API, Cloud for ComfyUI, Security vulnerability


OpenRouter ▷ #new-models (2 messages):

``


OpenRouter ▷ #discussion (28 messages🔥):

OR stance on country requirements, GPT erotica, Dipsy V3.2, ChatGPT Active Users, Fake AI products/papers


LM Studio ▷ #general (81 messages🔥🔥):

Scammer Alert, Great Uncensored Finetuners, LM Studio and Javascript Animations, LM Studio MCP and OpenHands Integration, System Prompts Parsing


LM Studio ▷ #hardware-discussion (167 messages🔥🔥):

DDR5-8000 Speed, GPU airflow, Mixing 1060 with 3070, LLMs for Medical Use, GPU Hardware Modification


Unsloth AI (Daniel Han) ▷ #general (87 messages🔥🔥):

Docker Image Update Frequency, Merging LoRA Adapters, SmolVLM2 Fine-tuning, Gemma 3-4B Loading Options, Kokoro TTS Finetune Notebook


Unsloth AI (Daniel Han) ▷ #introduce-yourself (8 messages🔥):

Freelancer introductions, LLM integration and blockchain, RAG pipelines


Unsloth AI (Daniel Han) ▷ #off-topic (50 messages🔥):

Qwen 2 VL 2B, Apple FastVLM-1.5B, Liquid FM2 VL 450M, Gemma 3 12B Instruct VL, LFM2-VL models


Unsloth AI (Daniel Han) ▷ #help (54 messages🔥):

GGUF model file naming conventions, Unsloth Dynamic Quantization, PIL import error, vLLM integration issues, Qwen2.5 7B OOM issues


Unsloth AI (Daniel Han) ▷ #showcase (3 messages):

Legal move attempts, Move hallucination


Unsloth AI (Daniel Han) ▷ #research (13 messages🔥):

BitNet performance, Microsoft BitNet GitHub, 1.58bit equivalence


HuggingFace ▷ #general (172 messages🔥🔥):

Access Token Permissions, HuggingChat Limits, Model Context Length, Prompt Injection Mitigation, AI Infrastructure


HuggingFace ▷ #today-im-learning (2 messages):

Influence Functions, Research Collaboration


HuggingFace ▷ #cool-finds (2 messages):

Qwen3 Vision model, NexaAI, GGUF


HuggingFace ▷ #i-made-this (5 messages):

FRAI, Responsible AI, YouTube Content, Agent Tutorial


HuggingFace ▷ #core-announcements (1 messages):

Custom Blocks in Diffusers, Modular Diffusers, Pipeline Blocks


HuggingFace ▷ #computer-vision (1 messages):

text conditioned image generation, dynamic action shots, pixelated art style images, serene atmospheres in images


HuggingFace ▷ #NLP (1 messages):

Chat Template Conversion, Tokenizer Usage, Fine-Tuning Script Execution


HuggingFace ▷ #smol-course (5 messages):

LoRA/PEFT training with HF jobs, Hyperparameter Optimization, Lighteval's compatibility with LoRA adapters, Pushing models to Hugging Face Hub without HF Jobs


HuggingFace ▷ #agents-course (5 messages):

agents-course intro, New students joining


Latent Space ▷ #ai-general-chat (92 messages🔥🔥):

Cognition SWE-grep, MobileLLM-Pro, Anthropic/Google TPU Partnership, HeyGen ARR, OpenAI Physics Initiative


Latent Space ▷ #private-agents (5 messages):

M4 Max, Ollama, LM Studio, Local LLM Performance, Qwen Next 80B


Latent Space ▷ #genmedia-creative-ai (9 messages🔥):

AI Granny, OpenAI Sora MLK Likeness


GPU MODE ▷ #general (16 messages🔥):

Maxwell Disassembler & Jetson Nano, Hopper GPUs for AI/Quantum, US GPU Restrictions & China, GPU Mode Distributed GPU Talks


GPU MODE ▷ #triton (2 messages):

Distributed Triton, Non-ML Kernels with Triton DSL


GPU MODE ▷ #cuda (10 messages🔥):

TMA Multicast Bandwidth, cuTensor L2 Promotion, cp.reduce.async.bulk Memory Ordering, Thread Block vs CTA, Perl modules for CUBIN files patching


GPU MODE ▷ #torch (7 messages):

PyTorch Free-Threading, Accessing Backward Functions, GELU Backward API


GPU MODE ▷ #jobs (1 messages):

SF Startup, GPU performance, PyTorch, CUDA kernels, Pac Heights


GPU MODE ▷ #beginner (1 messages):

zlu86: You should be good to go, it's general enough


GPU MODE ▷ #torchao (2 messages):

SGLang, vLLM, torchao, Quantization


GPU MODE ▷ #off-topic (1 messages):

geohot, Image Analysis


GPU MODE ▷ #irl-meetup (1 messages):

arseniivanov: I am at Lund University, but the HPC scene is kind of non-existent here tbh :/


GPU MODE ▷ #self-promotion (4 messages):

Iris multi-GPU programming framework, Gluon backend, NVIDIA backend, Scale-out and RDMA support, Metal backend


GPU MODE ▷ #thunderkittens (7 messages):

H100 attention kernels, ThunderKittens ROCm release, Fixing broken kernels, warp operations


GPU MODE ▷ #submissions (12 messages🔥):

VectorAdd Leaderboard Updates, B200 Performance, L4 Performance, A100 Performance, H100 Performance


GPU MODE ▷ #factorio-learning-env (7 messages):

Sphinx Docs, Factorio Learning Environment


GPU MODE ▷ #amd-competition (2 messages):

Discord user anuragj0803, Discord user meem, Amazing event, Dev day


GPU MODE ▷ #cutlass (2 messages):

PTX Documentation, CUDA Threads as SIMD Lanes, CuTe Layout Plotting


GPU MODE ▷ #singularity-systems (5 messages):

tinygrad compiler design, picograd architecture, SITP goals, Karpathy's influence on tinygrad, Eureka Starfleet academy


GPU MODE ▷ #low-bit-training (2 messages):

BitNet distillation, RL


GPU MODE ▷ #irl-accel-hackathon (2 messages):

Kernel Optimization, Distributed Frameworks, Consumer Devices, Distributed Inference, Distributed Training


GPU MODE ▷ #cluster-management (2 messages):

Fault Tolerant Llama Training, Node Failure Prediction


GPU MODE ▷ #helion (1 messages):

jongsokchoi: GPU mode talk starting now! https://www.youtube.com/watch?v=1zKvCLuvUYc


DSPy ▷ #general (32 messages🔥):

Anthropic agentic search, Langgraph's verbose boilerplate, Agentic Search vs Semantic Search, Groq not working in OpenRouter


Eleuther ▷ #general (20 messages🔥):

PersonaLLM Workshop, Custom Logit Processors, AI for offensive purposes


Eleuther ▷ #research (12 messages🔥):

Midtraining survey, MaskDiT, Attribution graphs from MLPs to attention, LLMs and TREAD


Nous Research AI ▷ #general (27 messages🔥):

Libtorch conversion, PersonaLLM Workshop, UK pricing, Prompt logging policies, GLM 4.6 vs Claude coding


Nous Research AI ▷ #research-papers (2 messages):

New Arxiv Paper


Nous Research AI ▷ #research-papers (2 messages):

Arxiv papers


Manus.im Discord ▷ #general (29 messages🔥):

Loading Errors and Agent Mode Issues, Prohibition of Selling Credits, Manus Workshop Promotion, Refund Request, Coffee Shop Tool


Moonshot AI (Kimi K-2) ▷ #general-chat (24 messages🔥):

Kimi K2 finetuning, Kimi vs Deepseek, Moonshot vs Deepseek


Yannick Kilcher ▷ #general (14 messages🔥):

Illegalize Linux, Children Operating Systems, AGI Definition, Emulate Ground Truth Data Distribution, Weekly Tautological Counter


Yannick Kilcher ▷ #ml-news (3 messages):

Qwen3 Vision Model, Open Sourcing Older Models, Protecting Best Tricks


Modular (Mojo 🔥) ▷ #general (2 messages):

Google Coral NPU, Apache 2 Licensing, RV32 Cores, Mojo Portability Testing


Modular (Mojo 🔥) ▷ #mojo (13 messages🔥):

TUI Frameworks for Mojo, Audio and MIDI 2.0, Jack Bindings, Mojo Origins vs Rust Lifetimes


Modular (Mojo 🔥) ▷ #max (1 messages):

MAX Python API Open Source


tinygrad (George Hotz) ▷ #general (5 messages):

IMAGE, NOLOCALS, and GRAPH_ONE_KERNEL Confusion, DEV= Default Device Setting, Speed Regressions Despite Tests, Generic Compilation Tests, Distributed Systems and GPU Memory


aider (Paul Gauthier) ▷ #general (4 messages):

aider fork with mcp support, GPT-5-nano removed from commit messages in aider-ce


aider (Paul Gauthier) ▷ #questions-and-tips (1 messages):

Filename Typing, Feature Requests, Aider Performance


MLOps @Chipro ▷ #events (2 messages):

MLOps Workshop, LWP Labs, ML Model Deployment