Frozen AI News archive

not much happened today

**Anthropic** announces a new CTO. Frontier coding agents see updates with **Claude Sonnet 4.5** showing strong cybersecurity and polished UX but trailing **GPT-5 Codex** in coding capability. **xAI Grok Code Fast** claims higher edit success at lower cost. **Google's Jules** coding agent launches a programmable API with CI/CD integration. **Qwen** clarifies its model taxonomy and API tiers. Vision/LM Arena rankings show a tight competition among **Claude Sonnet 4.5**, **Claude Opus 4.1**, **Gemini 2.5 Pro**, and OpenAI's latest models. In video generation, **Sora 2 Pro** leads App Store rankings with rapid iteration and a new creator ecosystem; early tests show it answers GPQA-style questions at 55% accuracy versus GPT-5's 72%. Video Arena adds new models like **Luma's Ray 3** and **Kling 2.5** for benchmarking. Multi-modal video+audio generation model **Ovi** (Veo-3-like) is released. Retrieval models include **ModernVBERT** from MIT with efficient image-text retrieval capabilities. *"Claude Sonnet 4.5 is basically the same as Opus 4.1 for coding"* and *"Jules is a programmable team member"* highlight key insights.

Canonical issue URL

The calm before DevDay.

AI News for 10/2/2025-10/3/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (196 channels, and 10895 messages) for you. Estimated reading time saved (at 200wpm): 758 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

gm, Anthropic has a new CTO.


AI Twitter Recap

Frontier coding agents and model standings (Claude 4.5, Grok Code Fast, Google’s Jules, Qwen naming, Arena leaderboard)

Video generation surge: Sora 2 Pro momentum, evaluation, and a broader model stack

Retrieval, VLMs, and perception models (ModernVBERT, Jina v3, RF-DETR, π0.5 robotics)

Reasoning, RL, and verifiers (PPO/GRPO, RESTRAIN, ExGRPO, RLAD, TUMIX, CLUE, RoT)

Efficiency, quantization, and infra (FP8, SINQ, MLX, CPU MoE, QAT, sampling, training control)

Industry and research signals (Sakana x Daiwa, Terence Tao + GPT-5, xLSTM scaling laws, Comet)

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. LLM Efficiency and Benchmarks: Huawei SINQ Quantization + GLM 4.6 Tool-Calling Performance

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Sora 2 and Latest Text-to-Video Demo Reels

2. GPT-5 Thinking Wikipedia Audits and Research Assistance

3. AI in Education: Teacher Adoption and Student Legal Cases


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. Agentic Dev Tools: Comet, Solveit, Chrome DevTools MCP

2. GPU Performance & Quantization Engineering

3. Datasets, Leaderboards, and Model Lineup Moves

4. Agent Protocols, Formats, and Access-as-Code

5. Local Inference Performance: vLLM, Memory Bandwidth, Qwen3 TPS


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


Cursor Community Discord


LM Studio Discord


GPU MODE Discord


OpenRouter Discord


Nous Research AI Discord


Latent Space Discord


HuggingFace Discord


Modular (Mojo 🔥) Discord


Eleuther Discord


MCP Contributors (Official) Discord


DSPy Discord


Yannick Kilcher Discord


Manus.im Discord Discord


aider (Paul Gauthier) Discord


Moonshot AI (Kimi K-2) Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (2 messages):

o3 Deprecation, Comet Browser Release, Background Assistants, Slack Connector, Claude Sonnet 4.5


Perplexity AI ▷ #general (1108 messages🔥🔥🔥):

Rocket.new, DeepSeek, Copilot, Grok, Comet Browser


Perplexity AI ▷ #sharing (5 messages):

Perplexity AI, bootstrap paradox, versatile ai, tesla cybertruck lawsuit, Microsoft detects linux


Perplexity AI ▷ #pplx-api (3 messages):

Sonar-pro 403 errors, Firebase function servers, IP address blocking, Perplexity API issues


LMArena ▷ #general (1092 messages🔥🔥🔥):

GPT-5 vs GPT-4o, Sora 2 access, Gemini 3 Pro, Jailbreaking AI models


LMArena ▷ #announcements (3 messages):

Claude Sonnet 4.5, LMArena Text Leaderboard, IBM Granite, Ray-3 Video Model


OpenAI ▷ #annnouncements (1 messages):

GPT-5 Instant, distress support, model updates


OpenAI ▷ #ai-discussions (1020 messages🔥🔥🔥):

Sora Downgrade and Censorship, AI and Human Creativity, PhilosoBots, Gemini Hack, Ethical Concerns AI


OpenAI ▷ #gpt-4-discussions (4 messages):

Sora as Social Media, Sora + ChatGPT, GPT Date Time Stamp across chats


OpenAI ▷ #prompt-engineering (9 messages🔥):

Portrait vs Landscape image generation, Saving image/art prompts


OpenAI ▷ #api-discussions (9 messages🔥):

Portrait vs Landscape Image Generation, Saving image/art prompts, Consistent JSON context profile for infographics with Sora


Unsloth AI (Daniel Han) ▷ #general (416 messages🔥🔥🔥):

Granite 4.0 Quants, Qwen3 30B Performance, PyTorch 2.8.0 Issues, Ring and Ling Series LLMs, GLM 4.5 Air Availability


Unsloth AI (Daniel Han) ▷ #introduce-yourself (1 messages):

bridgelessalex: su p


Unsloth AI (Daniel Han) ▷ #off-topic (334 messages🔥🔥):

Pizza toppings, Qwen finetuning challenges, Overtrained vs undertrained models, AI music generation


Unsloth AI (Daniel Han) ▷ #help (227 messages🔥🔥):

GGUF/Ollama conversion guide, Disable multiprocessing problem, Seq2Seq Task for Gemma 3, Loss Masking With Unsloth


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

GRPO, SFT, AI Safety, Structured Outputs


Unsloth AI (Daniel Han) ▷ #research (9 messages🔥):

Metis quantization-aware training, Qwen model quantization efficiency, Sparsity in MoEs, Training on detailed verbal feedback


Cursor Community ▷ #general (435 messages🔥🔥🔥):

Cursor Cost Analysis, Cursor Ambassador, GPT-5 vs Claude, Better Auth Theme, Agent Terminal


LM Studio ▷ #general (166 messages🔥🔥):

Qwen3 performance, vLLM, LMS steam log, glm-4.5-air vs qwen3-coder, context length 131000


LM Studio ▷ #hardware-discussion (145 messages🔥🔥):

DDR3 vs DDR4, LM Studio GPU Split, Qwen3 Coder, GPT OSS 120b


GPU MODE ▷ #general (49 messages🔥):

GPU performance engineering career, DeepSeek sparse attention CUDA implementation, Partial RoPE, GPU compute resources, Hopper and Blackwell SM quadrants


GPU MODE ▷ #cuda (34 messages🔥):

NVIDIA Blackwell Architecture, AMD Matrix Cores, NVIDIA Hopper GPU Architecture, CUDA mbarriers vs regular barriers, Citadel's microbenchmarking papers


GPU MODE ▷ #torch (2 messages):

Dynamo Sparse Tensors, ONNX Runtime Build


GPU MODE ▷ #cool-links (4 messages):

LLMs optimize GPU performance, KernelBench project, AI workloads advance


GPU MODE ▷ #jobs (1 messages):

schizik12: <@325883680419610631> spam


GPU MODE ▷ #beginner (1 messages):

GPU Experience, GEMM, cuBLAS, Kernel Optimizations, FlashAttentionX


GPU MODE ▷ #pmpp-book (1 messages):

C++ for the 5090, Vibe Coding Frustrations


GPU MODE ▷ #youtube-recordings (1 messages):

Kernel Benchmarking, GPU Kernel Performance


GPU MODE ▷ #torchao (3 messages):

INT4 Quantization, TorchAO, TensorCore, TinyGemm library, Efficient Kernels


GPU MODE ▷ #irl-meetup (1 messages):

josephtracyvoltagepark_53706: I am going to the PyTorch conference! would love to meet up


GPU MODE ▷ #rocm (24 messages🔥):

Making a profiler, SQTT registers, Instruction level profiling in a GUI, Radeon GUI, Stochastic sampling


GPU MODE ▷ #lecture-qa (1 messages):

marksaroufim: <@1173619488730665011> I'm down if you are


GPU MODE ▷ #self-promotion (6 messages):

Speculative Decoding, Benchmarking Kernels, AWQ Quantization, CuTe Layouts


GPU MODE ▷ #submissions (2 messages):

MI300x8 Performance, amd-ag-gemm, amd-gemm-rs


GPU MODE ▷ #hardware (2 messages):

Homelab Builds, Livestream Rig Build


GPU MODE ▷ #tpu (1 messages):

Kernel issues, Jupyter Notebook, Run all cells


GPU MODE ▷ #factorio-learning-env (10 messages🔥):

FLE 0.3.0 on Mac M1/M2, Private Discord Meeting


GPU MODE ▷ #amd-competition (2 messages):

pyrocshmem, GitHub Repo Inquiry


GPU MODE ▷ #cutlass (51 messages🔥):

GMEMI pattern recognition, CuteDSL roadmap, Uniform Registers, CooperativeGroup alignment


GPU MODE ▷ #multi-gpu (31 messages🔥):

Tensor Transfer Time, Profiling Tools, CUDA Events, GPU Clocks, MoE Training


OpenRouter ▷ #app-showcase (1 messages):

eofr: Yayy, thank you :D


OpenRouter ▷ #general (174 messages🔥🔥):

NSFW roleplay, Gemini Pro issues, BYOK setup on OpenRouter, Sonnet 4.5 arguing, Free multimodal models


OpenRouter ▷ #discussion (14 messages🔥):

Cerebras Llama Removal, K2-THINK on Cerebras, ByteDance Seed LLM on OpenRouter, OpenAI's ZDR


Nous Research AI ▷ #general (157 messages🔥🔥):

Qwen 3 30B A3B, DGX Spark, Sora 2 Limitations, ComfyUI workflow exposed


Nous Research AI ▷ #research-papers (1 messages):

Sparse Autoencoders, Goodfire AI, LLM Deception, Autolabel Gap


Nous Research AI ▷ #interesting-links (2 messages):

AI Consciousness, Emergent AI


Nous Research AI ▷ #research-papers (1 messages):

Sparse Autoencoders, LLM Deception, Goodfire AI


Latent Space ▷ #ai-general-chat (114 messages🔥🔥):

Prime Intellect AMA, Jules Tools CLI, AI Capex Bubble, Nitter HTTP 429, Comet Browser Global Launch


Latent Space ▷ #genmedia-creative-ai (12 messages🔥):

Sora-TikTok Automation, Sora 2 Puppet Explainer Videos, Pika's Swift Takeover


HuggingFace ▷ #general (94 messages🔥🔥):

Gemini Vision, Leaving AI for Blacksmithing, Ollama tool support, ArXiv dataset, HF billing support


HuggingFace ▷ #i-made-this (3 messages):

Debate Site Using LLMs, Operating System for AI Behavior, RAG Chatbot for Neovim


HuggingFace ▷ #smol-course (11 messages🔥):

LocalLlama, TRL docs, DPO Section Quiz


HuggingFace ▷ #agents-course (4 messages):

SmolAgent documentation discrepancies, ToolCallingAgents paradigm, GAIA exercise errors


Modular (Mojo 🔥) ▷ #general (10 messages🔥):

Qualcomm, Mojo Manual, Modular contact


Modular (Mojo 🔥) ▷ #mojo (43 messages🔥):

Mojo with Dask or PySpark, Mojo custom framework, MLIR-level optimizations, MAX API return in Mojo, Mojo Networking Options


Eleuther ▷ #general (6 messages):

Underrated LLM Pretraining Papers, Diffusion Model Evaluation, Sora 2 Manual Human Eval, Gemma's Architecture vs Qwen's


Eleuther ▷ #research (19 messages🔥):

Masked Autoencoders, Mixup Augmentation, Neural Radiance Fields, Deepseek Context Length Increase


Eleuther ▷ #scaling-laws (5 messages):

MLP Design, Linear Attention Variants, AUNNs


Eleuther ▷ #interpretability-general (1 messages):

Interp Agents, Goodfire AI


MCP Contributors (Official) ▷ #mcp-dev-summit (2 messages):

Profiles, MCP Conf


MCP Contributors (Official) ▷ #general (11 messages🔥):

GitHub team management, infrastructure-as-code migration, access control, repository permissions, team memberships


MCP Contributors (Official) ▷ #general-wg (12 messages🔥):

Feature Support Matrix, Server Capabilities, Typescript SDK, Icons Metadata


DSPy ▷ #general (24 messages🔥):

chat adapter default, XML format promotion, Tool use models formats, DSPy roadmap, ReAct trajectories


Yannick Kilcher ▷ #general (8 messages🔥):

GPTs vs. Gemini, Meta AI Research Shift, AI Sex Robots


Yannick Kilcher ▷ #paper-discussion (3 messages):

bacteriophages, genome language models, computational biology


Yannick Kilcher ▷ #ml-news (11 messages🔥):

Oracle OpenAI datacenters, LLM RL generalization, Reasoning tokens in LLMs, IRL exploration


Manus.im Discord ▷ #general (17 messages🔥):

Global USD Pricing Model, Memory Key, AI interaction, Manus's memory architecture


aider (Paul Gauthier) ▷ #general (10 messages🔥):

aider-desk, SST/OpenCode, chrome mcp, GLM4.6, Deepseek


aider (Paul Gauthier) ▷ #questions-and-tips (4 messages):

Polyglot LLM Evaluation, Aider Scala Code Generation Depth, Openrouter Caching Issues