Frozen AI News archive

not much happened today

**Samsung's 7M Tiny Recursive Model (TRM)** achieves superior reasoning on ARC-AGI and Sudoku with fewer layers and MLP replacing self-attention. **LeCun's team** introduces **JEPA-SCORE**, enabling density estimation from encoders without retraining. **AI21 Labs** releases **Jamba Reasoning 3B**, a fast hybrid SSM-Transformer model supporting up to 64K context tokens. **Alibaba's Qwen3 Omni/Omni Realtime** offers a unified audio-video-text model with extensive language and speech support, outperforming Gemini 2.0 Flash on BigBench Audio. **Alibaba** also debuts **Qwen Image Edit 2509**, a top open-weight multi-image editing model. **ColBERT Nano** models demonstrate effective retrieval at micro-scale parameter sizes. In reinforcement learning, **CoreWeave**, **Weights & Biases**, and **OpenPipe** launch serverless RL infrastructure reducing costs and speeding training. **Stanford's AgentFlow** presents an in-the-flow RL system with a 7B backbone outperforming larger models on agentic tasks. This update highlights advances in **recursive reasoning**, **density estimation**, **multimodal architectures**, **long-context modeling**, **retrieval**, and **serverless reinforcement learning**.

Canonical issue URL

a quiet day.

AI News for 10/7/2025-10/8/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (197 channels, and 9439 messages) for you. Estimated reading time saved (at 200wpm): 722 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

If you have questions about any of the DevDay launches, the OpenAI team is actively soliciting good questions for the Reddit AMA tomorrow, specifically from you AI engineers. Post them here.


AI Twitter Recap

Tiny reasoning models, JEPA density estimation, and new multimodal LLMs

RL and agentic systems: serverless, in-the-flow optimization, and code eval

Tooling and infra: no‑GIL Python lands, “voice‑prompt” dev, and Sora integrations

Funding, talent, and leaderboards

Data, evaluation, and retrieval practices

Top tweets (by engagement)

Notes and opinions that resonated:


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. AI21 Jamba 3B Launch Benchmarks and Anthropic Researcher Exit News

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Robotics product news: Figure 03, Walmart service bot, Neuralink arm control

2. New vision model release and demo: Qwen-Image LoRa + wan 2.2 360 video

3. AI viral memes + ChatGPT humor/complaints: Olympic dishes, Bowie vs Mercury, parkour


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. GPU Kernel DSLs and Performance Tuning

2. Agentic Tooling and APIs for LLM Apps

3. Notable Model and Platform Launches

4. Memory and Context Compression Architectures

5. Research and Benchmark Highlights


Discord: High level Discord summaries

OpenRouter Discord


Perplexity AI Discord


LMArena Discord


Cursor Community Discord


HuggingFace Discord


GPU MODE Discord


LM Studio Discord


Modular (Mojo 🔥) Discord


Latent Space Discord


Nous Research AI Discord


Yannick Kilcher Discord


Eleuther Discord


aider (Paul Gauthier) Discord


tinygrad (George Hotz) Discord


DSPy Discord


Moonshot AI (Kimi K-2) Discord


MCP Contributors (Official) Discord


Windsurf Discord


Manus.im Discord Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

OpenRouter ▷ #announcements (1 messages):

DeepSeek v3.1, DeepInfra endpoint, Traffic Impact, Free vs Paid Traffic


OpenRouter ▷ #app-showcase (3 messages):

Interfaze Launch, LLM for developers, OpenRouter Integration


OpenRouter ▷ #general (1047 messages🔥🔥🔥):

Chub vs Jan, NSFW Ban Wave, DeepSeek and censorship, Gemini for roleplay, OpenRouter's Free Models


OpenRouter ▷ #new-models (1 messages):

Readybot.io: OpenRouter - New Models


OpenRouter ▷ #discussion (42 messages🔥):

OpenAI AMD Chip Negotiations, Gemini Computer Model, OpenAI's Top Customers, OpenAI Azure ZDR endpoints, OpenInference Relation to OpenRouter


Perplexity AI ▷ #general (1175 messages🔥🔥🔥):

Comet browser, GPT-5 Thinking, Sora 2 invites, Referral program limits, Agentic Deep Research


Perplexity AI ▷ #sharing (3 messages):

Hack for Social Impact, Prompt Engineering, Fundraising, Biodiversity Datasets


Perplexity AI ▷ #pplx-api (6 messages):

OpenAI Proxy, Perplexity Search API access, New Search API release


LMArena ▷ #general (1111 messages🔥🔥🔥):

WebDev Direct & Side by Side, Sora 2 Access, LM Arena Extension, Gemini 3 Release, Perplexity Pro


LMArena ▷ #announcements (2 messages):

New Models in LMArena, Codenames Channel


Cursor Community ▷ #general (564 messages🔥🔥🔥):

Cursor Plan Mode Token Usage, Cheetah Model Performance, Cursor Built-in Browser, GPT-5 Pro Pricing, Oracle Free Tier


Cursor Community ▷ #background-agents (5 messages):

Background Agents, Linear and Github Projects, API Background Agents


HuggingFace ▷ #general (305 messages🔥🔥):

Japanese konbini experience, Vibrant Horizons model, HF server tag, boosts requirement, proprietary AI behavior control system


HuggingFace ▷ #today-im-learning (1 messages):

Python WebRTC Client, fastrtc, aiortc, WebRTC Documentation


HuggingFace ▷ #cool-finds (1 messages):

AI program Istanbul, Scopus paper publication, PhD students, young researchers


HuggingFace ▷ #i-made-this (6 messages):

NeuralGrid, ORCA, HyDRA, RL vs Imitation Argument, WSL Pytorch vLLM venv bootstrap


HuggingFace ▷ #NLP (1 messages):

cakiki: <@864381649201266698> please don't cross-post


HuggingFace ▷ #smol-course (2 messages):

HuggingFace Jobs Authentication, DPO-aligned Model Evaluation


HuggingFace ▷ #agents-course (7 messages):

Course Repo Submission, Pro Account Requirement, Agent Behavior & Guardrails, System Directive Override


GPU MODE ▷ #general (31 messages🔥):

Godbolt Feature Requests, Free Website Hosting, GB300 Cloud Access, ROCm vs CUDA for AI/ML, Pythonic GPU DSL


GPU MODE ▷ #triton (21 messages🔥):

FP8 GEMM Kernel Performance, TMA/Warp Specialization, Triton Linear Layouts using F_2, H100 GPU Failure


GPU MODE ▷ #cuda (20 messages🔥):

CUDA thread block cluster APIs, 2CTA matmul, ThunderKittens attn kernel, cuteDSL and CUDA, Parallel Reduction in CUDA


GPU MODE ▷ #torch (12 messages🔥):

Parallel Layers in Torch, CUDA Streams for Parallel Compute, ScMoE Paper Replication, torch.compile Limitations


GPU MODE ▷ #jobs (1 messages):

Aurora, Autonomous Trucking, Deep Learning Acceleration, CUDA Kernels, PyTorch


GPU MODE ▷ #beginner (6 messages):

CUDA coding on Macbook, VSCode Remote Desktop, clangd, neovim


GPU MODE ▷ #off-topic (9 messages🔥):

GPU Programming Jobs, Internships in GPU programming, New grad GPU positions, Machine Learning Engineering


GPU MODE ▷ #irl-meetup (1 messages):

garrett.garrett: Your workplace sounds awesome


GPU MODE ▷ #triton-puzzles (1 messages):

Triton Puzzles, GPU mode videos, Original Triton Paper, Triton Tutorials


GPU MODE ▷ #rocm (5 messages):

ROCm vs CUDA, AMD GPU for AI/ML, ROCm support in AI/ML libraries


GPU MODE ▷ #self-promotion (1 messages):

Mutual Information, Context Compression


GPU MODE ▷ #submissions (9 messages🔥):

MI300x8 Performance, amd-ag-gemm Leaderboard, amd-gemm-rs Leaderboard


GPU MODE ▷ #amd-competition (2 messages):

ROCm version, Submission Reminder


GPU MODE ▷ #general (1 messages):

Rust-based IDE, wgpu support, Godbolt-like compilation output


GPU MODE ▷ #low-bit-training (1 messages):

kitsu5116: http://arxiv.org/pdf/2502.17055


GPU MODE ▷ #llmq (9 messages🔥):

clang CI integration, rmsnorm_backward optimization, rope_backward optimization


GPU MODE ▷ #helion (51 messages🔥):

Helion DSL for Kernel Authoring, Helion vs TLX, Torch to Triton conversion, Helion limitations, Helion autotuning


LM Studio ▷ #general (141 messages🔥🔥):

AMD Instinct MI50 Shroud, Nvidia VRAM Pressure, Vulkan Performance Degradation, Older LM Studio Versions, Context Memory Use


LM Studio ▷ #hardware-discussion (17 messages🔥):

AMD MI350, Intel Core Ultra CPUs, External Graphics Card Dock, LM Studio Vulkan Runtime, MOE Models


Modular (Mojo 🔥) ▷ #general (89 messages🔥🔥):

Python imports in Mojo, Mojo vs Rust on GPU, Graphics integration in Mojo, Mojo compilation model, Python to Mojo code converter


Modular (Mojo 🔥) ▷ #mojo (38 messages🔥):

Laptop Hardware for Robotics, NVIDIA vs AMD GPUs, Apple Silicon & Strix Halo, Mixed Runtime & Compile-Time Layouts


Modular (Mojo 🔥) ▷ #max (4 messages):

GPU Compatibility, MI60 testing, Hardware Test Suite


Latent Space ▷ #ai-general-chat (60 messages🔥🔥):

OpenAI's 30 ‘1-Trillion Token’ Super-Users, Introducing the Gemini 2.5 Computer Use, Bob Ross AI “Vibe Coding” Video Goes Viral, Techno-Capital Singularity


Latent Space ▷ #ai-announcements (6 messages):

Apps SDK, AgentKit, OpenAI API Deep-Dive, Prompt optimization, MCP


Latent Space ▷ #genmedia-creative-ai (5 messages):

xAI, Imagine v0.9, video generator


Nous Research AI ▷ #announcements (1 messages):

NousCon 2024, San Francisco AI Event


Nous Research AI ▷ #general (19 messages🔥):

Self-MCP prompting tool for Claude, Hermes-MoE release, Nous con, Teknium questions, BDH data streaming framework


Nous Research AI ▷ #ask-about-llms (21 messages🔥):

Test Time Reinforcement Learning, Hermes Vision, Character per token ratio, LLM tool calling


Nous Research AI ▷ #research-papers (1 messages):

Recursive Reasoning with Tiny networks, HRM Model Performance, ARC-AGI benchmarks


Nous Research AI ▷ #interesting-links (1 messages):

RL vs Imitation Learning, Information bits in RL


Nous Research AI ▷ #research-papers (1 messages):

Recursive Reasoning, Tiny Networks, HRM Model


Yannick Kilcher ▷ #general (16 messages🔥):

RTX PRO 6000 Max-Q variant, Image/Video Generator Model Summaries, Attention in RNNs and Self-Attention Write-ups, RL vs Imitation Argument, Transferring RL Bits via SFT and LoRA Merging


Yannick Kilcher ▷ #paper-discussion (19 messages🔥):

Daily discussion times, Engineering insights from a sleeper paper, Emotional intelligence research, Ovi video+audio model, Rights and responsibilities in technology


Yannick Kilcher ▷ #ml-news (6 messages):

Qualcomm stock performance, Artificial Hippocampus Networks (AHNs), ByteDance-Seed releases AHN


Eleuther ▷ #general (5 messages):

RNN Attention (Bahdanau), Self Attention, Kaggle Arena


Eleuther ▷ #research (25 messages🔥):

ARC-AGI performance, babyLM origin, Weight Decay, SWA equivalence, evolutionary algorithm


Eleuther ▷ #lm-thunderdome (1 messages):

Task Management in AI Runs, Convenience Flags in AI Runs


aider (Paul Gauthier) ▷ #general (18 messages🔥):

Opencode vs Aider, Coding Models, Gemini Integration, GLM-4.6 and Claude Code 2, Cost Control


aider (Paul Gauthier) ▷ #questions-and-tips (4 messages):

Model Quality, aider and Openrouter & Gemini


tinygrad (George Hotz) ▷ #general (12 messages🔥):

Tinygrad SF Bay Area Meetup, Bounty Locking Process, Intel GPU Backend, RANGEIFY Merged


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

RMSProp in Tinygrad, Karpathy's RL blogpost


DSPy ▷ #general (10 messages🔥):

Pyodide/Wasm support, Community Plugins, BALM improvements, Composio integration, dspy.context() override


DSPy ▷ #examples (1 messages):

GRPO, RL, Prompt Optimization, Effectiveness of Finetuning


Moonshot AI (Kimi K-2) ▷ #general-chat (5 messages):

Mid Autumn Festival


MCP Contributors (Official) ▷ #general (2 messages):

Discord Self-Promotion Rules, ChatGPT Integration with MCP


MCP Contributors (Official) ▷ #general-wg (2 messages):

Discord Events for community calls, UX value add in agent/application chat