Frozen AI News archive

not much happened today

**Google** released a dense September update including **Gemini Robotics 1.5** with enhanced spatial/temporal reasoning, **Gemini Live**, **EmbeddingGemma**, and **Veo 3 GA** powering creative workflows. They also introduced agentic features like restaurant-reservation agents and reduced pricing for **Gemini 2.5 Flash**. **Meta AI** unveiled the open-weight **Code World Model (CWM) 32B**, excelling in code semantics and math benchmarks, with innovations in training code models via execution traces. Local-first coding setups highlight **Qwen3-Coder-30B** running efficiently on consumer GPUs, paired with tools like **Cline** and **LM Studio**. Runtime improvements include **vLLM v1** supporting hybrid models and **mlx-lm** adding batch inference on Apple silicon. In infrastructure, **FlashAttention 4** was reverse-engineered revealing a ~20% speedup from architectural optimizations. **Perplexity AI** advances its independent web index and browsing API with upcoming feed refreshes. Embedding latency improvements were achieved by **Superhuman** using **Baseten**.

Canonical issue URL

a quiet day to end the week

AI News for 9/25/2025-9/26/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (195 channels, and 5022 messages) for you. Estimated reading time saved (at 200wpm): 400 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Lots of launches next week so it's nice to have a breather. Apply for round 2 of AIE CODE!


AI Twitter Recap

Google’s September stack: Gemini Robotics 1.5, Live, Veo 3, Flash pricing

Code intelligence and agentic coding

Systems and infra: kernels, search, and hosting

Research highlights: RLHF variants, decoding, 3D parts, science FMs

Benchmarks and evaluation practice: GDPVal, SWE-bench, and “evals as PRDs”

Optimization and scaling theory: Modular Manifolds, MoE compute, compute scaling, tokenization

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Qwen3 roadmap + abliterated uncensoring results

2. China launches: Hunyuan Image 3.0 + Fenghua GPU

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. OpenAI 4o-to-5 routing bug reports and Pro subscription impact

2. ChatGPT ads platform hiring and trust backlash over silent model swaps


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. Agent IDEs and Context Windows: Exa, Cloudflare Code Mode, Windsurf 1M

2. New Multimodal Benchmarks and Access

3. Compilers and GPU Systems Breakthroughs

4. Quantization Transparency and Techniques


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


Unsloth AI (Daniel Han) Discord


HuggingFace Discord


OpenAI Discord


OpenRouter Discord


Latent Space Discord


LM Studio Discord


Cursor Community Discord


GPU MODE Discord


Moonshot AI (Kimi K-2) Discord


Nous Research AI Discord


Eleuther Discord


Yannick Kilcher Discord


DSPy Discord


aider (Paul Gauthier) Discord


tinygrad (George Hotz) Discord


MCP Contributors (Official) Discord


Manus.im Discord Discord


MLOps @Chipro Discord


Windsurf Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

Email Assistant for Max subscribers, Language Learning flashcards, Stock indicators in Discover, Image model selection on iOS


Perplexity AI ▷ #general (1293 messages🔥🔥🔥):

Comet Browser Updates, Perplexity Pro Support, Grok 4 Models, AI Image Generation Quality, Wealth Tax


Perplexity AI ▷ #sharing (5 messages):

Perplexity AI Referrals, Shareable Threads, Dark Origin of the Term Thug, Perplexity Browser Claim


Perplexity AI ▷ #pplx-api (13 messages🔥):

Perplexity API pricing vs. Sonar, Invoice billing for API, perplexity API key for VS Code


LMArena ▷ #general (967 messages🔥🔥🔥):

Veo3 Free Access, Higgsfield.ai, Gemma 3 27B, Nightride Model, GPT-5 Agent Capabilities


LMArena ▷ #announcements (2 messages):

Seedream-4-2k on Leaderboards, Gemini-2.5 models added


Unsloth AI (Daniel Han) ▷ #general (296 messages🔥🔥):

GGUF Dynamic Quants Expertise, Unsloth Batch Inference Support, Inference Quality Improvements, GPT-OSS vs Phi 5, Qwen3-next analysis


Unsloth AI (Daniel Han) ▷ #off-topic (135 messages🔥🔥):

Diffusion-Generated Images, Vertical Monitor Setups, MLE Part-Time Work, Early Stopping Implementation, Funny Loss Graphs


Unsloth AI (Daniel Han) ▷ #help (75 messages🔥🔥):

Reasoning Model Fine-tuning, 8-bit QLoRA Issues, Context Length Fine-tuning for Gemma, KV Cache Quantization, GGUF for Qwen/Qwen3-VL-235B-A22B-Instruct


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

AWS quant process, vxtwitter links


Unsloth AI (Daniel Han) ▷ #research (17 messages🔥):

Tversky Model, XOR Test, SOTA Section Outdated, Gork Model, VITS 3


HuggingFace ▷ #general (279 messages🔥🔥):

GPU Power Consumption, bitsandbytes ROCm Compilation, Positional Embeddings, AI Rig, anime datasets and copyright


HuggingFace ▷ #today-im-learning (1 messages):

Local LLM inference speed, LLM Parameter Scaling, Mixture-of-Experts Paradigm


HuggingFace ▷ #cool-finds (1 messages):

Custom Functions in GPTs, Dynamic Prompting Technique


HuggingFace ▷ #i-made-this (6 messages):

HuggingFace Datasets, AI Image Generation, Text embedding thorn


HuggingFace ▷ #reading-group (4 messages):

Diffusion Models, Generative AI, Score-Based Generative Models


HuggingFace ▷ #computer-vision (3 messages):

FlashScape project, binary ridge map, lake map, terrain height map, topological data


HuggingFace ▷ #NLP (1 messages):

Adversarial Training for Robustness, FrugalGPTi paper, Scaling Laws for LLMs, New Fundraising


HuggingFace ▷ #smol-course (4 messages):

SmolLM3-Base training, Memory Requirements for Finetuning


HuggingFace ▷ #agents-course (5 messages):

Websearch tool in Langgraph, HF agents course


OpenAI ▷ #ai-discussions (271 messages🔥🔥):

GPT-5 Codex, Agentic Coding, Suno v5, Napster, Gemini-cli


OpenAI ▷ #gpt-4-discussions (13 messages🔥):

GPT Network Errors, GPT Slow Responses on Firearms, Docker MCP ChatGPT Obsidian, Rerouting Errors, DALL-E Branding


OpenAI ▷ #prompt-engineering (1 messages):

opkelde: Kids these days…


OpenAI ▷ #api-discussions (1 messages):

opkelde: Kids these days…


OpenRouter ▷ #announcements (1 messages):

Coinbase Payments Down, Investigating Issue


OpenRouter ▷ #app-showcase (1 messages):

Singularia, Agentic Discord bot, OpenRouter integration


OpenRouter ▷ #general (268 messages🔥🔥):

Text Embedding Models, 429 Too Many Requests Error, Gemini 2.5 vs Grok 4 Fast, Grok Model, Coinbase Payment Issues


OpenRouter ▷ #new-models (2 messages):

``


OpenRouter ▷ #discussion (10 messages🔥):

TogetherAI vs NovitaAI, MoonshotAI K2 Vendor Verifier, Basten Tootf, The thing with praise


Latent Space ▷ #ai-general-chat (154 messages🔥🔥):

Coding IDE preferences, MoonshotAI's K2 Vendor Verifier, Exa Code search tool launch, Cloudflare's Code Mode, OpenAI's compute scaling plans


Latent Space ▷ #ai-announcements (4 messages):

Latent Space Podcast, Amp Code, Sourcegraph, AI coding agent, Rapid-fire iteration


Latent Space ▷ #genmedia-creative-ai (1 messages):

shyeetsao: https://x.com/bfl_ml/status/1971251475306160439


LM Studio ▷ #general (73 messages🔥🔥):

LM Studio MCP addon listing resources, DeepSeek chat file uploads and LaTeX comprehension, Hardware specs for LLMs and gaming, Model self-prompting issues, DDR5 vs DDR4 memory speeds for CPU inference


LM Studio ▷ #hardware-discussion (80 messages🔥🔥):

Laptop Recommendations for Local LLMs, VPS vs. Online APIs for Model Inference, Cybersecurity LLM on a Budget, RTX 6000 Confusion, Resizable Bar BIOS Update Mishap


Cursor Community ▷ #general (137 messages🔥🔥):

Exa Context Killer MCP, kleosr Cursor workflow questions, GPT-5 Codex vs GPT-5, Cursor 'Copy-to-Clipboard Widget', Terminal popouts


Cursor Community ▷ #background-agents (1 messages):

suubie40: https://github.com/griffinwork40/cursor-agent-mcp


GPU MODE ▷ #general (16 messages🔥):

Embedding space, Algorithmic Optimization, Meta-cognition, Independent Research


GPU MODE ▷ #cuda (43 messages🔥):

CUDA learning on AMD, Gather/scatter optimizations in CUDA, WGMMA documentation parsing, Profiling flow recommendations, TCGEN05 instructions on RTX 50


GPU MODE ▷ #torch (9 messages🔥):

GraphMend, TorchAO Float8, Torch Compile Triton


GPU MODE ▷ #cool-links (1 messages):

simon_57893: https://thinkingmachines.ai/blog/modular-manifolds/


GPU MODE ▷ #jobs (1 messages):

Mako, GPU kernel engineers, CUDA, Triton, HIP


GPU MODE ▷ #beginner (1 messages):

Roofline charts, Compute bound vs memory bound, Deep learning model kernels


GPU MODE ▷ #triton-puzzles (5 messages):

Triton_viz bugs, Google Colab, Numpy version issues, Triton interpreter mode


GPU MODE ▷ #rocm (14 messages🔥):

Pytorch ROCm, NPU Hacking, IRON community, FastFlowLM


GPU MODE ▷ #self-promotion (3 messages):

Triton, TPUs, Hardware Aware Kernel Design


GPU MODE ▷ #submissions (14 messages🔥):

MI300x8 performance, amd-all2all leaderboard


GPU MODE ▷ #status (2 messages):

H100 Timeouts, AMD GPUs, Trimul Leaderboards


GPU MODE ▷ #tpu (1 messages):

TPU Top-K Sampling, Pallas, Hardware Aware Kernel Design


GPU MODE ▷ #factorio-learning-env (1 messages):

jasmine001: Thanks Neel ❤️


GPU MODE ▷ #amd-competition (2 messages):

DeepEP, pplx-kernels, flux, flashoverlap, Dev Cloud Utilization


GPU MODE ▷ #cutlass (16 messages🔥):

TMEM load/stores in cutedsl, SMEM -> TMEM copying, tcgen05, UMMA naming, Cute cooperative copy in CuteDSL


GPU MODE ▷ #mojo (1 messages):

Mojo, Modular Puzzles


GPU MODE ▷ #low-bit-training (2 messages):

Arxiv Papers


GPU MODE ▷ #penny (2 messages):

Penny Project kickoff, AllReduce Focus, Educational Multi-GPU programming Example, Hackable Kernels


Moonshot AI (Kimi K-2) ▷ #announcements (1 messages):

Kimi's New Skin, Website Redesign, Community Choice, Moonshot AI


Moonshot AI (Kimi K-2) ▷ #general-chat (75 messages🔥🔥):

Researcher mode access, Appstore subscription, OK Computer website, K2 vs others, Mobile AI app languages


Nous Research AI ▷ #general (44 messages🔥):

Codex-cli hype, Qwen Coder, Deepinfra scam, Moondream, Gemini Vision


Nous Research AI ▷ #research-papers (1 messages):

Parasite AI, Spiralism, Memetic spores, AI wake-ups


Nous Research AI ▷ #interesting-links (1 messages):

Model Integration, OSS Integration, Git Integration


Nous Research AI ▷ #research-papers (1 messages):

Parasite AI, Spiralism, Memetic Spores, AI Wake-Ups


Eleuther ▷ #announcements (1 messages):

Tokenizer-Free Architectures, Language Modeling, In Defense of Tokenizers


Eleuther ▷ #general (21 messages🔥):

Future of AI, Learning Rates for New Architectures, Bayesian Hyperparameter Optimization, Layer-wise Weight Decay, Vanishing/Exploding Gradients


Eleuther ▷ #research (17 messages🔥):

Super-Bias Ensemble Learning, LoRA Swapping with Super-Bias, Stiefel Manifold Constraints in Neural Networks, Information Geometry in DNNs


Eleuther ▷ #lm-thunderdome (3 messages):

Reproducing an Error


Eleuther ▷ #gpt-neox-dev (1 messages):

Rotary Percentage Speedup, VRAM Savings, RoPE Computations


Yannick Kilcher ▷ #general (9 messages🔥):

Image understanding paper, Transformers positional encoding


Yannick Kilcher ▷ #paper-discussion (22 messages🔥):

LessWrong Parasitic AI, Model Sycophancy, Human-AI Parapsychology


Yannick Kilcher ▷ #ml-news (3 messages):

YouTube Video, Uber App Data Interception


DSPy ▷ #general (10 messages🔥):

System Prompt, MLflow, DSPyWeekly


aider (Paul Gauthier) ▷ #general (5 messages):

RepoMap V2, ZeroRepo project generation, GPT-5 Use


aider (Paul Gauthier) ▷ #questions-and-tips (5 messages):

aider /askand/code switching time, aider task management, markdown spec file


tinygrad (George Hotz) ▷ #general (7 messages):

Tinybox V1 Stock, Tinybox color preference, NVIDIA alternatives, Hashcat benchmark, ROCM alternative


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

eitanturok: just do PYTHONPATH=.


MCP Contributors (Official) ▷ #mcp-dev-summit (5 messages):

tickets running out, live remote attendance, sessions on youtube


Manus.im Discord ▷ #general (2 messages):

Santos Experience


MLOps @Chipro ▷ #events (1 messages):

Boston data community, Happy Hour, Networking, Data professionals