Frozen AI News archive

not much happened today

**Moonshot AI** released **Kimi Linear (KDA)** with day-0 infrastructure and strong long-context metrics, achieving up to **75% KV cache reduction** and **6x decoding throughput**. **MiniMax M2** pivoted to full attention for multi-hop reasoning, maintaining strong agentic coding performance with **200k context** and **~100 TPS**. **ByteDance**, **Princeton**, and **Mila** introduced **Looped LLMs** showing efficiency gains comparable to larger transformers. **OpenAI**'s **Aardvark (GPT-5)** entered private beta as an agentic security researcher for scalable vulnerability discovery. **Cursor** launched faster cloud coding agents, though transparency concerns arose regarding base-model provenance. **Cognition** released a public beta for a desktop/mobile tool-use agent named Devin. The community discussed advanced attention mechanisms and adaptive compute techniques.

Canonical issue URL

a quiet day

AI News for 10/29/2025-10/30/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (198 channels, and 5621 messages) for you. Estimated reading time saved (at 200wpm): 490 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Congrats HuggingFace on the Smol Training Playbook. and welcome Beyang (Amp) and Skyler (MiniMax) to AIE CODE, and check out the Stripe / ACP Latent Space pod!


AI Twitter Recap

Kimi Linear (KDA), Minimax M2, and the linear-attention wars

Agentic coding and tool-use systems

Training, evaluation, and embeddings

Multimodal: speech, video, and image editing

Product and infra updates

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Hugging Face Training Insights

2. Open Source AI Music Generation Advocacy

3. Qwen 3 VL and Kimi Linear Model Updates

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Anthropic's Claude Skills and Introspective Awareness

2. Humorous AI and Technology Memes

3. Legal and Educational Challenges with AI


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. Agentic Coding & Reality-Check Benchmarks

2. New Multimodal Models, Leaderboards & Gateways

3. GPU Kernel Craft: Scans, Samples, and Small Floats

4. Long-Context Engineering: Kimi’s Linear Attention Push


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


OpenRouter Discord


HuggingFace Discord


Modular (Mojo 🔥) Discord


LM Studio Discord


GPU MODE Discord


Yannick Kilcher Discord


DSPy Discord


Latent Space Discord


Eleuther Discord


Moonshot AI (Kimi K-2) Discord


Manus.im Discord Discord


Nous Research AI Discord


tinygrad (George Hotz) Discord


Windsurf Discord


MCP Contributors (Official) Discord


The aider (Paul Gauthier) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1089 messages🔥🔥🔥):

Moderation problems, Comet Referal Promo issues, Perplexity.ai payouts, GPT Go subscriptions, Gemini Pro offer


Perplexity AI ▷ #pplx-api (9 messages🔥):

Sonar Reasoning API, Live Data, External Data Connectors, Web Search Module


LMArena ▷ #general (952 messages🔥🔥🔥):

MiniMax Cheaper AI, ReCaptcha, Image Generation Limits, AI Alignment & Self Harm, AI Ethics


LMArena ▷ #announcements (1 messages):

Image-to-Video Leaderboard, Text-to-Video Leaderboard Update, Hailuo-2.3 model


Cursor Community ▷ #general (834 messages🔥🔥🔥):

Composer model, Claude Code, Pricing and limits, New Cursor 2.0 features and bugs, Tab complete


Cursor Community ▷ #background-agents (3 messages):

Cloud Agent, Background Agents


Cursor Community ▷ #announcements (1 messages):

Cursor new look, Cloud Agents


Unsloth AI (Daniel Han) ▷ #general (221 messages🔥🔥):

RTX 8000 Turing Cards, Qwen3 finetuning, Kimi-Linear-48B-A3B-Instruct Model, Qwen 3 VL, GLM 4.6 model


Unsloth AI (Daniel Han) ▷ #off-topic (103 messages🔥🔥):

Backend latency improvements, VAE data sample requirements, Colab UI updates, Elon Musk's Grokipedia, Probabilistic computing


Unsloth AI (Daniel Han) ▷ #help (92 messages🔥🔥):

Qwen3VLCausalLMOutputWithPast and hidden states, Unsloth environment flags for debugging, triton_kernels installation issues, Offline loading with Unsloth, Mapping part of training stuck


Unsloth AI (Daniel Han) ▷ #showcase (2 messages):

Gemma 3 model, RAZOR-12B-GGUF model


Unsloth AI (Daniel Han) ▷ #research (3 messages):

Anthropic Introspection, Model Self-Awareness


OpenRouter ▷ #announcements (1 messages):

Perplexity Sonar Pro, Pro Search, Multi-step agentic reasoning, Real-time thought streaming


OpenRouter ▷ #app-showcase (2 messages):

API Endpoints, environment variables, OpenRouter Typescript SDK


OpenRouter ▷ #general (307 messages🔥🔥):

Yandex Browser Issues, AI and Singularity, DeepSeek OCR Request, Sora 2 and Image Generation, OpenRouter and Chutes Prompt Training


OpenRouter ▷ #new-models (6 messages):

``


OpenRouter ▷ #discussion (60 messages🔥🔥):

Exclusive Models, DeepInfra Errors, Factory Droid, Embedding Models, Minimax M2


HuggingFace ▷ #general (198 messages🔥🔥):

HF Job Application, Qwen Omni, 10gbit networking, OCR for CPU, LLM Model Formats & Storage


HuggingFace ▷ #i-made-this (5 messages):

RAG system, CLI Python code remediation tool, Golf cart detection model, Snippet Creator


HuggingFace ▷ #computer-vision (1 messages):

InstantID + IP-Adapter FaceID, ControlNet reference-only setup, Lora Training, InstructPix2Pix / T2I-Adapter model, Consistent 2D Style Transfer


HuggingFace ▷ #NLP (1 messages):

sebizaur: No


HuggingFace ▷ #smol-course (16 messages🔥):

SFT Course, GPU memory usage, robbiemu/smol-course-notes


HuggingFace ▷ #agents-course (9 messages🔥):

Final Project Questions, Agent Course Progress, API File Retrieval


Modular (Mojo 🔥) ▷ #general (57 messages🔥🔥):

Mojo bindings for wgpu or vulkan, OpenGL bindings in Mojo, Apple's GPU design, MAX performance, Scikit-learn alternative in Mojo


Modular (Mojo 🔥) ▷ #mojo (103 messages🔥🔥):

mojo formatter, mojo single-threaded CPU, parameter(enable_if=bool_expr), hardware specs, graph-compiler-like constant propagation


Modular (Mojo 🔥) ▷ #max (11 messages🔥):

MAX on AMD GPUs, ROCm support, HIP Driver, RX 580 compatibility


LM Studio ▷ #general (67 messages🔥🔥):

Qwen3 support in LM Studio, MCP image support, LM Studio settings, Arabic language support, Model speed factors


LM Studio ▷ #hardware-discussion (99 messages🔥🔥):

GLM 4.5 Air, Qwen 3 235b, GPU slots, Orange Pi 6 Plus, Seed-oss 30tkps


GPU MODE ▷ #general (3 messages):

Tokenizer efficiency, Tokenizer accuracy, Encoding benchmarks, Decoding benchmarks


GPU MODE ▷ #triton (2 messages):

Triton to OpenCL, Triton Developer Conference 2025


GPU MODE ▷ #cuda (21 messages🔥):

CUB DeviceScan performance, Thrust benchmarking inaccuracies, Custom allocators in Thrust, nvbench downclocking detection, nsight-copilot feedback


GPU MODE ▷ #torch (2 messages):

CUDAGraphs OOM, Torch Inductor Freezing, PyTorch Distributed Memory Usage


GPU MODE ▷ #algorithms (8 messages🔥):

Hardware friendly top-k logits algorithms, Radix-based approach, CCCL/CUB TopK implementation


GPU MODE ▷ #jobs (2 messages):

AI Devs for hire, HTuO Biosciences Hiring


GPU MODE ▷ #beginner (13 messages🔥):

LLM Pretraining Journey, Mentorships in AI, Data Parallelism, Distributed Training, GPU Programming with CUDA


GPU MODE ▷ #pmpp-book (1 messages):

PMPP Book, FLOPs Calculation, Global Memory Access, OP/B Calculation


GPU MODE ▷ #torchao (13 messages🔥):

Quantization with Float8, TorchAO and GemLite Integration, Quantization Format Survey, FP8 Inference


GPU MODE ▷ #rocm (6 messages):

MI300X TFLOPS, HBM bandwidth numbers, clpeak, RadeonFlow FP8 GEMM, AMD challenge


GPU MODE ▷ #intel (1 messages):

Intel Compute Runtime release, oneAPI improvements


GPU MODE ▷ #self-promotion (3 messages):

Technical Blog on Sum Reduction, Agentic Reinforcement Learning for LLMs, Nsight Copilot for VS Code


GPU MODE ▷ #🍿 (2 messages):

Kernel Generation, Data Efforts in Kernel Generation


GPU MODE ▷ #thunderkittens (1 messages):

Kernel recompilation, Incremental compilation


GPU MODE ▷ #edge (3 messages):

Executorch CUDA backend status, Torchscript deprecation, Production GPU Deployments


GPU MODE ▷ #amd-competition (4 messages):

AMD Competition, Yottalabs blog, Distributed Inference, SoL vs Kernel Performance


GPU MODE ▷ #cutlass (13 messages🔥):

tiled_copy for row major tensors, mask_mod equivalence check, colexigraphical order vs Pytorch, scalar_to_ssa definition, cute-dsl constant memory

def scalar_to_ssa(a: cute.Numeric, dtype) -> cute.TensorSSA:
    vec = cute.make_fragment(1, dtype)
    vec[0] = a
    return vec.load()

GPU MODE ▷ #singularity-systems (1 messages):

j4orz: https://singularitysystems.bearblog.dev/


GPU MODE ▷ #helion (1 messages):

Helion PR feedback


Yannick Kilcher ▷ #general (18 messages🔥):

Extropic AI's hardware accelerator, Low-resource language translation, ArXiv publishing schedule, AI paper filtering


Yannick Kilcher ▷ #paper-discussion (78 messages🔥🔥):

Markovian vs Non-Markovian, Linux Foundation Robotics Project, Universities as Businesses, Robot Purchase Discussion, Continual Learning vs Continual Adaptation


Yannick Kilcher ▷ #ml-news (3 messages):

Anthropic is Fraud, Haiku sizes


DSPy ▷ #general (98 messages🔥🔥):

Scikit-learn style API for DSPy, Semantic Dataframes, ReAct module finish() function with no arguments, DSPy Meetup in Pune, India, BAML Adapters vs JSON Schema


Latent Space ▷ #ai-general-chat (77 messages🔥🔥):

Extropic BEFF Chip, ScaleAI Remote Labor Index, Cognition SWE-1.5, Codex Degradation, OpenAI Codex Credits


Latent Space ▷ #genmedia-creative-ai (8 messages🔥):

MiniMax Speech 2.6, Voice Cloning, MiniMax Music 2.0, Generative Music Platform


Eleuther ▷ #general (19 messages🔥):

Frontier LLM Training, Low-Resource Language MT, OCR for Custom Scripts


Eleuther ▷ #research (49 messages🔥):

Manus agent, Agent Evaluation Metrics, Extropic hardware, RWKV Understanding, Weight Decay Scaling


Eleuther ▷ #interpretability-general (1 messages):

gsarti: <@709147478963781692> fyi


Moonshot AI (Kimi K-2) ▷ #general-chat (23 messages🔥):

Kimi K2, Kimi Delta Attention, Kimi-cli's D-Mail


Manus.im Discord ▷ #general (11 messages🔥):

Manus Credits, Developer for Project


Nous Research AI ▷ #general (5 messages):

Neos wrestling/boxing matches, MCP CTF in November


Nous Research AI ▷ #ask-about-llms (2 messages):

Local Model Training, Dependency Issues on Windows, Linux vs WSL


Nous Research AI ▷ #research-papers (1 messages):

``


Nous Research AI ▷ #interesting-links (1 messages):

Kimi Linear Attention, MoonshotAI


Nous Research AI ▷ #research-papers (1 messages):

``


tinygrad (George Hotz) ▷ #general (1 messages):

georgehotz: one of these days we're gonna ruff format tinygrad


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

GROUP_REDUCE errors, rangeify rewrites debugging


Windsurf ▷ #announcements (1 messages):

SWE-1.5, Fast Agent Models, Coding Performance


MCP Contributors (Official) ▷ #general-wg (1 messages):

Model Context Protocol RFC Status