Frozen AI News archive

not much happened today

**Meta** released **MobileLLM-R1**, a sub-1B parameter reasoning model family on Hugging Face with strong small-model math accuracy, trained on 4.2T tokens. **Alibaba** introduced **Qwen3-Next-80B-A3B** with hybrid attention, 256k context window, and improved long-horizon memory, priced competitively on Alibaba Cloud. **Meta AI FAIR** fixed a benchmark bug in SWE-Bench affecting agent evaluation. LiveMCP-101 benchmark shows frontier models like **GPT-5** underperform on complex tasks with common failure modes cataloged. OpenAI highlights hallucination issues due to benchmark incentives, proposing calibration improvements. Community demos and tooling updates continue to evolve.

Canonical issue URL

a quiet day.

AI News for 9/11/2025-9/12/2025. We checked 12 subreddits, 544 Twitters and 22 Discords (189 channels, and 5258 messages) for you. Estimated reading time saved (at 200wpm): 464 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Happy o1 anniversary. Congrats to Naveen Rao and Interaction on buzzy new fundraises.


AI Twitter Recap

Edge Reasoning on-device: Meta’s MobileLLM-R1 (sub‑1B) goes open on HF

Qwen3‑Next‑80B (A3B): hybrid attention, 256k context, and heavy infra implications

Agents, evaluation fixes, and failure forensics

Tooling, infra, and libraries

Frontier access, SDKs, and safety collaborations

Vision models and leaderboards

Privacy-preserving pretraining

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Meta MobileLLM-R1 Release + Weekly LocalLLaMA Model/Dataset Roundup (Sep 12)

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Seedream/Seedance 4.0 Image Model Releases and Benchmarks

2. UK Government AI Adoption Coverage

3. ChatGPT Ads, Gemini 3 Release Delay, and Feature Gap Debate


AI Discord Recap

A summary of Summaries of Summaries by X.ai Grok-4

Theme 1: Fresh Models Flex Muscles in Arenas

Theme 2: Throughput Wars Heat Up Hardware

Theme 3: Training Tricks Tackle Data Dilemmas

Theme 4: Deployment Demons Dog Engineers

Theme 5: Tools Twist Creative and Coding Flows


Discord: High level Discord summaries

Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


LMArena Discord


HuggingFace Discord


Cursor Community Discord


Moonshot AI (Kimi K-2) Discord


OpenRouter Discord


Nous Research AI Discord


Eleuther Discord


GPU MODE Discord


Latent Space Discord


LM Studio Discord


Modular (Mojo 🔥) Discord


OpenAI Discord


Yannick Kilcher Discord


DSPy Discord


tinygrad (George Hotz) Discord


aider (Paul Gauthier) Discord


Manus.im Discord Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Unsloth AI (Daniel Han) ▷ #general (1276 messages🔥🔥🔥):

GPT-OSS 120B, Qwen3 model, Local AI, llama.cpp, Telemetry collection


Unsloth AI (Daniel Han) ▷ #introduce-yourself (3 messages):

Partnership Opportunity, Introduction of Anand


Unsloth AI (Daniel Han) ▷ #off-topic (125 messages🔥🔥):

Promptwright DAG dataset generation, Curriculum two-stage training for datasets, RTX 3080 language model training speed, NVIDIA DGX Spark reservations, Android sideloading restrictions and alternatives


Unsloth AI (Daniel Han) ▷ #help (125 messages🔥🔥):

Unsloth's save_pretrained_merged method, Docker image compatibility issues with H100 GPUs, Deploying Unsloth models in production, GRPO with Qwen 4B, 4-bit BNB model deployment with vLLM


Unsloth AI (Daniel Han) ▷ #showcase (8 messages🔥):

Kimi-K2-Instruct (FP8), vllm plugin


Unsloth AI (Daniel Han) ▷ #research (8 messages🔥):

LLM inference determinism, Synthetic data in LLM training, Gemma 3 performance, AI humanizers scam


Perplexity AI ▷ #announcements (1 messages):

Perplexity Finance on iOS & Android, Hotel loyalty support for bookings, Streamlined PDFs in Labs & Research modes


Perplexity AI ▷ #general (790 messages🔥🔥🔥):

Comparing Perplexity to ChatGPT and Gemini, Comet Browser, Perplexity Pro, Gemini Pro photo editing, AI Model Leaks


Perplexity AI ▷ #sharing (13 messages🔥):

Perplexity AI Referral Codes, Shareable Threads, CaviraOSS/neuropilot


Perplexity AI ▷ #pplx-api (1 messages):

anshuman_.9: hi


LMArena ▷ #general (736 messages🔥🔥🔥):

Qwen3 80B, Seedream 4, Gemini 3, DeepSeek slowness, Open Source AI vs Closed Source AI


LMArena ▷ #announcements (2 messages):

Hunyuan-image-2.1, Seedream-4-high-res


HuggingFace ▷ #general (185 messages🔥🔥):

n8n freelance jobs, Transformer architecture fine-tuning, GPU for fine-tuning, OpenAI investing in Hugging Face, Local LLM Linux box parts


HuggingFace ▷ #cool-finds (50 messages🔥):

Direct/Inverse FFT, QKV Calculations, Runaway Loss Value Recovery, Android Audio Implementation, NWaves DSP Library


HuggingFace ▷ #i-made-this (5 messages):

Hexagen.WorldAerelyth Game, Aerelyth Intelligence, FluentlyQwen3 Models, Nano Banana Editor


HuggingFace ▷ #NLP (2 messages):

Paid collaboration, Freelance developers


HuggingFace ▷ #smol-course (20 messages🔥):

Colab and HF Free Tiers for Fine-Tuning, Kaggle GPU Availability, Study Groups, PEFT/LoRA for Colab, DataCollatorForCompletionOnlyLM ImportError


HuggingFace ▷ #agents-course (2 messages):

First Hugging Face Course, Building First Agent


Cursor Community ▷ #general (249 messages🔥🔥):

Smart Resume, Cursor Pricing, Background Agents, Netlify account


Cursor Community ▷ #background-agents (4 messages):

Cursor unauthorized error, Background agent docker issues


Moonshot AI (Kimi K-2) ▷ #general-chat (203 messages🔥🔥):

Kimi K2, GPT-5 (Medium), Qwen3-Max, creative writing, Ao3


OpenRouter ▷ #general (185 messages🔥🔥):

Dropshipping, Gemini API's, OpenRouter API, Kimi-k2


OpenRouter ▷ #discussion (1 messages):

fn5io: https://openai.com/index/joint-statement-from-openai-and-microsoft/


Nous Research AI ▷ #general (155 messages🔥🔥):

Qwen 3 80B Model Details, TypeScript Provider Adapter Interface, Nous Hermes Agentic Oracle, Merging Discord Servers, Tucker Carlson Interview with Sam Altman


Nous Research AI ▷ #ask-about-llms (5 messages):

Claude Alignment Issues, Client Strategy Workflows, Anthropic's Acknowledgement of Bugs


Nous Research AI ▷ #research-papers (5 messages):

Herme3 Evaluation, LLM Preferences Probing, Complex Terminology in Research Paper


Nous Research AI ▷ #research-papers (5 messages):

Herme3 Evaluations, LLM Preferences, Probing LLM Preferences


Eleuther ▷ #general (28 messages🔥):

Crank detection questions, editable vector memory systems, Therapeutic tool released into the wild, Low bit training of pythia, Training data for language models


Eleuther ▷ #research (123 messages🔥🔥):

Fluid Dynamics Computers, Analog Computers, Mortality and Unreproducibility in Analog Models, Gated Delta Rule Expressiveness, Photonic Neuromorphic Computing


Eleuther ▷ #multimodal-general (2 messages):

Discord Channel Link, User Agreement


GPU MODE ▷ #general (9 messages🔥):

lium.io GPU marketplace, AWS L40s GPUs, IRL hackathon teams, Iris SHMEM in Triton


GPU MODE ▷ #triton (2 messages):

Gluon, Triton attention implementation, OpenAI's Triton usage


GPU MODE ▷ #cuda (2 messages):

logsumexp, fused kernels, NCU profiling


GPU MODE ▷ #torch (16 messages🔥):

vLLM uv pip, Torch Nightly troubles, Gemma3 from scratch, F.interpolate with vmap


GPU MODE ▷ #announcements (1 messages):

Nebius, B200 GPUs, SF hackathon, Multi-GPU programming


GPU MODE ▷ #jobs (4 messages):

AI Engineer - Graph-Based Learning Systems, AI Infra Startup Hiring, Zig for AI


GPU MODE ▷ #beginner (8 messages🔥):

P104-100 BIOS Flash, Data Parallel Training, CUDA vs Triton for Data Scientists, RAPIDS and CUDA-X


GPU MODE ▷ #irl-meetup (6 messages):

Triton Conference, PyTorch Conference, Open Source PRs Selection


GPU MODE ▷ #rocm (30 messages🔥):

Free tech, ROCm Development, AMD vs Nvidia, StreamHPC


GPU MODE ▷ #intel (12 messages🔥):

Intel optimizations on AMD, AVX512 promotion to AMX, SGLang AMX Usage, PyTorch and MKL integration


GPU MODE ▷ #self-promotion (2 messages):

CUDA PTX, MCP AI Agents Hackathon, Bright Data, TigerData, Redis


GPU MODE ▷ #thunderkittens (1 messages):

Llama-3B, Megakernel, H100


GPU MODE ▷ #gpu模式 (1 messages):

carson_62312: 请问有推荐的金融财务岗位么,在深圳,>2.5w/month


GPU MODE ▷ #submissions (22 messages🔥):

MI300x8 leaderboards, Submitting to amd-all2all


GPU MODE ▷ #factorio-learning-env (2 messages):

Meeting Missed, Call Happening


GPU MODE ▷ #amd-competition (9 messages🔥):

IRIS, ROCm, Torch, Triton, TorchDistributed


GPU MODE ▷ #cutlass (1 messages):

CuTeDSL, PTX Documentation Discrepancy, Swizzling Atoms, TF32 Datatype


GPU MODE ▷ #low-bit-training (5 messages):

NCCL CE Collectives, Copy Engine, symmem, vLLM


GPU MODE ▷ #irl-accel-hackathon (18 messages🔥):

Accel SF hackathon organization, Compute budget and team formation, Acceptance timeline, GPU focus for winning, Horace as a mentor


Latent Space ▷ #ai-general-chat (106 messages🔥🔥):

gpt-oss optimizations, Palmyra-mini models, LLM agent tools, Cursor Tab model, ChatGPT discount code finder


Latent Space ▷ #private-agents (7 messages):

Local Text-to-Speech, Speaker Detection, Parakeet, Deepgram, Diarization models


Latent Space ▷ #genmedia-creative-ai (5 messages):

AI video startup Higgsfield, Higgsfield Ventures, Gen Z founders


LM Studio ▷ #general (81 messages🔥🔥):

Limiting Download Speed, Flash Attention Broken in Gemma Models on Vulkan, PSU Wattage Calculations, Sharing Formatted Conversations, Grok Powered GF System Prompt


LM Studio ▷ #hardware-discussion (16 messages🔥):

PCI-E ASPM, Secondary GPU sleep state, Power supply issues, AI for electronics design, Max+ 395 vs 3090 for Home Assistant


Modular (Mojo 🔥) ▷ #general (6 messages):

Mojo Dev Container, ExplicitlyCopyable switch, Oracle Cloud partnership


Modular (Mojo 🔥) ▷ #mojo (66 messages🔥🔥):

DPDK use cases, clang AST Parser for Mojo, Ember JSON fix, Mojo on Windows


OpenAI ▷ #ai-discussions (61 messages🔥🔥):

Chatgpt years of use, Albania governmental chatbot, GPT-5 coding games, OAI academy transcripts, Qwen-code vs Qwen-coder


OpenAI ▷ #gpt-4-discussions (3 messages):

GPT-5 PDF Downloads, Google AI Studio, Nano Banana


OpenAI ▷ #prompt-engineering (2 messages):

AI Self Help Tool, Relational Prompting, Conceptual Networks


OpenAI ▷ #api-discussions (2 messages):

AI Self Help Conversation Analyzer, Relational Prompting, Knowledge Mapping


Yannick Kilcher ▷ #general (26 messages🔥):

Active Inference, Machine Learning Street Talk, AI for understanding mathematics and universe, fixupx pre-print


Yannick Kilcher ▷ #paper-discussion (9 messages🔥):

HuMo, Disinformation use-cases


Yannick Kilcher ▷ #ml-news (4 messages):

Albania AI Minister, Qwen Blog, MobileLLM-R1-950M


DSPy ▷ #show-and-tell (1 messages):

ankurgupta_24936: DSPyWeekly Issue No #2 is out https://dspyweekly.com/newsletter/2/


DSPy ▷ #general (26 messages🔥):

DSPy generating sections, Databricks_genai and DSPy, ARC-AGI2 in-context test time training, Modaic declarative AI programming


tinygrad (George Hotz) ▷ #general (15 messages🔥):

Remove realize from setitem bounty, Assign operation is deeply broken, GEMM TFLOP measurement on RTX 4090


tinygrad (George Hotz) ▷ #learn-tinygrad (4 messages):

tinygrad documentation, company meeting


aider (Paul Gauthier) ▷ #general (4 messages):

RepoMap Benchmarks, Real World Benchmarks, Aider Repomap Use


aider (Paul Gauthier) ▷ #questions-and-tips (5 messages):

C to Rust Migration with Aider, Aider always start in /ask mode


Manus.im Discord ▷ #general (9 messages🔥):

WordPress to Next.js conversion, Manus AI Basic Plan, Mount free credits, Manus interlink knowledge, Manus credits rollover