Frozen AI News archive

not much happened today

**Mistral** released **Voxtral**, claimed as the world's best open speech recognition models, available via API and Hugging Face. **Moonshot AI** launched **Kimi K2**, a trillion-parameter **Mixture-of-Experts (MoE)** model, outperforming **GPT-4.1** on benchmarks with 65.4% on SWE-Bench Verified and achieving 200 tokens/second inference speed on **Groq** hardware. **Nous Research** open-sourced the **Hermes 3** dataset with 1 million samples, aiding SOTA models on the **Llama-3** series. **Google DeepMind** introduced the **Mixture-of-Recursions (MoR)** architecture promising 2x inference speed and 50% parameter reduction but faced skepticism. **Goedel-Prover V2** topped the **PutnamBench** theorem proving benchmark. AtCoder World Finals saw a human winner with **OpenAI** placing second. Research highlights include **Jason Wei**'s insights on **reinforcement learning** and the "Verifier's Law" emphasizing the asymmetry of verification in AI training.

Canonical issue URL

a quiet day

AI News for 7/15/2025-7/16/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (226 channels, and 5810 messages) for you. Estimated reading time saved (at 200wpm): 481 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

there was a eyebrow raising HR move if you care about Claude Code's future or Anthropic's $100b fundraise, Fal's leaked $1.5b Series C, or otherwise you could just tune in to the first ever podcast with Cline.


AI Twitter Recap

Model Releases, Performance & Benchmarks

AI Research, Techniques & Theory

AI Agents, Tooling & Frameworks

Industry Trends, Talent & Companies

Infrastructure & Datasets

Humor & Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Recent AI Model and Framework Launches (Dream 7B, T5Gemma, llama.cpp Diffusion)

2. Hardware and Accelerator Advancements for AI (AMD Radeon, MLX CUDA)

3. Critical Industry Perspectives: Meta's ASI Team and Benchmark Skepticism

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Meta's Recruitment of Top OpenAI Talent and Industry Reactions

2. Latest Video and LoRA Model Releases and Community Updates

3. Claude Code Advanced Usage, Workflow Innovations, and User Experiences


AI Discord Recap

A summary of Summaries of Summaries by X.ai Grok 4

Theme 1. Kimi K2 Hype Ignites Model Wars

Theme 2. GPU Optimization Tricks Steal the Spotlight

Theme 3. Research Papers Drop Bombshells on Efficiency

Theme 4. Tools and Frameworks Level Up Agentic AI

Theme 5. Benchmarks and Evaluations Face Reality Checks


Discord: High level Discord summaries

OpenAI Discord


Perplexity AI Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


Eleuther Discord


OpenRouter (Alex Atallah) Discord


LMArena Discord


Latent Space Discord


GPU MODE Discord


Nous Research AI Discord


LM Studio Discord


Torchtune Discord


HuggingFace Discord


Yannick Kilcher Discord


aider (Paul Gauthier) Discord


Notebook LM Discord


Manus.im Discord Discord


Modular (Mojo 🔥) Discord


tinygrad (George Hotz) Discord


LlamaIndex Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


MCP (Glama) Discord


Nomic.ai (GPT4All) Discord


MLOps @Chipro Discord


Codeium (Windsurf) Discord


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

OpenAI ▷ #annnouncements (1 messages):

OpenAI: @everyone


OpenAI ▷ #ai-discussions (1222 messages🔥🔥🔥):

DeepSeek Censorship, IQ Testing, AI's Role in Society, GPT-5 Speculation, AI and North Korea


OpenAI ▷ #gpt-4-discussions (5 messages):

Coding Libraries for AI, Future OpenAI API Integrations, Pro Membership Value


Perplexity AI ▷ #announcements (1 messages):

Aravind and Leonid Reddit AMA, Comet browser


Perplexity AI ▷ #general (1020 messages🔥🔥🔥):

Comet Browser, Image Generation Issues, Samsung Galaxy Store Free Pro, Grok 4 Availability, Comet Agent Saved Interactions


Perplexity AI ▷ #sharing (5 messages):

Shareable threads, Audio Overviews


Perplexity AI ▷ #pplx-api (2 messages):

Perplexity Pro, API access


Cursor Community ▷ #general (552 messages🔥🔥🔥):

Cursor billing and pricing, Kimi K2 model discussion, Cursor performance issues and troubleshooting, Multi-agent collaboration and code management, Context engineering for effective AI use


Cursor Community ▷ #background-agents (8 messages🔥):

Cursor runtime env, Customize PR Title, Start Script output not shown, Reconfigure manual snapshot error, Cursor fetching background agents


Unsloth AI (Daniel Han) ▷ #general (267 messages🔥🔥):

BF16 vs FP32 for LoRA fine-tuning, Qlora as a constraint, Gemma 3 pretraining, Kimi audio distillation, Overfitting solutions


Unsloth AI (Daniel Han) ▷ #off-topic (18 messages🔥):

OpenPipe ART, LLM as a Judge, Agentic Models, ARTwell RULER, Model Finetuning


Unsloth AI (Daniel Han) ▷ #help (166 messages🔥🔥):

Unsloth fixes, VLLM cache, Kaggle Mistral notebook errors, Huge VRAM recommendations, Llama.cpp multi GPU config


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

Podcast Announcement, Community Engagement


Unsloth AI (Daniel Han) ▷ #research (13 messages🔥):

ETHOS paper, LLM psychosis, Independent research challenges, TTS models playground


Unsloth AI (Daniel Han) ▷ #unsloth-bot (24 messages🔥):

Llama.cpp Quantization Errors, VLLM Cache Directory Configuration, Qwen 2.5 7B Inference, Torch Cache Storage and Corruption, Model Size of Qwen 2.5 7B


Eleuther ▷ #general (66 messages🔥🔥):

ZR1-1.5B Model, Pythia 12B vs 2.8B, TensorFlow Decline, AI Research Management


Eleuther ▷ #research (316 messages🔥🔥):

nanoGPT speedrunning, recursion papers, tuning LLM inference-time hyper-parameters, peer review on research, MoE like things in RWKV-8


Eleuther ▷ #interpretability-general (8 messages🔥):

Function Vectors, nnterp package, Transformer models


Eleuther ▷ #lm-thunderdome (1 messages):

Harness Evaluation, IFEval Suite


Eleuther ▷ #gpt-neox-dev (25 messages🔥):

Transformer Engine performance, Slurm and containers with GPT-NeoX, CUDA drivers in NGC containers, DeeperSpeed Slurm runner


OpenRouter (Alex Atallah) ▷ #announcements (3 messages):

o1-preview Deprecation


OpenRouter (Alex Atallah) ▷ #general (357 messages🔥🔥):

Deepseek R1 Quality Drop, OpenRouter AutoRouter Privacy Concerns, Model Deprecation Notices, GPT 3.5 Turbo Endpoint Gone, Claude Opus 4 Weekly Token Usage


OpenRouter (Alex Atallah) ▷ #discussion (41 messages🔥):

Quality Certification Badges, Speed Certification Badges, Eval Harness for Model Benchmarks, Context Compression Shenanigans, Tool Use Benchmarks


LMArena ▷ #general (337 messages🔥🔥):

Polymarket Bet Failures, Prediction Markets Legality, Grok4 Performance, Kimi K2 Performance and Pricing, LMArena Issues and Feedback


LMArena ▷ #announcements (1 messages):

UI Improvements, Leaderboard Navigation, Streamlined Interface, Compact Sidebar


Latent Space ▷ #ai-general-chat (220 messages🔥🔥):

Kimi UI, Atlassian Rovo Dev AI Agent, Flux Pro and Seedance Realistic AI Videos, Anthropic hires back Cursor developers, OpenAI API Face Editing


Latent Space ▷ #ai-announcements (1 messages):

YouTube video


GPU MODE ▷ #general (71 messages🔥🔥):

Radix Sort on GPU, Serverless GPU Platforms, Profiling CUDA Kernels, Industry implementations of RMSNorm and Reduce operators, PyTorch vs efficient CUDA kernels


GPU MODE ▷ #triton (3 messages):

triton-autodiff tool


GPU MODE ▷ #torch (6 messages):

Torch Compile Debugging, Torch Inductor Issues


GPU MODE ▷ #algorithms (3 messages):

Parallel Radix Sort, Fluid Simulation in OpenGL with CUDA


GPU MODE ▷ #jobs (3 messages):

GPU Engineering Book, Software Engineer Hiring, Technical Reviewers


GPU MODE ▷ #beginner (2 messages):

Colab GPU, Numba, MLE, ML performance engineering


GPU MODE ▷ #off-topic (12 messages🔥):

China H20, H100 vs H20, NVL72 vs NVL144, Ascend GPUs


GPU MODE ▷ #self-promotion (8 messages🔥):

SemiAnalysis Podcast, LLM RL environment framework


GPU MODE ▷ #gpu模式 (1 messages):

complexfilterr: Half of CVPR accepted papers' authors are from China.


GPU MODE ▷ #hardware (6 messages):

GB300 Availability, Coreweave's GB300 Capacity, Nvidia hardware purchase prioritization, DGX vs HGX, B200 Availability


GPU MODE ▷ #factorio-learning-env (3 messages):

Data Backup, Phone Theft, Learning from Mishaps


GPU MODE ▷ #cutlass (3 messages):

CuTeDSL, Jetson series, Jetson Orin, Jetson Thor, CUTLASS Python support for NVIDIA GPUs


Nous Research AI ▷ #general (88 messages🔥🔥):

Kimi K2, H200, B200, Manus release, Model Ownership


Nous Research AI ▷ #ask-about-llms (1 messages):

Model Context Size, Adding Personality to Models, Letta (MemGPT) Personas


Nous Research AI ▷ #interesting-links (5 messages):

LLM RL environment framework, Atropos compatibility, Unsloth RL guide


LM Studio ▷ #general (28 messages🔥):

Image Generation, Model Search Repo URL, LM Studio Development Roadmap, Memory Features


LM Studio ▷ #hardware-discussion (39 messages🔥):

LG's EXAONE License, Thunderbolt eGPUs for VRAM Expansion, AMD NPU Support in llama.cpp, PCI-Express Atomics Support


Torchtune ▷ #announcements (1 messages):

Future of Torchtune, Torchtune Project, Discord and Github support


Torchtune ▷ #general (53 messages🔥):

Torchtune future, HuggingFace TRL License, Quantum Computing in Ohio, Checkpointing via NFT


HuggingFace ▷ #general (32 messages🔥):

Kimi K2 Open Source Model, Linux for Home Lab, Azure Speech Services SDK, Qwen-1.5 Inference Technologies, SmolVLM2 Technical Report


HuggingFace ▷ #today-im-learning (1 messages):

Model Training, 1.5 bit research


HuggingFace ▷ #cool-finds (1 messages):

tonic_1: https://gpuhammer.com/ wake up babe ! new exploit just dropppppped !


HuggingFace ▷ #i-made-this (7 messages):

LLM Quantization, Desktop App for Plural Identification, French Deep Learning Course, English to Ukrainian Machine Translation Model, LunarisCodex LLM


HuggingFace ▷ #reading-group (3 messages):

Pipeline Parallelism


HuggingFace ▷ #computer-vision (1 messages):

SmolDocLing Finetuning, IDEFICS3ImageProcessor Error


HuggingFace ▷ #agents-course (1 messages):

smolagents, Multi-Agent System, AI Agent Message Flow


Yannick Kilcher ▷ #general (19 messages🔥):

Meta's Open Source Policies, Behemoth Model, Inferring closed models from open weights, Actual ML


Yannick Kilcher ▷ #paper-discussion (5 messages):

GPUHammer paper, memory corruption, data structures


Yannick Kilcher ▷ #ml-news (18 messages🔥):

Muon Optimizer, Mixture of Experts memory optimization, Amazon Cursor competitor


aider (Paul Gauthier) ▷ #general (26 messages🔥):

Vertex AI Thinking Output, Terminal Recommendations for Aider, Local Model Alternatives to Claude Code, Kimi K2 with Groq, Aider benchmark updates


aider (Paul Gauthier) ▷ #questions-and-tips (11 messages🔥):

Aider Debugging, OpenRouter Models, Gemini Flash, Architect Mode, Thinking Mode


aider (Paul Gauthier) ▷ #links (2 messages):

Switchpoint Router, OpenRouter AI, aider polyglot benchmark


Notebook LM ▷ #use-cases (2 messages):

Google Docs Tab Feature, uBlock browser extension, Copying news articles into Google Docs


Notebook LM ▷ #general (23 messages🔥):

PC version of NotebookLM, Featured Notebooks Removal, Video Overviews Release, Custom Podcast Intros, Public Notebooks Location


Manus.im Discord ▷ #general (20 messages🔥):

Manus AI mobile app creation, Vehicle creation for OMSI 2, AI outperforming Manus


Modular (Mojo 🔥) ▷ #general (2 messages):

Discord Channels, Community Showcase


Modular (Mojo 🔥) ▷ #mojo (15 messages🔥):

Mojo native requests library, TLS support in Mojo, Escaping keyword usage in Mojo, @parameter functions and runtime closures


tinygrad (George Hotz) ▷ #general (12 messages🔥):

setitem PR, tensor.py, assign parameter, kernel fusion, remove realize()


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

tensor lvl hooks, LLM hidden states, Fetching hidden states


LlamaIndex ▷ #blog (4 messages):

Amsterdam Meetup, UiPath Integration, Production-Ready RAG, ODSC Agentic AI Summit


LlamaIndex ▷ #general (5 messages):

LLM Fine-tuning Guide, Multi-agent workflow using LlamaIndex, AI Engineer Opportunities


DSPy ▷ #papers (5 messages):

IReRa for hierarchical labels, Multiple modules for hierarchy, Vanilla DSPy for parent-child identification


DSPy ▷ #general (3 messages):

aws/nova-prompt-optimizer, Lean 4


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (8 messages🔥):

Certificate Declaration Form, Lab Submission Feedback


MCP (Glama) ▷ #general (5 messages):

Anthropic Connectors Directory, Docker MCP Toolkit, MCP Inspector


MCP (Glama) ▷ #showcase (2 messages):

AI agents, Model Context Protocol, Autonomous Orchestration, Parallel Execution, Anthropic Claude Sonnet-4


Nomic.ai (GPT4All) ▷ #general (4 messages):

Cloudflare R2, GPT4ALL logic, AI and Web3


MLOps @Chipro ▷ #events (1 messages):

DeepSeek in Production Event, MoE, MLA, FP8, MTP


MLOps @Chipro ▷ #general-ml (1 messages):

Financial Keyword Extraction with BERT, BERT for Key Sentence Extraction, Cosine Similarity for Keyword Identification, Improving BERT-based Keyword Extraction


Codeium (Windsurf) ▷ #announcements (1 messages):

Claude Sonnet 4, Anthropic, Discounted Credit Rate


Codeium (Windsurf) ▷ #content (1 messages):

Wave 11 inclusion