Frozen AI News archive

not much happened today

**Cognition** is acquiring the remaining assets of **Windsurf** after a significant weekend deal. **Moonshot AI** released **Kimi K2**, an open-source, MIT-licensed agentic model with **1 Trillion total / 32B active parameters** using a Mixture-of-Experts architecture, trained on **15.5 Trillion tokens** with the **MuonClip** optimizer, showing top performance on benchmarks like **EQ-Bench** and **Creative Writing**. **xAI** launched **Grok-4**, ranking 5th on **IQ Bench** but with notable quirks including a bug causing it to respond only with "Heavy" and a high frequency of Elon Musk mentions. Rumors about **OpenAI** delaying an open-source model release surfaced, with speculation about CEO **sama**'s PR strategy and a possible **GPT-5** launch in September. The **Gemini 2.5** paper was released with **3,295 authors**, and **Google** introduced its **Gemini Embedding** model, topping the **MTEB leaderboard**.

Canonical issue URL

unless you're a Windsurf employee.

AI News for 7/11/2025-7/14/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (226 channels, and 17145 messages) for you. Estimated reading time saved (at 200wpm): 1343 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

After a whirlwind weekend romance, Cognition is acquiring the remaining, still very valuable assets of Windsurf. Updated reporting on the Windsurf-Google execuhire (all employees dividended the cash value of their vested shares, bonus $82m ARR company afterward) showed a lot of speculation premature, and, with this Cognition deal, ultimately irrelevant.


AI Twitter Recap

Model Releases & Performance: Kimi K2 and Grok-4 Shake Up the Leaderboards

AI Companies & Business Moves

AI Tooling, Frameworks, & Infrastructure

AI Research & Techniques

Broader Implications & Industry Commentary

Humor & Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Kimi K2 Model Release, Technical Deep Dives, and Derivatives

2. Recent Large Model Benchmarks: Reasoning and Coding Performance

3. Major AI Industry Developments and Tooling Innovations

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. OpenAI's Recent Turbulence and Industry Competition

2. Claude, Kiro IDE, and User Coding Tool Reviews

3. LoRA Models, Training Tutorials, and Stable Diffusion Community


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. Kimi K2: Rising Star Faces Hardware Hurdles

Theme 2. Benchmarks and Model Performance Shifts

Theme 3. Dev Tools and Frameworks: Features, Fixes, and Frustrations

Theme 4. Low-Level Deep Dives: Architectures, Training, and GPU Code

Theme 5. AI Industry Moves: Mega Clusters, Delayed Models, and Acquisitions


Discord: High level Discord summaries

Perplexity AI Discord


Cursor Community Discord


LMArena Discord


Unsloth AI (Daniel Han) Discord


OpenAI Discord


OpenRouter (Alex Atallah) Discord


LM Studio Discord


HuggingFace Discord


Nous Research AI Discord


GPU MODE Discord


Eleuther Discord


Latent Space Discord


MCP (Glama) Discord


Yannick Kilcher Discord


aider (Paul Gauthier) Discord


LLM Agents (Berkeley MOOC) Discord


Modular (Mojo 🔥) Discord


Manus.im Discord Discord


Notebook LM Discord


LlamaIndex Discord


Torchtune Discord


tinygrad (George Hotz) Discord


Nomic.ai (GPT4All) Discord


Cohere Discord


DSPy Discord


Gorilla LLM (Berkeley Function Calling) Discord


Codeium (Windsurf) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1265 messages🔥🔥🔥):

Comet data harvesting warnings, Perplexity Pro referral benefits, Grok 4 vs. O3 Pro comparison, Kimi K2 Local Run, Comet as Default Browser


Perplexity AI ▷ #sharing (9 messages🔥):

Renewable energy grid reliability, COVID mortality data analysis, Comet AI use case, Perplexity AI spaces


Perplexity AI ▷ #pplx-api (5 messages):

Perplexity not searching the web, Sonar hallucinating URL contents, search_domain_filters parameter


Cursor Community ▷ #general (646 messages🔥🔥🔥):

Cursor Performance, Kimi K2 Integration, Pricing Model Feedback, Gemini 2.5 Pro, Background Agents


Cursor Community ▷ #background-agents (20 messages🔥):

Background Agents secrets not working, Automatic port forwarding issues, Trigger background agents programmatically, coredump issue in background agent commits, Background Agents UI not updating


LMArena ▷ #general (983 messages🔥🔥🔥):

Grok 4 No System Prompt, Kimi K2 Performance, LM Arena Leaderboard, OpenAI Open Source Model Delay, LLM Development Costs


LMArena ▷ #announcements (1 messages):

LMArena, kimi-k2


Unsloth AI (Daniel Han) ▷ #general (1062 messages🔥🔥🔥):

Unsloth Q001 K_M GGUF, LegalNLP Dataset, Goody2 AI censored model, Open Empathic Project, GPTs Agents


Unsloth AI (Daniel Han) ▷ #off-topic (76 messages🔥🔥):

AGI benchmarks, Memory vs Internet, tinygrad drivers, Voice representation


Unsloth AI (Daniel Han) ▷ #help (97 messages🔥🔥):

Custom HF Datasets for LoRA, Unsloth RL Tool Harness, FLAN-T5 Support, Llama4 Scout Support, Gemma 3n Inference with Kaggle GPUs


Unsloth AI (Daniel Han) ▷ #research (60 messages🔥🔥):

GPT-4.5 size, Qwen 2.5 Training, Multilingual Datasets, Training Data Copyright, SFT creative writing


Unsloth AI (Daniel Han) ▷ #unsloth-bot (49 messages🔥):

UnslothTrainer vs SFTTrainer, Ollama Model Export Error, Sesame TTS Model Audio Input Length Error, Unsloth Introduction, Model Distillation


OpenAI ▷ #ai-discussions (691 messages🔥🔥🔥):

Ray's sacrifice, AI-assisted coding, emotional AI, persona layers, Grok's biases


OpenAI ▷ #gpt-4-discussions (16 messages🔥):

GPT-4o limitations, CustomGPT vs Projects, Memory settings in ChatGPT, GPT-4.5 vs GPT-4o for creative writing, Multimodal AI platform limitations


OpenAI ▷ #prompt-engineering (5 messages):

AI writing, Alternative History prompts


OpenAI ▷ #api-discussions (5 messages):

AI Writing, Alternative History image generation


OpenRouter (Alex Atallah) ▷ #announcements (10 messages🔥):

Cypher Alpha Sunset, Kimi K2 launch, Gemini 2.5 Flash Deprecation


OpenRouter (Alex Atallah) ▷ #app-showcase (5 messages):

Mathcheap, Y-Router, Personality.gg, Multi-AI Automated Research Bot


OpenRouter (Alex Atallah) ▷ #general (833 messages🔥🔥🔥):

Text Completion, OpenRouter's Credit System, Chatroom GUI, Svelte vs React Chat Performance, Rate Limits


OpenRouter (Alex Atallah) ▷ #new-models (12 messages🔥):

Switchpoint Router, Default Model Settings, Auto Router Functionality


OpenRouter (Alex Atallah) ▷ #discussion (89 messages🔥🔥):

OpenRouter Pricing, Frontend UI Discussions, Gemini Embedding, Fast LLMs


LM Studio ▷ #general (255 messages🔥🔥):

Multi-Modal Support, LM Studio SDK, Prompt Caching, Tool Calling and MCP, Hardware for Kimi K2


LM Studio ▷ #hardware-discussion (63 messages🔥🔥):

Nvidia DGX, 5090 Price, electricity cost of running, 1T parameter model, EXAONE 4


HuggingFace ▷ #announcements (1 messages):

Gemma 3n, SmolLM3, Efficient MultiModal Data Pipeline, responses.js, EoMT image segmentation model


HuggingFace ▷ #general (233 messages🔥🔥):

Fine-tuning multimodal models for electronics, AI moderator bot with image support, Quantization and running LLMs on limited hardware, Hugging Face Courses, SillyTavern and AI model integration on Android


HuggingFace ▷ #today-im-learning (6 messages):

Deepseek 8-bit training, 4-bit training


HuggingFace ▷ #cool-finds (2 messages):

Dynamic Structure Adjustments


HuggingFace ▷ #i-made-this (20 messages🔥):

License Compliance Tool, BorgLLM Open Source, Light Weight Computer Vision Model, Agent Arena for Preference Data, Stable Audio Model Experiments


HuggingFace ▷ #reading-group (2 messages):

HuggingFace Ultrascale Playbook, Full Scale Training Resources, OpenAI Job Requirements


HuggingFace ▷ #computer-vision (1 messages):

dlp1843: Is the landing page to opencv.org to opencv what bitcoin.com is to bitcoin?


HuggingFace ▷ #agents-course (28 messages🔥):

HF Secrets Leak, Tools for images, audio, Agents course video sessions?, Assistant node one-word answers, MCP Server setup help


Nous Research AI ▷ #general (142 messages🔥🔥):

Grok-4 reasoning and tools, Deep-Hermes reasoner options, AI models self-play, Kimi K2, OAI open model delayed


Nous Research AI ▷ #ask-about-llms (11 messages🔥):

Dockerizing, Prompt engineering, Egyptian Gods, AI Governance Articles, SFT and GRPO


Nous Research AI ▷ #research-papers (11 messages🔥):

Recursive Learning Systems Research, Recursive Symbolic Intelligence, Ontology at the Root of Every Model, Psyche as an MCP Component


Nous Research AI ▷ #interesting-links (4 messages):

AI Disruption, MedGemma, Expert-Level Fine-Tuning


Nous Research AI ▷ #research-papers (11 messages🔥):

Recursive Learning Systems, Symbolic Intelligence, Psyche MCP Component, Ontology in Models


GPU MODE ▷ #general (35 messages🔥):

PMPP 5th edition and ML updates, FP8 training, Luminal talk, vast.ai GPU pricing scraper, Programming models for ML applications


GPU MODE ▷ #triton (16 messages🔥):

Triton Kernel Padding, AOT Triton Updates, Gluon Tile Scheduling, Linear Attention Kernel Optimization, Matmul Library Matrix Handling


GPU MODE ▷ #cuda (1 messages):

Deadlock Issue Debugging, cudaMemcpyAsync issues, cudaHostFunc issues, NCCL issue #1509


GPU MODE ▷ #torch (6 messages):

gradient computation, xai method, CPU memory usage, Torch, activation memory


GPU MODE ▷ #announcements (1 messages):

Luminal, Deep Learning Compiler, Joe Fioti


GPU MODE ▷ #pmpp-book (1 messages):

piotr.mazurek: https://github.com/tugot17/pmpp


GPU MODE ▷ #torchao (4 messages):

PyTorch TorchAO, ICML 2025, CodeML workshop, TorchAO Poster


GPU MODE ▷ #off-topic (2 messages):

GPU pronouns, TPU pronouns, CUDA pronouns, ROCm pronouns


GPU MODE ▷ #irl-meetup (2 messages):

AI Conference San Francisco, ICML Meetup, KernelBot Paper Presentation


GPU MODE ▷ #rocm (4 messages):

rocprofv3 profiling, AMD kernels, PyTorch profiling


GPU MODE ▷ #webgpu (2 messages):

TurboWarp Extension for Machine Learning, MTLReadWriteTextureTier2 and wgpu


GPU MODE ▷ #self-promotion (12 messages🔥):

Thunder Compute VSCode Extension, NVIDIA Tensor Core Evolution, QuACK Open Source Library, Backpropagation through RMSNorm and LayerNorm, AI Compute Hackathon in a German Castle


GPU MODE ▷ #🍿 (3 messages):

nsight compute profiling, AutoTriton


GPU MODE ▷ #submissions (7 messages):

H100 First Place, A100 First Place, MI300 Personal Best


GPU MODE ▷ #factorio-learning-env (15 messages🔥):

Training Repo, Vision Transformers, TAS Data, Main Branch Broken


GPU MODE ▷ #cutlass (49 messages🔥):

Cute Tensors, Broadcasting in CuteDSL, Cutlass Kernel, cuTile, CUDA


Eleuther ▷ #general (85 messages🔥🔥):

GPU demand, ICML 2025, Causal Systems, Low GPU power AI, Water/Light GPU


Eleuther ▷ #research (42 messages🔥):

RNN tokenization, Mixture of Tokenizers, Byte-level models, n-simplical attention, antipodal dense features


Eleuther ▷ #scaling-laws (1 messages):

schizik12: The rising sea??


Eleuther ▷ #interpretability-general (3 messages):

MechInterp Workshop CFP, NeurIPS, Open Source Library Spotlight


Eleuther ▷ #lm-thunderdome (16 messages🔥):

lm-evaluation-harness mixed precision PR, logsumexp Trick for Logprob Calculation, Dynamic IFEval Dataset Benchmark


Eleuther ▷ #gpt-neox-dev (13 messages🔥):

Neox, H100s, Transformer Engine, DeepSpeed


Latent Space ▷ #ai-general-chat (153 messages🔥🔥):

Windsurf Acquired, Apple succession and leadership, GPT-5 Rumors, Universal Reward Function, Gemini Embedding Model Release


Latent Space ▷ #ai-announcements (1 messages):

swyxio: special double podcast this week! https://x.com/latentspacepod/status/1943774304166195402


MCP (Glama) ▷ #general (74 messages🔥🔥):

MCP for ML Models, Agent Definitions in GenAI, Clipboard Servers in MCP, Elicitation Implementations


MCP (Glama) ▷ #showcase (9 messages🔥):

Neurabase, mcp-spec, Director Run, MCP Evals, Albert Heijn MCP


Yannick Kilcher ▷ #general (56 messages🔥🔥):

Industrial Agents Training, Good World Models, Kimi K2, OpenAI Safety, BitNet vs Llama.cpp


Yannick Kilcher ▷ #paper-discussion (6 messages):

ResNet and Attention in U-Nets, Hugging Face Transformers, U-Net Definition Confusion, Accordion Networks (WWWWWW), Kimi K2 Model


Yannick Kilcher ▷ #ml-news (3 messages):

Twitter Links


aider (Paul Gauthier) ▷ #general (52 messages🔥):

Grok 4 Aider Benchmark, Aider Benchmark Harder Tasks, Aider Leaderboard Updates, Aider Agents, Aider on Windows


aider (Paul Gauthier) ▷ #questions-and-tips (10 messages🔥):

Zed editor schema validation for aider conf file, Github Copilot support in Aider, COBOL support to Aider, LiteLLM Proxy config and Aider config, Gemini thinking tokens


LLM Agents (Berkeley MOOC) ▷ #mooc-announcements (1 messages):

MOOC Certificates, Certificate Requirements, Feedback form


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (49 messages🔥):

Certificate Issues, Certificate Declaration Form, Article Submission Form, Formatting Errors on Certificates, Missing Certificates


Modular (Mojo 🔥) ▷ #general (9 messages🔥):

Assembly coding inside Mojo, Modular community event tracking, Discord notifications, Mojo Standard Library Assembly Module


Modular (Mojo 🔥) ▷ #announcements (1 messages):

July Community Meeting, Hashable-based hashing, FFT implementation, Mojo-Lapper, Quantum circuit simulator


Modular (Mojo 🔥) ▷ #mojo (30 messages🔥):

Mojo error messages, M1 Metal 3 GPUs, Autotune functionality, EqualityComparable, Atomics on GPU


Modular (Mojo 🔥) ▷ #max (4 messages):

arg_nonzero kernel, max.kernels import, mojo build max kernels


Manus.im Discord ▷ #general (37 messages🔥):

Manus Flutter Web Emulator, Startup Advice, Google Drive Save Error, Manus Website Outage, Manus Fellowship


Notebook LM ▷ #announcements (1 messages):

Featured Notebooks, NotebookLM


Notebook LM ▷ #use-cases (11 messages🔥):

Targeted Fiction Editing with AI, NotebookLM integration with Apple system toolkits, AI for extracting information from books


Notebook LM ▷ #general (24 messages🔥):

Source naming conventions, Audio file generation length, Embedding model details, Server tag requests, iOS app functionality


LlamaIndex ▷ #announcements (1 messages):

LlamaIndex Meetup Amsterdam, Office Hours, Notebook Llama, Context Engineering, Research Agent


LlamaIndex ▷ #blog (3 messages):

Notebook Llama new features, RAG Apps, Google Gemini 2.5 Pro


LlamaIndex ▷ #general (27 messages🔥):

LlamaIndex Partner Program, Tool Calling Models, Synk Hiring, Response Synthesizers


LlamaIndex ▷ #ai-discussion (1 messages):

Synk, MetaToyGame, Decentralized system for browsers


Torchtune ▷ #general (1 messages):

yamashi: Kimi K2 was trained with muon, could it be that this is the future


Torchtune ▷ #dev (17 messages🔥):

Async Recipe, Flex Attention memory usage with complicated masks, torch.cuda.memory._set_allocator_settings, Sync GRPO Recipe

compile:
  model: true
  loss: true
  scale_grads: true
  optimizer_step: false

Torchtune ▷ #papers (2 messages):

Token Training, Grokking


tinygrad (George Hotz) ▷ #general (19 messages🔥):

Frontend Reimplementations, Metal Profiling API, ONNX Flaky and Coredumps, Driving Vision ONNX Issue, Tinygrad Apps and Examples


Nomic.ai (GPT4All) ▷ #general (19 messages🔥):

Gemma 3, Nomic-embed-v2 finetuning, LocalDocs embedding issues, Nomic API server performance, RAG for lore


Cohere ▷ #🧵-general-thread (12 messages🔥):

Aya Expanse 32B, Preference Optimization Dataset, Cohere Labs Discord Server


Cohere ▷ #👋-introduce-yourself (6 messages):

Machine Learning Research, High Performance Computing, Quantum Computing, PhD opportunities


DSPy ▷ #papers (1 messages):

okhattab: Yes. Or read the paper for IReRa!


DSPy ▷ #general (4 messages):

NFT Public Mint, OpenSea Rewards Claim, Custom LLM Adapter Error, Arc Prize DSPy Hacking


Gorilla LLM (Berkeley Function Calling) ▷ #leaderboard (2 messages):

Llama 4 Scout vs Llama 3.1 70B, BFCL Website Rendering Bug, Llama-3.3-70B-Instruct (FC) Score Discrepancy


Codeium (Windsurf) ▷ #announcements (1 messages):

Windsurf, Cognition, Devin, AI coding