Frozen AI News archive

not much happened today

**Meta** has hired **Scale AI CEO Alexandr Wang** as its new **Chief AI Officer**, acquiring a **49% non-voting stake** in **Scale AI** for **$14.3 billion**, doubling its valuation to **~$28 billion**. This move is part of a major talent shuffle involving **Meta**, **OpenAI**, and **Scale AI**. Discussions include the impact on **Yann LeCun**'s influence at **Meta** and potential responses from **OpenAI**. In model news, **Gemma 3N** faces technical issues like vision NaNs and FP16 overflows, with fixes from **UnslothAI**. Chinese open-source models like **GLM-4.1V-Thinking** by **Zhipu AI** and **DeepSeek R1T2** show strong performance and speed improvements. **Huawei** open-sourced a **72B MoE** model with a novel load balancing solution. The **MiniMax-M1** hybrid MoE model leads math benchmarks on the **Text Arena leaderboard**. **AllenAI** launched **SciArena** for scientific literature evaluation, where **o3** outperforms others. Research from **Sakana AI Labs** introduces **AB-MCTS** for code generation, improving synthesis benchmarks.

Canonical issue URL

a quiet day

AI News for 7/1/2025-7/2/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (220 channels, and 7625 messages) for you. Estimated reading time saved (at 200wpm): 603 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Today's Current Thing was Soham Parekh stories, which only affect a small number of startups. If you're seeking interesting AI stories, perhaps you could consider buying your own personal open source humanoid robot, available today.


AI Twitter Recap

The Great AI Talent Shuffle: Meta, OpenAI, and Scale AI

Model Releases, Benchmarks, and Performance

Agent Tooling, Frameworks, and Infrastructure

Robotics and Embodied AI

Broader Tech & Societal Implications

Humor & The Soham Parekh Saga


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. New Open-Source AI Model Announcements and Benchmarks

2. Open-Source AI Model Applications and User Projects

3. Cutting-Edge Multi-Modal/Thinking Model Previews

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Veo 3 AI Video Generation Impact and Creative Uses

2. Kontext & ComfyUI Advanced Reference and Workflow Techniques

3. AI-Generated Influencers and Virtual Personas in Social Media


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1. The AI Arms Race: New Models, Performance Showdowns, and Talent Wars

Theme 2. Innovations in Model Architecture and Fine-Tuning

Theme 3. Developer Tooling, Workflows, and GPU Nightmares

Theme 4. The Business of AI: Pricing, Outages, and Poaching

Theme 5. The Guardrail Gauntlet and the Perils of AI Hallucinations


Discord: High level Discord summaries

OpenAI Discord


Perplexity AI Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


OpenRouter (Alex Atallah) Discord


LM Studio Discord


HuggingFace Discord


Eleuther Discord


Yannick Kilcher Discord


GPU MODE Discord


Latent Space Discord


MCP (Glama) Discord


aider (Paul Gauthier) Discord


Notebook LM Discord


Cohere Discord


Modular (Mojo 🔥) Discord


Nous Research AI Discord


Nomic.ai (GPT4All) Discord


LlamaIndex Discord


Manus.im Discord Discord


tinygrad (George Hotz) Discord


DSPy Discord


AI21 Labs (Jamba) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

OpenAI ▷ #ai-discussions (1340 messages🔥🔥🔥):

Champ's guardrail insights, zark muckerberg images, grok vs chatgpt avm, rule-based rewards, American models


OpenAI ▷ #gpt-4-discussions (1 messages):

metacire: My operator broken


OpenAI ▷ #prompt-engineering (5 messages):

Strategic Project Matrix, Deep Research Prompt, AI-Driven Project Triage


OpenAI ▷ #api-discussions (5 messages):

Strategic Project Matrix, Deep Research Prompt, AI-driven Obligation Triage, Cross-functional Team Coordination, Mnemonic Prioritization Framework


Perplexity AI ▷ #announcements (1 messages):

kesku: https://fixvx.com/perplexity_ai/status/1940443479710257226 <@&1105626802732404746>


Perplexity AI ▷ #general (1364 messages🔥🔥🔥):

File Uploads on Pro, Selling Perplexity Pro Accounts, Manus AI Gemini, Comet Browser, Perplexity Max


Perplexity AI ▷ #sharing (4 messages):

Siri Overhaul, Education, Family, and Fate Cult, Silk Road Origins, DIY Thermal Optimization


Perplexity AI ▷ #pplx-api (4 messages):

API subdomain exclusion, sonar-reasoning-pro <think> tag


Cursor Community ▷ #general (742 messages🔥🔥🔥):

Agent performance, Pricing changes, New Features (queue), Model performance (Gemini vs Claude), Background Agents


Cursor Community ▷ #background-agents (52 messages🔥):

Snapshot visibility issues, Docker in Docker setup, Apply Changes Locally UX, Background Agent vs Cursor chat behavior, NPM setup in Dockerfile


Unsloth AI (Daniel Han) ▷ #general (602 messages🔥🔥🔥):

Training GPTs Agent, Custom dataset for fine tuning, Benefits of unsloth quant vs other quant methods, e-prime, CUDA cores vs Tensor cores


Unsloth AI (Daniel Han) ▷ #off-topic (38 messages🔥):

OCR Model for Fast Inference, Intel Arc Pricing, OSS Alternative to 11labs Scribe V1, Fine tuning failures, ChessFish.io


Unsloth AI (Daniel Han) ▷ #help (108 messages🔥🔥):

Qwen3-32B Saving Issues, ModuleNotFoundError: No module named 'unsloth', Unsloth and math problems, Quantization Issue, Llama 4 Quantized Issue


Unsloth AI (Daniel Han) ▷ #research (2 messages):

MoE Model, Ascend GPUs, Qwen3-32b


LMArena ▷ #general (740 messages🔥🔥🔥):

Cypher Alpha evaluation, Grok-4-0629 release date, OpenAI's open-weights model license, Gemini CLI limits, Deepseek R2 model


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

DeepSeek V3, Configuration Mistake, Downtime Apology


OpenRouter (Alex Atallah) ▷ #app-showcase (5 messages):

AI-powered dictionary app, Free roleplay website


OpenRouter (Alex Atallah) ▷ #general (561 messages🔥🔥🔥):

Deepseek 0324 outage, Cypher Model Details, cometapi using OpenRouter data, Grok-4-code-0629, Contributing to OpenRouter


LM Studio ▷ #general (333 messages🔥🔥):

LLMs and AI Hallucinations, LLM Output Trust, Local LLM Use Cases, RAG Implementation, LM Studio and RAG


LM Studio ▷ #hardware-discussion (157 messages🔥🔥):

GPU VRAM, LM Studio Accuracy, APA 7 Citations, Shared VRAM


HuggingFace ▷ #general (54 messages🔥):

HuggingChat Shutdown, GPT4All model recommendations for Building Design, HF Inference Client Backward Compatibility, Exporting HuggingChat Data, MCP Server on Claude Desktop Error


HuggingFace ▷ #today-im-learning (1 messages):

alperugurcan: https://www.coursera.org/learn/generative-ai-for-everyone


HuggingFace ▷ #cool-finds (4 messages):

step1 landing page builder, Lovable/Cursor Alternatives, Selling side projects on Fiverr


HuggingFace ▷ #i-made-this (11 messages🔥):

OCR Demo MCP Server, HF Dataset LLM Key, LoRMA for LLMs


HuggingFace ▷ #core-announcements (1 messages):

Flux Optimization, H100, PyTorch, torch.compile()


HuggingFace ▷ #computer-vision (1 messages):

VideoMAE, Domain-Adaptive Pretraining, Video Classification


HuggingFace ▷ #NLP (1 messages):

SentenceTransformers, bi-encoder setup, similarity search, Dynamic K


HuggingFace ▷ #smol-course (1 messages):

jiji3369: It is 10 dollars for one run or that's 10 dollars in total for all the runs you made?


HuggingFace ▷ #agents-course (22 messages🔥):

GenAI Solution Consultant, Smolagents Course, Hugging Face Inference Endpoints, Llama-3.3-70B-Instruct Issues


Eleuther ▷ #general (58 messages🔥🔥):

Diffusion World Models, OpenWebText (OWT) Quality, RLHF Packages, Conference Travel Grants, Independent Research Mentoring


Eleuther ▷ #research (9 messages🔥):

Transition Matching, NeurIPS Ethics Review, Open Research Hackathon, Single Layer Transformer, KV Caching


Eleuther ▷ #interpretability-general (2 messages):

Context Engineering, Open Research Hackathon


Eleuther ▷ #lm-thunderdome (17 messages🔥):

lm-evaluation-harness library standardization, lm-evaluation-harness init script optimization, lm-evaluation-harness task discoverability, Lazy-loading modules in lm-evaluation-harness, lm_eval startup speed


Eleuther ▷ #multimodal-general (3 messages):

Kaiming He, Mean Flow Matching


Yannick Kilcher ▷ #general (32 messages🔥):

Dynamic K in similarity search, RNNs and LSTM, Universal Function Approximators, SSMs, BPTT


Yannick Kilcher ▷ #paper-discussion (3 messages):

Linear Transformers Parallelization, Delta Rule over Sequence Length, RWKV-7 Equation 18, DeltaNet Performance


Yannick Kilcher ▷ #ml-news (54 messages🔥):

Healthcare Decisions, Immunotherapy Development, American vs European Food, Transition Matching


GPU MODE ▷ #general (17 messages🔥):

nsys and torch.compile, cursor and windsurf, GPT vs Claude, Work-Life Balance, European Work


GPU MODE ▷ #triton (1 messages):

Triton Nightly Wheel Builds, TensorDescriptor Use


GPU MODE ▷ #cool-links (1 messages):

gau.nernst: https://x.com/davisblalock/status/1939956579698094166


GPU MODE ▷ #beginner (43 messages🔥):

CUDA for deep learning tasks, Implementing custom ML algorithms, Docker image for CUDA, Contributing to existing libraries


GPU MODE ▷ #torchao (1 messages):

FSDP 2.0, DTensors, Sharding


GPU MODE ▷ #off-topic (3 messages):

GPU, CUDA, Interview prep, Cram resources, YouTube tutorials


GPU MODE ▷ #rocm (1 messages):

Compiler register lifetime, Avoiding register spills


GPU MODE ▷ #self-promotion (2 messages):

Recipe Index Launch, Google Meet Link


GPU MODE ▷ #thunderkittens (1 messages):

Apple Silicon, Thunderkitten


GPU MODE ▷ #reasoning-gym (1 messages):

FSDP Config, model_dtype parameter, Qwen2.5


GPU MODE ▷ #factorio-learning-env (2 messages):

FLE talk, Pre-training


GPU MODE ▷ #cutlass (1 messages):

Cutlass Kernel Performance Prediction, Analytical Cost Models vs. Autotuning, GEMM Kernel Performance Predictability


Latent Space ▷ #ai-general-chat (66 messages🔥🔥):

Anysphere/Cursor hires Anthropic Claude Code Leaders, Anthropic's $4B ARR, Meta's Aggressive Poaching of AI Talent from OpenAI, Luma Labs AI Modify Video Tool, Perplexity New Subscription Tier


Latent Space ▷ #ai-announcements (4 messages):

Information Theory, Jack Morris, LLM Inversion


MCP (Glama) ▷ #general (63 messages🔥🔥):

O'Reilly Book on MCP, Storing Docs for LLM, MCP Inspect and Badge Issues, Claude Hooks for Git, MCP Routing Layer


MCP (Glama) ▷ #showcase (2 messages):

MCP, tip.md, x402, CDP SDK, Coinbase Hackathon


aider (Paul Gauthier) ▷ #general (24 messages🔥):

Cypher Alpha performance, Claude Sonnet ranking, Openrouter Oauth issues, Aider API key problems, Claude Code comparison


aider (Paul Gauthier) ▷ #questions-and-tips (27 messages🔥):

aider /architect mode, Local Model Recommendations, aider auto test, aider --yes-always and --no-always, Quantized Models


Notebook LM ▷ #use-cases (4 messages):

NotebookLM use cases, NotebookLM as a personal daily journal, Audio overview function


Notebook LM ▷ #general (21 messages🔥):

NotebookLM sources, Podcast generation, Gas plant power factor, Pro vs Free accounts, Opening PDF sources


Cohere ▷ #🧵-general-thread (5 messages):

Cohere open model weight release, CMD-R model, tool/agent frameworks, ML Summer School channel


Cohere ▷ #🔌-api-discussions (4 messages):

Cohere Embedding Model, Trial key, Rate limits, Production key, Monthly limit


Cohere ▷ #👋-introduce-yourself (10 messages🔥):

ML Summer School, Agentic AI, Computer Vision, Water Quality Monitoring


Cohere ▷ #🔬-research (3 messages):

Secure ML, Privacy Preservation, AGI is here


Modular (Mojo 🔥) ▷ #mojo (15 messages🔥):

Mojo Origin Tracking System, Ownership and Life Cycles in Mojo, Mojo structs vs classes, GPU puzzles in Mojo, Dependent Type System in Mojo


Modular (Mojo 🔥) ▷ #max (2 messages):

Mojo Offline Inference, QuantizationEncoding, LLM on M1 Mac, Nightly vs Stable Builds


Nous Research AI ▷ #general (12 messages🔥):

Making friends online, Manus AI new unlimited plan, NFT Scams


Nous Research AI ▷ #research-papers (2 messages):

Mentorship for Independent Research


Nous Research AI ▷ #research-papers (2 messages):

Mentorship Request


Nomic.ai (GPT4All) ▷ #general (13 messages🔥):

Floor plan analysis with GPT4All, Image recognition limitations, LM Studio image acceptance, ChatGPT image analysis


LlamaIndex ▷ #blog (3 messages):

LlamaExtract, LlamaCloud, Enterprise RAG, Multi-modal indexing


LlamaIndex ▷ #general (6 messages):

OpenAI Batch API, LlamaIndex Workflows 1.0, Embedding OpenAI API Key, Developer Collaboration


Manus.im Discord ▷ #general (6 messages):

Manus Prompt, MCP Server, Claude Opus 4, Qwen 3, 32B model


tinygrad (George Hotz) ▷ #general (3 messages):

haldie style viz, tile viz approach, shared mem buffers, global buffers


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

.pyrophoric.: hi is there a cli tool to automatically format code in the tinygrad style?


DSPy ▷ #general (1 messages):

chiggly007: What do you mean by this?


AI21 Labs (Jamba) ▷ #general-chat (1 messages):

Autonomous Agents, Multi-Agent Systems, LangChain, AutoGen, AI Assistants