Frozen AI News archive

not much happened today

**Meta** makes a major AI move by hiring **Scale AI** founder **Alexandr Wang** as Chief AI Officer and acquiring a 49% non-voting stake in **Scale AI** for **$14.3 billion**, doubling its valuation to about **$28 billion**. **Chai Discovery** announces **Chai-2**, a breakthrough model for zero-shot antibody discovery and optimization. The US government faces budget cuts threatening to eliminate a quarter million science research jobs by **2026**. Data access restrictions intensify as companies like **Atlassian**, **Notion**, and **Slack** block web crawlers including **Common Crawl**, raising concerns about future public internet archives. **Hugging Face** shuts down **HuggingChat** after serving over a million users, marking a significant experiment in open-source LLMs. **Sakana AI** releases **AB-MCTS**, an inference-time scaling algorithm enabling multiple models like **Gemini 2.5 Pro** and **DeepSeek-R1-0528** to cooperate and outperform individual models.

Canonical issue URL

a quiet day.

AI News for 6/30/2025-7/1/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (220 channels, and 7874 messages) for you. Estimated reading time saved (at 200wpm): 647 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Lots of small stories - Wired confirms 8 figure offers from Meta Superintelligence, Cursor poached Claude Code's leads from Anthropic, Cloudflare is blocking CommonCrawl, Grammarly acquired Superhuman.


AI Twitter Recap

Industry, Corporate Moves, and Funding

AI Models, Research, and Benchmarks

Agent Development, Frameworks, and Tooling

Infrastructure, Efficiency, and Developer Tools

Broader Implications and Commentary

Humor and Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Major Open Weight Model Launches: Huawei Pangu Pro 72B

2. Gemma 3n and Unsloth: Fine-Tuning Performance and Fixes

3. Community Projects and MLX Rumors: LLM Client for PS Vita and Apple MLX Speculation

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Major AI Executive Moves and Industry Talent Wars

2. Anthropic Claude Code: Guides, Features, and User Experiences

3. AI Model Behavior and Autonomous Risk Studies


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Flash Preview

Theme 1. Model Performance & New Releases

Theme 2. Platform Pricing Strikes Back

Theme 3. Cracking the Code: AI Development & Research Deep Dive

Theme 4. GPU Power Plays and Hardware Hacks

Theme 5. AI Ecosystem Connects, Acquires, and Automates


Discord: High level Discord summaries

Perplexity AI Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


LM Studio Discord


Yannick Kilcher Discord


HuggingFace Discord


Nous Research AI Discord


aider (Paul Gauthier) Discord


Latent Space Discord


Eleuther Discord


GPU MODE Discord


MCP (Glama) Discord


Notebook LM Discord


LlamaIndex Discord


Cohere Discord


Modular (Mojo 🔥) Discord


Nomic.ai (GPT4All) Discord


Manus.im Discord Discord


DSPy Discord


AI21 Labs (Jamba) Discord


LLM Agents (Berkeley MOOC) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1099 messages🔥🔥🔥):

Apple Claude Siri, Gemini vs Sonnet, Context Window Limit, BlackBox AI, Perplexity Max


Perplexity AI ▷ #sharing (3 messages):

China's countryside, Google's story, Siri overhaul


Perplexity AI ▷ #pplx-api (12 messages🔥):

Sonar models base, Spending limits, finance search, API credits


Cursor Community ▷ #general (967 messages🔥🔥🔥):

Cursor's Pricing Changes, New Pro+ Plan, Rate Limits and API Usage, Warp vs Cursor, Claude Code


Cursor Community ▷ #background-agents (62 messages🔥🔥):

GitLab Integration, MCP Server/API for Background Agents, Background Agents and Linear Integration, Docker in Docker with Background Agents, Snapshot Visibility and Environment Setup


Unsloth AI (Daniel Han) ▷ #general (643 messages🔥🔥🔥):

Training Cost, Speech to Speech Models, GPTs Training, Multilingual Knowledge, Unsloth Gradient Checkpointing


Unsloth AI (Daniel Han) ▷ #announcements (1 messages):

Gemma 3n, TTS Models, Unsloth Updates, DeepSeek-R1-0528, Mistral Models


Unsloth AI (Daniel Han) ▷ #off-topic (28 messages🔥):

Intel Arc Pro B60 Pricing, GPU VRAM Management in PyTorch, Unsloth Open Source Contribution, OCR Model for Fast Inference, Alternatives to 11labs Scribe V1


Unsloth AI (Daniel Han) ▷ #help (186 messages🔥🔥):

Qwen 14B Training in Colab, SFTTrainer Sequence Truncation, Model Saving after Training, Gemma 3n Fine-tuning Guidelines, Multimodal RL with Unsloth


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

GRPO, Reward Function Generator, Logic-based evaluator, TrebuchetNetwork


Unsloth AI (Daniel Han) ▷ #research (27 messages🔥):

Identity mixture in LLMs, Catastrophic forgetting mitigation, Context management in LLMs, Knowledge decay and graph storage, MoE model trained on Ascend GPUs


LMArena ▷ #general (583 messages🔥🔥🔥):

PolyMarket welcomes US users, Perplexity Sub vs Vendor Subs, LMArena Update and Test Garden News, Cypher Alpha Model Analysis, Grok 4 launch and hype


LM Studio ▷ #general (222 messages🔥🔥):

Memory Management of Multiple Models, Llama.cpp WebUI, Local LLMs, MCP and LM Studio


LM Studio ▷ #hardware-discussion (15 messages🔥):

GDDR7, NVIDIA 5080, AMD 9080 XT, Memory Bus


Yannick Kilcher ▷ #general (48 messages🔥):

LLM Finetuning, Hierarchical Reasoning Model, Test Time Training, Test Time Training Done Right, Inner and outer layer


Yannick Kilcher ▷ #paper-discussion (3 messages):

RWKV-7, Arxiv paper


Yannick Kilcher ▷ #ml-news (78 messages🔥🔥):

Intelligence vs Statistics, Healthcare as a human right, UnitedHealthcare lawsuit, Cigna claim denials, Transition Matching by Meta


HuggingFace ▷ #general (46 messages🔥):

Zero-shot labeling models, Hugging Face Chat Bot suggestions, On-demand GPU cluster service, Hugging Face Hub new category, Fine-tuned GGUF model uploads to inference endpoints


HuggingFace ▷ #today-im-learning (1 messages):

alperugurcan: https://www.coursera.org/learn/generative-ai-for-everyone


HuggingFace ▷ #i-made-this (22 messages🔥):

symbolic music AI frontend, rust crate for local models, embedder models, OCR dataset, PDF support in dataset viewer


HuggingFace ▷ #computer-vision (4 messages):

HF CV course, Fine-tuning internvl3, LayoutLMv3 with is_split_into_words, Predict float value a grayscale image


HuggingFace ▷ #NLP (1 messages):

kaafi_aalsi: hi all, has anyone here finetuned internvl3 model? need a bit of help😩


HuggingFace ▷ #smol-course (2 messages):

Agents Course, Course Completion Certificate


HuggingFace ▷ #agents-course (26 messages🔥):

Hugging Face Course Progress, DETR Training Help, HF Account Creation Issues, Agent Course Completion, Final Challenge Details


Nous Research AI ▷ #general (93 messages🔥🔥):

SaaS sales job leading to selling own SaaS, Poor man's SaaS, Automated AB testing for dating profiles, AI and Dating Apps, Ethics of AI in dating


Nous Research AI ▷ #ask-about-llms (3 messages):

Lora Training, Axolotl, philosophical lore-trained companion


Nous Research AI ▷ #interesting-links (1 messages):

Pivotal Token Search, OptiLLM Inference


aider (Paul Gauthier) ▷ #general (52 messages🔥):

Aider Workspaces, Model Overfitting, OpenAI Response API, Cypher Alpha


aider (Paul Gauthier) ▷ #questions-and-tips (28 messages🔥):

Gemini streaming issues, aider task automation, feeding rust docs into aider, context7 tool, aider and make test


Latent Space ▷ #ai-general-chat (75 messages🔥🔥):

Custom UIs, Context Engineering, Multimodal Preference Training, Grammarly Acquires Superhuman, Llama-4 Scores


Eleuther ▷ #general (38 messages🔥):

GPT-4o, Common Pile v0.1 subsets, ICML workshops, Diffusion World Models, OLMO models


Eleuther ▷ #research (32 messages🔥):

Qwen 1.7B diffusion LM, NAACL 2026 cancellation rumors, Immiscible Diffusion, Transition Matching attack, NeurIPS Ethics Reviewers


Eleuther ▷ #interpretability-general (5 messages):

Model Diffing, Crosscoders Hallucinations, SAE Training, Refusal Detection, Interpretability Conference in Boston


GPU MODE ▷ #general (29 messages🔥):

TorchServe deprecation, PyTorch model serving, NVIDIA Dynamo, nvml-tool for fan control, nsys and torch.compile


GPU MODE ▷ #torch (1 messages):

``


GPU MODE ▷ #cool-links (8 messages🔥):

Halide Thesis, Triton Docs, TVM Approach, Halide's Downfall, Image Processing Focus


GPU MODE ▷ #jobs (4 messages):

CUDA Kernels, LLM inference engines, vLLM module, LinearMethodBase, custom_op


GPU MODE ▷ #off-topic (1 messages):

Eth Foundation, Frontier Tower, LinkedIn


GPU MODE ▷ #thunderkittens (9 messages🔥):

Thundermittens Retirement, HazyResearch's ThunderKittens Repo, Broken Blog Links


GPU MODE ▷ #reasoning-gym (1 messages):

Verl, model_dtype parameter, fsdp_config, Qwen2.5


GPU MODE ▷ #general (4 messages):

Beginner Leaderboards Closing, VectorAdd Leaderboard, Releasing polished versions of problems, test, benchmark, profile commands


GPU MODE ▷ #cutlass (2 messages):

Data movement, Warp optimization, Resource management


MCP (Glama) ▷ #general (55 messages🔥🔥):

MCP Server Discovery, Glama Features, Structured vs Unstructured Content in MCP, Atuin MCP server


MCP (Glama) ▷ #showcase (3 messages):

Recipes automation, MCP Workflows, New MCP Updates


Notebook LM ▷ #use-cases (5 messages):

Cognitive Clones, Neurodivergent Minds, NotebookLM Tool


Notebook LM ▷ #general (36 messages🔥):

NotebookLM Free vs Paid, NotebookLM Image Support, NotebookLM Audio Support, NotebookLM Copying Notebooks, NotebookLM Obsidian Import


LlamaIndex ▷ #blog (3 messages):

LlamaIndex Agent Tool, LlamaCloud MCP Server, LlamaExtract


LlamaIndex ▷ #general (12 messages🔥):

Custom Memory Block for HITL Workflow, Google GenAI Integration, AsyncClient Usage, AgentWorkflow subclassing


Cohere ▷ #🧵-general-thread (6 messages):

Cohere Summer School, ReRanker pricing


Cohere ▷ #👋-introduce-yourself (7 messages):

Recommendation Systems, LLM-based Project, Diffusion-LMs, Applied ML, Generative AI


Modular (Mojo 🔥) ▷ #general (8 messages🔥):

GPU puzzles, Mojo and MAX adoption, Modular roadmap


Modular (Mojo 🔥) ▷ #mojo (4 messages):

Stringable Conformance, PythonObject return, Mojo borrow checker


Nomic.ai (GPT4All) ▷ #general (11 messages🔥):

GPT4All Release, Future features for GPT4All, Image generation in LLMs, Brave RAG Search


Manus.im Discord ▷ #general (8 messages🔥):

Let's Defend Soc analysis training, Account feedback function, Issue resolution


DSPy ▷ #general (6 messages):

Audio-Native LLMs, Gemini Live models


AI21 Labs (Jamba) ▷ #general-chat (2 messages):

HON disabled, AI Engineer, LangChain, AutoGen, CrewAI


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (1 messages):

Reinforcement Learning Resources, LLM Fine-tuning for Tool Calling