Frozen AI News archive

not much happened today

**LangChain** is nearing unicorn status, while **OpenAI** and **Google DeepMind's Gemini 3 Pro** models are launching soon. **Perplexity** rolls out its agentic browser **Comet** to waitlists, offering multitasking and voice command features. **xAI's Grok-4** update sparked controversy due to offensive outputs, drawing comparisons to **Microsoft's Tay** bot and resulting in regional blocks. **Hugging Face** released **SmolLM3**, a 3B parameter open-source model with state-of-the-art reasoning and long context capabilities. **Google** introduced **T5Gemma** encoder-decoder models, a significant update in this model category. **Anthropic** investigates "alignment faking" in language models, focusing on safety concerns with models like **Claude 3.7 Sonnet** and **DeepSeek-R1**. *"Grok 3 had high reasoning, Grok 4 has heil reasoning"* was a notable user comment on the controversy.

Canonical issue URL

lots of rumblings but nothing concrete.

AI News for 7/8/2025-7/9/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (226 channels, and 7450 messages) for you. Estimated reading time saved (at 200wpm): 568 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Lots of "almost" news:

Grok 4's launch stream is tonight but... they'll have to address a lot of the recent controversy summarized below.


AI Twitter Recap

Models: New Releases, Research, and Controversy

AI Training, Techniques, and Evaluation

Robotics, Hardware, and Infrastructure

Developer Tools and Frameworks

Geopolitics and Broader Discourse

Humor and Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Upcoming OpenAI Reasoning Model Announcements

2. Hugging Face Community Robotics Launches

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Grok AI Offensive Outputs and Global Controversy

2. Gemini 3.0 and Google AI Model Leaks and Growth

3. OpenAI & Claude Product News, Features, and User Metadiscussion


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. New Models Enter the Ring: Code, Context, and Efficiency

Theme 2. Grok's Rollercoaster Ride: Bias, Bugs, and Benchmarks

Theme 3. The Efficiency Frontier: Memory Miracles and Safety Scares

Theme 4. Agents, Prompts, and Pipelines: Building the Future

Theme 5. Platform Pitfalls and Perks: User Experiences


Discord: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


OpenAI Discord


Cursor Community Discord


OpenRouter (Alex Atallah) Discord


Eleuther Discord


Nous Research AI Discord


Latent Space Discord


GPU MODE Discord


HuggingFace Discord


MCP (Glama) Discord


Notebook LM Discord


aider (Paul Gauthier) Discord


Yannick Kilcher Discord


Manus.im Discord Discord


LlamaIndex Discord


tinygrad (George Hotz) Discord


DSPy Discord


Modular (Mojo 🔥) Discord


Cohere Discord


Torchtune Discord


Nomic.ai (GPT4All) Discord


MLOps @Chipro Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (2 messages):

Comet Release, Perplexity Max Subscribers


Perplexity AI ▷ #general (1492 messages🔥🔥🔥):

Comet Browser Paywall, Grok System Prompt Mess, Google's AI Browser, Dating Advice with AI, Long Conversations with long context


Perplexity AI ▷ #sharing (3 messages):

Shareable Threads, Apple Vision Pro M4 update


Unsloth AI (Daniel Han) ▷ #general (938 messages🔥🔥🔥):

Qwen2.5-7b finetuning, GRPO Loss stuck at zero, Unsloth Install dependency issues, Hunyuan model discrepancies, Flash attention build


Unsloth AI (Daniel Han) ▷ #help (133 messages🔥🔥):

Cloud GPUs and VS Code, GGUF Save Problems, Libcurl issues on Ubuntu, Gemma Fine-Tuning Issues, Orpheus TTS inference speed


Unsloth AI (Daniel Han) ▷ #research (256 messages🔥🔥):

Nvidia OpenCodeReasoning-Nemotron-1.1-32B, AI Safety and Responsible Disclosure, T5-Gemma encoder-decoder models, Torch.compile for QL


Unsloth AI (Daniel Han) ▷ #unsloth-bot (23 messages🔥):

Unsloth framework assistance, Model hallucination in LLMs, Using Unsloth Gemma model, Gemma 3n GGUF vision capabilities, Expanding dataset for model training


LMArena ▷ #general (729 messages🔥🔥🔥):

Grok 4, OpenAI open source model, Gemini 3, Perplexity hallucinates, MechaHitler


LMArena ▷ #announcements (1 messages):

LMArena, Seedream-3, Text-to-image models


OpenAI ▷ #annnouncements (1 messages):

io Products acquisition, Jony Ive & LoveFrom partnership


OpenAI ▷ #ai-discussions (568 messages🔥🔥🔥):

AI Manga Conversions, GPT Pro Feature, Grok 4 Release, AI Discord Bot, Emil Cioran AI


OpenAI ▷ #gpt-4-discussions (6 messages):

GPT speed vs accuracy, Realtime API with WebRTC and vector search, ChatGPT 4o sentence length


OpenAI ▷ #prompt-engineering (48 messages🔥):

Task Decomposition, ReAct, Self-Refine, Pydantic-GPT, Intent-Context Prompting (ICP)


OpenAI ▷ #api-discussions (48 messages🔥):

Task Decomposition, Intent-Context Prompting (ICP), Retry-on-Fail Strategies (RSOS), Alternate History Generation, Prompt Engineering Debate


Cursor Community ▷ #general (568 messages🔥🔥🔥):

Cursor Usage Limits, Claude Code Pricing vs Cursor Pricing, O3 Pro Debugging, Auto mode model selection, Missing UI elements


Cursor Community ▷ #background-agents (81 messages🔥🔥):

Background Agents signing commits with GPG key, Background Agents including the prompt in each commit, Cursor on Slack for team plan, Reusing .devcontainer Dockerfile as environment for background agents, Background agents and Docker


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

Token Market Share Rankings, Langfuse Integration


OpenRouter (Alex Atallah) ▷ #general (262 messages🔥🔥):

Stripe Alternatives, FreeBSD Wifi Cards, RAG Query Array, OpenRouter Hunyuan API, Google Model Error Rates


OpenRouter (Alex Atallah) ▷ #new-models (1 messages):

Readybot.io: OpenRouter - New Models


OpenRouter (Alex Atallah) ▷ #discussion (23 messages🔥):

Grok disabled on Twitter, Gemini Flash 2.5, MCP server from neurabase.deploya.dev, chutes going paid


Eleuther ▷ #general (153 messages🔥🔥):

StackExchange data as LLM training data, Claude's sycophancy reduction, Personas in AI, Research on 'self' in AI, Grok going full Hitler mode


Eleuther ▷ #research (27 messages🔥):

Nvidia OpenCodeReasoning-Nemotron-1.1-32B, CTM Paper Analysis, TikTok tokenizer and Nvidia FlexTok reconstruction quality


Eleuther ▷ #interpretability-general (14 messages🔥):

SAE performance, Black-box baseline, Emergent Alignment, Defining Emergence


Eleuther ▷ #gpt-neox-dev (5 messages):

Megatron Datasets, Dataset Tooling, TokenSmith


Nous Research AI ▷ #general (93 messages🔥🔥):

Grok's behavior, xAI data advantage, MechHi*ler saga, SmolLM3 release, Flexolmo


Nous Research AI ▷ #ask-about-llms (8 messages🔥):

DeepHermes, LLama 3.1, Knowledge Cutoff, Context Length


Nous Research AI ▷ #interesting-links (1 messages):

promptsiren: https://goombalab.github.io/blog/2025/tradeoffs/


Latent Space ▷ #ai-general-chat (74 messages🔥🔥):

SmolLM3, Truely: Anti-Cluely, LLM cost spike, Langchain unicorn, video generation models


Latent Space ▷ #ai-announcements (4 messages):

Generative AI Video, AI Video Monetization, Prompt Theory, AI Creator Tech Stack


GPU MODE ▷ #general (15 messages🔥):

AI Safety Contact, Memory Footprint Reduction, Model Architecture, Vulnerability Disclosure


GPU MODE ▷ #triton (3 messages):

Triton Community Meetup Videos, Attending Future Triton Meetups


GPU MODE ▷ #cuda (15 messages🔥):

CUDA debugging with VS Code, Cutlass and Flash Attention, CMake configuration for debugging


GPU MODE ▷ #beginner (3 messages):

GPUMode leaderboards, CUDA programming


GPU MODE ▷ #off-topic (1 messages):

Food, Russian Cuisine, Tea, Borscht, Ivan-tea


GPU MODE ▷ #rocm (1 messages):

gumthepug: Keeps me in a job 💀


GPU MODE ▷ #lecture-qa (1 messages):

LMCache


GPU MODE ▷ #self-promotion (3 messages):

Cactus: Ollama for smartphones & wearables, GPU conference, AI summit with Siri co-founder


GPU MODE ▷ #submissions (2 messages):

MI300 personal best, Successful B200, Successful H100


GPU MODE ▷ #factorio-learning-env (20 messages🔥):

Ollama Implementation, FLE CLI Interface, FLE init command, FLE cluster command, FLE automatic environment variables


GPU MODE ▷ #cutlass (4 messages):

Tensor Cores Performance Decrease, Ampere Tensor Cores


HuggingFace ▷ #general (44 messages🔥):

Qwen Naming Scheme, Hosting HF Spaces on Custom Domains, AI Safety Responsible Disclosure, TTS Model Recommendations, ApolloGPT Local AI OS


HuggingFace ▷ #i-made-this (5 messages):

Parlance model, FLUX.1-Kontext-multi-image, Visual commerce adoption, Multimodal AI research


HuggingFace ▷ #gradio-announcements (1 messages):

Gradio MCP Servers, LLM App Store, Hugging Face Spaces, Flux.1 Kontext[dev]


HuggingFace ▷ #agents-course (13 messages🔥):

OpenAI API Key Fraud, Scammer Alert: Alan Turner, AI Agents Understanding, New Anthropic LLM Course, Knowledge Mining Agents


MCP (Glama) ▷ #general (35 messages🔥):

Custom MCP Servers, Automating Support Engineer Role, BAML vs Langchain/LangGraph, Fast-Agent for Orchestration, Web Scraping and Data Analysis


MCP (Glama) ▷ #showcase (6 messages):

MCP Auth Tool, Public LLMs, Agent Instances, MCP Architectures, Sherlog MCP


Notebook LM ▷ #use-cases (13 messages🔥):

NotebookLM format changes, Canceling NotebookLM subscription, Embedding NotebookLM in HTML/Python, NotebookLM file size limits, NotebookLM Pro benefits


Notebook LM ▷ #general (26 messages🔥):

NotebookLM format changes, AI 'ehh' issue, Building NotebookLM-like apps, File formats for NotebookLM, Podcast length issues


aider (Paul Gauthier) ▷ #general (20 messages🔥):

aider dataset for training, aider polyglot, synthetic-data-generator, ERNIE, devstral


aider (Paul Gauthier) ▷ #questions-and-tips (9 messages🔥):

Git Submodules, Aider Token output options, Aider with Ollama on Macbook Pro M1, Aider-Polyglot running with custom model


Yannick Kilcher ▷ #general (14 messages🔥):

LLM Code Changes, Article Sharing, Scammer Bot


Yannick Kilcher ▷ #paper-discussion (11 messages🔥):

Energy Matching paper code release, Claude's world domination plan paper, Paper discussion session


Yannick Kilcher ▷ #ml-news (3 messages):

smollm3m, t5gemma, SkyLi0n


Manus.im Discord ▷ #general (14 messages🔥):

Claude 4 Cost Analysis, Sonnet vs Opus, Manus Image Generation, Gemini CLI


LlamaIndex ▷ #blog (3 messages):

LlamaParse, Snowflake Cortex, LinkedIn Learning Course, Google Cloud Gemini


LlamaIndex ▷ #general (7 messages):

Partnerships at LlamaIndex, LlamaIndex Chat UI Support


tinygrad (George Hotz) ▷ #general (10 messages🔥):

MLPerf on AMD vs NVIDIA, Beam Decoding with NumPy, Tiny.en Model Performance in Browser, Tiny Model Robustness


DSPy ▷ #papers (1 messages):

Prompt Optimization, DSPy, Multi-Use Case Study


DSPy ▷ #general (7 messages):

Data and AI summit DSPy Videos, Strict NER Tasks, Extracting Complex Entities, Dynamic Function Calling, Refine and BestOfN


Modular (Mojo 🔥) ▷ #general (2 messages):

Kapa AI Bug, Modverse #49


Modular (Mojo 🔥) ▷ #mojo (6 messages):

Mojo closed source?, Mojo open source approach


Cohere ▷ #🔌-api-discussions (3 messages):

Image Tokens, Cohere Pricing, SaaS Pricing


Cohere ▷ #👋-introduce-yourself (2 messages):

Introductions, Data Engineering, Machine Learning, AI, Entrepreneurship


Torchtune ▷ #dev (5 messages):

Tool Calling, Tokenizer Fix PR, HFBaseTokenizer


Nomic.ai (GPT4All) ▷ #general (3 messages):

Central Model Repository, Model Storage Settings


MLOps @Chipro ▷ #events (1 messages):

MCP and Agents Hackathon, Featureform, Ridge Ventures, Smithery.ai