Frozen AI News archive

not much happened today

**Ilya Sutskever** confirmed his role as CEO of **Safe Superintelligence Inc. (SSI)** with **Daniel Levy** as President, dismissing acquisition rumors and emphasizing their strong team and compute resources. **Perplexity AI** expanded its data integrations by adding **Morningstar's** financial research and hinted at new product features for Pro users. **Meta AI FAIR** clarified its research structure, distinguishing its small lab from larger model training groups, and welcomed **Nat Friedman** to enhance AI product development. **Midjourney** and **Sakana AI** announced hiring for research and applied engineering roles. **Cohere** expanded its presence in Montréal, receiving praise from Canadian officials. On the model front, **Google DeepMind's Gemini Pro** released the **Veo 3** video generation model globally. **DeepSeek** launched the faster **DeepSeek R1T2** model using an Assembly of Experts approach, available under an MIT license. **Kling AI** showcased cinematic video generation capabilities. **OpenAI** introduced a high-cost **Deep Research API** with pricing up to **$30 per call**. **Together AI** announced the release of the **DeepSWE agent**.

Canonical issue URL

a quiet day.

AI News for 7/2/2025-7/3/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (220 channels, and 8382 messages) for you. Estimated reading time saved (at 200wpm): 703 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

We'll also be taking tomorrow off, unless rumors of a Grok 4 release on July 4 come true.


AI Twitter Recap

Company & Leadership News

Model Releases & Research Updates

AI Engineering, Frameworks, & Tooling

Hardware, Infrastructure, & Efficiency

The "Soham Parekh" Affair & Tech Hiring Culture

Broader Implications & Humor


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Kyutai and DeepSWE: New Open-Source AI Model Releases and Benchmarks

2. Running and Experimenting with Large Language Models on Consumer Hardware

3. Local-First AI Applications and Framework Launches

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Emerging Model and TTS/Avatar Technology Announcements

2. AI's Impact on Human Identity, Longevity, and Brain/Mental Health

3. Public Figures, Personas, and Debates around AGI/ASI/Prompt Theory


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Flash Preview

Theme 1. Model Performance, Evaluation, and Capabilities

Theme 2. Hardware and Performance Optimization

Theme 3. AI Development Tools and Ecosystem

Theme 4. Industry Dynamics: Open Source, Companies & Market Shifts

Theme 5. Core AI Research & Concepts


Discord: High level Discord summaries

OpenAI Discord


Cursor Community Discord


Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


OpenRouter (Alex Atallah) Discord


LMArena Discord


HuggingFace Discord


Eleuther Discord


LM Studio Discord


GPU MODE Discord


Nous Research AI Discord


Latent Space Discord


MCP (Glama) Discord


Yannick Kilcher Discord


Notebook LM Discord


Modular (Mojo 🔥) Discord


Cohere Discord


aider (Paul Gauthier) Discord


Nomic.ai (GPT4All) Discord


DSPy Discord


Manus.im Discord Discord


Torchtune Discord


tinygrad (George Hotz) Discord


LLM Agents (Berkeley MOOC) Discord


AI21 Labs (Jamba) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

OpenAI ▷ #ai-discussions (990 messages🔥🔥🔥):

AI model for content creation, AI's Potential and Limitations, Solving Photonic Computing Memory Storage Problem with AI, Interpreting AI Models' Outputs and Hallucinations, Current state of AI image and video generation


OpenAI ▷ #gpt-4-discussions (4 messages):

Channel restarting issues, GPT-4 for learning, GPT-5 release rumors


OpenAI ▷ #prompt-engineering (10 messages🔥):

World Building Instructions, Math Problem Solving with O3, Human-like Memory Storage, Context for World Building


OpenAI ▷ #api-discussions (10 messages🔥):

World Building Folder Instructions, Human-like Memory Storage, O3 Math Problem Challenge, OpenAI Math challenge


Cursor Community ▷ #general (930 messages🔥🔥🔥):

Rate Limits in Cursor, Claude Code vs Cursor, Using Gemini CLI, The Auto Agent in Cursor, Frontend vs Backend


Cursor Community ▷ #background-agents (69 messages🔥🔥):

Cursor Agent Docker Cache Issues, Background Agents and Slack Integration Problems, Background Agents and GitHub Action Monitoring, Background Agent Infrastructure Improvements, Best Use Cases for Background Agents


Cursor Community ▷ #announcements (1 messages):

Cursor 1.2 Release, To-Do Lists in Cursor, PR Search in Cursor, Tab Speed Improvements


Perplexity AI ▷ #general (1187 messages🔥🔥🔥):

ChatGPT Free Tier, Gemini Privacy Policy, O3 Pro Budget, Image Uploads to Perplexity, AI Tool for Image Scraping


Perplexity AI ▷ #sharing (3 messages):

Banana in space, Who is soham parekh, House passes GOP megabill


Perplexity AI ▷ #pplx-api (12 messages🔥):

Sonar models, LinkedIn access, Caching responses


Unsloth AI (Daniel Han) ▷ #general (526 messages🔥🔥🔥):

CUDA cores vs Tensor cores, GGUF models inference, GRPO code update, Gemma3n issues, Unsloth Pro pricing


Unsloth AI (Daniel Han) ▷ #off-topic (7 messages):

Cloud Fees, ChessFish.io, LoRA finetuning, FlashAttention (FA) on T4 GPUs


Unsloth AI (Daniel Han) ▷ #help (544 messages🔥🔥🔥):

Unsloth Sesame CSM-1B notebook errors, Tokenizer issues after adding new tokens, Fine-tuning for translation, Mistral-common tokenization in Unsloth, Vision model error


Unsloth AI (Daniel Han) ▷ #research (16 messages🔥):

Llama 3.1-70B, Psych 101 dataset, Emergent Properties, fMRI scans, Human decisions


OpenRouter (Alex Atallah) ▷ #announcements (6 messages):

Airdrops, Cryptocurrency


OpenRouter (Alex Atallah) ▷ #app-showcase (3 messages):

Roleplay Website, personality.gg, character.ai alternative, janitorai.com alternative


OpenRouter (Alex Atallah) ▷ #general (540 messages🔥🔥🔥):

OpenRouter provider selection, Contribution to OpenRouter, Chutes paywall, OpenRouter Trivia, Gemini 2.5 Pro


LMArena ▷ #general (346 messages🔥🔥):

Google AI strategy, Gemini pricing vs OpenAI, Claude's Context Handling, DeepSeek R2 delay, Grok 4 release


LMArena ▷ #announcements (1 messages):

Image Edit Leaderboard, Community Driven Leaderboard


HuggingFace ▷ #general (104 messages🔥🔥):

Inference Bug, HF's MCP server to Claude Desktop on Windows, Azure Text-to-Speech, OpenAI's whisper large v3 turbo, Synthetic data creation


HuggingFace ▷ #cool-finds (3 messages):

HuggingFace Server, Piracy


HuggingFace ▷ #i-made-this (30 messages🔥):

Rust AI library, HuggingChat alternative, LLMs speak structured data, Godtier Prompts


HuggingFace ▷ #NLP (16 messages🔥):

Cross Encoders, Asymmetric vs Symmetric Semantic Search, Thresholding with Cross Encoders, Bi-encoders and Cross-encoders


HuggingFace ▷ #agents-course (16 messages🔥):

Hugging Face Inference Endpoints, Public Inference Endpoints, Generative AI Article, Unit 1 Course Certificate, Smolagents CodeAgent


Eleuther ▷ #general (17 messages🔥):

Open Research Hackathon, Conference Travel Funding, Independent Research Mentoring


Eleuther ▷ #research (93 messages🔥🔥):

Open Research Hackathon, 1-layer transformer, KV caching, TinyStories paper, llama.cpp


Eleuther ▷ #interpretability-general (1 messages):

Open Research Hackathon, Community research projects


Eleuther ▷ #lm-thunderdome (21 messages🔥):

lm-evaluation-harness standardization, lm_eval init optimization, task discoverability, gpqa benchmark details, Optimizing lm_eval startup time


Eleuther ▷ #multimodal-general (3 messages):

Kaiming's talk, Mean flow matching


LM Studio ▷ #general (65 messages🔥🔥):

VPN setup for LM Studio, Serving LLMs without LM Studio UI, Trusting Hugging Face models, Running LM Studio headless, AnythingLLM mobile app


LM Studio ▷ #hardware-discussion (36 messages🔥):

GPU driver update, Shared VRAM, AMD vs Nvidia, Run 24B param on RTX 4080, Run LLAMA 3.3 70B


GPU MODE ▷ #general (18 messages🔥):

Industrial PhD in Denmark, SWE vs MLE role, Work-life balance in Europe, Pursuing CUDA, Perfectionist mindset


GPU MODE ▷ #triton (14 messages🔥):

Torch Compile, Autotuning, PTX, SASS, CUDA


GPU MODE ▷ #cuda (6 messages):

Kernel Benchmarking for LLM Inference, Warm-up Iterations for Kernel Benchmarking, Compiler Explorer's NVCC Support Delay, PTX Instruction Availability


GPU MODE ▷ #cool-links (1 messages):

simon_57893: https://semianalysis.com/2025/07/03/deepseek-debrief-128-days-later/


GPU MODE ▷ #beginner (16 messages🔥):

Oldest GPU for beginners, Compute vs Memory bound kernel, Renting GPU time, Second hand RTX 3060


GPU MODE ▷ #torchao (4 messages):

torch.distributed.checkpoint.StateDictOptions, Sharded Parameters, Dtensor


GPU MODE ▷ #rocm (6 messages):

Register lifetime, Avoiding register spills, Inline ASM, Kernel hacking, Compiler optimization


GPU MODE ▷ #self-promotion (1 messages):

CuTeDSL, WGMMA, TMA, Hopper Architecture, TV-Layouts


GPU MODE ▷ #🍿 (1 messages):

Project Popcorn, Weights&Biases conference, PyTorch PM


GPU MODE ▷ #gpu模式 (1 messages):

leung3035: 作为杭州人,很负责的告诉你:杭州玩的地方蛮多,但是,吃就算了,简直就是美食荒漠。


GPU MODE ▷ #general-leaderboard (1 messages):

Handling Missing Data, Replacing Zero Values with Mean, Dropping Rows, Handling missing data


GPU MODE ▷ #submissions (1 messages):

Leaderboard Submission, A100 performance


GPU MODE ▷ #factorio-learning-env (3 messages):

Factorio Client Desync Logs, Github review interface


GPU MODE ▷ #amd-competition (1 messages):

MI300 Access, Competition Leaderboard Resources


GPU MODE ▷ #cutlass (9 messages🔥):

Cutlass Analytical Cost Model, GEMM Kernels, cuBLASLt Heuristics, Claude Code CLI for Cutlass, PyTorch Autotuner Model


GPU MODE ▷ #singularity-systems (2 messages):

c\/cuda c compiler, codegen, instruction selection, instruction scheduling, register allocation


Nous Research AI ▷ #general (46 messages🔥):

Open Source Industry Dying, Nous Research's Open Source Commitment, Meta's Open Source Future, Rejection Sampling definition, Llama 4 Failure


Nous Research AI ▷ #research-papers (5 messages):

Independent Research Mentoring, Reproducing Research Results


Nous Research AI ▷ #interesting-links (4 messages):

Symbolic Intelligence architecture, AREU Codex framework, Interpretability and alignment, Narrative Destabilization


Nous Research AI ▷ #research-papers (5 messages):

Independent Research Mentoring, Reproducing Research Results


Latent Space ▷ #ai-general-chat (47 messages🔥):

Anthropic Experimental APIs, Microsoft Layoffs, DeepSWE RL Agent, Chamath Palihapitiya & Tobi Lütke on AI, GPT for summarizing news


MCP (Glama) ▷ #general (44 messages🔥):

MCP as the Application, MCP servers, Resources and Prompts in MCP, Connecting to MCP servers, Remote MCP server issue


MCP (Glama) ▷ #showcase (2 messages):

Hypermode Agents Bootcamp, Agent Sandboxing Marketplace


Yannick Kilcher ▷ #general (40 messages🔥):

LSTM comeback, Universal Function Approximators, Semantic Search with Cross Encoders, Diffusion-based VLMs, Tokenizer Rebalancing


Yannick Kilcher ▷ #paper-discussion (3 messages):

Linear Transformers, Delta Rule, RWKV Optimization


Yannick Kilcher ▷ #ml-news (2 messages):

The Atlantic, Eleven Labs


Notebook LM ▷ #use-cases (14 messages🔥):

NotebookLM setup, Readwise style workflow, NotebookLM audio overview function, interactive PDF mind maps


Notebook LM ▷ #general (15 messages🔥):

Edit capability request, NotebookLM access issues, Combine Notebooks, Family Plan Limits, Latex rendering


Modular (Mojo 🔥) ▷ #general (7 messages):

Modular Customers, Native Network Programming, GPU HTTP server


Modular (Mojo 🔥) ▷ #mojo (17 messages🔥):

Dependent Type Systems in Mojo, NumPy Array Conversion to LayoutTensor, ExtraMojo Package for I/O, Mojo Compiler Hanging Issue, UnsafePointer.alloc Alignment


Modular (Mojo 🔥) ▷ #max (4 messages):

Modular Max Offline Inference, Quantization Encoding, Apple MLX Support


Cohere ▷ #🧵-general-thread (11 messages🔥):

ML Summer School Channel, Cohere Labs Open Weights, AYA Vision Models, ML Summer School Recordings


Cohere ▷ #🔌-api-discussions (4 messages):

Cohere Embedding Model, Trial key, Rate Limits, Production Keys, Monthly Limits


Cohere ▷ #👋-introduce-yourself (7 messages):

Cohere Summer School, New member introductions, Support channels, Community Discord Server


aider (Paul Gauthier) ▷ #general (9 messages🔥):

Claude overloaded, Polyglot benchmark speed, Gemini-cli performance, API token rate limits


aider (Paul Gauthier) ▷ #questions-and-tips (5 messages):

Local Model Performance, Aider --no-always option, Switching Model Edit Formats


aider (Paul Gauthier) ▷ #links (1 messages):

claude-code-api, api/providers


Nomic.ai (GPT4All) ▷ #general (10 messages🔥):

Llama 3, Android local LLMs, Multimind SDK, AI News Sources, r/LocalLLaMA


DSPy ▷ #general (8 messages🔥):

DSPy module creation, LLM-RAG-Agent with DSPy, Recipes for starting with little to no data, dspy.Tool and dspy.ToolCalls vs OpenAI functions/tools, Weaviate vectordb multi tenancy fix


Manus.im Discord ▷ #general (6 messages):

Usage visibility, Video generation, Manus down, Big update


Torchtune ▷ #dev (3 messages):

Generic Tokenizer Parity, HF Tokenizer Parity, Special Tokens, Chat Templates


tinygrad (George Hotz) ▷ #general (2 messages):

Tensor.stack Tuple Support, SDPA Enable GQA


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (1 messages):

Securing OpenAI API Keys, Tracking API Usage, Multi-Service Key Access