Frozen AI News archive

OpenAI releases Deep Research API (o3/o4-mini)

**OpenAI** has launched the **Deep Research API** featuring powerful models **o3-deep-research** and **o4-mini-deep-research** with native support for MCP, Search, and Code Interpreter, enabling advanced agent capabilities including multi-agent setups. **Google** released **Gemma 3n**, a multimodal model optimized for edge devices with only 3GB RAM, achieving a top score of 1300 on LMSys Arena, featuring the new MatFormer architecture and broad ecosystem integration. **Black Forest Labs** introduced **FLUX.1 Kontext [dev]**, a 12B parameter rectified flow transformer for instruction-based image editing, comparable to **GPT-4o**. **DeepMind** unveiled **AlphaGenome**, an AI model capable of reading 1 million DNA bases for gene function prediction, marking a breakthrough in AI biology. **Sakana AI** presented Reinforcement-Learned Teachers (RLTs) to enhance LLM reasoning, achieving 86.1% on MiniF2F with efficient compute. **Higgsfield AI** released **Higgsfield Soul**, a high-aesthetic photo model with 50+ presets for fashion-grade realism. Additionally, **Google** launched the **Gemini CLI**, an open-source AI agent for terminal use with free Gemini 2.5 Pro requests.

Canonical issue URL

Deep Research is all you need.

AI News for 6/25/2025-6/26/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (220 channels, and 5509 messages) for you. Estimated reading time saved (at 200wpm): 472 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

While Google had announced their intentions to release a Deep Research API, it seems OpenAI has chosen today to scoop them by actually releasing their Deep Research API in a relatively lowkey announcement:

We will not mince words - o3-deep-research and o4-mini-deep-research are probably the most powerful LLMs for powering agents in the world right now. This is thanks to the native support for MCP, Search and Code Interpreter which are 3 of the Big 5 LLM OS primitives.

Apart from the new webhook modality, you should not miss the cookbooks released today:


AI Twitter Recap

Model Releases & Updates

Tooling, Frameworks, and Infrastructure

Company & Industry News

Research, Techniques, and Commentary

Broader Implications

Humor & Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Gemma 3n Model Launch and Community Tooling

2. Latest Open Weights and Reasoning Model Releases

3. DeepSeek R2 Launch Delays and Market Constraints

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/aivideo, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Major AI Company Leadership Moves and Open-Source Model Hype

2. Higgsfield Soul and Flux Models: Hyperrealistic AI Image Generation

3. Anthropic's Jack Clark and AI Regulation Discourse


AI Discord Recap

A summary of Summaries of Summaries by chatgpt-4o-latest

1. OpenRouter's Funding and Tooling Expansion

2. DSPy's Ruby Port and Language Expansion

3. AI-Generated GPU Programming and Mirage Launch

4. Gemini CLI and Agentic IDEs Catch Heat

5. Growing Tooling Ecosystem: Doppl, Deep Research API, and Dopamine Boosts


Discord: High level Discord summaries

Perplexity AI Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


Cursor Community Discord


LM Studio Discord


GPU MODE Discord


LMArena Discord


OpenRouter (Alex Atallah) Discord


HuggingFace Discord


Yannick Kilcher Discord


aider (Paul Gauthier) Discord


tinygrad (George Hotz) Discord


Latent Space Discord


DSPy Discord


Eleuther Discord


LlamaIndex Discord


Modular (Mojo 🔥) Discord


Notebook LM Discord


MLOps @Chipro Discord


Manus.im Discord Discord


Torchtune Discord


MCP (Glama) Discord


Cohere Discord


Nomic.ai (GPT4All) Discord


Gorilla LLM (Berkeley Function Calling) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1121 messages🔥🔥🔥):

Android vs iPhone, Perplexity AI pricing/plans, Iphone cooling, GPT-5, Doppl app


Perplexity AI ▷ #sharing (7 messages):

HSK 2.0 vs 3.0, Ta She, Guan Yu, Killing Games, Deepseek


Perplexity AI ▷ #pplx-api (3 messages):

Sonar Deep Research Documentation, Credits pending


OpenAI ▷ #annnouncements (1 messages):

OpenAI DevDay 2025, San Francisco Event, Livestreamed Keynote, Hands-on Building, New Models and Tools


OpenAI ▷ #ai-discussions (943 messages🔥🔥🔥):

NotaGen release, Codex rate limits, BS detector benchmark, Gödel’s incompleteness theorem, Minimax benchmark


OpenAI ▷ #gpt-4-discussions (3 messages):

ChatGPT business plan PDF issues, AI Proper Usage Learning


OpenAI ▷ #prompt-engineering (32 messages🔥):

3D to 2D texture conversion, Tileable textures, Kaleidoscopic reflection, Python for seamless tiling


OpenAI ▷ #api-discussions (32 messages🔥):

3D to 2D texture, Kaleidoscopic reflection, Tileable textures with Python, Non-tileable textures


Unsloth AI (Daniel Han) ▷ #general (590 messages🔥🔥🔥):

Local LLM Security, Tool Creation with Claude, Copilot vs Cline, Unsloth and Gemma, GGUF Conversion


Unsloth AI (Daniel Han) ▷ #off-topic (4 messages):

Electrolyte Labs Job Postings, AI-Generated Introductory Video, Open-Source Model Inquiry


Unsloth AI (Daniel Han) ▷ #help (155 messages🔥🔥):

Gemma3 .gguf saving issues, LLM output issues, Unsloth Mistral Small 3.2 quants, Qwen3 image vision, SSML finetunned models


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

``


Unsloth AI (Daniel Han) ▷ #research (2 messages):

YouTube video, arXiv paper


Cursor Community ▷ #general (524 messages🔥🔥🔥):

Gemini CLI, Claude Code, Rate Limits, Cursor Pricing, MCP Errors


Cursor Community ▷ #background-agents (47 messages🔥):

Background Agent Connection Errors, Background Agent Network Security, Python 3.11 Setup, Environment.json schema URL, Background Agent Token Limits


LM Studio ▷ #general (226 messages🔥🔥):

Downgrading LM Studio, Cybersecurity LLMs, LM Studio Context Length Limits, Local LLM Hosting for Friends, LM Studio MCP setup


LM Studio ▷ #hardware-discussion (97 messages🔥🔥):

GPU zip tie mounting, DDR5 Memory Temp Reporting, Deepseek 671B vs 70B Model Speed, Motherboards bolted to wooden boards, Open bench PC fire safety


GPU MODE ▷ #general (28 messages🔥):

GCC as build system, Bazel, CMake


GPU MODE ▷ #triton (7 messages):

Triton Community Meetup, LinearLayout usage change, Gluon update, Nightly performance regression suite, Triton developer's summit update


GPU MODE ▷ #cuda (5 messages):

CUDA barrier parity parameter, ABA problem in circular buffers, Tensor Cores Usage


GPU MODE ▷ #torch (1 messages):

CUDA graphs blocking execution, SGL updates, Kernel execution


GPU MODE ▷ #rocm (28 messages🔥):

HIP on Debian Nvidia, Building HIP from source, ROCm Clang necessity, HIP cross-platform support


GPU MODE ▷ #self-promotion (1 messages):

CuTeDSL, SGEMM, Ampere architecture


GPU MODE ▷ #🍿 (6 messages):

Mirage compiler, GPU kernels, Kernel generation using LLMs, Benchmarking tools


GPU MODE ▷ #general (1 messages):

FP32 vs Tensor Cores, Nvidia hardware optimization


GPU MODE ▷ #submissions (72 messages🔥🔥):

vectorsum leaderboard, vectoradd leaderboard, sort leaderboard, trimul leaderboard, H100 performance


GPU MODE ▷ #factorio-learning-env (81 messages🔥🔥):

LuaSurface vs LuaPlayer, Mining Drill API Comparison, Test Environment Issues, Manual Data Collection, Teleportation


GPU MODE ▷ #mojo (1 messages):

GPU Hackathon, GPU Programming Workshop


LMArena ▷ #general (211 messages🔥🔥):

GEM CLI first impressions, Copyright impact on AI training, Undepressing Gemini, Adding a changelog channel, GPT-5 Release Prediction


OpenRouter (Alex Atallah) ▷ #announcements (3 messages):

Database Downtime, Frontend Authentication Outage, Presets Launch, LLM Configuration Management


OpenRouter (Alex Atallah) ▷ #general (199 messages🔥🔥):

OpenRouter raise, Gemini roasting, Clerk outage, Free Mistral version


OpenRouter (Alex Atallah) ▷ #new-models (1 messages):

Readybot.io: OpenRouter - New Models


HuggingFace ▷ #general (101 messages🔥🔥):

Llama 3.1 8B, Macbook M1/M2 Performance, Groq LPU, AI Agents, Model Context Protocol


HuggingFace ▷ #today-im-learning (4 messages):

RAG resources, Time Titans paper


HuggingFace ▷ #cool-finds (1 messages):

SAM, Segment Anything, Model Import


HuggingFace ▷ #i-made-this (32 messages🔥):

Fine-tuning model for scientific research, Huggingface File System Explorer, Streaming for local LLM Rust Crate, Native French Q&A Dataset, Command Line Web Browser


HuggingFace ▷ #NLP (1 messages):

User Profile Similarity with Opinion Pieces, Embedding Strategies for User Data, Cosine Similarity for Opinion Alignment


HuggingFace ▷ #smol-course (1 messages):

maik0z: Hey did you find it ?


HuggingFace ▷ #agents-course (16 messages🔥):

DuckDuckGoSearchException, AI Agent Course Certification Deadline, Accessing models, Hugging Face Introductions, Deprecated Langchain Issues


Yannick Kilcher ▷ #general (39 messages🔥):

Fair Use in AI Training, Claude 4 Spiritual Bliss Attractor, Anthropic LLM Welfare Team, Common Crawl Handling


Yannick Kilcher ▷ #paper-discussion (53 messages🔥):

Your Brain on ChatGPT EEG Findings, Deepseek V3 and R1 models, RWKV G Gate, BNPO and Dr.GRPO


Yannick Kilcher ▷ #ml-news (3 messages):

Yuchenj_UW tweet, Zuckerberg, Deepseek R2 launch


aider (Paul Gauthier) ▷ #general (44 messages🔥):

Local AI coding setup, Gemini CLI, ASI, Aider timeout


aider (Paul Gauthier) ▷ #questions-and-tips (15 messages🔥):

VRAM limitations with long contexts, Qwen3 model performance on different GPUs, Piping command output into aider, Killing aider process after idle timeout


tinygrad (George Hotz) ▷ #general (4 messages):

tinygrad PR closed, debugging tinygrad, detect and warn tinygrad


tinygrad (George Hotz) ▷ #learn-tinygrad (46 messages🔥):

WebGPU Stable Diffusion on Windows, ShaderF16 Feature, DXC Compiler and F16 Support, WebGPU Backends, Realtime Diffusion in Browser


Latent Space ▷ #ai-general-chat (38 messages🔥):

OpenRouter Funding, Foundation Model Report 2025, BFL Kontext Weights Released, OpenAI API Deep Research & Webhooks, Google Doppl AI Fashion App


DSPy ▷ #general (23 messages🔥):

DSPy in Ruby, Desiru Project, Naming Conventions for DSPy Ports, Persistence Layer in Desiru, Async Background Processing in Desiru


Eleuther ▷ #general (8 messages🔥):

HeroDevs Sustainability Fund, OG Ersatz, Consciousness emerging property


Eleuther ▷ #research (9 messages🔥):

Jianlin Su's Weight Decay, Sigma Reparam, SVD Approximation, Power Iteration Complexity


Eleuther ▷ #interpretability-general (2 messages):

Order Statistics Learning in Models, Frequency Bias Mitigation Techniques


Eleuther ▷ #lm-thunderdome (3 messages):

Codex, TyDiQA, lm-evaluation-harness


LlamaIndex ▷ #blog (4 messages):

Zoom RTMS, AI agent, Observability Tools, Klavis AI's MCP servers


LlamaIndex ▷ #general (16 messages🔥):

Azure OpenAI Responses API, LlamaIndex Docs Sync Script, Agent Workflow and React Agent


Modular (Mojo 🔥) ▷ #general (1 messages):

shalokshalom: Its at 48:42


Modular (Mojo 🔥) ▷ #announcements (1 messages):

Modular Hack Weekend, GPU Programming Workshop, NVIDIA Sponsorship, Lambda Compute Credits


Modular (Mojo 🔥) ▷ #mojo (10 messages🔥):

InlineArray move semantics, VariadicPack.each() removal, CNN model in Mojo using LayoutTensor


Modular (Mojo 🔥) ▷ #max (7 messages):

TS deprecated, ONNX support message, ONNX, TorchScript deprecated


Notebook LM ▷ #use-cases (1 messages):

Notebook LM for Customer Discovery, Pattern Recognition in Customer Conversations, Reliance on AI in Hypothesis Validation


Notebook LM ▷ #general (18 messages🔥):

Multilingual Notebook LM, Notebook LM page limits, PDF Format preference, Notebook LM model details, Nested wrapper issue


MLOps @Chipro ▷ #events (1 messages):

EnrichMCP, Agents connecting to data, Webinar, Data access, ML Engineers


MLOps @Chipro ▷ #general-ml (12 messages🔥):

Game-playing Bot API, Game State Capture, RL-based game playing bot, Git repositories for projects


Manus.im Discord ▷ #general (13 messages🔥):

Premium Account Sharing, Quality Agent Launch, Comic Actor Alvaro Vitali Death, Manus Browser Issues


Torchtune ▷ #general (1 messages):

dizzy7948: yeah will do, hope i can contribute some time


Torchtune ▷ #dev (8 messages🔥):

Liger CE PR, memory increase after set self.mask_ignored_tokens = False, packed and seq_len=4096, iterable dataset + on the fly packing + dataset logging


Torchtune ▷ #papers (3 messages):

Tiled MLP, Chunked CE Loss, Sequence Parallelism, Tensor Parallelism, Ring Attention


MCP (Glama) ▷ #general (5 messages):

Hugging Face Authentication, Reddit Moderators, PlayMCP browser


MCP (Glama) ▷ #showcase (1 messages):

Rust Docs MCP Server, Agent Hallucination


Cohere ▷ #🧵-general-thread (1 messages):

Cohere Newcomers, Cohere Support Channels, Cohere Labs


Cohere ▷ #📣-announcements (1 messages):

AWS, Pinecone, agentic applications, financial semantic search


Cohere ▷ #👋-introduce-yourself (3 messages):

NLP, Animal Linguistics, Deep Learning


Nomic.ai (GPT4All) ▷ #general (5 messages):

Qt requirement in GPT4All, Microsoft 1.58B 2B4T model, LM Studio vs GPT4All, GPT4all outdated


Gorilla LLM (Berkeley Function Calling) ▷ #leaderboard (2 messages):

Leaderboard Inclusion, LLM Evaluation with Thinking Mode