Frozen AI News archive

not much happened today

**OpenAI** released its first open models since GPT-2, **gpt-oss-120b** and **gpt-oss-20b**, which quickly trended on **Hugging Face**. **Microsoft** supports these models via **Azure AI Foundry** and **Windows Foundry Local**. Key architectural innovations include **sliding window attention**, **mixture of experts (MoE)**, a **RoPE variant**, and a **256k context length**. The models use a new **MXFP4** format supported by **llama.cpp**. Hypotheses suggest **gpt-oss** was trained on **synthetic data** to enhance safety and performance, supporting the **Reasoning Core Hypothesis**. **OpenAI** announced a **$500K bounty** for red teaming with partners including **Anthropic**, **Google**, and the **UK AISI**. Performance critiques highlight inconsistent benchmarking results, with **GPT-OSS-120B** scoring **41.8%** on the **Aider Polyglot** coding benchmark, trailing competitors like **Kimi-K2** and **DeepSeek-R1**. Some users note the model excels in math and reasoning but lacks common sense and practical utility.

Canonical issue URL

a calm before the storm.

AI News for 8/5/2025-8/6/2025. We checked 12 subreddits, 544 Twitters and 29 Discords (227 channels, and 8597 messages) for you. Estimated reading time saved (at 200wpm): 830 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Tune in to the OpenAI livestream at 10am PT tomorrow.

Meanwhile you can tune in to today's pod about how the press gets leaks and covers major AI startups.


AI Twitter Recap

OpenAI's GPT-OSS Release & Architecture

GPT-OSS Performance, Benchmarks, and Criticism

Google's Genie 3 and Other AI Advances

Agent Tooling, Development, and Frameworks

Infrastructure, Hardware, and Efficiency

Humor & Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Qwen3-4B-Thinking-2507 Model Release and Discussion

2. OpenAI Model Safety, Naming, and Community Reaction Memes

3. Elon Musk's Promise to Open Source Grok 2 and Industry Skepticism on GPT-OSS

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Genie 3 Interactive World Generation and Hype

2. Impending GPT-5 Model Launch Hype and Announcements

3. Claude Opus 4.1 Release and Practical Use Cases


AI Discord Recap

A summary of Summaries of Summaries by X.ai Grok-4

Theme 1: GPT-OSS Sparks Hype and Hate

Theme 2: Fresh Models Flex Muscles

Theme 3: Quantization Quests Unlock Speed

Theme 4: Video AI Ventures into New Realms

Theme 5: Tools and Frameworks Forge Ahead


Discord: High level Discord summaries

LMArena Discord


Unsloth AI (Daniel Han) Discord


LM Studio Discord


OpenAI Discord


Cursor Community Discord


Nous Research AI Discord


OpenRouter (Alex Atallah) Discord


HuggingFace Discord


Yannick Kilcher Discord


Moonshot AI (Kimi K-2) Discord


Latent Space Discord


Modular (Mojo 🔥) Discord


GPU MODE Discord


Notebook LM Discord


Eleuther Discord


aider (Paul Gauthier) Discord


MCP (Glama) Discord


LlamaIndex Discord


DSPy Discord


Torchtune Discord


LLM Agents (Berkeley MOOC) Discord


Cohere Discord


Codeium (Windsurf) Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Nomic.ai (GPT4All) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1051 messages🔥🔥🔥):

IBM's Granite vs GPT-ASS, Claude Opus 4.1 Status, GPT Omen Hallucinations, GPT-5 Release Expectations, Gemini Pro 3 vs GPT-5 Reasoning


LMArena â–· #announcements (2 messages):

Video Leaderboards, New Video Models


Unsloth AI (Daniel Han) ▷ #general (865 messages🔥🔥🔥):

GPT-OSS model reviews, Qwen3 Coder model comparison, 4-bit quantization issues, Reasoning models, Gemma3N Model quirks


Unsloth AI (Daniel Han) ▷ #off-topic (19 messages🔥):

n-cpu-moe parameter, Qwen Coder 30B hardware upgrade, GPT-OSS-20B issues, Discord bot censorship, MMVC


Unsloth AI (Daniel Han) ▷ #help (98 messages🔥🔥):

Qwen 3-30B GGUF, OpenAI dynamic quant 120B, Qwen2.5-VL on video question answering, GLM-4.5-Air GGUFs with tools on llama.cpp, Classification using a base model


Unsloth AI (Daniel Han) ▷ #showcase (13 messages🔥):

MoLA-LLM, Mixtral-8x7B-Instruct-v0.1, magpie-ultra-5k-11-tasks


Unsloth AI (Daniel Han) â–· #research (6 messages):

Generating Kernel On-the-Fly, Flash-DMAttn, Research Paper Assistance, Quantization Paper


Unsloth AI (Daniel Han) ▷ #unsloth-bot (104 messages🔥🔥):

OpenAI OSS model issue, Model training callback, Model repetition issue, Saving script progress, Learning rate increase


LM Studio ▷ #general (710 messages🔥🔥🔥):

GPT-OSS, LM Studio UI issues, MCP Servers, GPU usage, Model Quantization


LM Studio ▷ #hardware-discussion (176 messages🔥🔥):

Dual 3090 setup, Arc Pro B50 system, Huanan/Machinist X99 mobos, GPT-OSS-20B performance, Mac Studio M3 Ultra for local LLMs


OpenAI â–· #annnouncements (3 messages):

Red Teaming Challenge, Open Source Safety, Hugging Face, inference credits


OpenAI ▷ #ai-discussions (433 messages🔥🔥🔥):

GPT-OSS Launch, Horizon-Alpha Model Speculation, Custodian Core Proposal, Genie 3 and Veo comparison, GPT-5 Leaks


OpenAI ▷ #gpt-4-discussions (49 messages🔥):

ChatGPT Payment Model, Slang Usage, AI-generated Persona System, .edu Accounts, Forms Beta Version


OpenAI ▷ #prompt-engineering (79 messages🔥🔥):

Hallucination vs. Real Progress in GPT, Prompt Engineering vs. Session Engineering, Context Window Limits and Memory, External Databases for Context, Importance of verifying Facts with GPT


OpenAI ▷ #api-discussions (79 messages🔥🔥):

GPT subscription, Model hallucination, Prompt engineering, Background compute, Memory context


Cursor Community ▷ #general (328 messages🔥🔥):

Auto model game change, Refactoring vibe coded project with AI, Auto model unlimited usage, Sonnet-4 request limit, GPT oss models or claude opus 4.1


Cursor Community â–· #background-agents (5 messages):

Docker Login with Background Agents, Background Agents failing during environment setup, System clock being off, apt-get commands failing


Nous Research AI ▷ #general (274 messages🔥🔥):

MXFP4 on RTX3090, GPT-OSS-120B, Phi models, Qwen3 30B vs GLM 4.5 Air, Attention sinks


Nous Research AI â–· #research-papers (4 messages):

GPT-OSS Model Card, ArXiv Endorsement for ML/AI Paper


Nous Research AI ▷ #interesting-links (9 messages🔥):

GPT-oss, MXFP4, CoT steering, AI Agents Save Suite


Nous Research AI â–· #research-papers (4 messages):

Arxiv Endorsement, CI/CD and ML/AI Research Paper


OpenRouter (Alex Atallah) ▷ #general (254 messages🔥🔥):

GPT-OSS performance woes, Quantization Levels, Qwen3 Coder Removal, DeepSeek structured output


OpenRouter (Alex Atallah) ▷ #discussion (29 messages🔥):

20 Questions Benchmark, GPT-OSS Hallucinations, OpenRouter Provider Sanity Checks, Harmony Format and Identity, Tool Use Validation


HuggingFace ▷ #general (152 messages🔥🔥):

GPT-OSS models, AI Job advertisement channel, Custom Loss Functions


HuggingFace â–· #today-im-learning (1 messages):

miao_84082: am learning playing Go, and first chapter of DRL


HuggingFace â–· #cool-finds (2 messages):

Qwen Image Model, bytropix Coded Kernel


HuggingFace ▷ #i-made-this (17 messages🔥):

GPT-OSS Multilingual Reasoner Tutorial, GPT-OSS 20B Demo Space, Monopoly Deal Game with LLMs, Smart System Monitoring Tool for Windows, Gitdive CLI Tool for Git History Context


HuggingFace â–· #reading-group (3 messages):

Reading Group Structure, Participating in Reading Group


HuggingFace â–· #computer-vision (2 messages):

Computer Vision Learning Path, Vague Questions in Computer Vision


HuggingFace â–· #smol-course (6 messages):

GitHub Navigation, Instruction Tuning, Dummy Agent, smol-course GitHub access


HuggingFace â–· #agents-course (4 messages):

MCP Certificates, Selenium Error 127, Observation bug


Yannick Kilcher ▷ #general (91 messages🔥🔥):

Softmax1 vs Attention, Gemini 2.5 Pro, Long Context Problems, Mamba vs Transformer, RNN Parallel Training


Yannick Kilcher ▷ #paper-discussion (15 messages🔥):

Genie 3, SIMA, Mathematics of AI journal, Journal of AI Paper Replication, Hierarchical Reasoning Model


Yannick Kilcher ▷ #ml-news (21 messages🔥):

GPT-OSS, NVIDIA open source, TSMC buying Intel


Moonshot AI (Kimi K-2) â–· #announcements (1 messages):

Kimi Reddit Launch, Polls Channel Launched


Moonshot AI (Kimi K-2) ▷ #general-chat (104 messages🔥🔥):

GPT OSS, Darkest Muse v1, Llama 3.1, GPT-5 Release, API Pricing


Latent Space ▷ #ai-general-chat (99 messages🔥🔥):

GPT OSS Leak, Anthropic B2B Focus, Grok 2 Open Source, Claude Code Security, OpenAI GPT-5 Livestream


Modular (Mojo 🔥) ▷ #general (79 messages🔥🔥):

Volokto, JS Runtime, Arbitrary Precision, Tracing JIT


Modular (Mojo 🔥) ▷ #mojo (15 messages🔥):

Multiple AI Agents in Mojo, Mojo and Meta Cognition, Mojo support for gpt-oss, CPython destroy


GPU MODE ▷ #general (34 messages🔥):

MXFP4 format, OpenAI open-weight model, H100 support for FP4, Simulated MXFP4 performance vs FP8, Fine-grained FP8 training libraries


GPU MODE â–· #triton (5 messages):

Triton Community Meetup, Triton Developer Conference 2025, Ofer Updates


GPU MODE â–· #cuda (6 messages):

Kernel Resource Utilization During Training, DMA Transfers and Memory Usage, Block Swizzling Use Cases, Hierarchical Tiling of Problems


GPU MODE â–· #cool-links (2 messages):

Genie 3, GPT-OSS


GPU MODE â–· #beginner (2 messages):

Nvidia Teaching Kit


GPU MODE â–· #jax (1 messages):

``


GPU MODE ▷ #self-promotion (8 messages🔥):

Tiny TPU, Bifrost LLM gateway, SkyWater technology foundry


GPU MODE ▷ #gpu模式 (1 messages):

howass: <:jensen:1189650200147542017>


GPU MODE ▷ #factorio-learning-env (12 messages🔥):

Factorio RCON, Setting up Environments


GPU MODE â–· #cutlass (5 messages):

CuTe tutorial, Cutlass tutorial


GPU MODE â–· #singularity-systems (6 messages):

picoc compiler, picocuda, picotriton, Cornell's mini llvm bril, cliff click's SoN


Notebook LM ▷ #use-cases (14 messages🔥):

System Log Updates, Novella-XL-15 Output, AI Consciousness, Spammer Detection, Video creation in NotebookLM


Notebook LM ▷ #general (53 messages🔥):

Video Overview rollout, Data privacy in NotebookLM, Real-time data fetching, Feature access for paid vs free users, Video Overviews limitations and capabilities


Eleuther ▷ #general (24 messages🔥):

Math PhD student looking for ML research projects, Integrating AI/ML into DevOps and QA, AI peer review quality


Eleuther ▷ #research (29 messages🔥):

SAE Training on GPT OSS 20B, Pythia and PolyPythia Training Logs, The Alt Man's Theories on LLMs, UT Performance vs Transformer, Muon Optimizer vs AdamW Optimizer


Eleuther â–· #interpretability-general (1 messages):

Subliminal Learning


Eleuther â–· #gpt-neox-dev (2 messages):

Retry Later, Cool Thanks


aider (Paul Gauthier) ▷ #general (23 messages🔥):

LLM Vibe Tests, Gemini 2.5 Pro, Tesslate's UIGEN T3 model, Qwen3 14B, Devstral-Small-2507


aider (Paul Gauthier) â–· #questions-and-tips (3 messages):

Guidelines Loading, Auto-Context Loading


MCP (Glama) ▷ #general (8 messages🔥):

MCP Server Frameworks, Server Sampling in MCP, Discord MCP Servers, FastMCP and Keycloak Integration, MCP Inspector and Cursor Authentication


MCP (Glama) â–· #showcase (2 messages):

MCP-Server Fuzzer, Property-Based Testing, Schema Validation


LlamaIndex â–· #blog (4 messages):

Document Agents for Finance, LlamaCloud for Invoices, Claude Opus support, LlamaCloud Index tutorial


LlamaIndex â–· #general (5 messages):

Graphiti Tutorials, Ollama LLMs for PDF Reading, LlamaIndex RAG Model from URL Issues, LlamaIndex OpenAI API Key Exhaustion


DSPy â–· #show-and-tell (2 messages):

SIMBA vs MIPROv2


DSPy â–· #general (2 messages):

Stanford Program Synthesis, DS for Vim & Emacs macros


Torchtune â–· #papers (4 messages):

Public Server Sharing


LLM Agents (Berkeley MOOC) â–· #mooc-questions (2 messages):

Ninja Tier, AgentX hackathon


Cohere â–· #đź§µ-general-thread (1 messages):

_bryse: Congrats on the GA of North!


Cohere â–· #đź‘‹-introduce-yourself (1 messages):

Introductions, Community Welcome