Frozen AI News archive

NVIDIA to invest $100B in OpenAI for 10GW of Vera Rubin rollout

**NVIDIA** and **OpenAI** announced a landmark strategic partnership to deploy at least **10 gigawatts** of AI datacenters using NVIDIA's systems, with NVIDIA investing up to **$100 billion** progressively as each gigawatt is deployed, starting in the second half of 2026 on the Vera Rubin platform. This deal significantly impacts the AI infrastructure funding landscape, potentially supporting OpenAI's $300 billion commitment to Oracle. The announcement caused major stock market reactions, with NVIDIA's market cap surging by $170 billion. Additionally, advancements in deterministic inference for reinforcement learning and FP8 precision gains in GPU performance were highlighted by AI practitioners.

Canonical issue URL

What is going on?

AI News for 9/22/2025-9/23/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (193 channels, and 3072 messages) for you. Estimated reading time saved (at 200wpm): 236 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

We would normally feature the remarkable velocity of Qwen (headlined by today's Qwen3-Omni model) or the new DeepSeek V3.1 update, but really today belongs again to NVIDIA, which over the last week has deployed billions into Intel ($5b) and Enfabrica's execuhire ($900m) and Wayne ($500m).

The relevant details of the press release are all we know:

News

  • Strategic partnership enables OpenAI to build and deploy at least 10 gigawatts of AI datacenters with NVIDIA systems representing millions of GPUs for OpenAI’s next-generation AI infrastructure.
  • To support the partnership, NVIDIA intends to invest up to $100 billion in OpenAI progressively as each gigawatt is deployed.
  • The first gigawatt of NVIDIA systems will be deployed in the second half of 2026 on NVIDIA’s Vera Rubin platform.

**San Francisco and Santa Clara—September 22, 2025—**NVIDIA and OpenAI today announced a letter of intent for a landmark strategic partnership to deploy at least 10 gigawatts of NVIDIA systems for OpenAI’s next-generation AI infrastructure to train and run its next generation of models on the path to deploying superintelligence. To support this deployment including datacenter and power capacity, NVIDIA intends to invest up to $100 billion in OpenAI as the new NVIDIA systems are deployed. The first phase is targeted to come online in the second half of 2026 using NVIDIA’s Vera Rubin platform.

We don't know this for a fact but this $100B deal is likely a big part of how OpenAI is funding their $300B commit to Oracle from 2 weeks ago (whose stock is back up at all time highs, seeming to support this theory).

Side note: it hasn't escaped observers that somehow all the stocks involved - ORCL, OpenAI, and NVIDIA - are all jumping disproportionately on this money going from one to the other. NVIDIA's stock gained $170B today after announcing this $100B investment to secure their revenue, OpenAI's stock is now presumably valued more than the most recent $500B after this deal as well, and ORCL is still $250B higher than it was before the announcement. Are there -ANY- losers here?

From The Information, we also have some insight on the breathtaking scale of OpenAI's intended infra spend, which includes about $150B more in existing + unaccounted spend.


AI Twitter Recap

Compute, Inference, and Systems: OpenAI–NVIDIA, FP8, and cross‑vendor GPU portability

Major model drops: Qwen3 Omni family, Grok‑4 Fast, DeepSeek V3.1 Terminus, Apple Manzano, Meituan LongCat

Coding agents, evals, and scaffolds: SWE‑Bench Pro, GAIA‑2/ARE, ZeroRepo, Perplexity Email Assistant

Safety, governance, and agent security

Research highlights: JEPA debate, synthetic data pretraining, memory for latent learning

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. DeepSeek-V3.1-Terminus Launch & Online Upgrade

2. Qwen3-Omni Multimodal Release & Open-Source Models

3. Qwen-Image-Edit-2509 Release: Multi-Image Editing & ControlNet

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. OpenAI–NVIDIA 10 GW Supercomputer Partnership Announcements

2. Qwen-Image-Edit-2509 Release and Gemini/ChatGPT Multimodal Demos

3. Robot Uprising Memes and Unitree G1 Agility Clips


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. DeepSeek v3.1 Terminus and Qwen3 Releases

2. Diffusion Sampling & Data Efficiency Breakthroughs

3. Compute Megadeals & GPU Systems

4. Agent Protocols & Constrained Outputs

5. Open-Source Platforms, DBs, and Communities


Discord: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


Cursor Community Discord


GPU MODE Discord


HuggingFace Discord


Latent Space Discord


Eleuther Discord


Nous Research AI Discord


aider (Paul Gauthier) Discord


Modular (Mojo 🔥) Discord


Yannick Kilcher Discord


DSPy Discord


MCP Contributors (Official) Discord


tinygrad (George Hotz) Discord


Moonshot AI (Kimi K-2) Discord


Manus.im Discord Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1175 messages🔥🔥🔥):

Comet Browser invitation, GPTs Agents training, OpenAI Platform's sidebars, Comet availability for ipadOS, AI winter


Perplexity AI ▷ #sharing (5 messages):

Comet Invitation, Shareable Threads, Trustworthy Data, Invitation Request


Unsloth AI (Daniel Han) ▷ #general (374 messages🔥🔥):

VRAM Usage for 1M Context Length, GRPO Fine-Tuning for GPT-OSS-20B, DeepSeek V3.1 Terminus and Huawei Ascent Chips, Qwen3 and Data Privacy Concerns, QAT and GGUF Quants


Unsloth AI (Daniel Han) ▷ #introduce-yourself (1 messages):

Collaboration Opportunities, Software Engineering, Small Business Ventures


Unsloth AI (Daniel Han) ▷ #off-topic (41 messages🔥):

Loss Curve Success, New iPhone Acquisition, CS Uni vs Bootcamps, Gacha Game Ratios, DataSeek Tool


Unsloth AI (Daniel Han) ▷ #help (23 messages🔥):

OOM Errors & USDT, Blackwell CUDA Issues, Orpheus TTS Fine-Tuning


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

LLMs, Fine Tuning, Task-Specific Models


Unsloth AI (Daniel Han) ▷ #research (13 messages🔥):

Diffusion vs Autoregressive, Data Repeating, Paper Citation, Peer Review


LMArena ▷ #general (446 messages🔥🔥🔥):

Video generation from photos in Indonesian, Grok 4 Fast performance, Seedream 4 2k vs High-Res, AI in medical field, Gemini 3.0 Flash rumors


LMArena ▷ #announcements (1 messages):

Seedream-4, LMArena Models


Cursor Community ▷ #general (419 messages🔥🔥🔥):

Token Usage, Kaspersky Malware Flag, Chat Exports, GPT-5 Pricing


GPU MODE ▷ #general (13 messages🔥):

vLLM affiliation, Image/Video Gen in vLLM, Sliding/Striding Multi-Node DiT Kernel, GB300s for High Compute Scale, Magnetohydrodynamics and Loop Quantum Gravity Modeling


GPU MODE ▷ #triton (1 messages):

exquisite_lemur_80905: There's also TRITON_ALWAYS_COMPILE to ignore the cache


GPU MODE ▷ #cuda (1 messages):

MLPerf Inference, CPU Bottleneck, GPU Utilization


GPU MODE ▷ #torch (2 messages):

Speeding up pip install, Setting TORCH_LOGS


GPU MODE ▷ #cool-links (1 messages):

Tianqi Chen Interview, Machine Learning Systems, XGBoost, MXNet, TVM


GPU MODE ▷ #jobs (1 messages):

Remote Research Intern, Deep Learning, New Models, Model Building, Stipend Information


GPU MODE ▷ #beginner (1 messages):

nwyin: https://jax-ml.github.io/scaling-book/roofline/


GPU MODE ▷ #off-topic (6 messages):

NVIDIA Tech Demos, Tailscale Interface & Pricing, VPN Business Models


GPU MODE ▷ #self-promotion (1 messages):

CowabugaAI, LeapfrogAI, Open Source AI, Military-Grade AI, Commercial AI Support


GPU MODE ▷ #🍿 (2 messages):

vLLM's guided decoding, grammars for automated code generation, kernel generation LLMs, KernelBench 0-shot evals


GPU MODE ▷ #thunderkittens (2 messages):

GPU memory sharing, NVSwitch reduction


GPU MODE ▷ #edge (1 messages):

radiation-hardened chips, Jetson usage in space, chips in the magnetosphere


GPU MODE ▷ #submissions (13 messages🔥):

MI300x8, amd-all2all leaderboard, amd-gemm-rs leaderboard, Personal Bests


GPU MODE ▷ #factorio-learning-env (8 messages🔥):

Sweep for Qwen 2 35b, Deepseek 3.1, GPT-oss progress, Release work


GPU MODE ▷ #amd-competition (7 messages):

AMD Contractor Prize Eligibility, All2All Optimizations


GPU MODE ▷ #cutlass (7 messages):

CUTLASS MLP Accuracy, CuTe Layouts


GPU MODE ▷ #singularity-systems (2 messages):

PyTorch autograd, JAX autograd, tinygrad autograd, torch dynamo, bytecode interception


HuggingFace ▷ #general (48 messages🔥):

Hugging-Science Discord Launch, HF Inference Providers Quality, Gradients Clipping, gguf conversion with llama cpp, smollm's goals


HuggingFace ▷ #cool-finds (1 messages):

Diffusion ODE Solver, DPM++2m, WACV 2025, Hyperparameter-is-all-you-need


HuggingFace ▷ #i-made-this (7 messages):

golang vectorDB, AI agent trust challenges, AgentXTrader, protein prediction dataset


HuggingFace ▷ #computer-vision (1 messages):

fingaz_ai: i havent but im also looking into that same feature i just havent isolated one yet.


HuggingFace ▷ #smol-course (2 messages):

In-Person Meetup in NYC, GSM8k Eval on Trained Model


HuggingFace ▷ #agents-course (4 messages):

Starting the Agents Course, Backgrounds of new course members


Latent Space ▷ #ai-general-chat (44 messages🔥):

DeepSeek Terminus, Claude resumable streaming, Untapped Capital Fund II, Alibaba Qwen3-TTS, OpenAI NVIDIA deal


Latent Space ▷ #genmedia-creative-ai (4 messages):

Google Gemini, Runway AI, Runway Gen-2


Eleuther ▷ #general (34 messages🔥):

Text-davinci-003 origin story, ChatGPT model finetuning, UChicago ML research community, GPT-3.5 series


Eleuther ▷ #research (9 messages🔥):

Prefilling vs Decoding Intuition, Diffusion ODE Solver, WACV 2025 Submission


Eleuther ▷ #lm-thunderdome (2 messages):

MMLU pro benchmark, lm-eval


Nous Research AI ▷ #general (34 messages🔥):

HuggingFace comments, OLMo-3 safetensors, Qwen3 Omni, Realtime Perceptual AI, SVG coding among various LLMs


Nous Research AI ▷ #ask-about-llms (7 messages):

LLM Training, LoRA training, Consumer hardware for LLMs, Bandwidth and Latency for LLMs


aider (Paul Gauthier) ▷ #general (19 messages🔥):

Navigator Mode in Aider Forks, aider-ce Package, Augment CLI, Deepseek V3.1 Setup, Web Search Tools in Aider


aider (Paul Gauthier) ▷ #questions-and-tips (5 messages):

Running multiple aider agents, aider asks to edit files, LLM confusion with prompt files


Modular (Mojo 🔥) ▷ #general (15 messages🔥):

FFI, Rust, C ABI, C header, Mojo binding generators


Modular (Mojo 🔥) ▷ #announcements (1 messages):

Modular Platform 25.6, NVIDIA Blackwell, AMD MI355X, Consumer GPUs


Modular (Mojo 🔥) ▷ #mojo (7 messages):

Mojo MAX .mojopkg requirement, Mojo nightly install command, Variadic args binding in Mojo


Yannick Kilcher ▷ #paper-discussion (16 messages🔥):

New Paper Discussion, Yann LeCun's Work, Joint Embedding Predictive Architecture, Paper Presentation Opportunity


Yannick Kilcher ▷ #ml-news (3 messages):

GPT parsing philosophy, Prompting improvements


DSPy ▷ #general (16 messages🔥):

MCP Secrets, Trace IDs, GEPA for ReAct


MCP Contributors (Official) ▷ #general (7 messages):

MCP Sampling Protocol, response_schema addition, Claude models constrained output


MCP Contributors (Official) ▷ #general-wg (6 messages):

Model Context Protocol Registry, Publishing MCP Servers, Remote Server Configurations, MCP Install Instructions


tinygrad (George Hotz) ▷ #general (6 messages):

CuTe DSL, RANGEIFY status, Company update, ThunderKittens project


Moonshot AI (Kimi K-2) ▷ #general-chat (6 messages):

Kimi K-2, Prompt Injection, Claude's Lobotomization