Frozen AI News archive

Oracle jumps +36% in a day after winning $300B OpenAI contract

**Oracle's OCI division** reported a stunning **+359% revenue bookings growth to $455B** with cloud revenue guidance of **$144B by 2030**, driven significantly by a large deal with **OpenAI** amid tensions with **Microsoft**. On AI infrastructure, **Moonshot AI** released **Kimi’s checkpoint-engine**, enabling rapid weight updates on 1T-parameter models across thousands of GPUs, integrating with **vLLM**. **RLFactory** introduced a plug-and-play reinforcement learning framework for tool-using agents, showing smaller models outperforming larger ones. **TRL v0.23** added context parallelism for long-context training. **Thinking Machines Lab** published research on deterministic inference pipelines, making **vLLM** deterministic for **Qwen** models. **Meta** launched **BackendBench**, a PyTorch benchmarking tool.

Canonical issue URL

Congrats Oracle!

AI News for 9/9/2025-9/10/2025. We checked 12 subreddits, 544 Twitters and 22 Discords (187 channels, and 5382 messages) for you. Estimated reading time saved (at 200wpm): 457 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

We were going to feature the official Anthropic MCP Registry news, or ChatGPT Developer Mode or Claude's new VM or Mistral's huge fundraise, but probably today's biggest vibe shift is for Oracle's OCI division which blew away estimates with their revenue bookings growth going up +359% to $455B and cloud revenue guidance of $144B by 2030 (for context OCI is $18B today, AWS is $112B, Azure is $75B). With the stock gaining >$250B market cap and almost entering the trillion dollar club, Larry Ellison is now the world's richest man, in what is retrospectively a run for the ages.

The Wall Street Journal carried the additional story that OpenAI was responsible for a large amount of the projected bookings, and, perhaps significantly, is the outcome of a months long tension with Microsoft.


AI Twitter Recap

Fast RL for Tool-Use and Weight Update Infrastructure (Kimi checkpoint-engine, RLFactory, TRL)

Deterministic and Scalable Inference/Training (vLLM determinism, BackendBench, dynamic quant, HierMoE)

Model Releases and Performance

Evals and Post‑Training Platforms

Agents, MCP, and SDKs

Multimodal & Edge Embeddings and Tooling

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Unsloth DeepSeek‑V3.1 Dynamic GGUFs Aider Polyglot Benchmarks & AMA

2. Microsoft VibeVoice long‑form multi‑speaker TTS showcase + GPT‑OSS from‑scratch pretraining release

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Image Gen Releases: SeeDream 4 vs Imagen, Qwen Edit (Nunchaku), and Wan 2.2 I2V

2. LLM Quality Volatility, Hallucinations, and Buggy Outputs

3. AI Job Displacement and Cultural Impact


AI Discord Recap

A summary of Summaries of Summaries by X.ai Grok-4

Theme 1: Models Muscle Up with Speedy Tweaks

Theme 2: Fresh Models Flaunt Features and Flaws

Theme 3: Tools Tackle Bugs and Boost Builds

Theme 4: Hardware Hustles for AI Edge

Theme 5: Community Buzzes on Events and Glitches


Discord: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


LM Studio Discord


OpenRouter Discord


Cursor Community Discord


GPU MODE Discord


Nous Research AI Discord


Latent Space Discord


DSPy Discord


HuggingFace Discord


OpenAI Discord


Moonshot AI (Kimi K-2) Discord


aider (Paul Gauthier) Discord


Eleuther Discord


Modular (Mojo 🔥) Discord


Manus.im Discord Discord


tinygrad (George Hotz) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1177 messages🔥🔥🔥):

Iphone Foldable vs Airpods, Kim Soohyun Grooming Allegations, High Website Bounce Rate, Perplexity Max is 200, AI Models for Studying


Perplexity AI ▷ #sharing (6 messages):

Apple Event, Referrals, Shareable Threads


Perplexity AI ▷ #pplx-api (2 messages):

API Error, Friend Request


Unsloth AI (Daniel Han) ▷ #general (942 messages🔥🔥🔥):

Reddit AMA, llama.cpp compiled on demand kernels, GLM 4.5 Air CPU, K2-Think, Gemma Embedding


Unsloth AI (Daniel Han) ▷ #introduce-yourself (4 messages):

User Introductions, Discord Etiquette


Unsloth AI (Daniel Han) ▷ #announcements (1 messages):

Unsloth AMA, Aider Polyglot benchmarks, Memory Efficient RL, Unsloth Flex Attention, DeepSeek-V3.1 GGUF


Unsloth AI (Daniel Han) ▷ #off-topic (83 messages🔥🔥):

Annotating Pauses, Small Model Efficiency, Apple's 'Illusion of Thinking' Paper, 48GB 4090s, Privacy-First OS


Unsloth AI (Daniel Han) ▷ #help (192 messages🔥🔥):

Disable VLLM, Grok-2 GGUF Tokenizer, RL Training Loss Issues, Daniel's AIE Talk, TRL Loss vs Unsloth Loss


Unsloth AI (Daniel Han) ▷ #showcase (19 messages🔥):

GPT Overreactions, GitHub vibe coding, AI Magic


LMArena ▷ #general (865 messages🔥🔥🔥):

Ernie models, Imagen 4 Ultra, Sonoma Sky vs Dusk, VACE 2.1 Workflow, Generate Images Auto-Select


LMArena ▷ #announcements (1 messages):

Seedream-4, LMArena Updates


LM Studio ▷ #general (359 messages🔥🔥):

Local LLM privacy, Quantization impact on performance, MoE model, GPU vs CPU for LLM, LLM Tool Usage


LM Studio ▷ #hardware-discussion (83 messages🔥🔥):

AMD MI500, MoE Models, DGX Spark patent lawsuits, 9070 Nitro+ Vulkan, Hard Drive lifespan


OpenRouter ▷ #announcements (1 messages):

Nvidia Nemotron Nano, DeepInfra new paid provider


OpenRouter ▷ #app-showcase (2 messages):

``


OpenRouter ▷ #general (397 messages🔥🔥):

OpenRouter free models, API rate limits, BYOK markups, API keys, Models vs. token limits


OpenRouter ▷ #new-models (1 messages):

Readybot.io: OpenRouter - New Models


OpenRouter ▷ #discussion (13 messages🔥):

Nemotron Nano pricing, Agentic tool calling models, LLMs for Swift UI development


Cursor Community ▷ #general (221 messages🔥🔥):

Project-Specific Docs and Memories, Audio Autoplay on Mobile, Disabling Inline Diff, Cursor Crashing Issues, Web Scraping for Engineering Guidelines


Cursor Community ▷ #background-agents (1 messages):

Background Agents, Multiple Repositories, Pull Requests


GPU MODE ▷ #general (5 messages):

GPU Architecture Praise, Apple's Dynamic Caching, Neural Accelerators, Local Models and AI Future


GPU MODE ▷ #triton (15 messages🔥):

TLX Extensions, Triton compiler improvements, Simplicial attention kernels, CUDA and PTX, compiler backend optimization


GPU MODE ▷ #torch (47 messages🔥):

torch.compile accuracy issues, Debugging torch.compile, BF16 precision with torch.compile, FlexAttention and FP8 KV-cache in vLLM, CUDA graph warmup


GPU MODE ▷ #algorithms (1 messages):

person12341234432: whaddafak is thaat


GPU MODE ▷ #beginner (2 messages):

GPU Programming, College Student Beginner


GPU MODE ▷ #pmpp-book (6 messages):

Career paths for PMPP knowledgeable candidates, Bridging the gap between theory and practice, Cloud vendors for GPU access, GPU kernels for BioML, Modern GPU hardware resources


GPU MODE ▷ #off-topic (3 messages):

Sam Zeloof, Jeri Ellsworth, GPU in M2 SSD form factor


GPU MODE ▷ #rocm (3 messages):

ROCm performance counters, VALUBusy counter issues, Vector multiplication kernel performance, AMD GPU architecture efficiency


GPU MODE ▷ #webgpu (1 messages):

WGSL, SPIRV, Corporate Politics


GPU MODE ▷ #self-promotion (2 messages):

r/LocalLlama AMA, Unsloth optimizations, Cohere Labs Event, Kernels, Triton


GPU MODE ▷ #🍿 (7 messages):

Leaderboard, BackendBench, KernelBot, Model Evaluations, LLM Benchmarking


GPU MODE ▷ #submissions (25 messages🔥):

Discord bot submissions, AMD leaderboard issues, Submitting files


GPU MODE ▷ #factorio-learning-env (12 messages🔥):

Docker issues, Factorio meeting attendance, Environment errors


GPU MODE ▷ #amd-competition (10 messages🔥):

Unspecified Issue, Backend Server Busy, Runner Working Again, Solutions to Previous Completions, Ranking Not Updating


GPU MODE ▷ #general (11 messages🔥):

L2 Cache Clearing Updates, Kernel Development, LeetGPU, Kernelbot, GPU Mode Leaderboard


GPU MODE ▷ #multi-gpu (13 messages🔥):

NCCL FP4 support, NCCL CMake, NCCL Makefiles, NCCL includes and libs, AI Code Assistance


GPU MODE ▷ #low-bit-training (7 messages):

FP8 Backward Transposes, Consumer GPU Transpose Support, Blackwell Architecture, Weight Re-quantization


Nous Research AI ▷ #general (108 messages🔥🔥):

llama.cpp Metal Kernels, Qwen3 VL, K2 Models, Agent Building Platform, LLM's Coding Abilities


Nous Research AI ▷ #ask-about-llms (2 messages):

AI model initialization prompts, Nous Chat initialization, Claude's "bad-faith" mode


Nous Research AI ▷ #research-papers (1 messages):

wholetoast: https://arxiv.org/pdf/2509.07367v1


Nous Research AI ▷ #interesting-links (1 messages):

promptsiren: https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/


Nous Research AI ▷ #research-papers (1 messages):

wholetoast: https://arxiv.org/pdf/2509.07367v1


Latent Space ▷ #ai-general-chat (90 messages🔥🔥):

Strands Agents bug fix, Nitter 404 errors, Model Context Protocol (MCP) Registry, Claude file creation, Codebuff beats Claude Code


Latent Space ▷ #private-agents (8 messages🔥):

Open-Source LLM Hacking, Mech interp and SAEs, Convergence Theory, VLM Replication, Sample Data Quality


Latent Space ▷ #genmedia-creative-ai (5 messages):

Seedream 4, Replicate, Nano-Banana


DSPy ▷ #show-and-tell (1 messages):

swair: thanks


DSPy ▷ #papers (2 messages):

REER, DSPy, ruler, training data, trajectories


DSPy ▷ #general (94 messages🔥🔥):

DSPy Modules Structure, DSPy metrics, Production DSPy, DSPy in Rust, Kimi-k2-instruct struggles


HuggingFace ▷ #general (29 messages🔥):

Dataclasses Field Default Factory, Active AI/Agentic AI Dev Servers, Unsloth for LLM Finetuning, smol course, PapersWithCode UI in HuggingFace


HuggingFace ▷ #today-im-learning (1 messages):

saadkhan_188: Same situation as ☝🏻


HuggingFace ▷ #cool-finds (1 messages):

FFT inference Method, Linear scaling LMs, CountSketch


HuggingFace ▷ #i-made-this (2 messages):

Procrastination in projects, lol


HuggingFace ▷ #reading-group (1 messages):

cakiki: <@892052262787096629> Please don't cross-post and keep channels on topic


HuggingFace ▷ #computer-vision (1 messages):

arXiv endorsement, cs.CV category


HuggingFace ▷ #NLP (1 messages):

LLM, Database, Chat Memory, Session Storage


HuggingFace ▷ #gradio-announcements (2 messages):

Trackio library, Gradio v5.45.0, gr.Walkthrough, Input validation, gr.Navbar


HuggingFace ▷ #smol-course (35 messages🔥):

Colab Notebook Error, SmolLM3 Fine-Tuning Clarification, Study Groups Forming, SFT Configuration


HuggingFace ▷ #agents-course (8 messages🔥):

Course Start Dates, Certificate Deadlines, Coding Exercise Errors, Introductions


OpenAI ▷ #ai-discussions (52 messages🔥):

LLM existential meltdown as community theater, Changing OpenAI account primary email, GPT-5 issues, AI for cognitive architecture, Automated AI Agent to apply to jobs


OpenAI ▷ #gpt-4-discussions (7 messages):

Bug reporting permission issues, GPT side chat vs create side chat, Knowledge base files, ChatGPT login issues


OpenAI ▷ #prompt-engineering (4 messages):

Transparent Optimizations Proposal, Creative Writing Prompts for Claude 4, PDF Hosting Alternatives


OpenAI ▷ #api-discussions (4 messages):

Transparent Optimizations, Optimizer Markers, Prompt Rewrite Previews, Feasibility Checks


Moonshot AI (Kimi K-2) ▷ #general-chat (55 messages🔥🔥):

EmbeddingGemma, Google product lifecycle, K2-Think model, Kimi Researcher report generation, SMS sign up issues


aider (Paul Gauthier) ▷ #general (38 messages🔥):

Aider vs Roo/Cline, GPT-OSS-120B performance, Leaderboard accuracy, GPT-5 chat latest, Aider config options


aider (Paul Gauthier) ▷ #questions-and-tips (14 messages🔥):

Model API URL, Auto-accept Aider Files, Aider vs. No-Code Platforms


Eleuther ▷ #general (13 messages🔥):

AI Automation, Waterloo Students in Open Source AI, GCG adversarial suffix jailbreaks


Eleuther ▷ #research (36 messages🔥):

Sequence packing/padding, GLM 4.5 arguments, Lossless packing strategies, Ordering of the pre-training corpus, RL approach for language models


Eleuther ▷ #scaling-laws (2 messages):

Transformer Model Parameters, Dataset Token Size


Eleuther ▷ #multimodal-general (1 messages):

ArXiv Endorsement Request, cs.CV, Undergrad seeking Endorsement


Modular (Mojo 🔥) ▷ #general (5 messages):

Leetcode add two numbers in Mojo, Mojo dev environment docker container


Modular (Mojo 🔥) ▷ #mojo (12 messages🔥):

conditional fields in structs, mojo compiler on roadmap, venv requirements, package management solution, classes vs structs