Person: "sama"

GPT-5.2 (Instant/Thinking/Pro): 74% on GDPVal, 1.4x cost of GPT 5.1, on 10 Year OpenAI Anniversary

OpenAI fires back: GPT-5.1-Codex-Max (API) and GPT 5.1 Pro (ChatGPT)

not much happened today

minor updates to GPT 5.1 and SIMA 2

not much happened today

OpenAI completes Microsoft + For-profit restructuring + announces 2028 AI Researcher timeline + Platform / AI cloud product direction + next $1T of compute

OpenAI Dev Day: Apps SDK, AgentKit, Codex GA, GPT‑5 Pro and Sora 2 APIs

Thinking Machines' Tinker: LoRA based LLM fine-tuning API

Sora 2: new video+audio model and OpenAI's first Social Network

not much happened today

GPT-5 Codex launch and OpenAI's quiet rise in Agentic Coding

OpenAI Realtime API GA and new `gpt-realtime` model, 20% cheaper than 4o

Databricks' $100B Series K

not much happened today

not much happened today

OpenAI's IMO Gold model also wins IOI Gold

not much happened today

OpenAI rolls out GPT-5 and GPT-5 Thinking to >1B users worldwide; -mini and -nano help claim Pareto Frontier

not much happened today

OpenAI's gpt-oss 20B and 120B, Claude Opus 4.1, DeepMind Genie 3

not much happened today

not much happened today

ChatGPT Agent: new o* model + unified Deep Research browser + Operator computer use + Code Interpreter terminal

not much happened today

Not much happened today

minor ai followups: MultiAgents, Meta-SSI-Scale, Karpathy, AI Engineer

Zuck goes Superintelligence Founder Mode: $100M bonuses + $100M+ salaries + NFDG Buyout?

Execuhires Round 2: Scale-Meta, Lamini-AMD, and Instacart-OpenAI

Reasoning Price War 2: Mistral Magistral + o3's 80% price cut + o3-pro

not much happened today

ChatGPT Codex, OpenAI's first cloud SWE agent

codex-1 openai-o3 codex-mini gemma-3 blip3-o qwen-2.5 marigold-iid deepseek-v3 lightlab gemini-2.0 lumina-next openai runway salesforce qwen deepseek google google-deepmind j1 software-engineering parallel-processing multimodality diffusion-models depth-estimation scaling-laws reinforcement-learning fine-tuning model-performance multi-turn-conversation reasoning audio-processing sama kevinweil omarsar0 iscienceluvr akhaliq osanseviero c_valenzuelab mervenoyann arankomatsuzaki jasonwei demishassabis philschmid swyx teortaxestex jaseweston

OpenAI launched Codex, a cloud-based software engineering agent powered by codex-1 (an optimized version of OpenAI o3) available in research preview for Pro, Enterprise, and Team ChatGPT users, featuring parallel task execution like refactoring and bug fixing. The Codex CLI was enhanced with quick sign-in and a new low-latency model, codex-mini. Gemma 3 is highlighted as the best open model runnable on a single GPU. Runway released the Gen-4 References API for style transfer in generation. Salesforce introduced BLIP3-o, a unified multimodal model family using diffusion transformers for CLIP image features. The Qwen 2.5 models (1.5B and 3B versions) were integrated into the PocketPal app with various chat templates. Marigold IID, a new state-of-the-art open-source depth estimation model, was released. In research, DeepSeek shared insights on scaling and hardware for DeepSeek-V3. Google unveiled LightLab, a diffusion-based light source control in images. Google DeepMind's AlphaEvolve uses Gemini 2.0 to discover new math and reduce costs without reinforcement learning. Omni-R1 studied audio's role in fine-tuning audio LLMs. Qwen proposed a parallel scaling law inspired by classifier-free guidance. Salesforce released Lumina-Next on the Qwen base, outperforming Janus-Pro. A study found LLM performance degrades in multi-turn conversations due to unreliability. J1 is incentivizing LLM-as-a-Judge thinking via reinforcement learning. A new Qwen study correlates question and strategy similarity to predict reasoning strategies.

Gemini 2.5 Flash completes the total domination of the Pareto Frontier

OpenAI o3, o4-mini, and Codex CLI

QwQ-32B claims to match DeepSeek R1-671B

GPT 4.1: The New OpenAI Workhorse

not much happened today

not much happened today

not much happened today

>$41B raised today (OpenAI @ 300b, Cursor @ 9.5b, Etched @ 1.5b)

not much happened today

not much happened today

Promptable Prosody, SOTA ASR, and Semantic VAD: OpenAI revamps Voice AI

The new OpenAI Agents Platform

GPT 4.5 — Chonky Orion ships!

small news items

OpenAI takes on Gemini's Deep Research

DeepSeek #1 on US App Store, Nvidia stock tanks -17%

not much happened today

PRIME: Process Reinforcement through Implicit Rewards

o3 solves AIME, GPQA, Codeforces, makes 11 years of progress in ARC-AGI and 25% in FrontierMath

ChatGPT Canvas GA

OpenAI Sora Turbo and Sora.com

Meta Llama 3.3: 405B/Nova Pro performance at 70B price

$200 ChatGPT Pro and o1-full/pro, with vision, without API, and mixed reviews

not much happened today

not much happened this weekend

not much happened today

a calm before the storm

o1 destroys Lmsys Arena, Qwen 2.5, Kyutai Moshi release

Learnings from o1 AMA

not much happened today

Microsoft AgentInstruct + Orca 3

Francois Chollet launches $1m ARC Prize

Chameleon: Meta's (unreleased) GPT4o-like Omnimodal Model

Cursor reaches >1000 tok/s finetuning Llama3-70b for fast file editing

Not much happened today

GPT-4o: the new SOTA-EVERYTHING Frontier model (GPT4O version)

Quis promptum ipso promptiet?

OpenAI's PR Campaign?