Model: "gpt-4o"

not much happened today

not much happened today

OpenAI releases Deep Research API (o3/o4-mini)

Not much happened today

minor ai followups: MultiAgents, Meta-SSI-Scale, Karpathy, AI Engineer

Execuhires Round 2: Scale-Meta, Lamini-AMD, and Instacart-OpenAI

not much happened today

gpt-image-1 - ChatGPT's imagegen model, confusingly NOT 4o, now available in API

not much happened today

not much happened today; New email provider for AINews

SOTA Video Gen: Veo 2 and Kling 2 are GA for developers

GPT 4.1: The New OpenAI Workhorse

Google's Agent2Agent Protocol (A2A)

DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level

not much happened today

not much happened today

Gemini 2.5 Pro + 4o Native Image Gen

not much happened today

not much happened today

not much happened today

lots of small launches

not much happened today

X.ai Grok 3 and Mira Murati's Thinking Machines

not much happened today

o3-mini launches, OpenAI on "wrong side of history"

Bespoke-Stratos + Sky-T1: The Vicuna+Alpaca moment for reasoning

not much happened today

not much happened today

Titans: Learning to Memorize at Test Time

PRIME: Process Reinforcement through Implicit Rewards

not much happened today

DeepSeek v3: 671B finegrained MoE trained for $5.5m USD of compute on 15T tokens

not much happened today

o3 solves AIME, GPQA, Codeforces, makes 11 years of progress in ARC-AGI and 25% in FrontierMath

Genesis: Generative Physics Engine for Robotics (o1-mini version)

OpenAI Voice Mode Can See Now - After Gemini Does

Meta BLT: Tokenizer-free, Byte-level LLM

Meta Llama 3.3: 405B/Nova Pro performance at 70B price

Olympus has dropped (aka, Amazon Nova Micro|Lite|Pro|Premier|Canvas|Reel)

Qwen with Questions: 32B open weights reasoning model nears o1 in GPQA/AIME/Math500

Stripe lets Agents spend money with StripeAgentToolkit

BitNet was a lie?

FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI

The AI Search Wars Have Begun — SearchGPT, Gemini Grounding, and more

DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing

DeepSeek Janus and Meta SpiRit-LM: Decoupled Image and Expressive Voice Omnimodality

not much happened today

Did Nvidia's Nemotron 70B train on test?

Canvas: OpenAI's answer to Claude Artifacts

OpenAI Realtime API and other Dev Day Goodies

not much happened today

Learnings from o1 AMA

o1: OpenAI's new general reasoning models

not much happened today

Grok 2! and ChatGPT-4o-latest confuses everybody

not much happened today

Too Cheap To Meter: AI prices cut 50-70% in last 30 days

GPT4o August + 100% Structured Outputs for All (GPT4o August edition)

Execuhires: Tempting The Wrath of Khan

Mistral Large 2 + RIP Mistral 7B, 8x7B, 8x22B

Llama 3.1 Leaks: big bumps to 8B, minor bumps to 70b, and SOTA OSS 405b model

That GPT-4o Demo

Shall I compare thee to a Sonnet's day?

Gemini Nano: 50-90% of Gemini Pro, <100ms inference, on device, in Chrome Canary

Claude Crushes Code - 92% HumanEval and Claude.ai Artifacts

Nemotron-4-340B: NVIDIA's new large open models, built on syndata, great for syndata

The Last Hurrah of Stable Diffusion?

Ten Commandments for Deploying Fine-Tuned Models

Chameleon: Meta's (unreleased) GPT4o-like Omnimodal Model

Cursor reaches >1000 tok/s finetuning Llama3-70b for fast file editing

Not much happened today

GPT-4o: the new SOTA-EVERYTHING Frontier model (GPT4T version)

GPT-4o: the new SOTA-EVERYTHING Frontier model (GPT4O version)