Company: "google-deepmind"

not much happened today

not much happened today

Google I/O 2026: Gemini 3.5 Flash, Omni, and Google’s Agent Stack

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

Nano Banana 2 aka Gemini 3.1 Flash Image Preview: the new SOTA Imagegen model

not much happened today

Gemini 3.1 Pro: 2x 3.0 on ARC-AGI 2

new Gemini 3 Deep Think, Anthropic $30B @ $380B, GPT-5.3-Codex Spark, MiniMax M2.5

Qwen-Image 2.0 and Seedance 2.0

xAI Grok Imagine API - the #1 Video Model, Best Pricing and Latency - and merging with SpaceX

Open Responses: explicit spec for OpenAI's Responses API supported by OpenRouter, Ollama, Huggingface, vLLM, et al

not much happened today

not much happened today

not much happened today

Claude Skills grows: Open Standard, Directory, Org Admin

Gemini 3.0 Flash Preview: 1/4 cost of Pro, but ~as smart, retakes Pareto Frontier

not much happened today

OpenRouter's State of AI - An Empirical 100 Trillion Token Study

AI Engineer Code Summit

Gemini 3 Pro — new GDM frontier model 6, Gemini 3 Deep Think, and Antigravity IDE

xAI Grok 4.1: #1 in Text Arena, #1 in EQ-bench, and better Creative Writing

not much happened today

minor updates to GPT 5.1 and SIMA 2

OpenAI completes Microsoft + For-profit restructuring + announces 2028 AI Researcher timeline + Platform / AI cloud product direction + next $1T of compute

DeepSeek-OCR finds vision models can decode 10x more efficiently with ~97% accuracy of text-only, 33/200k pages/day/A100

not much happened today

Gemini 2.5 Computer Use preview beats Sonnet 4.5 and OAI CUA

not much happened today

not much happened today

OpenAI updates Codex, VSCode Extension that can sync tasks with Codex Cloud

nano-banana is Gemini‑2.5‑Flash‑Image, beating Flux Kontext by 170 Elo with SOTA Consistency, Editing, and Multi-Image Fusion

not much happened today

not much happened today

OpenAI's IMO Gold model also wins IOI Gold

OpenAI's gpt-oss 20B and 120B, Claude Opus 4.1, DeepMind Genie 3

Qwen-Image: SOTA text rendering + 4o-imagegen-level Editing Open Weights MMDiT

Gemini 2.5 Deep Think finally ships

not much happened today

OAI and GDM announce IMO Gold-level results with natural language reasoning, no specialized training or tools, under human time limits

not much happened today

not much happened today

SmolLM3: the SOTA 3B reasoning open source LLM

not much happened today

not much happened today

Context Engineering: Much More than Prompts

Bartz v. Anthropic PBC — "Training use is Fair Use"

The Quiet Rise of Claude Code vs Codex

Reasoning Price War 2: Mistral Magistral + o3's 80% price cut + o3-pro

AI Engineer World's Fair Talks Day 1

DeepSeek-R1-0528 - Gemini 2.5 Pro-level model, SOTA Open Weights release

not much happened today

OpenAI buys Jony Ive's io for $6.5b, LMArena lands $100m seed from a16z

Google I/O: new Gemini native voice, Flash, DeepThink, AI Mode (DeepSearch+Mariner+Astra)

ChatGPT Codex, OpenAI's first cloud SWE agent

codex-1 openai-o3 codex-mini gemma-3 blip3-o qwen-2.5 marigold-iid deepseek-v3 lightlab gemini-2.0 lumina-next openai runway salesforce qwen deepseek google google-deepmind j1 software-engineering parallel-processing multimodality diffusion-models depth-estimation scaling-laws reinforcement-learning fine-tuning model-performance multi-turn-conversation reasoning audio-processing sama kevinweil omarsar0 iscienceluvr akhaliq osanseviero c_valenzuelab mervenoyann arankomatsuzaki jasonwei demishassabis philschmid swyx teortaxestex jaseweston

OpenAI launched Codex, a cloud-based software engineering agent powered by codex-1 (an optimized version of OpenAI o3) available in research preview for Pro, Enterprise, and Team ChatGPT users, featuring parallel task execution like refactoring and bug fixing. The Codex CLI was enhanced with quick sign-in and a new low-latency model, codex-mini. Gemma 3 is highlighted as the best open model runnable on a single GPU. Runway released the Gen-4 References API for style transfer in generation. Salesforce introduced BLIP3-o, a unified multimodal model family using diffusion transformers for CLIP image features. The Qwen 2.5 models (1.5B and 3B versions) were integrated into the PocketPal app with various chat templates. Marigold IID, a new state-of-the-art open-source depth estimation model, was released. In research, DeepSeek shared insights on scaling and hardware for DeepSeek-V3. Google unveiled LightLab, a diffusion-based light source control in images. Google DeepMind's AlphaEvolve uses Gemini 2.0 to discover new math and reduce costs without reinforcement learning. Omni-R1 studied audio's role in fine-tuning audio LLMs. Qwen proposed a parallel scaling law inspired by classifier-free guidance. Salesforce released Lumina-Next on the Qwen base, outperforming Janus-Pro. A study found LLM performance degrades in multi-turn conversations due to unreliability. J1 is incentivizing LLM-as-a-Judge thinking via reinforcement learning. A new Qwen study correlates question and strategy similarity to predict reasoning strategies.

Gemini's AlphaEvolve agent uses Gemini 2.0 to find new Math and cuts Gemini cost 1% — without RL

not much happened today

AI Engineer World's Fair: Second Run, Twice The Fun

Gemini 2.5 Pro Preview 05-06 (I/O edition) - the SOTA vision+coding model

Qwen 3: 0.6B to 235B MoE full+base models that beat R1 and o1

Grok 3 & 3-mini now API Available

GPT 4.1: The New OpenAI Workhorse

Google's Agent2Agent Protocol (A2A)

DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level

not much happened today

OpenAI adopts MCP

Gemini 2.5 Pro + 4o Native Image Gen

not much happened today

Gemma 3 beats DeepSeek V3 in Elo, 2.0 Flash beats GPT4o with Native Image Gen

DeepSeek's Open Source Stack

not much happened today

The Ultra-Scale Playbook: Training LLMs on GPU Clusters

not much happened today

not much happened today

s1: Simple test-time scaling (and Kyutai Hibiki)

Gemini 2.0 Flash GA, with new Flash Lite, 2.0 Pro, and Flash Thinking

How To Scale Your Model, by DeepMind

OpenAI takes on Gemini's Deep Research

OpenAI launches Operator, its first Agent

not much happened today

ModernBert: small new Retriever/Classifier workhorse, 8k context, 2T tokens,

Genesis: Generative Physics Engine for Robotics (o1-mini version)

OpenAI Voice Mode Can See Now - After Gemini Does

o1 API, 4o/4o-mini in Realtime API + WebRTC, DPO Finetuning

Meta Apollo - Video Understanding up to 1 hour, SOTA Open Weights

Google wakes up: Gemini 2.0 et al

ChatGPT Canvas GA

Meta Llama 3.3: 405B/Nova Pro performance at 70B price

not much happened today

Olympus has dropped (aka, Amazon Nova Micro|Lite|Pro|Premier|Canvas|Reel)

not much happened to end the week

LMSys killed Model Versioning (gpt 4o 1120, gemini exp 1121)

DeepSeek-R1 claims to beat o1-preview AND will be open sourced

GitHub Copilot Strikes Back

not much happened this weekend

Not much (in AI) happened this weekend

Contextual Document Embeddings: `cde-small-v1`

Liquid Foundation Models: A New Transformers alternative + AINews Pod 2

not much happened today

not much happened today

not much happened today

a quiet weekend

Summer of Code AI: $1.6b raised, 1 usable product

Cerebras Inference: Faster, Better, AND Cheaper

not much happened today

not much happened today

Grok 2! and ChatGPT-4o-latest confuses everybody

Too Cheap To Meter: AI prices cut 50-70% in last 30 days

GPT4o August + 100% Structured Outputs for All (GPT4o August edition)

Execuhires: Tempting The Wrath of Khan

Rombach et al: FLUX.1 [pro|dev|schnell], $31m seed for Black Forest Labs

Gemma 2 2B + Scope + Shield

not much happened today

AlphaProof + AlphaGeometry2 reach 1 point short of IMO Gold

That GPT-4o Demo

Gemma 2: The Open Model for Everyone

Contextual Position Encoding (CoPE)

ALL of AI Engineering in One Place

Chameleon: Meta's (unreleased) GPT4o-like Omnimodal Model

Cursor reaches >1000 tok/s finetuning Llama3-70b for fast file editing

Not much happened today

Google I/O in 60 seconds

LMSys advances Llama 3 eval analysis

OpenAI's PR Campaign?

DeepSeek-V2 beats Mixtral 8x22B with >160 experts at HALF the cost

Cohere Command R+, Anthropic Claude Tool Use, OpenAI Finetuning

Shipping and Dipping: Inflection + Stability edition

One Year of Latent Space

Sora pushes SOTA

12/26/2023: not much happened today

12/18/2023: Gaslighting Mistral for fun and profit

12/16/2023: ByteDance suspended by OpenAI