Company: "google"

OpenRouter's State of AI - An Empirical 100 Trillion Token Study

not much happened today

Claude Opus 4.5: 3rd new SOTA coding model in past week, 1/3 the price of Opus

Nano Banana Pro (Gemini Image Pro) solves text-in-images, infographic generation, 2-4k resolution, and Google Search grounding

OpenAI fires back: GPT-5.1-Codex-Max (API) and GPT 5.1 Pro (ChatGPT)

Gemini 3 Pro — new GDM frontier model 6, Gemini 3 Deep Think, and Antigravity IDE

Kimi K2 Thinking: 1T-A32B params, SOTA HLE, BrowseComp, TauBench && Soumith leaves Pytorch

not much happened today

ChatGPT Atlas: OpenAI's AI Browser

The Karpathy-Dwarkesh Interview delays AGI timelines

Claude Haiku 4.5

not much happened today

not much happened today

not much happened today

GDPVal finding: Claude Opus 4.1 within 95% of AGI (human experts in top 44 white collar jobs)

not much happened today

OpenAI Realtime API GA and new `gpt-realtime` model, 20% cheaper than 4o

not much happened today

not much happened today

Western Open Models get Funding: Cohere $500m @ 6.8B, AI2 gets $152m NSF+NVIDIA grants

not much happened today

not much happened today

not much happened today

not much happened today

OpenAI releases Deep Research API (o3/o4-mini)

Not much happened today

Gemini 2.5 Pro/Flash GA, 2.5 Flash-Lite in Preview

Execuhires Round 2: Scale-Meta, Lamini-AMD, and Instacart-OpenAI

Gemini 2.5 Pro (06-05) launched at AI Engineer World's Fair

not much happened today

not much happened today

OpenAI buys Jony Ive's io for $6.5b, LMArena lands $100m seed from a16z

Google I/O: new Gemini native voice, Flash, DeepThink, AI Mode (DeepSearch+Mariner+Astra)

not much happened today

ChatGPT Codex, OpenAI's first cloud SWE agent

codex-1 openai-o3 codex-mini gemma-3 blip3-o qwen-2.5 marigold-iid deepseek-v3 lightlab gemini-2.0 lumina-next openai runway salesforce qwen deepseek google google-deepmind j1 software-engineering parallel-processing multimodality diffusion-models depth-estimation scaling-laws reinforcement-learning fine-tuning model-performance multi-turn-conversation reasoning audio-processing sama kevinweil omarsar0 iscienceluvr akhaliq osanseviero c_valenzuelab mervenoyann arankomatsuzaki jasonwei demishassabis philschmid swyx teortaxestex jaseweston

OpenAI launched Codex, a cloud-based software engineering agent powered by codex-1 (an optimized version of OpenAI o3) available in research preview for Pro, Enterprise, and Team ChatGPT users, featuring parallel task execution like refactoring and bug fixing. The Codex CLI was enhanced with quick sign-in and a new low-latency model, codex-mini. Gemma 3 is highlighted as the best open model runnable on a single GPU. Runway released the Gen-4 References API for style transfer in generation. Salesforce introduced BLIP3-o, a unified multimodal model family using diffusion transformers for CLIP image features. The Qwen 2.5 models (1.5B and 3B versions) were integrated into the PocketPal app with various chat templates. Marigold IID, a new state-of-the-art open-source depth estimation model, was released. In research, DeepSeek shared insights on scaling and hardware for DeepSeek-V3. Google unveiled LightLab, a diffusion-based light source control in images. Google DeepMind's AlphaEvolve uses Gemini 2.0 to discover new math and reduce costs without reinforcement learning. Omni-R1 studied audio's role in fine-tuning audio LLMs. Qwen proposed a parallel scaling law inspired by classifier-free guidance. Salesforce released Lumina-Next on the Qwen base, outperforming Janus-Pro. A study found LLM performance degrades in multi-turn conversations due to unreliability. J1 is incentivizing LLM-as-a-Judge thinking via reinforcement learning. A new Qwen study correlates question and strategy similarity to predict reasoning strategies.

Prime Intellect's INTELLECT-2 and PRIME-RL advance distributed reinforcement learning

not much happened today

not much happened today

not much happened today

not much happened today; New email provider for AINews

Gemini 2.5 Flash completes the total domination of the Pareto Frontier

SOTA Video Gen: Veo 2 and Kling 2 are GA for developers

not much happened today

Google's Agent2Agent Protocol (A2A)

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

Bespoke-Stratos + Sky-T1: The Vicuna+Alpaca moment for reasoning

Titans: Learning to Memorize at Test Time

not much happened today

Genesis: Generative Physics Engine for Robotics (o1-2024-12-17)

o1 API, 4o/4o-mini in Realtime API + WebRTC, DPO Finetuning

OpenAI Sora Turbo and Sora.com

$200 ChatGPT Pro and o1-full/pro, with vision, without API, and mixed reviews

not much happened today

Perplexity starts Shopping for you

not much happened today

The AI Search Wars Have Begun — SearchGPT, Gemini Grounding, and more

DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing

not much happened today

not much happened today

o1 destroys Lmsys Arena, Qwen 2.5, Kyutai Moshi release

not much happened today + AINews Podcast?

Everybody shipped small things this holiday weekend

CogVideoX: Zhipu's Open Source Sora

The DSPy Roadmap

not much happened today

GPT4o August + 100% Structured Outputs for All (GPT4o mini edition)

How Carlini Uses AI

Execuhires: Tempting The Wrath of Khan

FlashAttention 3, PaliGemma, OpenAI's 5 Levels to Superintelligence

Gemini Nano: 50-90% of Gemini Pro, <100ms inference, on device, in Chrome Canary

Gemini launches context caching... or does it?

Talaria: Apple's new MLOps Superweapon

Ways to use Anthropic's Tool Use GA

Ten Commandments for Deploying Fine-Tuned Models

Google I/O in 60 seconds

Apple's OpenELM beats OLMo with 50% of its dataset, using DeLighT

Mixtral 8x22B Instruct sparks efficiency memes

Multi-modal, Multi-Aspect, Multi-Form-Factor AI

Mergestral, Meta MTIAv2, Cohere Rerank 3, Google Infini-Attention

Music's Dall-E moment

Gemini Pro and GPT4T Vision go GA on the same day by complete coincidence

Not much happened today

Claude 3 just destroyed GPT 4 (see for yourself)

... and welcome AI Twitter!

Ring Attention for >1M Context

Google AI: Win some (Gemma, 1.5 Pro), Lose some (Image gen)

Gemini Ultra is out, to mixed reviews

MetaVoice & RIP Bard

1/4/2024: Jeff Bezos backs Perplexity's $520m Series B.

1/3/2024: RIP Coqui

12/24/2023: Dolphin Mixtral 8x7b is wild

12/19/2023: Everybody Loves OpenRouter

12/8/2023 - Mamba v Mistral v Hyena

12/7/2023: Anthropic says "skill issue"

Is Google's Gemini... legit?