Topic: "model-release"

not much happened today

NVIDIA Nemotron 3: hybrid Mamba-Transformer completely open source models from 30B to 500B

not much happened today

Gemini 3 Pro — new GDM frontier model 6, Gemini 3 Deep Think, and Antigravity IDE

not much happened today

OpenAI Dev Day: Apps SDK, AgentKit, Codex GA, GPT‑5 Pro and Sora 2 APIs

Alibaba Yunqi: 7 models released in 4 days (Qwen3-Max, Qwen3-Omni, Qwen3-VL) and $52B roadmap

not much happened today

DeepSeek V3.1: 840B token continued pretrain, beating Claude 4 Sonnet at 11% of its cost

Databricks' $100B Series K

Western Open Models get Funding: Cohere $500m @ 6.8B, AI2 gets $152m NSF+NVIDIA grants

Kimi K2 - SOTA Open MoE proves that Muon can scale to 15T tokens/1T params

not much happened today

not much happened today

not much happened today

Bartz v. Anthropic PBC — "Training use is Fair Use"

Zuck goes Superintelligence Founder Mode: $100M bonuses + $100M+ salaries + NFDG Buyout?

Execuhires Round 2: Scale-Meta, Lamini-AMD, and Instacart-OpenAI

Gemini 2.5 Pro Preview 05-06 (I/O edition) - the SOTA vision+coding model

not much happened today

LlamaCon: Meta AI gets into the Llama API platform business

Qwen 3: 0.6B to 235B MoE full+base models that beat R1 and o1

QwQ-32B claims to match DeepSeek R1-671B

not much happened today

Google's Agent2Agent Protocol (A2A)

Llama 4's Controversial Weekend Release

not much happened today

Gemini 2.5 Pro + 4o Native Image Gen

Promptable Prosody, SOTA ASR, and Semantic VAD: OpenAI revamps Voice AI

Every 7 Months: The Moore's Law for Agent Autonomy

not much happened today

$200 ChatGPT Pro and o1-full/pro, with vision, without API, and mixed reviews

not much happened today

LMSys killed Model Versioning (gpt 4o 1120, gemini exp 1121)

DeepSeek-R1 claims to beat o1-preview AND will be open sourced

State of AI 2024

Llama 3.2: On-device 1B/3B, and Multimodal 11B/90B (with AI2 Molmo kicker)

Pixtral 12B: Mistral beats Llama to Multimodality

Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o version)

Claude Crushes Code - 92% HumanEval and Claude.ai Artifacts

Gemini launches context caching... or does it?

Qwen 2 beats Llama 3 (and we don't know how)

Google I/O in 60 seconds

Snowflake Arctic: Fully Open 10B+128x4B Dense-MoE Hybrid LLM

Meta Llama 3 (8B, 70B)

Mixtral 8x22B Instruct sparks efficiency memes

Shipping and Dipping: Inflection + Stability edition

1/10/2024: All the best papers for AI Engineers