All tags

Model: "deepseek-r1"

    not much happened today
    Qwen 3: 0.6B to 235B MoE full+base models that beat R1 and o1
    QwQ-32B claims to match DeepSeek R1-671B
    Google's Agent2Agent Protocol (A2A)
    OpenAI adopts MCP
    not much happened today
    not much happened today
    not much happened today
    Anthropic's $61.5B Series E
    lots of small launches
    not much happened today
    AI Engineer Summit Day 1
    not much happened today
    X.ai Grok 3 and Mira Murati's Thinking Machines
    not much happened today
    Reasoning Models are Near-Superhuman Coders (OpenAI IOI, Nvidia Kernels)
    not much happened today
    not much happened today
    Gemini 2.0 Flash GA, with new Flash Lite, 2.0 Pro, and Flash Thinking
    o3-mini launches, OpenAI on "wrong side of history"
    not much happened today
    not much happened today
    DeepSeek #1 on US App Store, Nvidia stock tanks -17%
    TinyZero: Reproduce DeepSeek R1-Zero for $30
    OpenAI launches Operator, its first Agent
    Project Stargate: $500b datacenter (1.7% of US GDP) and Gemini 2 Flash Thinking 2
    DeepSeek R1: o1-level open weights model and a simple recipe for upgrading 1.5B models to Sonnet/4o level
    not much happened to end the week
    Qwen with Questions: 32B open weights reasoning model nears o1 in GPQA/AIME/Math500
    LMSys killed Model Versioning (gpt 4o 1120, gemini exp 1121)