All tags

Model: "deepseek-v3"

    not much happened today
    ChatGPT Codex, OpenAI's first cloud SWE agent
    not much happened today
    not much happened today
    not much happened today
    Gemma 3 beats DeepSeek V3 in Elo, 2.0 Flash beats GPT4o with Native Image Gen
    not much happened today
    lots of small launches
    not much happened today
    Mistral Small 3 24B and Tulu 3 405B
    not much happened today
    not much happened today
    DeepSeek #1 on US App Store, Nvidia stock tanks -17%
    DeepSeek R1: o1-level open weights model and a simple recipe for upgrading 1.5B models to Sonnet/4o level
    not much happened today
    not much happened today
    PRIME: Process Reinforcement through Implicit Rewards
    not much happened to end the year
    not much happened today
    not much happened today
    DeepSeek v3: 671B finegrained MoE trained for $5.5m USD of compute on 15T tokens