All tags

Model: "o1"

    OAI and GDM announce IMO Gold-level results with natural language reasoning, no specialized training or tools, under human time limits
    Qwen 3: 0.6B to 235B MoE full+base models that beat R1 and o1
    not much happened today
    Google's Agent2Agent Protocol (A2A)
    DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level
    not much happened today
    not much happened today
    X.ai Grok 3 and Mira Murati's Thinking Machines
    Reasoning Models are Near-Superhuman Coders (OpenAI IOI, Nvidia Kernels)
    not much happened today
    o3-mini launches, OpenAI on "wrong side of history"
    DeepSeek #1 on US App Store, Nvidia stock tanks -17%
    TinyZero: Reproduce DeepSeek R1-Zero for $30
    OpenAI launches Operator, its first Agent
    not much happened today
    Moondream 2025.1.9: Structured Text, Enhanced OCR, Gaze Detection in a 2B Model
    not much happened to end the year
    not much happened this weekend
    o3 solves AIME, GPQA, Codeforces, makes 11 years of progress in ARC-AGI and 25% in FrontierMath
    ModernBert: small new Retriever/Classifier workhorse, 8k context, 2T tokens,
    Genesis: Generative Physics Engine for Robotics (o1-mini version)
    Genesis: Generative Physics Engine for Robotics (o1-2024-12-17)
    o1 API, 4o/4o-mini in Realtime API + WebRTC, DPO Finetuning
    OpenAI Sora Turbo and Sora.com
    $200 ChatGPT Pro and o1-full/pro, with vision, without API, and mixed reviews
    not much happened to end the week
    FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
    a calm before the storm
    not much happened today
    nothing much happened today
    a quiet weekend
    o1: OpenAI's new general reasoning models