All tags

Topic: "reasoning"

    Execuhires Round 2: Scale-Meta, Lamini-AMD, and Instacart-OpenAI
    Reasoning Price War 2: Mistral Magistral + o3's 80% price cut + o3-pro
    Apple exposes Foundation Models API and... no new Siri
    Gemini 2.5 Pro (06-05) launched at AI Engineer World's Fair
    not much happened today
    DeepSeek-R1-0528 - Gemini 2.5 Pro-level model, SOTA Open Weights release
    not much happened today
    OpenAI buys Jony Ive's io for $6.5b, LMArena lands $100m seed from a16z
    Google I/O: new Gemini native voice, Flash, DeepThink, AI Mode (DeepSearch+Mariner+Astra)
    not much happened today
    ChatGPT Codex, OpenAI's first cloud SWE agent
    Granola launches team notes, while Notion launches meeting transcription
    not much happened today
    not much happened today
    not much happened today
    Gemini 2.5 Pro Preview 05-06 (I/O edition) - the SOTA vision+coding model
    Cursor @ $9b, OpenAI Buys Windsurf @ $3b
    not much happened today
    not much happened today; New email provider for AINews
    Gemini 2.5 Flash completes the total domination of the Pareto Frontier
    QwQ-32B claims to match DeepSeek R1-671B
    not much happened today
    OpenAI adopts MCP
    Gemini 2.5 Pro + 4o Native Image Gen
    lots of little things happened this week
    not much happened today
    not much happened today
    not much happened today
    X.ai Grok 3 and Mira Murati's Thinking Machines
    not much happened today
    not much happened today
    not much happened today
    s1: Simple test-time scaling (and Kyutai Hibiki)
    OpenAI takes on Gemini's Deep Research
    o3-mini launches, OpenAI on "wrong side of history"
    OpenAI launches Operator, its first Agent
    Bespoke-Stratos + Sky-T1: The Vicuna+Alpaca moment for reasoning
    DeepSeek R1: o1-level open weights model and a simple recipe for upgrading 1.5B models to Sonnet/4o level
    not much happened today
    Moondream 2025.1.9: Structured Text, Enhanced OCR, Gaze Detection in a 2B Model
    not much happened today
    not much happened today
    not much happened to end the year
    not much happened today
    o3 solves AIME, GPQA, Codeforces, makes 11 years of progress in ARC-AGI and 25% in FrontierMath
    ModernBert: small new Retriever/Classifier workhorse, 8k context, 2T tokens,
    Genesis: Generative Physics Engine for Robotics (o1-mini version)
    o1 API, 4o/4o-mini in Realtime API + WebRTC, DPO Finetuning
    Meta Apollo - Video Understanding up to 1 hour, SOTA Open Weights
    not much happened today
    not much happened to end the week
    LMSys killed Model Versioning (gpt 4o 1120, gemini exp 1121)
    DeepSeek-R1 claims to beat o1-preview AND will be open sourced
    not much happened today
    a quiet weekend
    Learnings from o1 AMA
    Pixtral 12B: Mistral beats Llama to Multimodality
    Apple Intelligence Beta + Segment Anything Model 2
    Mistral Large 2 + RIP Mistral 7B, 8x7B, 8x22B
    Llama 3.1 Leaks: big bumps to 8B, minor bumps to 70b, and SOTA OSS 405b model
    Problems with MMLU-Pro
    Mozilla's AI Second Act
    There's Ilya!
    The Last Hurrah of Stable Diffusion?
    FineWeb: 15T Tokens, 12 years of CommonCrawl (deduped and filtered, you're welcome)
    Gemini Pro and GPT4T Vision go GA on the same day by complete coincidence