All tags

Topic: "model-release"

    Execuhires Round 2: Scale-Meta, Lamini-AMD, and Instacart-OpenAI
    Gemini 2.5 Pro Preview 05-06 (I/O edition) - the SOTA vision+coding model
    not much happened today
    LlamaCon: Meta AI gets into the Llama API platform business
    Qwen 3: 0.6B to 235B MoE full+base models that beat R1 and o1
    QwQ-32B claims to match DeepSeek R1-671B
    not much happened today
    Google's Agent2Agent Protocol (A2A)
    Llama 4's Controversial Weekend Release
    not much happened today
    Gemini 2.5 Pro + 4o Native Image Gen
    Promptable Prosody, SOTA ASR, and Semantic VAD: OpenAI revamps Voice AI
    Every 7 Months: The Moore's Law for Agent Autonomy
    not much happened today
    $200 ChatGPT Pro and o1-full/pro, with vision, without API, and mixed reviews
    not much happened today
    LMSys killed Model Versioning (gpt 4o 1120, gemini exp 1121)
    DeepSeek-R1 claims to beat o1-preview AND will be open sourced
    State of AI 2024
    Llama 3.2: On-device 1B/3B, and Multimodal 11B/90B (with AI2 Molmo kicker)
    Pixtral 12B: Mistral beats Llama to Multimodality
    Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o version)
    Claude Crushes Code - 92% HumanEval and Claude.ai Artifacts
    Gemini launches context caching... or does it?
    Qwen 2 beats Llama 3 (and we don't know how)
    Google I/O in 60 seconds
    Snowflake Arctic: Fully Open 10B+128x4B Dense-MoE Hybrid LLM
    Meta Llama 3 (8B, 70B)
    Mixtral 8x22B Instruct sparks efficiency memes
    Shipping and Dipping: Inflection + Stability edition
    Grok-1 in Bio
    1/10/2024: All the best papers for AI Engineers