All tags

Topic: "inference-speed"

    not much happened today
    AI Engineer Summit Day 1
    OpenAI takes on Gemini's Deep Research
    o3 solves AIME, GPQA, Codeforces, makes 11 years of progress in ARC-AGI and 25% in FrontierMath
    not much happened today
    s{imple|table|calable} Consistency Models
    not much happened today + AINews Podcast?
    Cerebras Inference: Faster, Better, AND Cheaper
    Llama 3.1: The Synthetic Data Model
    Mozilla's AI Second Act
    Nemotron-4-340B: NVIDIA's new large open models, built on syndata, great for syndata
    Qwen 2 beats Llama 3 (and we don't know how)
    Contextual Position Encoding (CoPE)
    Snowflake Arctic: Fully Open 10B+128x4B Dense-MoE Hybrid LLM
    Music's Dall-E moment
    Gemini Pro and GPT4T Vision go GA on the same day by complete coincidence
    Andrew likes Agents
    12/27/2023: NYT vs OpenAI