All tags

Topic: "benchmarks"

    SOTA Video Gen: Veo 2 and Kling 2 are GA for developers
    GPT 4.1: The New OpenAI Workhorse
    not much happened today
    lots of little things happened this week
    Stripe lets Agents spend money with StripeAgentToolkit
    Claude 3.5 Sonnet (New) gets Computer Use
    not much happened today
    Too Cheap To Meter: AI prices cut 50-70% in last 30 days
    SciCode: HumanEval gets a STEM PhD upgrade
    There's Ilya!
    The Last Hurrah of Stable Diffusion?