All tags

Topic: "agent-evaluation"

    GPT 5.1 in ChatGPT: No evals, but adaptive thinking and instruction following
    Claude Haiku 4.5
    not much happened today
    not much happened today
    >$41B raised today (OpenAI @ 300b, Cursor @ 9.5b, Etched @ 1.5b)