All tags

Topic: "alignment"

    QwQ-32B claims to match DeepSeek R1-671B
    X.ai Grok 3 and Mira Murati's Thinking Machines
    o3 solves AIME, GPQA, Codeforces, makes 11 years of progress in ARC-AGI and 25% in FrontierMath
    Meta Llama 3.3: 405B/Nova Pro performance at 70B price
    Gemini (Experimental-1114) retakes #1 LLM rank with 1344 Elo
    Not much technical happened today
    OpenAI's Instruction Hierarchy for the LLM OS
    Claude 3 is officially America's Next Top Model
    CodeLLama 70B beats GPT4 on HumanEval
    12/8/2023 - Mamba v Mistral v Hyena