All tags

Topic: "post-training"

    not much happened today
    DeepSeek-R1-0528 - Gemini 2.5 Pro-level model, SOTA Open Weights release
    Execuhires: Tempting The Wrath of Khan
    Apple Intelligence Beta + Segment Anything Model 2
    Qwen 2 beats Llama 3 (and we don't know how)