All tags

Topic: "direct-preference-optimization"

    Execuhires Round 2: Scale-Meta, Lamini-AMD, and Instacart-OpenAI
    Life after DPO (RewardBench)
    AI gets Memory
    MetaVoice & RIP Bard
    Qwen 1.5 Released
    Adept Fuyu-Heavy: Multimodal model for Agents