All tags

Model: "llama-3-1"

    Gemini 2.0 Flash GA, with new Flash Lite, 2.0 Pro, and Flash Thinking
    not much happened today
    Vision Everywhere: Apple AIMv2 and Jina CLIP v2
    Tencent's Hunyuan-Large claims to beat DeepSeek-V2 and Llama3-405B with LESS Data
    s{imple|table|calable} Consistency Models
    not much happened today
    Llama 3.2: On-device 1B/3B, and Multimodal 11B/90B (with AI2 Molmo kicker)
    o1 destroys Lmsys Arena, Qwen 2.5, Kyutai Moshi release
    Pixtral 12B: Mistral beats Llama to Multimodality
    not much happened today
    CogVideoX: Zhipu's Open Source Sora
    Nvidia Minitron: LLM Pruning and Distillation updated for Llama 3.1
    super quiet day
    not much happened today
    Llama 3.1: The Synthetic Data Model