All tags

Model: "llama-3.1"

    DeepSeek R1: o1-level open weights model and a simple recipe for upgrading 1.5B models to Sonnet/4o level
    Claude 3.5 Sonnet (New) gets Computer Use
    Did Nvidia's Nemotron 70B train on test?
    Cerebras Inference: Faster, Better, AND Cheaper
    Mistral Large 2 + RIP Mistral 7B, 8x7B, 8x22B