All tags

Topic: "token-efficiency"

    DeepSeek V3.1: 840B token continued pretrain, beating Claude 4 Sonnet at 11% of its cost
    GLM-4.5: Deeper, Headier, & better than Kimi/Qwen/DeepSeek (SOTA China LLM?)
    Reasoning Price War 2: Mistral Magistral + o3's 80% price cut + o3-pro
    lots of small launches