All tags

Model: "computerrl"

    DeepSeek V3.1: 840B token continued pretrain, beating Claude 4 Sonnet at 11% of its cost