All tags

Model: "r1-1776"

    not much happened today
    The Ultra-Scale Playbook: Training LLMs on GPU Clusters