All tags

Person: "thom-wolf"

    The Ultra-Scale Playbook: Training LLMs on GPU Clusters