All tags

Model: "baichuan-m1-14b"

    The Ultra-Scale Playbook: Training LLMs on GPU Clusters