All tags

Model: "stripedhyena-2"

    The Ultra-Scale Playbook: Training LLMs on GPU Clusters