All tags

Topic: "model-sharding"

    FSDP+QLoRA: the Answer to 70b-scale AI for desktop class GPUs