All tags

Topic: "rmsnorm"

    Qwen3-Next-80B-A3B-Base: Towards Ultimate Training & Inference Efficiency
    12/23/2023: NeurIPS Best Papers of 2023