All tags

Topic: "int8-precision"

    Shazeer et al (2024): you are overpaying for inference >13x