All tags

Topic: "kv-cache"

    Pixtral Large (124B) beats Llama 3.2 90B with updated Mistral Large 24.11
    Shazeer et al (2024): you are overpaying for inference >13x