All tags

Model: "llama-3-120b"

    Quis promptum ipso promptiet?
    DeepSeek-V2 beats Mixtral 8x22B with >160 experts at HALF the cost