All tags

Model: "octopus-v2"

    Mixture of Depths: Dynamically allocating compute in transformer-based language models