Model: "llama-3-400b"

Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o version)

DeepSeek-V2 beats Mixtral 8x22B with >160 experts at HALF the cost

Meta Llama 3 (8B, 70B)