Topic: "sparse-moe"

Mistral 3: Mistral Large 3 + Ministral 3B/8B/14B open weights models

MiniMax M2 230BA10B — 8% of Claude Sonnet's price, ~2x faster, new SOTA open model

Qwen 1.5 Released