All tags
Model: "command-a+"
not much happened today
command-a+ claude-3.7-sonnet openai cohere reinforcement-learning reasoning multimodality model-architecture model-optimization model-releases benchmarking long-context model-efficiency transformers wtgowers hongxunwu aidangomez nickfrosst clementdelangue eliebakouch rasbt sama
OpenAI achieved a major math breakthrough by disproving a long-standing ErdΕs unit distance problem using a general-purpose reasoning model, marking a milestone in AI-driven formal science and long-horizon reasoning. The result was validated by prominent mathematicians like Timothy Gowers and OpenAI researcher Hongxun Wu, highlighting the model's advanced reasoning capabilities beyond prior AI math achievements. Meanwhile, Cohere released Command A+ as an open-source Apache 2.0 licensed model, featuring a 218B MoE / 25B active multimodal architecture supporting 48 languages and optimized for low hardware requirements, runnable on as little as 2Γ H100 GPUs. Benchmarks place Command A+ near Claude 4.5 Haiku in intelligence with strong non-hallucination but weaker scientific reasoning and coding. The architecture includes novel elements like a parallel transformer block, shared experts, and LayerNorm over RMSNorm.