All tags
Model: "deepseek-r1t2"
not much happened today
veo-3 deepseek-r1t2 deepseek-tng-r1t2-chimera o3-deep-research o4-mini-deep-research deepswe-agent safe-superintelligence-inc perplexity-ai meta-ai-fair midjourney sakana-ai cohere google-deepmind deepseek openai together-ai video-generation assembly-of-experts model-licenses api-pricing research-roles product-expansion corporate-leadership model-release team-expansion ilya_sutskever daniel_levy daniel_gross aravsrinivas zeyuanallenzhu nat_friedman davidsholz fp_champagne demishassabis reach_vb
Ilya Sutskever confirmed his role as CEO of Safe Superintelligence Inc. (SSI) with Daniel Levy as President, dismissing acquisition rumors and emphasizing their strong team and compute resources. Perplexity AI expanded its data integrations by adding Morningstar's financial research and hinted at new product features for Pro users. Meta AI FAIR clarified its research structure, distinguishing its small lab from larger model training groups, and welcomed Nat Friedman to enhance AI product development. Midjourney and Sakana AI announced hiring for research and applied engineering roles. Cohere expanded its presence in Montréal, receiving praise from Canadian officials. On the model front, Google DeepMind's Gemini Pro released the Veo 3 video generation model globally. DeepSeek launched the faster DeepSeek R1T2 model using an Assembly of Experts approach, available under an MIT license. Kling AI showcased cinematic video generation capabilities. OpenAI introduced a high-cost Deep Research API with pricing up to $30 per call. Together AI announced the release of the DeepSWE agent.
not much happened today
gemma-3n glm-4.1v-thinking deepseek-r1t2 mini-max-m1 o3 claude-4-opus claude-sonnet moe-72b meta scale-ai unslothai zhipu-ai deepseek huawei minimax-ai allenai sakana-ai-labs openai model-performance vision conv2d float16 training-loss open-source model-benchmarks moe load-balancing scientific-literature-evaluation code-generation adaptive-tree-search synthesis-benchmarks alexandr_wang natfriedman steph_palazzolo thegregyang teortaxes_tex denny_zhou agihippo danielhanchen osanseviero reach_vb scaling01 ndea
Meta has hired Scale AI CEO Alexandr Wang as its new Chief AI Officer, acquiring a 49% non-voting stake in Scale AI for $14.3 billion, doubling its valuation to ~$28 billion. This move is part of a major talent shuffle involving Meta, OpenAI, and Scale AI. Discussions include the impact on Yann LeCun's influence at Meta and potential responses from OpenAI. In model news, Gemma 3N faces technical issues like vision NaNs and FP16 overflows, with fixes from UnslothAI. Chinese open-source models like GLM-4.1V-Thinking by Zhipu AI and DeepSeek R1T2 show strong performance and speed improvements. Huawei open-sourced a 72B MoE model with a novel load balancing solution. The MiniMax-M1 hybrid MoE model leads math benchmarks on the Text Arena leaderboard. AllenAI launched SciArena for scientific literature evaluation, where o3 outperforms others. Research from Sakana AI Labs introduces AB-MCTS for code generation, improving synthesis benchmarks.