All tags
Person: "martin_casado"
not much happened today
fable-5 mythos anthropic model-performance trust data-retention benchmarking agentic-ai coding policy darioamodei natolambert martin_casado drfeifei antirez clementdelangue deanwball hlntnr _arohan_ dbahdanau gergelyorosz scaling01 dbreunig omarsar0 yacinemtb mchlhess jasonbotterill lvwerra lechmazur kimmonismus walden_yan hrishioa
Anthropic faced backlash for silently degrading AI research capabilities in its Fable/Mythos models without clear disclosure, raising concerns about trust, reproducibility, and enterprise data retention policies. Despite controversy, Fable 5 demonstrated strong benchmark performance, leading in agentic and coding tasks with high scores on Agent Arena, SimpleBench, CADGenBench, and PACT. Dario Amodei published a policy advocating stronger frontier AI oversight amid these tensions.
not much happened today
gpt-5 grok-code-fast-1 claude-sonnet glm-4.5 longcat-flash-chat fastvlm mobileclip2 internvl3.5 openai x-ai zhipu-ai meituan apple model-architecture moe adaptive-compute inference-speed model-training cost-efficiency coding developer-tools open-inference on-device-ai vision gdb martin_casado yanndubs elonmusk cline vikhyatk dzhng quixiai tim_dettmers casper_hansen_ reach_vb eliebakouch teortaxestex youjiacheng
OpenAI integrates GPT-5 into Xcode 26 with improved coding latency, though some UX trade-offs are noted. xAI's Grok Code Fast 1 gains momentum, surpassing Claude Sonnet in usage and praised for fast debugging. Zhipu's GLM-4.5 offers a cost-effective coding plan with strong performance against Claude Sonnet 4. Meituan releases the LongCat-Flash-Chat, a 560B parameter MoE model with adaptive compute and detailed technical insights. Apple debuts on-device vision-language models FastVLM and MobileCLIP2 alongside InternVL3.5.