All tags
Topic: "open-sourcing"
not much happened today
claude-fable-5 nanogpt anthropic recursive-si nvidia model-governance model-transparency benchmarking automated-research optimization open-sourcing model-behavior cost-efficiency richard_socher
Anthropic reversed its covert degradation policy on Claude Fable 5 after public backlash, sparking debates on governance, transparency, and access to frontier AI models. The model shows strong capabilities with mixed benchmark results, including 87.8% on WeirdML and top ranking on FrontierSWE, but practical usage highlights cost and inconsistent behavior. Separately, Recursive SI, led by Richard Socher, released an automated open-ended discovery system achieving state-of-the-art results on NVIDIA SOL-ExecBench, NanoGPT Speedrun, and NanoChat autoresearch, with open-sourced discoveries and improved efficiency metrics.
Anthropic releases Claude 4 Sonnet and Opus: Memory, Agent Capabilities, Claude Code, Redteam Drama
claude-4 claude-4-opus claude-4-sonnet claude-3.5-sonnet anthropic instruction-following token-accounting pricing-models sliding-window-attention inference-techniques open-sourcing model-accessibility agent-capabilities-api extended-context model-deployment
Anthropic has officially released Claude 4 with two variants: Claude Opus 4, a high-capability model for complex tasks priced at $15/$75 per million tokens, and Claude Sonnet 4, optimized for efficient everyday use. The release emphasizes instruction following and extended work sessions up to 7 hours. Community discussions highlight concerns about token pricing, token accounting transparency, and calls for open-sourcing Claude 3.5 Sonnet weights to support local model development. The news also covers Claude Code GA, new Agent Capabilities API, and various livestreams and reports detailing these updates. There is notable debate around sliding window attention and advanced inference techniques for local deployment.