All tags
Person: "ml_angelopoulos"
not much happened today
brain2qwerty-v2 glm-5.2 qwen deepspark deepspeak-v4-flash deepspeak-v4-pro meta-ai-fair cursor deepseek cognition arena brain-computer-interfaces non-invasive-bci real-time-decoding speculative-decoding agent-assisted-research inference-systems cost-efficiency remote-agents training-data model-access infrastructure-strategy jeanremiking kimmonismus ml_angelopoulos
Meta announced Brain2Qwerty v2, a real-time non-invasive brain-to-text decoder achieving up to 78% word accuracy with released training code and dataset. Cursor launched Cursor for iOS with remote AI agents and live activity features. Open-weight model access is being commercialized with a $9.99/mo pass for models like GLM 5.2 and Qwen, while Cognition introduced Devin Fusion for cost-efficient coding. Arena reached a $100M ARR run rate eight months post-launch, focusing on agent evaluation. Infrastructure challenges, especially in China, remain critical. DeepSeek's DSpark advances speculative decoding with significant gains over prior methods, deployed in DeepSeek-V4-Flash and V4-Pro.
not much happened today
kimi-k2 qwen3-next nemotron-nano-2 granite-4.0 gpt-4.5 copilot codex vllm perplexity-ai ibm anthropic graphiti claude cursor-ai microsoft mixture-of-experts model-integration cloud-computing hybrid-models benchmarking agent-systems memory-persistence semantic-search code-retrieval context-length-optimization tool-use evaluation-frameworks software-development scaling01 cedric_chee aravsrinivas omarsar0 _avichawla pierceboggan jo_parkhurst jyangballin ofirpress ml_angelopoulos
Kimi-K2 Reasoner has been integrated into vLLM and will soon be supported by SGLang, featuring a massive 1.2 trillion parameter MoE configuration. Perplexity AI released research on cloud-portable trillion-parameter MoE kernels optimized for AWS EFA, with potential integration into vLLM. IBM's vLLM team formalized hybrid dense and sparse expert models, supporting models like Qwen3-Next, Nemotron Nano 2, and Granite 4.0. Kimi-K2 reportedly scores 77% on GPQA Diamond, outperforming GPT-4.5 at 71.4%, though this is unverified.
Anthropic published a guide on efficient tool-heavy agent systems using MCP patterns, drastically reducing context tokens by ~98.7%. Graphiti MCP demonstrated shared memory across apps like Claude Desktop and Cursor for persistent agent memory. VS Code introduced an "Agent sessions" feature to unify agent management, including Copilot and Codex. Cursor AI improved coding accuracy via semantic search and code retrieval embeddings. New evaluation frameworks like CodeClash and LMArena assess agent and coding model performance in realistic multi-round tasks and occupation-tagged leaderboards.