Company: "microsoft"

not much happened today

xAI Grok 4.1: #1 in Text Arena, #1 in EQ-bench, and better Creative Writing

minor updates to GPT 5.1 and SIMA 2

not much happened today

not much happened today

OpenAI completes Microsoft + For-profit restructuring + announces 2028 AI Researcher timeline + Platform / AI cloud product direction + next $1T of compute

not much happened today

not much happened today

Claude Agent Skills - glorified AGENTS.md? or MCP killer?

not much happened today

Gemini 2.5 Computer Use preview beats Sonnet 4.5 and OAI CUA

not much happened today

not much happened today

Oracle jumps +36% in a day after winning $300B OpenAI contract

not much happened today

OpenAI Realtime API GA and new `gpt-realtime` model, 20% cheaper than 4o

not much happened today

DeepSeek V3.1: 840B token continued pretrain, beating Claude 4 Sonnet at 11% of its cost

not much happened today

OpenAI rolls out GPT-5 and GPT-5 Thinking to >1B users worldwide; -mini and -nano help claim Pareto Frontier

not much happened today

not much happened today

not much happened today

Chinese Models Launch - MiniMax-M1, Hailuo 2 "Kangaroo", Moonshot Kimi-Dev-72B

Cognition vs Anthropic: Don't Build Multi-Agents/How to Build Multi-Agents

not much happened today

Cursor @ $9b, OpenAI Buys Windsurf @ $3b

not much happened today

not much happened today

Every 7 Months: The Moore's Law for Agent Autonomy

not much happened today

GPT 4.5 — Chonky Orion ships!

The Ultra-Scale Playbook: Training LLMs on GPU Clusters

Project Stargate: $500b datacenter (1.7% of US GDP) and Gemini 2 Flash Thinking 2

not much happened today

Meta BLT: Tokenizer-free, Byte-level LLM

FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI

not much happened today

OpenAI beats Anthropic to releasing Speculative Decoding

not much happened this weekend

not much happened today

DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing

Not much (in AI) happened this weekend

a calm before the storm

not much happened today

a quiet weekend

Ideogram 2 + Berkeley Function Calling Leaderboard V2

not much happened today

Execuhires: Tempting The Wrath of Khan

Nothing much happened today

Not much happened today.

Talaria: Apple's new MLOps Superweapon

ALL of AI Engineering in One Place

Anthropic's "LLM Genome Project": learning & clamping 34m features on Claude Sonnet

OpenAI's PR Campaign?

Kolmogorov-Arnold Networks: MLP killers or just spicy MLPs?

DeepSeek-V2 beats Mixtral 8x22B with >160 experts at HALF the cost

Evals: The Next Generation

Not much happened today

A quiet weekend

OpenAI's Instruction Hierarchy for the LLM OS

Llama-3-70b is GPT-4-level Open Model

Meta Llama 3 (8B, 70B)

Mixtral 8x22B Instruct sparks efficiency memes

Multi-modal, Multi-Aspect, Multi-Form-Factor AI

Cohere Command R+, Anthropic Claude Tool Use, OpenAI Finetuning

not much happened today

Shipping and Dipping: Inflection + Stability edition

Stable Diffusion 3 — Rombach & Esser did it again!

GPT4Turbo A/B Test: gpt-4-1106-preview

1/17/2024: Help crowdsource function calling datasets

1/1/2024: How to start with Open Source AI

12/22/2023: Anyscale's Benchmark Criticisms

12/21/2023: The State of AI (according to LangChain)

12/13/2023 SOLAR10.7B upstages Mistral7B?

12/12/2023: Towards LangChain 0.1