Company: "huggingface"

not much happened today

not much happened today

not much happened today

NVIDIA Nemotron 3: hybrid Mamba-Transformer completely open source models from 30B to 500B

not much happened today

Black Forest Labs FLUX.2 [pro|flex|dev|klein]: near-Nano Banana quality but Open Weights

Cursor 2.0 & Composer-1: Fast Models and New Agents UI

MiniMax M2 230BA10B — 8% of Claude Sonnet's price, ~2x faster, new SOTA open model

not much happened today

The Karpathy-Dwarkesh Interview delays AGI timelines

Claude Agent Skills - glorified AGENTS.md? or MCP killer?

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

Cognition's $10b Series C; Smol AI updates

Kimi K2‑0905 and Qwen3‑Max preview: two 1T open weights models launched

nano-banana is Gemini‑2.5‑Flash‑Image, beating Flux Kontext by 170 Elo with SOTA Consistency, Editing, and Multi-Image Fusion

not much happened today

Cohere Command A Reasoning beats GPT-OSS-120B and DeepSeek R1 0528

DeepSeek V3.1: 840B token continued pretrain, beating Claude 4 Sonnet at 11% of its cost

not much happened today

Figma's $50+b IPO

GLM-4.5: Deeper, Headier, & better than Kimi/Qwen/DeepSeek (SOTA China LLM?)

Voxtral - Mistral's SOTA ASR model in 3B (mini) and 24B ("small") sizes beats OpenAI Whisper large-v3

not much happened today

not much happened today

SmolLM3: the SOTA 3B reasoning open source LLM

OpenAI releases Deep Research API (o3/o4-mini)

minor ai followups: MultiAgents, Meta-SSI-Scale, Karpathy, AI Engineer

Cognition vs Anthropic: Don't Build Multi-Agents/How to Build Multi-Agents

not much happened today

not much happened today

not much happened today

Granola launches team notes, while Notion launches meeting transcription

not much happened today

The Ultra-Scale Playbook: Training LLMs on GPU Clusters

Project Stargate: $500b datacenter (1.7% of US GDP) and Gemini 2 Flash Thinking 2

not much happened today

ChatGPT Canvas GA

not much happened today

OLMo 2 - new SOTA Fully Open LLM

Common Corpus: 2T Open Tokens with Provenance

Nothing much happened today

Problems with MMLU-Pro

Gemini Nano: 50-90% of Gemini Pro, <100ms inference, on device, in Chrome Canary

Clémentine Fourrier on LLM evals

Cursor reaches >1000 tok/s finetuning Llama3-70b for fast file editing

FineWeb: 15T Tokens, 12 years of CommonCrawl (deduped and filtered, you're welcome)

Anime pfp anon eclipses $10k A::B prompting challenge

Claude 3 is officially America's Next Top Model

Companies liable for AI hallucination is Good Actually for AI Engineers

GPT4Turbo A/B Test: gpt-4-1106-preview

Google Solves Text to Video

12/25/2023: Nous Hermes 2 Yi 34B for Christmas

12/15/2023: Mixtral-Instruct beats Gemini Pro (and matches GPT3.5)

12/11/2023: Mixtral beats GPT3.5 and Llama2-70B