Topic: "long-context"

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

GLM 5.2: the top Frontend Coding model in the world, IndexShare reduces costs

not much happened today

not much happened today

Anthropic raises $65B in Series H at a $965B post-money valuation, releases Opus 4.8 and Dynamic Workflows

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

Anthropic's Claude Opus 4.7

not much happened today

GPT 5.4: SOTA Knowledge Work -and- Coding -and- CUA Model, OpenAI is so very back

not much happened today

Claude Sonnet 4.6: clean upgrade of 4.5, mostly better with some caveats

Qwen3.5-397B-A17B: the smallest Open-Opus class, very efficient model

Z.ai GLM-5: New SOTA Open Weights LLM

not much happened today

OpenAI and Anthropic go to war: Claude Opus 4.6 vs GPT 5.3 Codex

Context Graphs: Hype or actually Trillion-dollar opportunity?

OpenEvidence, the ‘ChatGPT for doctors,’ raises $250m at $12B valuation, 12x from $1b last Feb

not much happened today

Apple picks Google's Gemini to power Siri's next generation

not much happened today

Claude Skills grows: Open Standard, Directory, Org Admin

OpenAI GPT Image-1.5 claims to beat Nano Banana Pro, #1 across all Arenas, but completely fails Vibe Checks

NVIDIA Nemotron 3: hybrid Mamba-Transformer completely open source models from 30B to 500B

not much happened today

GPT-5.2 (Instant/Thinking/Pro): 74% on GDPVal, 1.4x cost of GPT 5.1, on 10 Year OpenAI Anniversary

not much happened today

OpenRouter's State of AI - An Empirical 100 Trillion Token Study

Mistral 3: Mistral Large 3 + Ministral 3B/8B/14B open weights models

not much happened today

not much happened today

not much happened today

not much happened today

DeepSeek-OCR finds vision models can decode 10x more efficiently with ~97% accuracy of text-only, 33/200k pages/day/A100

The Karpathy-Dwarkesh Interview delays AGI timelines

Claude Agent Skills - glorified AGENTS.md? or MCP killer?

not much happened today

Anthropic Claude Sonnet 4.5, Claude Code 2.0, new VS Code Extensions

GDPVal finding: Claude Opus 4.1 within 95% of AGI (human experts in top 44 white collar jobs)

not much happened today

GPT-5 Codex launch and OpenAI's quiet rise in Agentic Coding

not much happened today

Oracle jumps +36% in a day after winning $300B OpenAI contract

Kimi K2‑0905 and Qwen3‑Max preview: two 1T open weights models launched

Cohere Command A Reasoning beats GPT-OSS-120B and DeepSeek R1 0528

DeepSeek V3.1: 840B token continued pretrain, beating Claude 4 Sonnet at 11% of its cost

not much happened today

OpenAI rolls out GPT-5 and GPT-5 Thinking to >1B users worldwide; -mini and -nano help claim Pareto Frontier

not much happened today

ChatGPT Agent: new o* model + unified Deep Research browser + Operator computer use + Code Interpreter terminal

Voxtral - Mistral's SOTA ASR model in 3B (mini) and 24B ("small") sizes beats OpenAI Whisper large-v3

Kimi K2 - SOTA Open MoE proves that Muon can scale to 15T tokens/1T params

Grok 4: xAI succeeds in going from 0 to new SOTA LLM in 2 years

not much happened today

Zuck goes Superintelligence Founder Mode: $100M bonuses + $100M+ salaries + NFDG Buyout?

Gemini 2.5 Pro/Flash GA, 2.5 Flash-Lite in Preview

not much happened today

not much happened today

Qwen 3: 0.6B to 235B MoE full+base models that beat R1 and o1

gpt-image-1 - ChatGPT's imagegen model, confusingly NOT 4o, now available in API

not much happened today

GPT 4.1: The New OpenAI Workhorse

not much happened today

LLaDA: Large Language Diffusion Models

Project Stargate: $500b datacenter (1.7% of US GDP) and Gemini 2 Flash Thinking 2

Titans: Learning to Memorize at Test Time

ModernBert: small new Retriever/Classifier workhorse, 8k context, 2T tokens,

not much happened today

Not much (in AI) happened this weekend

not much happened today

a calm before the storm

not much happened today

Everybody shipped small things this holiday weekend

not much happened today

Summer of Code AI: $1.6b raised, 1 usable product

CogVideoX: Zhipu's Open Source Sora

not much happened this weekend

Nvidia Minitron: LLM Pruning and Distillation updated for Llama 3.1

super quiet day

Llama 3.1: The Synthetic Data Model

Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o-mini version)

Not much happened today.

Nemotron-4-340B: NVIDIA's new large open models, built on syndata, great for syndata

5 small news items

1 TRILLION token context, real time, on device?

Not much happened today

Evals: The Next Generation

Mergestral, Meta MTIAv2, Cohere Rerank 3, Google Infini-Attention

Claude 3 is officially America's Next Top Model

Claude 3 just destroyed GPT 4 (see for yourself)

Ring Attention for >1M Context

Google AI: Win some (Gemma, 1.5 Pro), Lose some (Image gen)

Sora pushes SOTA

1/8/2024: The Four Wars of the AI Stack

12/8/2023 - Mamba v Mistral v Hyena