Topic: "model-optimization"

not much happened today

Moonshot Kimi K2.5 - Beats Sonnet 4.5 at half the cost, SOTA Open Model, first Native Image+Video, 100 parallel Agent Swarm manager

not much happened today

Apple picks Google's Gemini to power Siri's next generation

not much happened today

OpenAI GPT Image-1.5 claims to beat Nano Banana Pro, #1 across all Arenas, but completely fails Vibe Checks

NVIDIA Nemotron 3: hybrid Mamba-Transformer completely open source models from 30B to 500B

not much happened today

MCP -> Agentic AI Foundation, Mistral Devstral 2

not much happened today

not much happened today

not much happened today

not much happened today

Terminal-Bench 2.0 and Harbor

Cursor 2.0 & Composer-1: Fast Models and New Agents UI

not much happened today

not much happened today

Thinking Machines' Tinker: LoRA based LLM fine-tuning API

not much happened today

not much happened today

Grok 4 Fast: Xai's distilled, 40% more token efficient, 2m context, 344 tok/s frontier model

Softbank, NVIDIA and US Govt take 2%, 5% and 10% of Intel, will develop Intel x86 RTX SOCs for consumer & datacenters

not much happened today

GPT-5 Codex launch and OpenAI's quiet rise in Agentic Coding

Oracle jumps +36% in a day after winning $300B OpenAI contract

not much happened today

not much happened today

Western Open Models get Funding: Cohere $500m @ 6.8B, AI2 gets $152m NSF+NVIDIA grants

not much happened today

Gemini 2.5 Deep Think finally ships

GLM-4.5: Deeper, Headier, & better than Kimi/Qwen/DeepSeek (SOTA China LLM?)

not much happened today

not much happened today

Kimi K2 - SOTA Open MoE proves that Muon can scale to 15T tokens/1T params

not much happened today

OpenAI releases Deep Research API (o3/o4-mini)

not much happened today

Mary Meeker is so back: BOND Capital AI Trends report

not much happened today

Google I/O: new Gemini native voice, Flash, DeepThink, AI Mode (DeepSearch+Mariner+Astra)

not much happened today

Prime Intellect's INTELLECT-2 and PRIME-RL advance distributed reinforcement learning

not much happened today

ChatGPT responds to GlazeGate + LMArena responds to Cohere

LlamaCon: Meta AI gets into the Llama API platform business

not much happened today

Cohere's Command A claims #3 open model spot (after DeepSeek and Gemma)

DeepSeek's Open Source Stack

not much happened today

Mistral Small 3 24B and Tulu 3 405B

DeepSeek R1: o1-level open weights model and a simple recipe for upgrading 1.5B models to Sonnet/4o level

not much happened today

not much happened today

DeepSeek v3: 671B finegrained MoE trained for $5.5m USD of compute on 15T tokens

Meta BLT: Tokenizer-free, Byte-level LLM

not much happened today

Olympus has dropped (aka, Amazon Nova Micro|Lite|Pro|Premier|Canvas|Reel)

Vision Everywhere: Apple AIMv2 and Jina CLIP v2

Tencent's Hunyuan-Large claims to beat DeepSeek-V2 and Llama3-405B with LESS Data

not much happened today

not much happened this weekend

not much happened today

not much happened today

DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing

DeepSeek Janus and Meta SpiRit-LM: Decoupled Image and Expressive Voice Omnimodality

not much happened today

Not much technical happened today

not much happened today

Llama 3.2: On-device 1B/3B, and Multimodal 11B/90B (with AI2 Molmo kicker)

a calm before the storm

Cerebras Inference: Faster, Better, AND Cheaper

Ideogram 2 + Berkeley Function Calling Leaderboard V2

The DSPy Roadmap

DataComp-LM: the best open-data 7B model/benchmark/dataset

Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o-mini version)

Qdrant's BM42: "Please don't trust us"

RouteLLM: RIP Martian? (Plus: AINews Structured Summaries update)

That GPT-4o Demo

Gemma 2: The Open Model for Everyone

Claude Crushes Code - 92% HumanEval and Claude.ai Artifacts

Talaria: Apple's new MLOps Superweapon

Not much happened today

Google I/O in 60 seconds

GPT-4o: the new SOTA-EVERYTHING Frontier model (GPT4O version)

A quiet weekend

Anime pfp anon eclipses $10k A::B prompting challenge

Evals-based AI Engineering

Jamba: Mixture of Architectures dethrones Mixtral

The Era of 1-bit LLMs

Welcome Interconnects and OpenRouter

Mistral Large disappoints

Karpathy emerges from stealth?

AI2 releases OLMo - the 4th open-everything LLM

Miqu confirmed to be an early Mistral-medium checkpoint

RWKV "Eagle" v5: Your move, Mamba

GPT4Turbo A/B Test: gpt-4-0125-preview

12/25/2023: Nous Hermes 2 Yi 34B for Christmas