Topic: "model-deployment"

not much happened today

Meta Superintelligence Labs acquires Manus AI for over $2B, at $100M ARR, 9months after launch

not much happened today

not much happened today

Gemini 3.0 Flash Preview: 1/4 cost of Pro, but ~as smart, retakes Pareto Frontier

DeepSeek V3.2 & 3.2-Speciale: GPT5-High Open Weights, Context Management, Plans for Compute Scaling

Terminal-Bench 2.0 and Harbor

Kimi K2 Thinking: 1T-A32B params, SOTA HLE, BrowseComp, TauBench && Soumith leaves Pytorch

MiniMax M2 230BA10B — 8% of Claude Sonnet's price, ~2x faster, new SOTA open model

OpenAI Dev Day: Apps SDK, AgentKit, Codex GA, GPT‑5 Pro and Sora 2 APIs

Grok 4 Fast: Xai's distilled, 40% more token efficient, 2m context, 344 tok/s frontier model

Qwen3-Next-80B-A3B-Base: Towards Ultimate Training & Inference Efficiency

ChatGPT Agent: new o* model + unified Deep Research browser + Operator computer use + Code Interpreter terminal

Anthropic releases Claude 4 Sonnet and Opus: Memory, Agent Capabilities, Claude Code, Redteam Drama

LlamaCon: Meta AI gets into the Llama API platform business

>$41B raised today (OpenAI @ 300b, Cursor @ 9.5b, Etched @ 1.5b)

not much happened today

not much happened today

not much happened today

not much happened today

Qwen with Questions: 32B open weights reasoning model nears o1 in GPQA/AIME/Math500

Tencent's Hunyuan-Large claims to beat DeepSeek-V2 and Llama3-405B with LESS Data

Liquid Foundation Models: A New Transformers alternative + AINews Pod 2

not much happened today

Pixtral 12B: Mistral beats Llama to Multimodality

GPT4o August + 100% Structured Outputs for All (GPT4o August edition)

Llama 3.1: The Synthetic Data Model

A quiet weekend

OpenAI's Instruction Hierarchy for the LLM OS

Not much happened today

The Core Skills of AI Engineering

12/31/2023: Happy New Year

12/29/2023: TinyLlama on the way

12/11/2023: Mixtral beats GPT3.5 and Llama2-70B

12/8/2023 - Mamba v Mistral v Hyena