Company: "nvidia"

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

OpenAI closes $110B raise from Amazon, NVIDIA, SoftBank in largest startup fundraise in history @ $840B post-money

OpenAI and Anthropic go to war: Claude Opus 4.6 vs GPT 5.3 Codex

Anthropic launches the MCP Apps open spec, in Claude.ai

xAI raises $20B Series E at ~$230B valuation

Nvidia buys (most of) Groq for $20B cash; largest execuhire ever

NVIDIA Nemotron 3: hybrid Mamba-Transformer completely open source models from 30B to 500B

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

OpenAI Titan XPU: 10GW of self-designed chips with Broadcom

not much happened today

GDPVal finding: Claude Opus 4.1 within 95% of AGI (human experts in top 44 white collar jobs)

NVIDIA to invest $100B in OpenAI for 10GW of Vera Rubin rollout

Softbank, NVIDIA and US Govt take 2%, 5% and 10% of Intel, will develop Intel x86 RTX SOCs for consumer & datacenters

GPT-5 Codex launch and OpenAI's quiet rise in Agentic Coding

Qwen3-Next-80B-A3B-Base: Towards Ultimate Training & Inference Efficiency

OpenAI updates Codex, VSCode Extension that can sync tasks with Codex Cloud

nano-banana is Gemini‑2.5‑Flash‑Image, beating Flux Kontext by 170 Elo with SOTA Consistency, Editing, and Multi-Image Fusion

not much happened today

Western Open Models get Funding: Cohere $500m @ 6.8B, AI2 gets $152m NSF+NVIDIA grants

not much happened today

Gemini 2.5 Pro (06-05) launched at AI Engineer World's Fair

DeepSeek-R1-0528 - Gemini 2.5 Pro-level model, SOTA Open Weights release

not much happened today

not much happened today

not much happened today

Gemini 2.5 Pro Preview 05-06 (I/O edition) - the SOTA vision+coding model

Cursor @ $9b, OpenAI Buys Windsurf @ $3b

gpt-image-1 - ChatGPT's imagegen model, confusingly NOT 4o, now available in API

not much happened today

not much happened today

Google's Agent2Agent Protocol (A2A)

not much happened today

lots of little things happened this week

Every 7 Months: The Moore's Law for Agent Autonomy

not much happened today

not much happened today

not much happened today

Reasoning Models are Near-Superhuman Coders (OpenAI IOI, Nvidia Kernels)

not much happened today

DeepSeek #1 on US App Store, Nvidia stock tanks -17%

Project Stargate: $500b datacenter (1.7% of US GDP) and Gemini 2 Flash Thinking 2

not much happened today

not much happened today

not much happened this weekend

OpenAI Sora Turbo and Sora.com

not much happened today

not much happened today

DeepSeek-R1 claims to beat o1-preview AND will be open sourced

Pixtral Large (124B) beats Llama 3.2 90B with updated Mistral Large 24.11

OpenAI beats Anthropic to releasing Speculative Decoding

The AI Search Wars Have Begun — SearchGPT, Gemini Grounding, and more

not much happened today

Claude 3.5 Sonnet (New) gets Computer Use

DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing

DeepSeek Janus and Meta SpiRit-LM: Decoupled Image and Expressive Voice Omnimodality

Did Nvidia's Nemotron 70B train on test?

Not much (in AI) happened this weekend

o1: OpenAI's new general reasoning models

Everybody shipped small things this holiday weekend

Summer of Code AI: $1.6b raised, 1 usable product

CogVideoX: Zhipu's Open Source Sora

not much happened this weekend

Nvidia Minitron: LLM Pruning and Distillation updated for Llama 3.1

not much happened today

GPT4o August + 100% Structured Outputs for All (GPT4o August edition)

How Carlini Uses AI

Rombach et al: FLUX.1 [pro|dev|schnell], $31m seed for Black Forest Labs

Gemma 2 2B + Scope + Shield

DataComp-LM: the best open-data 7B model/benchmark/dataset

Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o-mini version)

Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o version)

SciCode: HumanEval gets a STEM PhD upgrade

We Solved Hallucinations

GraphRAG: The Marriage of Knowledge Graphs and RAG

Gemini Nano: 50-90% of Gemini Pro, <100ms inference, on device, in Chrome Canary

Gemini launches context caching... or does it?

Is this... OpenQ*?

Nemotron-4-340B: NVIDIA's new large open models, built on syndata, great for syndata

Hybrid SSM/Transformers > Pure SSMs/Pure Transformers

5 small news items

Not much happened today

Somebody give Andrej some H100s already

Life after DPO (RewardBench)

ALL of AI Engineering in One Place

DeepSeek-V2 beats Mixtral 8x22B with >160 experts at HALF the cost

$100k to predict LMSYS human preferences in a Kaggle contest

Snowflake Arctic: Fully Open 10B+128x4B Dense-MoE Hybrid LLM

Llama-3-70b is GPT-4-level Open Model

Mixtral 8x22B Instruct sparks efficiency memes

Not much happened today

Welcome /r/LocalLlama!

Shipping and Dipping: Inflection + Stability edition

FSDP+QLoRA: the Answer to 70b-scale AI for desktop class GPUs

One Year of Latent Space

Ring Attention for >1M Context

Google AI: Win some (Gemma, 1.5 Pro), Lose some (Image gen)

Sora pushes SOTA

The Dissection of Smaug (72B)

1/16/2024: ArtificialAnalysis - a new model/host benchmark site