All tags

Company: "anthropic"

    Execuhires Round 2: Scale-Meta, Lamini-AMD, and Instacart-OpenAI
    Reasoning Price War 2: Mistral Magistral + o3's 80% price cut + o3-pro
    AI Engineer World's Fair Talks Day 1
    not much happened today
    not much happened today
    Mary Meeker is so back: BOND Capital AI Trends report
    DeepSeek-R1-0528 - Gemini 2.5 Pro-level model, SOTA Open Weights release
    not much happened today
    not much happened today
    Anthropic releases Claude 4 Sonnet and Opus: Memory, Agent Capabilities, Claude Code, Redteam Drama
    Granola launches team notes, while Notion launches meeting transcription
    AI Engineer World's Fair: Second Run, Twice The Fun
    not much happened today
    not much happened today
    not much happened today; New email provider for AINews
    Gemini 2.5 Flash completes the total domination of the Pareto Frontier
    not much happened today
    not much happened today
    not much happened today
    lots of little things happened this week
    not much happened today
    not much happened today
    Anthropic's $61.5B Series E
    not much happened today
    lots of small launches
    not much happened today
    Claude 3.7 Sonnet
    AI Engineer Summit Day 1
    not much happened today
    X.ai Grok 3 and Mira Murati's Thinking Machines
    not much happened today
    not much happened today
    Gemini 2.0 Flash GA, with new Flash Lite, 2.0 Pro, and Flash Thinking
    not much happened today
    OpenAI launches Operator, its first Agent
    Titans: Learning to Memorize at Test Time
    not much happened today
    DeepSeek v3: 671B finegrained MoE trained for $5.5m USD of compute on 15T tokens
    OpenAI Voice Mode Can See Now - After Gemini Does
    Meta BLT: Tokenizer-free, Byte-level LLM
    not much happened today
    Olympus has dropped (aka, Amazon Nova Micro|Lite|Pro|Premier|Canvas|Reel)
    not much happened today
    Anthropic launches the Model Context Protocol
    LMSys killed Model Versioning (gpt 4o 1120, gemini exp 1121)
    Perplexity starts Shopping for you
    Stripe lets Agents spend money with StripeAgentToolkit
    Gemini (Experimental-1114) retakes #1 LLM rank with 1344 Elo
    Common Corpus: 2T Open Tokens with Provenance
    FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
    not much happened today
    Not much happened today
    Tencent's Hunyuan-Large claims to beat DeepSeek-V2 and Llama3-405B with LESS Data
    OpenAI beats Anthropic to releasing Speculative Decoding
    not much happened today
    Creating a LLM-as-a-Judge
    GitHub Copilot Strikes Back
    not much happened this weekend
    not much happened today
    not much happened today
    Claude 3.5 Sonnet (New) gets Computer Use
    DeepSeek Janus and Meta SpiRit-LM: Decoupled Image and Expressive Voice Omnimodality
    not much happened today
    not much happened today
    State of AI 2024
    not much happened today
    The AI Nobel Prize
    ChatGPT Advanced Voice Mode
    a calm before the storm
    not much happened today
    not much happened today
    o1 destroys Lmsys Arena, Qwen 2.5, Kyutai Moshi release
    a quiet weekend
    Pixtral 12B: Mistral beats Llama to Multimodality
    Replit Agent - How did everybody beat Devin to market?
    $1150m for SSI, Sakana, You.com + Claude 500m context
    Everybody shipped small things this holiday weekend
    Cerebras Inference: Faster, Better, AND Cheaper
    Nvidia Minitron: LLM Pruning and Distillation updated for Llama 3.1
    super quiet day
    not much happened today
    not much happened today
    not much happened today
    Gemini Live
    not much happened today
    Execuhires: Tempting The Wrath of Khan
    Gemma 2 2B + Scope + Shield
    SciCode: HumanEval gets a STEM PhD upgrade
    Qdrant's BM42: "Please don't trust us"
    GraphRAG: The Marriage of Knowledge Graphs and RAG
    Gemma 2: The Open Model for Everyone
    Mozilla's AI Second Act
    Shall I compare thee to a Sonnet's day?
    Gemini Nano: 50-90% of Gemini Pro, <100ms inference, on device, in Chrome Canary
    Shazeer et al (2024): you are overpaying for inference >13x
    Claude Crushes Code - 92% HumanEval and Claude.ai Artifacts
    There's Ilya!
    Is this... OpenQ*?
    Ways to use Anthropic's Tool Use GA
    Contextual Position Encoding (CoPE)
    Life after DPO (RewardBench)
    Ten Commandments for Deploying Fine-Tuned Models
    ALL of AI Engineering in One Place
    Anthropic's "LLM Genome Project": learning & clamping 34m features on Claude Sonnet
    Chameleon: Meta's (unreleased) GPT4o-like Omnimodal Model
    Cursor reaches >1000 tok/s finetuning Llama3-70b for fast file editing
    Not much happened today
    Quis promptum ipso promptiet?
    Not much happened today
    Zero to GPT in 1 Year
    Cohere Command R+, Anthropic Claude Tool Use, OpenAI Finetuning
    Claude 3 is officially America's Next Top Model
    Shipping and Dipping: Inflection + Stability edition
    World_sim.exe
    Grok-1 in Bio
    MM1: Apple's first Large Multimodal Model
    Not much happened piday
    DeepMind SIMA: one AI, 9 games, 600 tasks, vision+language ONLY
    Fixing Gemma
    Inflection-2.5 at 94% of GPT4, and Pi at 6m MAU
    Not much happened today
    Stable Diffusion 3 — Rombach & Esser did it again!
    Claude 3 just destroyed GPT 4 (see for yourself)
    1/12/2024: Anthropic coins Sleeper Agents
    1/4/2024: Jeff Bezos backs Perplexity's $520m Series B.
    12/18/2023: Gaslighting Mistral for fun and profit
    12/16/2023: ByteDance suspended by OpenAI
    12/12/2023: Towards LangChain 0.1
    12/8/2023 - Mamba v Mistral v Hyena
    12/7/2023: Anthropic says "skill issue"