All tags

Company: "openai"

    Execuhires Round 2: Scale-Meta, Lamini-AMD, and Instacart-OpenAI
    Reasoning Price War 2: Mistral Magistral + o3's 80% price cut + o3-pro
    Apple exposes Foundation Models API and... no new Siri
    Gemini 2.5 Pro (06-05) launched at AI Engineer World's Fair
    AI Engineer World's Fair Talks Day 1
    not much happened today
    not much happened today
    Mistral's Agents API and the 2025 LLM OS
    not much happened today
    not much happened today
    OpenAI buys Jony Ive's io for $6.5b, LMArena lands $100m seed from a16z
    ChatGPT Codex, OpenAI's first cloud SWE agent
    Gemini's AlphaEvolve agent uses Gemini 2.0 to find new Math and cuts Gemini cost 1% — without RL
    Granola launches team notes, while Notion launches meeting transcription
    not much happened today
    not much happened today
    not much happened today
    Cursor @ $9b, OpenAI Buys Windsurf @ $3b
    not much happened today
    ChatGPT responds to GlazeGate + LMArena responds to Cohere
    Cognition's DeepWiki, a free encyclopedia of all GitHub repos
    not much happened today
    gpt-image-1 - ChatGPT's imagegen model, confusingly NOT 4o, now available in API
    not much happened today; New email provider for AINews
    Grok 3 & 3-mini now API Available
    Gemini 2.5 Flash completes the total domination of the Pareto Frontier
    OpenAI o3, o4-mini, and Codex CLI
    QwQ-32B claims to match DeepSeek R1-671B
    SOTA Video Gen: Veo 2 and Kling 2 are GA for developers
    GPT 4.1: The New OpenAI Workhorse
    not much happened today
    not much happened today
    Google's Agent2Agent Protocol (A2A)
    not much happened today
    not much happened today
    not much happened today
    >$41B raised today (OpenAI @ 300b, Cursor @ 9.5b, Etched @ 1.5b)
    not much happened today
    not much happened today
    OpenAI adopts MCP
    Gemini 2.5 Pro + 4o Native Image Gen
    Promptable Prosody, SOTA ASR, and Semantic VAD: OpenAI revamps Voice AI
    not much happened today
    Gemma 3 beats DeepSeek V3 in Elo, 2.0 Flash beats GPT4o with Native Image Gen
    The new OpenAI Agents Platform
    not much happened today
    DeepSeek's Open Source Stack
    not much happened today
    Anthropic's $61.5B Series E
    not much happened today
    GPT 4.5 — Chonky Orion ships!
    lots of small launches
    AI Engineer Summit Day 1
    not much happened today
    X.ai Grok 3 and Mira Murati's Thinking Machines
    not much happened today
    Reasoning Models are Near-Superhuman Coders (OpenAI IOI, Nvidia Kernels)
    small news items
    not much happened today
    not much happened today
    OpenAI takes on Gemini's Deep Research
    o3-mini launches, OpenAI on "wrong side of history"
    not much happened today
    not much happened today
    DeepSeek #1 on US App Store, Nvidia stock tanks -17%
    TinyZero: Reproduce DeepSeek R1-Zero for $30
    OpenAI launches Operator, its first Agent
    Project Stargate: $500b datacenter (1.7% of US GDP) and Gemini 2 Flash Thinking 2
    not much happened today
    Titans: Learning to Memorize at Test Time
    small little news items
    Moondream 2025.1.9: Structured Text, Enhanced OCR, Gaze Detection in a 2B Model
    not much happened today
    not much happened today
    PRIME: Process Reinforcement through Implicit Rewards
    not much happened today
    not much happened today
    not much happened today
    DeepSeek v3: 671B finegrained MoE trained for $5.5m USD of compute on 15T tokens
    not much happened today
    not much happened this weekend
    o3 solves AIME, GPQA, Codeforces, makes 11 years of progress in ARC-AGI and 25% in FrontierMath
    ModernBert: small new Retriever/Classifier workhorse, 8k context, 2T tokens,
    Genesis: Generative Physics Engine for Robotics (o1-mini version)
    Genesis: Generative Physics Engine for Robotics (o1-2024-12-17)
    OpenAI Voice Mode Can See Now - After Gemini Does
    o1 API, 4o/4o-mini in Realtime API + WebRTC, DPO Finetuning
    Meta Apollo - Video Understanding up to 1 hour, SOTA Open Weights
    Meta BLT: Tokenizer-free, Byte-level LLM
    Google wakes up: Gemini 2.0 et al
    ChatGPT Canvas GA
    OpenAI Sora Turbo and Sora.com
    Meta Llama 3.3: 405B/Nova Pro performance at 70B price
    $200 ChatGPT Pro and o1-full/pro, with vision, without API, and mixed reviews
    not much happened today
    LMSys killed Model Versioning (gpt 4o 1120, gemini exp 1121)
    Stripe lets Agents spend money with StripeAgentToolkit
    Gemini (Experimental-1114) retakes #1 LLM rank with 1344 Elo
    FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
    not much happened today
    OpenAI beats Anthropic to releasing Speculative Decoding
    not much happened today
    The AI Search Wars Have Begun — SearchGPT, Gemini Grounding, and more
    Creating a LLM-as-a-Judge
    GitHub Copilot Strikes Back
    not much happened this weekend
    not much happened today
    not much happened today
    DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing
    not much happened today
    not much happened today
    Not much (in AI) happened this weekend
    not much happened today
    The AI Nobel Prize
    not much happened this weekend
    Contextual Document Embeddings: `cde-small-v1`
    Canvas: OpenAI's answer to Claude Artifacts
    Not much technical happened today
    OpenAI Realtime API and other Dev Day Goodies
    Liquid Foundation Models: A New Transformers alternative + AINews Pod 2
    not much happened today
    ChatGPT Advanced Voice Mode
    a calm before the storm
    not much happened today
    not much happened today
    o1 destroys Lmsys Arena, Qwen 2.5, Kyutai Moshi release
    nothing much happened today
    a quiet weekend
    Learnings from o1 AMA
    o1: OpenAI's new general reasoning models
    Pixtral 12B: Mistral beats Llama to Multimodality
    AIPhone 16: the Visual Intelligence Phone
    Everybody shipped small things this holiday weekend
    not much happened today
    Ideogram 2 + Berkeley Function Calling Leaderboard V2
    not much happened today
    The DSPy Roadmap
    not much happened today
    Grok 2! and ChatGPT-4o-latest confuses everybody
    Gemini Live
    not much happened today
    GPT4o August + 100% Structured Outputs for All (GPT4o August edition)
    How Carlini Uses AI
    Execuhires: Tempting The Wrath of Khan
    Gemma 2 2B + Scope + Shield
    Llama 3.1 Leaks: big bumps to 8B, minor bumps to 70b, and SOTA OSS 405b model
    DataComp-LM: the best open-data 7B model/benchmark/dataset
    Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o-mini version)
    Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o version)
    We Solved Hallucinations
    FlashAttention 3, PaliGemma, OpenAI's 5 Levels to Superintelligence
    Nothing much happened today
    RouteLLM: RIP Martian? (Plus: AINews Structured Summaries update)
    That GPT-4o Demo
    Mozilla's AI Second Act
    Claude Crushes Code - 92% HumanEval and Claude.ai Artifacts
    There's Ilya!
    Is this... OpenQ*?
    Francois Chollet launches $1m ARC Prize
    HippoRAG: First, do know(ledge) Graph
    5 small news items
    Not much happened today
    Contextual Position Encoding (CoPE)
    Somebody give Andrej some H100s already
    Life after DPO (RewardBench)
    Ten Commandments for Deploying Fine-Tuned Models
    ALL of AI Engineering in One Place
    Chameleon: Meta's (unreleased) GPT4o-like Omnimodal Model
    Cursor reaches >1000 tok/s finetuning Llama3-70b for fast file editing
    Not much happened today
    GPT-4o: the new SOTA-EVERYTHING Frontier model (GPT4T version)
    GPT-4o: the new SOTA-EVERYTHING Frontier model (GPT4O version)
    Quis promptum ipso promptiet?
    LMSys advances Llama 3 eval analysis
    OpenAI's PR Campaign?
    Kolmogorov-Arnold Networks: MLP killers or just spicy MLPs?
    DeepSeek-V2 beats Mixtral 8x22B with >160 experts at HALF the cost
    $100k to predict LMSYS human preferences in a Kaggle contest
    Evals: The Next Generation
    Not much happened today
    LLMs-as-Juries
    Snowflake Arctic: Fully Open 10B+128x4B Dense-MoE Hybrid LLM
    OpenAI's Instruction Hierarchy for the LLM OS
    FineWeb: 15T Tokens, 12 years of CommonCrawl (deduped and filtered, you're welcome)
    Lilian Weng on Video Diffusion
    Zero to GPT in 1 Year
    Gemini Pro and GPT4T Vision go GA on the same day by complete coincidence
    Anime pfp anon eclipses $10k A::B prompting challenge
    Cohere Command R+, Anthropic Claude Tool Use, OpenAI Finetuning
    ReALM: Reference Resolution As Language Modeling
    Not much happened today
    AdamW -> AaronD?
    Evals-based AI Engineering
    DBRX: Best open model (just not most efficient)
    Andrew likes Agents
    World_sim.exe
    Grok-1 in Bio
    The world's first fully autonomous AI Engineer
    ... and welcome AI Twitter!
    Welcome Interconnects and OpenRouter
    Mistral Large disappoints
    Sora pushes SOTA
    AI gets Memory
    Gemini Ultra is out, to mixed reviews
    MetaVoice & RIP Bard
    Less Lazy AI
    Trust in GPTs at all time low
    GPT4Turbo A/B Test: gpt-4-0125-preview
    GPT4Turbo A/B Test: gpt-4-1106-preview
    RIP Latent Diffusion, Hello Hourglass Diffusion
    Sama says: GPT-5 soon
    1/13-14/2024: Don't sleep on #prompt-engineering
    1/12/2024: Anthropic coins Sleeper Agents
    1/10/2024: All the best papers for AI Engineers
    1/9/2024: Nous Research lands $5m for Open Source AI
    1/8/2024: The Four Wars of the AI Stack
    1/6-7/2024: LlaMA Pro - an alternative to PEFT/RAG??
    1/2/2024: Smol tweaks to Smol Talk
    1/1/2024: How to start with Open Source AI
    12/29/2023: TinyLlama on the way
    12/24/2023: Dolphin Mixtral 8x7b is wild
    12/22/2023: Anyscale's Benchmark Criticisms
    12/21/2023: The State of AI (according to LangChain)
    12/20/2023: Project Obsidian - Multimodal Mistral 7B from Nous
    12/19/2023: Everybody Loves OpenRouter
    12/18/2023: Gaslighting Mistral for fun and profit
    12/16/2023: ByteDance suspended by OpenAI
    12/15/2023: Mixtral-Instruct beats Gemini Pro (and matches GPT3.5)
    12/14/2023: $1e7 for Superalignment
    12/13/2023 SOLAR10.7B upstages Mistral7B?
    12/12/2023: Towards LangChain 0.1
    12/11/2023: Mixtral beats GPT3.5 and Llama2-70B
    12/10/2023: not much happened today
    12/7/2023: Anthropic says "skill issue"
    Is Google's Gemini... legit?