All tags

Company: "nvidia"

    Gemini 2.5 Pro (06-05) launched at AI Engineer World's Fair
    DeepSeek-R1-0528 - Gemini 2.5 Pro-level model, SOTA Open Weights release
    not much happened today
    not much happened today
    not much happened today
    Gemini 2.5 Pro Preview 05-06 (I/O edition) - the SOTA vision+coding model
    Cursor @ $9b, OpenAI Buys Windsurf @ $3b
    gpt-image-1 - ChatGPT's imagegen model, confusingly NOT 4o, now available in API
    not much happened today
    not much happened today
    Google's Agent2Agent Protocol (A2A)
    not much happened today
    lots of little things happened this week
    Every 7 Months: The Moore's Law for Agent Autonomy
    not much happened today
    not much happened today
    not much happened today
    Reasoning Models are Near-Superhuman Coders (OpenAI IOI, Nvidia Kernels)
    not much happened today
    DeepSeek #1 on US App Store, Nvidia stock tanks -17%
    Project Stargate: $500b datacenter (1.7% of US GDP) and Gemini 2 Flash Thinking 2
    not much happened today
    not much happened today
    not much happened this weekend
    OpenAI Sora Turbo and Sora.com
    not much happened today
    not much happened today
    DeepSeek-R1 claims to beat o1-preview AND will be open sourced
    Pixtral Large (124B) beats Llama 3.2 90B with updated Mistral Large 24.11
    OpenAI beats Anthropic to releasing Speculative Decoding
    The AI Search Wars Have Begun — SearchGPT, Gemini Grounding, and more
    not much happened today
    Claude 3.5 Sonnet (New) gets Computer Use
    DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing
    DeepSeek Janus and Meta SpiRit-LM: Decoupled Image and Expressive Voice Omnimodality
    Did Nvidia's Nemotron 70B train on test?
    Not much (in AI) happened this weekend
    o1: OpenAI's new general reasoning models
    Everybody shipped small things this holiday weekend
    Summer of Code AI: $1.6b raised, 1 usable product
    CogVideoX: Zhipu's Open Source Sora
    not much happened this weekend
    Nvidia Minitron: LLM Pruning and Distillation updated for Llama 3.1
    not much happened today
    GPT4o August + 100% Structured Outputs for All (GPT4o August edition)
    How Carlini Uses AI
    Rombach et al: FLUX.1 [pro|dev|schnell], $31m seed for Black Forest Labs
    Gemma 2 2B + Scope + Shield
    DataComp-LM: the best open-data 7B model/benchmark/dataset
    Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o-mini version)
    Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o version)
    SciCode: HumanEval gets a STEM PhD upgrade
    We Solved Hallucinations
    GraphRAG: The Marriage of Knowledge Graphs and RAG
    Gemini Nano: 50-90% of Gemini Pro, <100ms inference, on device, in Chrome Canary
    Gemini launches context caching... or does it?
    Is this... OpenQ*?
    Nemotron-4-340B: NVIDIA's new large open models, built on syndata, great for syndata
    Hybrid SSM/Transformers > Pure SSMs/Pure Transformers
    5 small news items
    Not much happened today
    Somebody give Andrej some H100s already
    Life after DPO (RewardBench)
    ALL of AI Engineering in One Place
    DeepSeek-V2 beats Mixtral 8x22B with >160 experts at HALF the cost
    $100k to predict LMSYS human preferences in a Kaggle contest
    Snowflake Arctic: Fully Open 10B+128x4B Dense-MoE Hybrid LLM
    Llama-3-70b is GPT-4-level Open Model
    Mixtral 8x22B Instruct sparks efficiency memes
    Not much happened today
    Welcome /r/LocalLlama!
    Shipping and Dipping: Inflection + Stability edition
    World_sim.exe
    FSDP+QLoRA: the Answer to 70b-scale AI for desktop class GPUs
    One Year of Latent Space
    Ring Attention for >1M Context
    Google AI: Win some (Gemma, 1.5 Pro), Lose some (Image gen)
    Sora pushes SOTA
    The Dissection of Smaug (72B)
    1/16/2024: ArtificialAnalysis - a new model/host benchmark site