All tags

Topic: "vision"

    not much happened today
    not much happened today
    not much happened today
    Cognition's DeepWiki, a free encyclopedia of all GitHub repos
    OpenAI o3, o4-mini, and Codex CLI
    not much happened today
    Google's Agent2Agent Protocol (A2A)
    not much happened today
    Gemma 3 beats DeepSeek V3 in Elo, 2.0 Flash beats GPT4o with Native Image Gen
    not much happened today
    The Ultra-Scale Playbook: Training LLMs on GPU Clusters
    not much happened today
    s1: Simple test-time scaling (and Kyutai Hibiki)
    DeepSeek #1 on US App Store, Nvidia stock tanks -17%
    Titans: Learning to Memorize at Test Time
    Moondream 2025.1.9: Structured Text, Enhanced OCR, Gaze Detection in a 2B Model
    not much happened today
    not much happened today
    not much happened today
    Genesis: Generative Physics Engine for Robotics (o1-mini version)
    Genesis: Generative Physics Engine for Robotics (o1-2024-12-17)
    o1 API, 4o/4o-mini in Realtime API + WebRTC, DPO Finetuning
    Meta BLT: Tokenizer-free, Byte-level LLM
    $200 ChatGPT Pro and o1-full/pro, with vision, without API, and mixed reviews
    not much happened today
    OLMo 2 - new SOTA Fully Open LLM
    Vision Everywhere: Apple AIMv2 and Jina CLIP v2
    LMSys killed Model Versioning (gpt 4o 1120, gemini exp 1121)
    Pixtral Large (124B) beats Llama 3.2 90B with updated Mistral Large 24.11
    Claude 3.5 Sonnet (New) gets Computer Use
    The AI Nobel Prize
    Llama 3.2: On-device 1B/3B, and Multimodal 11B/90B (with AI2 Molmo kicker)
    Pixtral 12B: Mistral beats Llama to Multimodality
    AIPhone 16: the Visual Intelligence Phone
    CogVideoX: Zhipu's Open Source Sora
    Ideogram 2 + Berkeley Function Calling Leaderboard V2
    not much happened today
    not much happened today
    Too Cheap To Meter: AI prices cut 50-70% in last 30 days
    not much happened today
    GPT4o August + 100% Structured Outputs for All (GPT4o August edition)
    not much happened today
    We Solved Hallucinations
    FlashAttention 3, PaliGemma, OpenAI's 5 Levels to Superintelligence
    Qdrant's BM42: "Please don't trust us"
    That GPT-4o Demo
    There's Ilya!
    Ways to use Anthropic's Tool Use GA
    Chameleon: Meta's (unreleased) GPT4o-like Omnimodal Model
    Google I/O in 60 seconds
    GPT-4o: the new SOTA-EVERYTHING Frontier model (GPT4O version)
    AdamW -> AaronD?
    Welcome /r/LocalLlama!
    Claude 3 just destroyed GPT 4 (see for yourself)
    CodeLLama 70B beats GPT4 on HumanEval
    12/28/2023: Smol Talk updates
    12/20/2023: Project Obsidian - Multimodal Mistral 7B from Nous
    12/13/2023 SOLAR10.7B upstages Mistral7B?