All tags

Company: "mistral-ai"

    Reasoning Price War 2: Mistral Magistral + o3's 80% price cut + o3-pro
    Mistral's Agents API and the 2025 LLM OS
    OpenAI buys Jony Ive's io for $6.5b, LMArena lands $100m seed from a16z
    not much happened today
    Prime Intellect's INTELLECT-2 and PRIME-RL advance distributed reinforcement learning
    not much happened today
    not much happened today
    Qwen 3: 0.6B to 235B MoE full+base models that beat R1 and o1
    not much happened today
    Cohere's Command A claims #3 open model spot (after DeepSeek and Gemma)
    DeepSeek's Open Source Stack
    not much happened today
    lots of small launches
    o3-mini launches, OpenAI on "wrong side of history"
    Mistral Small 3 24B and Tulu 3 405B
    OpenAI Voice Mode Can See Now - After Gemini Does
    OpenAI Sora Turbo and Sora.com
    LMSys killed Model Versioning (gpt 4o 1120, gemini exp 1121)
    Perplexity starts Shopping for you
    Pixtral Large (124B) beats Llama 3.2 90B with updated Mistral Large 24.11
    not much happened this weekend
    Did Nvidia's Nemotron 70B train on test?
    o1 destroys Lmsys Arena, Qwen 2.5, Kyutai Moshi release
    a quiet weekend
    Pixtral 12B: Mistral beats Llama to Multimodality
    not much happened this weekend
    not much happened today
    not much happened today
    Too Cheap To Meter: AI prices cut 50-70% in last 30 days
    not much happened today
    AlphaProof + AlphaGeometry2 reach 1 point short of IMO Gold
    Mistral Large 2 + RIP Mistral 7B, 8x7B, 8x22B
    DataComp-LM: the best open-data 7B model/benchmark/dataset
    Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o-mini version)
    Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o version)
    Gemma 2 tops /r/LocalLlama vibe check
    Gemma 2: The Open Model for Everyone
    Nemotron-4-340B: NVIDIA's new large open models, built on syndata, great for syndata
    Talaria: Apple's new MLOps Superweapon
    5 small news items
    Not much happened today
    1 TRILLION token context, real time, on device?
    Life after DPO (RewardBench)
    ALL of AI Engineering in One Place
    DeepSeek-V2 beats Mixtral 8x22B with >160 experts at HALF the cost
    Evals: The Next Generation
    A quiet weekend
    OpenAI's Instruction Hierarchy for the LLM OS
    FineWeb: 15T Tokens, 12 years of CommonCrawl (deduped and filtered, you're welcome)
    Meta Llama 3 (8B, 70B)
    Mixtral 8x22B Instruct sparks efficiency memes
    Multi-modal, Multi-Aspect, Multi-Form-Factor AI
    Zero to GPT in 1 Year
    Mergestral, Meta MTIAv2, Cohere Rerank 3, Google Infini-Attention
    Music's Dall-E moment
    Cohere Command R+, Anthropic Claude Tool Use, OpenAI Finetuning
    Not much happened today
    Evals-based AI Engineering
    DBRX: Best open model (just not most efficient)
    Claude 3 is officially America's Next Top Model
    not much happened today
    Welcome /r/LocalLlama!
    Grok-1 in Bio
    Fixing Gemma
    Inflection-2.5 at 94% of GPT4, and Pi at 6m MAU
    Not much happened today
    Welcome Interconnects and OpenRouter
    Mistral Large disappoints
    One Year of Latent Space
    Karpathy emerges from stealth?
    Companies liable for AI hallucination is Good Actually for AI Engineers
    Sora pushes SOTA
    AI gets Memory
    Gemini Ultra is out, to mixed reviews
    Qwen 1.5 Released
    AI2 releases OLMo - the 4th open-everything LLM
    Trust in GPTs at all time low
    Miqu confirmed to be an early Mistral-medium checkpoint
    CodeLLama 70B beats GPT4 on HumanEval
    RWKV "Eagle" v5: Your move, Mamba
    GPT4Turbo A/B Test: gpt-4-1106-preview
    Adept Fuyu-Heavy: Multimodal model for Agents
    Google Solves Text to Video
    Nightshade poisons AI art... kinda?
    1/17/2024: Help crowdsource function calling datasets
    1/8/2024: The Four Wars of the AI Stack
    1/6-7/2024: LlaMA Pro - an alternative to PEFT/RAG??
    1/4/2024: Jeff Bezos backs Perplexity's $520m Series B.
    12/31/2023: Happy New Year
    12/30/2023: Mega List of all LLMs
    12/27/2023: NYT vs OpenAI
    12/24/2023: Dolphin Mixtral 8x7b is wild
    12/19/2023: Everybody Loves OpenRouter
    12/13/2023 SOLAR10.7B upstages Mistral7B?
    12/12/2023: Towards LangChain 0.1
    12/11/2023: Mixtral beats GPT3.5 and Llama2-70B
    12/10/2023: not much happened today
    12/9/2023: The Mixtral Rush
    12/8/2023 - Mamba v Mistral v Hyena