Company: "mistral-ai"

not much happened today

Qwen-Image 2.0 and Seedance 2.0

not much happened today

not much happened today

MCP -> Agentic AI Foundation, Mistral Devstral 2

not much happened today

Mistral 3: Mistral Large 3 + Ministral 3B/8B/14B open weights models

not much happened today

Grok 4 Fast: Xai's distilled, 40% more token efficient, 2m context, 344 tok/s frontier model

Softbank, NVIDIA and US Govt take 2%, 5% and 10% of Intel, will develop Intel x86 RTX SOCs for consumer & datacenters

Qwen3-Next-80B-A3B-Base: Towards Ultimate Training & Inference Efficiency

Anthropic raises $13B at $183B Series F

Figma's $50+b IPO

not much happened today

Voxtral - Mistral's SOTA ASR model in 3B (mini) and 24B ("small") sizes beats OpenAI Whisper large-v3

Kimi K2 - SOTA Open MoE proves that Muon can scale to 15T tokens/1T params

SmolLM3: the SOTA 3B reasoning open source LLM

Not much happened today

The Quiet Rise of Claude Code vs Codex

Reasoning Price War 2: Mistral Magistral + o3's 80% price cut + o3-pro

Mistral's Agents API and the 2025 LLM OS

OpenAI buys Jony Ive's io for $6.5b, LMArena lands $100m seed from a16z

not much happened today

Prime Intellect's INTELLECT-2 and PRIME-RL advance distributed reinforcement learning

not much happened today

not much happened today

Qwen 3: 0.6B to 235B MoE full+base models that beat R1 and o1

not much happened today

Cohere's Command A claims #3 open model spot (after DeepSeek and Gemma)

DeepSeek's Open Source Stack

not much happened today

lots of small launches

o3-mini launches, OpenAI on "wrong side of history"

Mistral Small 3 24B and Tulu 3 405B

OpenAI Voice Mode Can See Now - After Gemini Does

OpenAI Sora Turbo and Sora.com

LMSys killed Model Versioning (gpt 4o 1120, gemini exp 1121)

Perplexity starts Shopping for you

Pixtral Large (124B) beats Llama 3.2 90B with updated Mistral Large 24.11

not much happened this weekend

Did Nvidia's Nemotron 70B train on test?

o1 destroys Lmsys Arena, Qwen 2.5, Kyutai Moshi release

a quiet weekend

Pixtral 12B: Mistral beats Llama to Multimodality

not much happened this weekend

not much happened today

not much happened today

Too Cheap To Meter: AI prices cut 50-70% in last 30 days

not much happened today

AlphaProof + AlphaGeometry2 reach 1 point short of IMO Gold

Mistral Large 2 + RIP Mistral 7B, 8x7B, 8x22B

DataComp-LM: the best open-data 7B model/benchmark/dataset

Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o-mini version)

Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o version)

Gemma 2 tops /r/LocalLlama vibe check

Gemma 2: The Open Model for Everyone

Nemotron-4-340B: NVIDIA's new large open models, built on syndata, great for syndata

Talaria: Apple's new MLOps Superweapon

5 small news items

Not much happened today

1 TRILLION token context, real time, on device?

Life after DPO (RewardBench)

ALL of AI Engineering in One Place

DeepSeek-V2 beats Mixtral 8x22B with >160 experts at HALF the cost

Evals: The Next Generation

A quiet weekend

OpenAI's Instruction Hierarchy for the LLM OS

FineWeb: 15T Tokens, 12 years of CommonCrawl (deduped and filtered, you're welcome)

Meta Llama 3 (8B, 70B)

Mixtral 8x22B Instruct sparks efficiency memes

Multi-modal, Multi-Aspect, Multi-Form-Factor AI

Zero to GPT in 1 Year

Mergestral, Meta MTIAv2, Cohere Rerank 3, Google Infini-Attention

Music's Dall-E moment

Cohere Command R+, Anthropic Claude Tool Use, OpenAI Finetuning

Not much happened today

Evals-based AI Engineering

DBRX: Best open model (just not most efficient)

Claude 3 is officially America's Next Top Model

not much happened today

Welcome /r/LocalLlama!

Inflection-2.5 at 94% of GPT4, and Pi at 6m MAU

Not much happened today

Welcome Interconnects and OpenRouter

Mistral Large disappoints

One Year of Latent Space

Karpathy emerges from stealth?

Companies liable for AI hallucination is Good Actually for AI Engineers

Sora pushes SOTA

Gemini Ultra is out, to mixed reviews

Qwen 1.5 Released

AI2 releases OLMo - the 4th open-everything LLM

Trust in GPTs at all time low

Miqu confirmed to be an early Mistral-medium checkpoint

CodeLLama 70B beats GPT4 on HumanEval

codellama miqu mistral-medium llama-2-70b aphrodite-engine mixtral flatdolphinmaid noromaid rpcal chatml mistral-7b activation-beacon eagle-7b rwkv-v5 openhermes2.5 nous-hermes-2-mixtral-8x7b-dpo imp-v1-3b bakllava moondream qwen-vl meta-ai-fair ollama nous-research mistral-ai hugging-face ai-ethics alignment gpu-optimization direct-prompt-optimization fine-tuning cuda-programming optimizer-technology quantization multimodality context-length dense-retrieval retrieval-augmented-generation multilinguality model-performance open-source code-generation classification vision

Meta AI surprised the community with the release of CodeLlama, an open-source model now available on platforms like Ollama and MLX for local use. The Miqu model sparked debate over its origins, possibly linked to Mistral Medium or a fine-tuned Llama-2-70b, alongside discussions on AI ethics and alignment risks. The Aphrodite engine showed strong performance on A6000 GPUs with specific configurations. Role-playing AI models such as Mixtral and Flatdolphinmaid faced challenges with repetitiveness, while Noromaid and Rpcal performed better, with ChatML and DPO recommended for improved responses. Learning resources like fast.ai's course were highlighted for ML/DL beginners, and fine-tuning techniques with optimizers like Paged 8bit lion and adafactor were discussed. At Nous Research AI, the Activation Beacon project introduced a method for unlimited context length in LLMs using "global state" tokens, potentially transforming retrieval-augmented models. The Eagle-7B model, based on RWKV-v5, outperformed Mistral in benchmarks with efficiency and multilingual capabilities. OpenHermes2.5 was recommended for consumer hardware due to its quantization methods. Multimodal and domain-specific models like IMP v1-3b, Bakllava, Moondream, and Qwen-vl were explored for classification and vision-language tasks. The community emphasized centralizing AI resources for collaborative research.

RWKV "Eagle" v5: Your move, Mamba

GPT4Turbo A/B Test: gpt-4-1106-preview

Adept Fuyu-Heavy: Multimodal model for Agents

Google Solves Text to Video

Nightshade poisons AI art... kinda?

1/17/2024: Help crowdsource function calling datasets

1/8/2024: The Four Wars of the AI Stack

1/6-7/2024: LlaMA Pro - an alternative to PEFT/RAG??

1/4/2024: Jeff Bezos backs Perplexity's $520m Series B.

12/31/2023: Happy New Year

12/30/2023: Mega List of all LLMs

12/27/2023: NYT vs OpenAI

12/24/2023: Dolphin Mixtral 8x7b is wild

12/19/2023: Everybody Loves OpenRouter

12/13/2023 SOLAR10.7B upstages Mistral7B?

12/12/2023: Towards LangChain 0.1

12/11/2023: Mixtral beats GPT3.5 and Llama2-70B

12/10/2023: not much happened today

12/9/2023: The Mixtral Rush

12/8/2023 - Mamba v Mistral v Hyena