Topic: "open-source"

Open Responses: explicit spec for OpenAI's Responses API supported by OpenRouter, Ollama, Huggingface, vLLM, et al

not much happened today

Mistral 3: Mistral Large 3 + Ministral 3B/8B/14B open weights models

Gemini 2.5 Computer Use preview beats Sonnet 4.5 and OAI CUA

not much happened today

Cognition's $10b Series C; Smol AI updates

not much happened today

not much happened today

not much happened today

not much happened today

SmolLM3: the SOTA 3B reasoning open source LLM

not much happened today

not much happened today

Reasoning Price War 2: Mistral Magistral + o3's 80% price cut + o3-pro

not much happened today

AI Engineer World's Fair Talks Day 1

not much happened today

Mistral's Agents API and the 2025 LLM OS

not much happened today

ChatGPT responds to GlazeGate + LMArena responds to Cohere

Cognition's DeepWiki, a free encyclopedia of all GitHub repos

Gemini 2.5 Flash completes the total domination of the Pareto Frontier

OpenAI o3, o4-mini, and Codex CLI

Google's Agent2Agent Protocol (A2A)

DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level

not much happened today

not much happened today

The new OpenAI Agents Platform

not much happened today

GPT 4.5 — Chonky Orion ships!

lots of small launches

not much happened today

AI Engineer Summit Day 1

not much happened today

not much happened today

How To Scale Your Model, by DeepMind

not much happened today

TinyZero: Reproduce DeepSeek R1-Zero for $30

OpenAI launches Operator, its first Agent

Project Stargate: $500b datacenter (1.7% of US GDP) and Gemini 2 Flash Thinking 2

not much happened today

not much happened today

not much happened to end the year

not much happened today

Genesis: Generative Physics Engine for Robotics (o1-2024-12-17)

LMSys killed Model Versioning (gpt 4o 1120, gemini exp 1121)

BitNet was a lie?

Not much happened today

DeepSeek Janus and Meta SpiRit-LM: Decoupled Image and Expressive Voice Omnimodality

Did Nvidia's Nemotron 70B train on test?

The AI Nobel Prize

Liquid Foundation Models: A New Transformers alternative + AINews Pod 2

ChatGPT Advanced Voice Mode

not much happened today

not much happened this weekend

not much happened today

not much happened today

Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o-mini version)

Mini, Nemo, Turbo, Lite - Smol models go brrr (GPT4o version)

DeepSeek-V2 beats Mixtral 8x22B with >160 experts at HALF the cost

Apple's OpenELM beats OLMo with 50% of its dataset, using DeLighT

Snowflake Arctic: Fully Open 10B+128x4B Dense-MoE Hybrid LLM

Multi-modal, Multi-Aspect, Multi-Form-Factor AI

Music's Dall-E moment

ReALM: Reference Resolution As Language Modeling

MM1: Apple's first Large Multimodal Model

Welcome Interconnects and OpenRouter

Karpathy emerges from stealth?

Companies liable for AI hallucination is Good Actually for AI Engineers

The Core Skills of AI Engineering

CodeLLama 70B beats GPT4 on HumanEval

codellama miqu mistral-medium llama-2-70b aphrodite-engine mixtral flatdolphinmaid noromaid rpcal chatml mistral-7b activation-beacon eagle-7b rwkv-v5 openhermes2.5 nous-hermes-2-mixtral-8x7b-dpo imp-v1-3b bakllava moondream qwen-vl meta-ai-fair ollama nous-research mistral-ai hugging-face ai-ethics alignment gpu-optimization direct-prompt-optimization fine-tuning cuda-programming optimizer-technology quantization multimodality context-length dense-retrieval retrieval-augmented-generation multilinguality model-performance open-source code-generation classification vision

Meta AI surprised the community with the release of CodeLlama, an open-source model now available on platforms like Ollama and MLX for local use. The Miqu model sparked debate over its origins, possibly linked to Mistral Medium or a fine-tuned Llama-2-70b, alongside discussions on AI ethics and alignment risks. The Aphrodite engine showed strong performance on A6000 GPUs with specific configurations. Role-playing AI models such as Mixtral and Flatdolphinmaid faced challenges with repetitiveness, while Noromaid and Rpcal performed better, with ChatML and DPO recommended for improved responses. Learning resources like fast.ai's course were highlighted for ML/DL beginners, and fine-tuning techniques with optimizers like Paged 8bit lion and adafactor were discussed. At Nous Research AI, the Activation Beacon project introduced a method for unlimited context length in LLMs using "global state" tokens, potentially transforming retrieval-augmented models. The Eagle-7B model, based on RWKV-v5, outperformed Mistral in benchmarks with efficiency and multilingual capabilities. OpenHermes2.5 was recommended for consumer hardware due to its quantization methods. Multimodal and domain-specific models like IMP v1-3b, Bakllava, Moondream, and Qwen-vl were explored for classification and vision-language tasks. The community emphasized centralizing AI resources for collaborative research.

12/11/2023: Mixtral beats GPT3.5 and Llama2-70B

12/10/2023: not much happened today

12/8/2023 - Mamba v Mistral v Hyena