Topic: "image-generation"

Nano Banana 2 aka Gemini 3.1 Flash Image Preview: the new SOTA Imagegen model

Qwen-Image 2.0 and Seedance 2.0

not much happened today

OpenAI GPT Image-1.5 claims to beat Nano Banana Pro, #1 across all Arenas, but completely fails Vibe Checks

not much happened today

Black Forest Labs FLUX.2 [pro|flex|dev|klein]: near-Nano Banana quality but Open Weights

AI Engineer Code Summit

Nano Banana Pro (Gemini Image Pro) solves text-in-images, infographic generation, 2-4k resolution, and Google Search grounding

not much happened today

not much happened today

not much happened today

Western Open Models get Funding: Cohere $500m @ 6.8B, AI2 gets $152m NSF+NVIDIA grants

not much happened today

Qwen-Image: SOTA text rendering + 4o-imagegen-level Editing Open Weights MMDiT

not much happened today

not much happened today

not much happened today

not much happened today

OpenAI releases Deep Research API (o3/o4-mini)

not much happened today

Granola launches team notes, while Notion launches meeting transcription

AI Engineer World's Fair: Second Run, Twice The Fun

Cursor @ $9b, OpenAI Buys Windsurf @ $3b

not much happened today

gpt-image-1 - ChatGPT's imagegen model, confusingly NOT 4o, now available in API

not much happened today

DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level

not much happened today

not much happened today

not much happened today

not much happened today

not much happened today

Gemma 3 beats DeepSeek V3 in Elo, 2.0 Flash beats GPT4o with Native Image Gen

not much happened today

not much happened today

$200 ChatGPT Pro and o1-full/pro, with vision, without API, and mixed reviews

not much happened today

Vision Everywhere: Apple AIMv2 and Jina CLIP v2

Perplexity starts Shopping for you

Common Corpus: 2T Open Tokens with Provenance

s{imple|table|calable} Consistency Models

DeepSeek Janus and Meta SpiRit-LM: Decoupled Image and Expressive Voice Omnimodality

Replit Agent - How did everybody beat Devin to market?

Ideogram 2 + Berkeley Function Calling Leaderboard V2

The DSPy Roadmap

not much happened today

Cursor reaches >1000 tok/s finetuning Llama3-70b for fast file editing

Google I/O in 60 seconds

Not much happened today

Snowflake Arctic: Fully Open 10B+128x4B Dense-MoE Hybrid LLM

Llama-3-70b is GPT-4-level Open Model

Mergestral, Meta MTIAv2, Cohere Rerank 3, Google Infini-Attention

ReALM: Reference Resolution As Language Modeling

AdamW -> AaronD?

Jamba: Mixture of Architectures dethrones Mixtral

Andrew likes Agents

Not much happened today

Stable Diffusion 3 — Rombach & Esser did it again!

Google AI: Win some (Gemma, 1.5 Pro), Lose some (Image gen)

The Dissection of Smaug (72B)

RIP Latent Diffusion, Hello Hourglass Diffusion

1/10/2024: All the best papers for AI Engineers

1/6-7/2024: LlaMA Pro - an alternative to PEFT/RAG??

1/2/2024: Smol tweaks to Smol Talk

12/7/2023: Anthropic says "skill issue"