Frozen AI News archive

SOTA Video Gen: Veo 2 and Kling 2 are GA for developers

**Google's Veo 2** video generation model is now available in the **Gemini API** with a cost of **35 cents per second** of generated video, marking a significant step in accessible video generation. Meanwhile, China's **Kling 2** model launched with pricing around **$2 for a 10-second clip** and a minimum subscription of **$700 per month for 3 months**, generating excitement despite some skill challenges. **OpenAI** announced the **GPT-4.1 family** release, including **GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano**, highlighting improvements in **coding, instruction following, and a 1 million token context window**. The GPT-4.1 models are **26% cheaper than GPT-4o** and will replace the **GPT-4.5 Preview** API version by July 14. Performance benchmarks show GPT-4.1 achieving **54-55% on SWE-bench verified** and a **60% improvement over GPT-4o** in some internal tests, though some critiques note it underperforms compared to other models like OpenRouter and DeepSeekV3 in coding tasks. The release is API-only, with a prompting guide provided for developers.

Canonical issue URL

AI News for 4/14/2025-4/15/2025. We checked 7 subreddits, 433 Twitters and 29 Discords (211 channels, and 7102 messages) for you. Estimated reading time saved (at 200wpm): 557 minutes. You can now tag @smol_ai for AINews discussions!

We rarely cover video gen model advances here, partially because of the biases in sources towards text/coding topics, and also because they often aren't API available and it can be hard to quantify advances. However, it's not every day that the top 2 Video Arena Leaderboard models get general availability and a bunch of hype videos, so it's a nice excuse to check in on SOTA video gen.

Google's Veo 2 is now in Gemini's own API (after first releasing on Fal) and Gemini Advanced/Whisk, for a remarkably cheap 35 cents per second of generated video (actual experience may differ).

image.png

Kling 2 from China also released today, with pricing at around $2 for a 10 second clip, sold in packages of a minimum quantity of $700 a month for 3 months. People are very excited about the quality, but note that skill issues abound.

image.png


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

Okay, here is a summary of the tweets, categorized by topic and sorted by impression count:

GPT-4.1 and OpenAI Announcements

Model Releases & Capabilities

Agent-Based Systems and Tools

AI Infrastructure and Hardware

AI Industry Analysis

Humor/Memes


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. "Championing Llama.cpp: Recognizing Unsung AI Heroes"

Theme 2. Disappointment Over OpenAI's Open Source Release Delay

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. "Exploring the Frontiers of AI: Innovations and Discoveries"

Theme 2. "Unlocking AI Productivity: Gemini Tools in Action"


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: Model Mania: GPT-4.1, Gemini 2.5, Sonar Lead the Pack

Theme 2: Tooling Up: Frameworks, Hardware, and Quantization Frenzy

Theme 3: Open Source & Community Collabs Shine

Theme 4: Gremlins in the Gears: Bugs, Limits, and Workarounds

Theme 5: Bleeding Edge Research: From Alignment to Apple's Privacy


PART 1: High level Discord summaries

Perplexity AI Discord


aider (Paul Gauthier) Discord


LMArena Discord


OpenRouter (Alex Atallah) Discord


LM Studio Discord


Unsloth AI (Daniel Han) Discord


OpenAI Discord


Cursor Community Discord


HuggingFace Discord


GPU MODE Discord


Manus.im Discord Discord


Nous Research AI Discord


Modular (Mojo 🔥) Discord


MCP (Glama) Discord


Notebook LM Discord


LlamaIndex Discord


Eleuther Discord


Latent Space Discord


Torchtune Discord


Cohere Discord


tinygrad (George Hotz) Discord


Nomic.ai (GPT4All) Discord


The DSPy Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Perplexity AI ▷ #announcements (1 messages):

Sonar-Reasoning-Pro-High, Gemini-2.5-Pro-Grounding, LM Arena's Search Arena, Search depth, Citations from community sources


Perplexity AI ▷ #general (1135 messages🔥🔥🔥):

GPT 4.1 vs GPT 4o, Perplexity AI Credit Card Issues, Gemini 2.5 Pro performance


Perplexity AI ▷ #pplx-api (2 messages):

Social Toggles in API, Search Domain Filters


aider (Paul Gauthier) ▷ #announcements (3 messages):

Aider v0.82.0 Release, GPT 4.1 support, Architect mode improvements, New Models Support (Grok-3, Optimus), Free Alpha Endpoints Retirement


aider (Paul Gauthier) ▷ #general (882 messages🔥🔥🔥):

Github Copilot Proxies, Aider Leaderboard Design, GaryML/DotCommands, Model Token Limits, Context Pruning


aider (Paul Gauthier) ▷ #questions-and-tips (34 messages🔥):

Confirm mode for architect, Context file importance, Gemini and Go, Multiple local models, Litellm proxy for multiple LLMs


aider (Paul Gauthier) ▷ #links (2 messages):

Y Combinator, YouTube videos


LMArena ▷ #general (555 messages🔥🔥🔥):

GPT-4.1 Analysis, RooCode IDE, NightWhisper vs DragonTail, Llama 4 Performance, OpenAI's Social Network Plans


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

Chrome Extension, OpenRouter API, Summarizer Tool, GitHub


OpenRouter (Alex Atallah) ▷ #general (375 messages🔥🔥):

GPT 4.1 vs Quasar, Free Coding LLM, GPT-4.1 Reasoning, Gemini 2.0 Flash Lite, Roo vs Cline


LM Studio ▷ #general (212 messages🔥🔥):

Gemma 3, Llama 3.3 70b, Model performance, LLM Translator, Download speeds


LM Studio ▷ #hardware-discussion (148 messages🔥🔥):

RTX 3090 CUDA 12 Performance, 5090 Justification, GPU for Hunyuan Video Generation, Nvidia DGX Spark, M3/M4 Memory Bandwidth Analysis


Unsloth AI (Daniel Han) ▷ #general (297 messages🔥🔥):

Unsloth BnB models not working on vLLM, GPT4.1 minor improvement, New micro model, Llama 3 8b fine tuning colab tutorial, Kimi VL 16B A3B notebook


Unsloth AI (Daniel Han) ▷ #off-topic (11 messages🔥):

OpenAI, Claude, Gemini 2.5 Pro, Frontend Coding


Unsloth AI (Daniel Han) ▷ #help (36 messages🔥):

Qwen 2.5 finetuning, LoRA and 4bit download, Multimodal Llama 4 in llama.cpp, Custom tokens in Gemma 3B, Run Unsloth on pre-RTX cards


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

Shisa-v2 models, Unsloth Llamafied Phi4


Unsloth AI (Daniel Han) ▷ #research (3 messages):

Prima.cpp, Low Level engineering details


OpenAI ▷ #ai-discussions (248 messages🔥🔥):

GPT-4.1 vs 2.5 Pro, GPT-4o Audio Generation, T3 Chat, Windsurf for free GPT-4.1, Veo 2 in Advanced


OpenAI ▷ #gpt-4-discussions (4 messages):

GPT-4o limits, Custom GPT function issues, Internet search interference


OpenAI ▷ #prompt-engineering (7 messages):

Complex Prompt Handling by 4.1, Generating Consistent Mythological Gods, Cartoon-Style Illustration Prompts


OpenAI ▷ #api-discussions (7 messages):

Consistent Character Generation, Generating Mythological Gods, Cartoon-Style Illustrations


Cursor Community ▷ #general (234 messages🔥🔥):

GPT-4.1 issues, Windsurf vs Cursor, VS Code Agent AI vs Cursor, Interactive README.md, Agent mode issues


HuggingFace ▷ #general (178 messages🔥🔥):

Hugging Face 500 Errors, consistent character generation, HF buying a robotics company, GPU Memory


HuggingFace ▷ #cool-finds (1 messages):

barrel_of_lube: https://youtu.be/wrkiMZ3SKH4


HuggingFace ▷ #i-made-this (13 messages🔥):

roo code, free inference credits, GPT-4.1, BwETAF-IID-100M


HuggingFace ▷ #reading-group (2 messages):

Society of Minds framework


HuggingFace ▷ #computer-vision (1 messages):

guarin_: https://github.com/lightly-ai/lightly-train


HuggingFace ▷ #NLP (1 messages):

LLM-chat-templates, python glue


HuggingFace ▷ #smol-course (1 messages):

kirubeldavid: What do you guys think of the RL, NLP courses compared to the ones in Coursera?


HuggingFace ▷ #agents-course (13 messages🔥):

Qwen 2.5 coder, Mistral Small 24b iq4xs, Certification details


GPU MODE ▷ #general (14 messages🔥):

CPU/GPU proximity, Data Normalization, AMD GPU Competition, Training Models


GPU MODE ▷ #triton (5 messages):

Triton Build, Blackwell support, PyTorch Nightly


GPU MODE ▷ #cuda (10 messages🔥):

nvdisasm instruction grouping, SASS vs PTX, Dual Issue


GPU MODE ▷ #torch (1 messages):

ZeRO Stage 3, PyTorch Lightning Tutorials


GPU MODE ▷ #announcements (1 messages):

Discord competition


GPU MODE ▷ #cool-links (1 messages):

Triton Lang, DeepSeek Inference Engine


GPU MODE ▷ #beginner (4 messages):

Triton for Model Training, Transformer Kernel Optimization, GPUMODE Lecture Series


GPU MODE ▷ #rocm (12 messages🔥):

ROCm 6.2 Upgrade, Runpod Instances, Docker Images, AMD Cloud


GPU MODE ▷ #tilelang (1 messages):

gau.nernst: https://huggingface.co/microsoft/bitnet-b1.58-2B-4T


GPU MODE ▷ #liger-kernel (1 messages):

0x000ff4: are the some kind of meetings for this group?


GPU MODE ▷ #metal (1 messages):

candle-metal-kernels, metal, reduce.metal


GPU MODE ▷ #self-promotion (4 messages):

Model Training Frameworks, Large Scale Inference Tooling, Speech/Audio Algorithm Development, Q.ai Boston Job Opportunity


GPU MODE ▷ #general (23 messages🔥):

POPCORN_API_URL, load_inline with HIP, Torch Headers, thrust/complex.h


GPU MODE ▷ #submissions (10 messages🔥):

Leaderboard Updates, MI300 Dominance, Matmul Performance on T4, Vectoradd Benchmark, Grayscale Leaderboard


GPU MODE ▷ #status (9 messages🔥):

AMD competition debugging, CLI submission fixes, Temporary account removals, CLI release and re-authentication, Discord login issues


GPU MODE ▷ #feature-requests-and-bugs (1 messages):

snektron: I think that any file that contains \ leads to

Error during creation of submission


GPU MODE ▷ #hardware (5 messages):

4-bit Inference Performance on A100, GPTQ-quantized LLMs, INT8/INT4 Limitations on Ampere, AWS FPGAs for Chip Design, Zero to ASIC Course


GPU MODE ▷ #amd-competition (42 messages🔥):

AMD Competition, Email Confirmation, FP8 GEMM Problem, Reference Kernels, Submission Errors


Manus.im Discord ▷ #general (129 messages🔥🔥):

Invitation Codes, Fellow Program, Gemini 2.5 Pro, Project EchoCore, Image Permissions


Nous Research AI ▷ #general (70 messages🔥🔥):

GPT-4.1 mini vs Gemini 2.5 Pro, OpenAI Social Network vs X, 4.1 and autocaching, Apple's privacy preserving distributed RL, 4.1-nano level


Nous Research AI ▷ #research-papers (2 messages):

DeepMath-103K Dataset, RLVR Applications


Nous Research AI ▷ #interesting-links (3 messages):

langwatch/scenario GitHub Repository, RFC5545, draft-ietf-calext-ical-tasks-13


Nous Research AI ▷ #research-papers (2 messages):

DeepMath-103K Dataset, RLVR, Massive math dataset


Modular (Mojo 🔥) ▷ #general (19 messages🔥):

Mojo extensions on OpenVSX, Closed vs Open Source VS Code, Microsoft vs Community VS Code Extensions, Modular business model, April 14 meeting recording


Modular (Mojo 🔥) ▷ #mojo (51 messages🔥):

Quantity type system in Mojo, Mojo compiler bugs, StringLiteral type system in Mojo, syscalls in Mojo, Linux syscall ABI

@value
struct Bar[S: StringLiteral]:
    pass

fn foo[A: StringLiteral, B: StringLiteral](out res: Bar[A or B]):
    return __type_of(res)()

fn main():
    print(foo['', 'baz']().S) # baz

MCP (Glama) ▷ #general (59 messages🔥🔥):

FastMcp library, Msty Studio for hot swapping LLMs, Best way to use MCP servers in RooCode/Cline, Open Empathic Project Plea for Assistance, Google Docs MCP with fast MCP


MCP (Glama) ▷ #showcase (10 messages🔥):

Klavis AI Launch, Open Source Remote MCP SSH Server & Client, Slack MCP, Google Docs MCP, Bidirectional Communication between Chat Services


Notebook LM ▷ #announcements (1 messages):

NotebookLM new features, User feedback on NotebookLM, NBLM users needed


Notebook LM ▷ #use-cases (12 messages🔥):

Google Keep vs. Google Docs for Note-Taking, Integrating Notebook LM with Google Apps, Using Notebook LM for Microsoft Documentation


Notebook LM ▷ #general (45 messages🔥):

Gemini 2.5 release date, No-code web builder, Career in DevOps, NotebookLM podcast translation, Character limit in chat


LlamaIndex ▷ #blog (6 messages):

GPT-4.1, Agent Benchmarks, Hierarchical Multi-Agent System, Agents and Data, AI Knowledge Agents


LlamaIndex ▷ #general (15 messages🔥):

Phoenix tracing for Anthropic, AnyAgent library, LlamaIndex managed agents, Pinecone multiple namespaces, Beta testers


Eleuther ▷ #announcements (1 messages):

ICLR, Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon, Bridging the Data Provenance Gap Across Text, Speech, and Video, PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs, Aria-MIDI: A Dataset of MIDI Files for Symbolic Music Modeling


Eleuther ▷ #general (5 messages):

Model selection for formalization/coder, Ceph project adds key/value storage to llama.cpp, Hidden state extraction script


Eleuther ▷ #research (12 messages🔥):

Alignment Tension in LLMs, Multimodal Data Approaches, Cross-Domain Applicability


Latent Space ▷ #ai-general-chat (13 messages🔥):

Quasar, Optimus, GPT-4o, LlamaCon, Meta Dev Conference


Latent Space ▷ #ai-announcements (1 messages):

swyxio: special pod on GPT 4.1 with OAI! https://www.youtube.com/watch?v=y__VY7I0dzU&t=415s


Latent Space ▷ #llm-paper-club-west (3 messages):

X-Ware.v0, Red - X-Ware.v0, Dylan522p Tweet


Torchtune ▷ #general (1 messages):

Office Hours


Torchtune ▷ #dev (2 messages):

Validation Set PR, GRPO Bug Fixes


Torchtune ▷ #papers (14 messages🔥):

Deep Cogito V1 Model Release, IDA Method Implementation, AI Alignment and Theoretical Ideas, Vibe Versioning


Cohere ▷ #「💬」general (3 messages):

vLLM, Docker, H100 GPUs, memory optimization


Cohere ▷ #「🔌」api-discussions (1 messages):

adonisthegoat: Hi, I'm curious when cohere plans to support embed-v4.0 in the Jobs api


Cohere ▷ #「💡」projects (1 messages):

Command A, Agent Mode, OpenAI compat API, Continuedev


tinygrad (George Hotz) ▷ #learn-tinygrad (4 messages):

Printing Bugs, Tinygrad Notes


Nomic.ai (GPT4All) ▷ #general (2 messages):

Webmaster Dreams, Appreciation in Web Development






{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}