Frozen AI News archive

Alibaba Yunqi: 7 models released in 4 days (Qwen3-Max, Qwen3-Omni, Qwen3-VL) and $52B roadmap

**Alibaba's Tongyi Qianwen (Qwen) team** launched major updates including the **1T parameter Qwen3-Max**, **Qwen3-Omni**, and **Qwen3-VL** models, alongside specialized versions like **Qwen3Guard**, **Qwen3-LiveTranslate**, **Qwen3-TTS-Flash**, **Qwen-Image-Edit**, and **Qwen3Coder**. At the **AliCloud Yunqi (Apsara) conference**, CEO **Eddie Wu** outlined a $52B roadmap emphasizing two AI development stages: "intelligence emergence" focusing on learning from humans and reasoning, and "autonomous action" highlighting AI's tool use and real-world task execution. The updates showcase advances in **tool use**, **large-model coding capabilities**, and AI's expanding role across industries such as logistics, manufacturing, biomedicine, and finance. Junyang Lin and Alibaba Wan are key spokespersons for these developments. The Qwen project is now seen as a "frontier lab" for AI innovation.

Canonical issue URL

Qwen is all you need?

AI News for 9/23/2025-9/24/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (194 channels, and 2236 messages) for you. Estimated reading time saved (at 200wpm): 188 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Today is both AI Engineer Paris and AliCloud's annual Yunqi aka Apsara conference, and the Tongyi Qianwen (aka Qwen) team has been working overtime to launch updates of all their models, including the major ones: the monster 1T model Qwen3-Max (previewed 3 weeks ago), Qwen3-Omni, and Qwen3-VL, with Qwen3Guard, Qwen3-LiveTranslate, Qwen3-TTS-Flash, and updates to Qwen-Image-Edit and Qwen3Coder. Here's how Junyang Lin, their primary spokesperson in AI Twitter, put it:

Just to visualize the step up of velocity, here's all the Qwen releases this year visualized:

Not to forget all the work from Alibaba Wan too, but Qwen is now being regarded as a "frontier lab" with all these releases.

Alibaba's CEO Eddie Wu took to the stage to map out their $52B USD roadmap:

Here's a translation of the speech:

They are also recent converts to the LLM OS thesis.


AI Twitter Recap

Compute buildout: OpenAI–NVIDIA deal, Stargate expansion, and the gigawatt era

Qwen’s multi-model salvo: Max, VL‑235B‑A22B, Omni, Coder‑Plus, Guard, and LiveTranslate

OpenAI’s GPT‑5‑Codex and agent tooling move to the fore

Retrieval, context engineering, and agent research

Video and 3D content: Kling 2.5 Turbo, Ray 3 HDR, and more

Systems, kernels, and inference

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Qwen3-Max Release and Benchmarks

2. Qwen Shipping Speed Memes/Discussion

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Wan 2.2/2.5 Video Demos + Qwen-Image-Edit GGUF and LMarena Leaderboard

2. OpenAI Infrastructure, Funding, and Product Changes/User Feedback

3. AI Humor and Speculation Memes (cats, immortality, money glitch, seahorses)


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. GPT-5-Codex Rolls Into IDEs and APIs

2. Qwen3 Multimodal Suite: Omni, VL, and Image Edit

3. Agent Benchmarks and Builder Tooling

4. Research Spotlight: Faster Diffusion, Smarter Audio

5. DSPy: Profiles, Prompts, and Practical GEPA


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


Cursor Community Discord


OpenRouter Discord


HuggingFace Discord


GPU MODE Discord


Latent Space Discord


Yannick Kilcher Discord


LM Studio Discord


aider (Paul Gauthier) Discord


Modular (Mojo 🔥) Discord


OpenAI Discord


DSPy Discord


tinygrad (George Hotz) Discord


Nous Research AI Discord


Eleuther Discord


Moonshot AI (Kimi K-2) Discord


Windsurf Discord


MCP Contributors (Official) Discord


Manus.im Discord Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (826 messages🔥🔥🔥):

Image Generation Limits on Perplexity, Qwen Model Releases, Using Custom Instructions, Perplexity Email Assistant, Open Router Web Search Functionality


Perplexity AI ▷ #sharing (9 messages🔥):

Shareable threads on Perplexity, Perplexity Pro Referral Codes


LMArena ▷ #general (294 messages🔥🔥):

Image Editing AI, Nano Banana, Seedream, Model Awareness of Conversation History, GPTs Agents


LMArena ▷ #announcements (1 messages):

deepseek-v3.1-terminus, LMArena, Model Evaluation


Cursor Community ▷ #general (248 messages🔥🔥):

Cursor line reading limits, GPT-5-CODEX rollout, Chrome DevTools MCP Server, Playwright MCP Alternative, Supernova model evaluation


Cursor Community ▷ #background-agents (2 messages):

Zombie process analysis, Zombie process escalation


OpenRouter ▷ #announcements (2 messages):

GPT-5-Codex launch, Agentic coding workflows, OpenRouter-compatible coding tools, Chatroom recommended parameters


OpenRouter ▷ #app-showcase (1 messages):

eofr: Scam


OpenRouter ▷ #general (173 messages🔥🔥):

Deepseek 3.1 uptime issues, OpenRouter iOS app, Qwen3 VL


OpenRouter ▷ #new-models (3 messages):

``


OpenRouter ▷ #discussion (2 messages):

4Wallai benchmarks


HuggingFace ▷ #general (100 messages🔥🔥):

TTS narration, Open Models for narration, ML Course recommendations, Private LLM


HuggingFace ▷ #i-made-this (4 messages):

Go wrapper for tokenizers library, Canis.lab launch


HuggingFace ▷ #computer-vision (1 messages):

Menu Translation, Gemini 2.5 Flash, Taiwanese Signage Menus, OCR for spaced characters


HuggingFace ▷ #smol-course (2 messages):

Canis.lab, Synthetic Data, Eval Dataset issues


HuggingFace ▷ #agents-course (1 messages):

RAG Courses, Bangla Retrieval, Multimodal Support


GPU MODE ▷ #general (14 messages🔥):

Python Profiling, DeepGEMM Benchmarking, NCU Clock Control, GPU Kernel Downclocking


GPU MODE ▷ #cuda (3 messages):

mbarrier instructions, cuda::barrier, cuda::memcpy_async, inline PTX, CCCL


GPU MODE ▷ #beginner (2 messages):

CUDA Documentation, Memory vs Compute Bound


GPU MODE ▷ #off-topic (20 messages🔥):

Slurm Reading Material, Sysadmin/Devops Channel, Kubernetes + Slurm + Docker, Flux from LLNL


GPU MODE ▷ #self-promotion (7 messages):

CuTe Layout Algebra, Colfax Team Paper, Categorical treatment, WMMA/MMA instruction, NVRTC MMA


GPU MODE ▷ #avx (2 messages):

AVX512, BPE, Tiktoken, Huggingface, Data Loading Optimization


GPU MODE ▷ #edge (2 messages):

Cubesat hardware, Cubesat software, Error Correction, Redundancy, RasPi Cubesats


GPU MODE ▷ #submissions (2 messages):

MI300x8, amd-gemm-rs leaderboard


GPU MODE ▷ #status (1 messages):

Runner Issues, Timeouts, Debugging with AMD and DigitalOcean


GPU MODE ▷ #factorio-learning-env (3 messages):

GEPA, Deepseek Neel eval


GPU MODE ▷ #amd-competition (33 messages🔥):

MI300X Environment, Docker Image for Benchmarks, GEMM Submission Timeout, Cluster Health Issue, All2All Custom Kernel Data Access


GPU MODE ▷ #cutlass (8 messages🔥):

Shape Compatibility, CUTE documentation, PTX Diagrams


GPU MODE ▷ #singularity-systems (2 messages):

Eager Mode, Graph Mode, Tinygrad's IR, Tensor Sugar, Torch vs. Jax


GPU MODE ▷ #cluster-management (8 messages🔥):

GPU Reservations, Slurm and Docker, Singularity vs Docker, llm-d.ai for cluster management


Latent Space ▷ #ai-general-chat (45 messages🔥):

Meta's ARE and Gaia2, Cline's Agentic Algorithm, Greptile's $25M Series A, Cloudflare's VibeSDK, GPT-5-Codex Release


Latent Space ▷ #genmedia-creative-ai (4 messages):

Foo Fighters, Artists using AI


Yannick Kilcher ▷ #general (2 messages):

Paper Reading Events, Yannick's Reading List


Yannick Kilcher ▷ #paper-discussion (17 messages🔥):

Diffusion ODE Solver, MiMo-Audio, Diversity is all you need


Yannick Kilcher ▷ #ml-news (12 messages🔥):

Gaia2, Meta Agents Research Environments (ARE), GPT5 Models, Cloudflare Vibesdk, Compilebench


LM Studio ▷ #general (21 messages🔥):

LM Studio Model Support, GGUF/MLX Models, Qwen-3-omni, Google Gemini Free Tier


LM Studio ▷ #hardware-discussion (2 messages):

Innosilicon GPU, DirectX12 Support, Ray Tracing Hardware


aider (Paul Gauthier) ▷ #general (11 messages🔥):

Response API Support, GPT-5-Codex Integration, aider and litellm


aider (Paul Gauthier) ▷ #questions-and-tips (8 messages🔥):

aider ollama setup, Aider reads MD file, Context Retransmitted, Prompt Caching


Modular (Mojo 🔥) ▷ #general (18 messages🔥):

RISC-V Performance, Tenstorrent's MMA accelerator + CPU combos, RISC-V 32-bit and 64-bit, RISC-V Bringup, RISC-V ISA


OpenAI ▷ #annnouncements (1 messages):

Stargate Sites, Oracle, SoftBank, 10-Gigawatt Commitment


OpenAI ▷ #ai-discussions (14 messages🔥):

Codex Fallback, Sora Issues, Ternary System Study, Github Copilot Alternative, kilocode


OpenAI ▷ #prompt-engineering (1 messages):

GPT4o Translations, Chain of Thought


OpenAI ▷ #api-discussions (1 messages):

GPT4o translation, Chain of thought in translation


DSPy ▷ #show-and-tell (4 messages):

DSPy profiles, dspy-profiles, LLM behavior


DSPy ▷ #general (8 messages🔥):

GEPA Multimodality Performance Issue, Passing images and PDFs into DSPy, VLMs for Data Extraction, OCR Approaches for Data Extraction, Best PDF or Image Parsing Stuff


DSPy ▷ #examples (5 messages):

Prompt Optimization, GEPA, AI Safety Research, Trusted Monitor, Comparative Metric with Feedback


tinygrad (George Hotz) ▷ #general (12 messages🔥):

High-Level IRs like Triton, Multi-Layer IR Stack, Hardware-Incomplete vs Complete IRs, Search and Learning in Compilers, Graph-Based Models for Compilers


Nous Research AI ▷ #general (6 messages):

TRL Assessor, Nous Tek


Nous Research AI ▷ #ask-about-llms (6 messages):

Distributed Learning, Code Genetics, Model Non-Homology


Eleuther ▷ #general (3 messages):

AI Behavioral Coherence, Mathematical AI Constraints, Davinci Architecture


Eleuther ▷ #research (8 messages🔥):

Zero Knowledge Proofs, SwiGLU up-projection, Model Tampering Defenses


Moonshot AI (Kimi K-2) ▷ #general-chat (3 messages):

pydantic-ai lib


Windsurf ▷ #announcements (2 messages):

GPT-5-Codex, Figma MCP server, Windsurf update, Remote Figma integration


MCP Contributors (Official) ▷ #mcp-dev-summit (2 messages):

MCP Dev Summit, Apify & Jentic Happy Hour