Frozen AI News archive

not much happened today

**Kling 2.5 Turbo** leads in text-to-video and image-to-video generation with competitive pricing. **OpenAI Sora 2** shows strong instruction-following but has physics inconsistencies. **Google Gemini 2.5 Flash** "Nano Banana" image generation is now generally available with multi-image blending and flexible aspect ratios. **IBM Granite 4.0** introduces a hybrid Mamba/Transformer architecture with large context windows and strong token efficiency, outperforming some peers on the Intelligence Index. **Qwen** models receive updates including fine-tuning API support and improved vision capabilities. **Tinker** offers a flexible fine-tuning API supporting LoRA sharing and CPU-only training loops. The ecosystem also sees updates like **Synthesia 3.0** adding video agents.

Canonical issue URL

a quiet day

AI News for 10/1/2025-10/2/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (196 channels, and 8860 messages) for you. Estimated reading time saved (at 200wpm): 629 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

It's a quiet day so you can check out the latest Latent Space with Dylan Field pod!

Also, invites for the first AI Engineer Code Summit have started going out.


AI Twitter Recap

Video generation: Sora 2, Kling 2.5 Turbo, and Google’s “Nano Banana” GA

Open-weight model releases: IBM Granite 4.0 and Qwen updates

Fine‑tuning and systems: Tinker, rank‑1 LoRA, MoE support, and inference speedups

RL and reasoning: search‑in‑training, broadened exploration, latent CoT, front‑loaded reasoning

Agents and toolchains: CLI + semantic search, Notebook MCP, browsers, and CLIs

Leaderboards and real‑world coding agent metrics

Top tweets (by engagement)


AI Reddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Sora 2 and WAN 2.2 Video Generation Demos

2. OpenAI $500B Valuation + ChatGPT 'Think Longer' UX + Silicon Valley Foresight

3. AI Comedy Threads: 'Strangest Flea Market' Pt.7 and Related Skits


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. IBM Granite 4.0 Hybrid Models Launch

2. Unsloth Training Stack: Docker, RL Speedups, and New Tricks

3. GPU Systems: Determinism, Flash‑MoE, and Kernel Fusion

4. OpenRouter: Routing Metrics, Fees, and New Models

5. LMArena: Reasoning Trace and Leaderboard Shifts


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


Unsloth AI (Daniel Han) Discord


OpenAI Discord


LM Studio Discord


Cursor Community Discord


OpenRouter Discord


Eleuther Discord


GPU MODE Discord


Latent Space Discord


Nous Research AI Discord


Yannick Kilcher Discord


Manus.im Discord Discord


DSPy Discord


aider (Paul Gauthier) Discord


MCP Contributors (Official) Discord


Modular (Mojo 🔥) Discord


Moonshot AI (Kimi K-2) Discord


tinygrad (George Hotz) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Windsurf Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (2 messages):

o3 Deprecation, GPT-5 Thinking


Perplexity AI ▷ #general (1281 messages🔥🔥🔥):

Comet Browser, Discord Quest, Troubleshooting, User Experience, AI and Personal Data


Perplexity AI ▷ #sharing (8 messages🔥):

PC BUILD, Perplexity AI apps, bootstrap-paradox


Perplexity AI ▷ #pplx-api (1 messages):

Sonar-Pro API, 404 Errors, Public Resources


LMArena ▷ #general (1348 messages🔥🔥🔥):

Sora 2, Gemini 3, Qwen3 4B 2507 instruct, 4o model, OpenAI safety


LMArena ▷ #announcements (5 messages):

Arena Champions Role, Reasoning Trace, New Model Update - reve-v1, New Model Update - claude-sonnet-4-5-20250929-thinking-32k, Leaderboard Update


Unsloth AI (Daniel Han) ▷ #general (324 messages🔥🔥):

Qwen3 deep research, Manual compiling of xformers on Blackwell, LLMs on blockchain, Unsloth supporting RWKV architecture, Synthetic dataset generation without vLLM


Unsloth AI (Daniel Han) ▷ #introduce-yourself (3 messages):

Blockchain and AI synergy, Trust in Code, Consensus mechanisms, AI problem-solving


Unsloth AI (Daniel Han) ▷ #announcements (1 messages):

Unsloth Docker image, IBM Granite-4.0, gpt-oss RL, Vision RL, GLM-4.6


Unsloth AI (Daniel Han) ▷ #off-topic (638 messages🔥🔥🔥):

WSL for development, Sonnet 4.5 for coding, Custom LSTM memory, Kaggle Notebook Training, Data extraction using LLMs


Unsloth AI (Daniel Han) ▷ #help (164 messages🔥🔥):

Fine-tuning subtitles into Q&A format, GGUF Conversion Issues, Gemma3 and vLLM Compatibility, ONNX conversion for Gemma3, Multiprocessing Problems with Unsloth


Unsloth AI (Daniel Han) ▷ #research (41 messages🔥):

Tversky-All GPT2 reproduction, Efficient Training Setup, AMXFP4 Precision, quack kernels


OpenAI ▷ #ai-discussions (722 messages🔥🔥🔥):

Cameo usage on TikTok, Sonnet 4.5 vs GLM 4.6 Cost, Overrun of Sora users, Deepfake generation, System Artifacts Log for Emerging Validation of Novelty


OpenAI ▷ #gpt-4-discussions (2 messages):

Sora as Social Media, Sora credits, Sora integration


OpenAI ▷ #prompt-engineering (11 messages🔥):

Human writing prompts, Sora camera control, Portrait vs Landscape in Sora


OpenAI ▷ #api-discussions (11 messages🔥):

Writing Prompts, Sora Camera Control, Portrait vs Landscape Generation


LM Studio ▷ #general (542 messages🔥🔥🔥):

Surface Pro Snapdragon X Elite, Artifacts as emergent validation of novelty, Model Quantization and Quality Tradeoffs, GPT-OSS, LM Studio Linux Install


LM Studio ▷ #hardware-discussion (115 messages🔥🔥):

4090 vs 5090 for vertical scaling, Arc B50 Pro benchmarks, GPT OSS 120b hardware recommendations, DDR3 vs DDR4 for GPU offloading, Unsloth vs LM Studio for LLM's


Cursor Community ▷ #general (601 messages🔥🔥🔥):

Git Worktree in Cursor, Beta Functions in Cursor, Typescript Refactor with Cursor, Memory Leaks with Cursor on MacOS, Cursor Hackaton?


OpenRouter ▷ #announcements (5 messages):

OpenRouter Performance Tab, Grok-4-Fast


OpenRouter ▷ #app-showcase (2 messages):

RPG, Mixture of LLMs


OpenRouter ▷ #general (495 messages🔥🔥🔥):

OpenRouter BYOK, Free Inference Providers, Grok vs Sonoma, Gemini Pro Performance Issues, Deepseek R1 0528 deprecation


OpenRouter ▷ #discussion (23 messages🔥):

Sora.com and new model, BYOK tokens, Latency vs E2E latency, Qwen image model, Cerebras removing Llama


Eleuther ▷ #general (7 messages):

Perplexity AI framework, Deepseek Sparse Attention, Underrated LLM pretraining papers, Attention Matrices, LLM Attention Research


Eleuther ▷ #research (28 messages🔥):

Gradient Descent Dynamics, Symmetry Transformer, ViT training, Compact Image Representation, Quantifying Scientific Impact


Eleuther ▷ #scaling-laws (91 messages🔥🔥):

SOTA scaling of MLPs, AUNN implementation and efficiency, Test-time training (TTT) framework vs AUNN, Inductive bias in sequence models, Computational cost of different model architectures


GPU MODE ▷ #general (12 messages🔥):

GEMM optimization, Tversky paper implementation, DeepSeek Sparse Attention in CUDA, GPU performance engineering career path


GPU MODE ▷ #cuda (18 messages🔥):

RF meaning, Volkov's paper, mbarriers vs barriers


GPU MODE ▷ #torch (2 messages):

LLM Training, Cross Entropy, Gradient Norm, Sparse Tensors, Torch Compile


GPU MODE ▷ #cool-links (4 messages):

Non-determinism in LLM Inference, Flash-MoE, Nvidia Compiler Techniques, Warp Specialization, Distributed Setting


GPU MODE ▷ #jobs (1 messages):

schizik12: <@325883680419610631> spam


GPU MODE ▷ #beginner (9 messages🔥):

Benchmarking Guides, Kernel Benchmarking, Career Opportunities in GPU Programming, Gaining Experience in GPU Programming, GEMM Optimization


GPU MODE ▷ #pmpp-book (2 messages):

Learning vs. Job Performance, C++ Requirement for a Book, 5090 GPU Learning Experience


GPU MODE ▷ #jax (1 messages):

Blackwell, matmuls, jax


GPU MODE ▷ #torchao (3 messages):

INT4 Quantization, TorchAO, TensorCore, A100 GPUs, Efficient Kernels


GPU MODE ▷ #self-promotion (2 messages):

GPU Engineering, MMA Tensor Cores


GPU MODE ▷ #submissions (5 messages):

MI300x8, amd-gemm-rs, amd-all2all, amd-ag-gemm


GPU MODE ▷ #tpu (9 messages🔥):

Cloud TPUs, JupyterLab, gcloud CLI, rclone


GPU MODE ▷ #factorio-learning-env (11 messages🔥):

Lab Play Interpretation, Open Play Development, PIP Stuff Discussion, GIF Updates


GPU MODE ▷ #cutlass (31 messages🔥):

permutation_mnk rules, tiled_mma, CooperativeGroup.__init__ alignment, Uniform Registers (URs)


GPU MODE ▷ #multi-gpu (2 messages):

nccl::all_to_all performance, bf16 vs fp8


GPU MODE ▷ #low-bit-training (4 messages):

LLM Training Acceleration, Linear Cross Entropy, Sequence Packed Training, Quack Optimization


Latent Space ▷ #ai-general-chat (103 messages🔥🔥):

AI-integrated MMO, Karpathy on Sutton and Bitter Lesson, Hume AI Octave 2, Mistral's formal-math team, Scalable Option Learning (SOL)


Latent Space ▷ #ai-announcements (4 messages):

Dylan Field, Figma, Latent Space, Make, MCP


Latent Space ▷ #genmedia-creative-ai (8 messages🔥):

Mosaic AI video editor launch, Sora-TikTok automation monetization


Nous Research AI ▷ #general (25 messages🔥):

Nous Research Model similar to GPT-4.5, Gemini answers, Veo3 gems, Granite language models, Qwen 30B A3B for CPU


Nous Research AI ▷ #research-papers (3 messages):

LLMs Strategically Lie, Sparse Autoencoder Tools, Goodfire AI, Model Dishonesty Detection


Nous Research AI ▷ #research-papers (3 messages):

LLM Deception, Sparse Autoencoders, Goodfire AI


Yannick Kilcher ▷ #general (17 messages🔥):

Deepmind Code Incompleteness, RoPE Implementation


Yannick Kilcher ▷ #paper-discussion (6 messages):

Knowledge Distillation, Semantic Equivalence, RL for Fuzzy Prediction


Yannick Kilcher ▷ #ml-news (7 messages):

IBM Granite 4.0, Mamba/Transformer architecture, ISO 42001 certification, Oracle Business Model, OpenAI datacenters


Manus.im Discord ▷ #general (27 messages🔥):

Credits Issue, Memory Key Protocol, Sora invite code, Manus API key, Neuro-cognitive agentic logic layer


DSPy ▷ #show-and-tell (2 messages):

AGI Introduction, Hugging Face Paper


DSPy ▷ #general (23 messages🔥):

Caching Prompt Order, DSPyWeekly Search Feature, JSONAdaptor vs ChatAdaptor vs XMLAdaptor, Tool Use RL for Models, OpenAI Function calling and MCP


aider (Paul Gauthier) ▷ #general (16 messages🔥):

Qwen Coder Models, aider Development, aider-desk UI, Model Discussions Channel


aider (Paul Gauthier) ▷ #questions-and-tips (8 messages🔥):

DeepWiki, Custom Chat Templates, GBNF, Multi-Line Prompts, LLM Polyglot Performance


MCP Contributors (Official) ▷ #mcp-dev-summit (10 messages🔥):

Ye Olde London meetup, Registry Team Livestream, Asynchronous Tool Calls Livestream, Security and Ops Track, Talk about Profiles


MCP Contributors (Official) ▷ #general (6 messages):

Tool Call Support, Reference Implementation, OCI Interface for MCP Servers


Modular (Mojo 🔥) ▷ #general (3 messages):

Qualcomm contacting Modular, Mojo Manual update, Level 2 badge unlocked


Modular (Mojo 🔥) ▷ #mojo (10 messages🔥):

Mojo notebook, GPU Compatibility, Mojo distributed computing


Moonshot AI (Kimi K-2) ▷ #general-chat (7 messages):

Kimi new features, Sora video quality, Pro Subscription watermarks