Frozen AI News archive

not much happened today

**Alibaba** released compact dense **Qwen3-VL** models at 4B and 8B sizes with FP8 options, supporting up to 1M context and open vocabulary detection, rivaling larger models like **Qwen2.5-VL-72B**. Ecosystem support includes **MLX-VLM**, **LM Studio**, **vLLM**, **Kaggle models**, and **Ollama Cloud**. In video AI, **Arena** added **Sora 2** models leading in video benchmarks, with **Higgsfield Enhancer** improving video quality. **Runway** launched domain-specific workflow apps for creative tasks. Research on **Representation Autoencoders for DiTs (RAE-DiT)** shows improved diffusion model performance. On local training, **NVIDIA DGX Spark** enables strong local fine-tuning, while **Nanochat** by **Karpathy** offers a minimal stack for training and inference. **Together AI** introduced **ATLAS**, a speculative decoding method achieving up to 4× faster inference on **DeepSeek-V3.1**. These developments highlight advances in efficient model deployment, video AI, local fine-tuning, and inference speed optimization.

Canonical issue URL

a quiet day

AI News for 10/13/2025-10/14/2025. We checked 12 subreddits, 544 Twitters and 23 Discords (197 channels, and 6882 messages) for you. Estimated reading time saved (at 200wpm): 510 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

a quiet day.


AI Twitter Recap

Alibaba’s Qwen3‑VL Dense Models (4B/8B) and Rapid Ecosystem Support

Video Models and Creative Tools

Local Training and Inference: DGX Spark, Nanochat, and Inference Speculation

Agents, Tool Use, and RL

Search, Retrieval, and Data Tools

Policy, Product, and Platform Notes

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Local-Only AI Ownership Slogan

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. OpenAI ChatGPT Adult-Content Rollout and Personality Relaxation (Dec rollout)

2. Duplicate Reposts: Vintage TV/Music Clips (Elvis 1977; Mr Rogers 'Crashes Out')

3. AI/Robotics Visual Demos and Posters (Gunkata meme; Qwen+Wan I2V; Humanoid lineup)


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. AI Hardware: Custom Silicon, GPUs, and Kernel Tricks

2. Open-Source Training Tools and Custom Devices

3. Massive Datasets and Embedding Nuances

4. Agent Platforms and Frameworks

5. DGX Spark: Reality Check on Bandwidth and Value


Discord: High level Discord summaries

Perplexity AI Discord


OpenAI Discord


Cursor Community Discord


LM Studio Discord


Unsloth AI (Daniel Han) Discord


OpenRouter Discord


HuggingFace Discord


GPU MODE Discord


Nous Research AI Discord


Modular (Mojo 🔥) Discord


Latent Space Discord


Yannick Kilcher Discord


Moonshot AI (Kimi K-2) Discord


Eleuther Discord


DSPy Discord


MCP Contributors (Official) Discord


tinygrad (George Hotz) Discord


Manus.im Discord Discord


aider (Paul Gauthier) Discord


Windsurf Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1135 messages🔥🔥🔥):

Opera GX vs Chrome, ChatGPT vs Perplexity, Comet browser security, Free Pro, Gemini 2.5 Pro


Perplexity AI ▷ #sharing (2 messages):

Palantir, US Government, Takeover


Perplexity AI ▷ #pplx-api (1 messages):

haydon0864: Why is my spaces not allowing me to create a new chat within any of my existing spaces


OpenAI ▷ #annnouncements (2 messages):

OpenAI chips, Expert Council on Well-Being and AI


OpenAI ▷ #ai-discussions (727 messages🔥🔥🔥):

AI and Emotional Dependency, Sora Watermark Removal, Python vs Other Languages, Kilocode-CLI, PGVector setup


OpenAI ▷ #gpt-4-discussions (10 messages🔥):

GPT updates, Speech to Speech models, GPT-5 study stem


OpenAI ▷ #prompt-engineering (51 messages🔥):

DSM-VM critique, Quantum superpositioning debate, Token Cascade Model, LLM Crossword Solving Limitations, Prompt Engineering


OpenAI ▷ #api-discussions (51 messages🔥):

DSM-VM critique, Quantum Superposition debate, Token Cascade Model, LLM crossword solving, Prompt engineering resources


Cursor Community ▷ #general (647 messages🔥🔥🔥):

Cheetah model insane speed, Gemini 3.0, GPT-5 too stupid, Student Discount


Cursor Community ▷ #background-agents (2 messages):

Cursor stopped responding, Linear issues with Cursor, Cursor unresponsive with Linear


LM Studio ▷ #general (201 messages🔥🔥):

Whole Message strategies, LM Studio API Context Window, Deterministic Output Testing, LLM Determinism, MCP Servers


LM Studio ▷ #hardware-discussion (186 messages🔥🔥):

LM Studio, GPU vs CPU inference, M series mac for LLM, NPU on LMStudio, SSD tips and tricks


Unsloth AI (Daniel Han) ▷ #general (295 messages🔥🔥):

VLM fine-tuning with LoRA, Custom UI for loss trajectory, Qwen3-4B-Instruct fine-tuning, Kimi K2 Groq implementation, DGX Spark review


Unsloth AI (Daniel Han) ▷ #off-topic (33 messages🔥):

Linux Distro for Dev Server, Multimodal Question Logic, NVIDIA DGX Spark Comparison, Sydney Student's Unix OS in Rust, LLM OS


Unsloth AI (Daniel Han) ▷ #help (36 messages🔥):

MacBook battery issues, vLLM and RL for gpt-oss, RL learning resources, Saving and loading fine-tuned models, B200 vs T4 speed


Unsloth AI (Daniel Han) ▷ #showcase (4 messages):

Unsloth-powered R&D models, AI Podcast with TTS using Ollama


Unsloth AI (Daniel Han) ▷ #research (4 messages):

Job automation, Hack Week projects, Model improvement


OpenRouter ▷ #app-showcase (2 messages):

OpenRouter Bot, Feedback Request, Non-coder bot builder


OpenRouter ▷ #general (306 messages🔥🔥):

Google's Gemini Android Play Store publishing, OpenRouter embedding near 2026, inclusionai/ling-1t model, Kimi K2 model instability, DeepSeek models issues


OpenRouter ▷ #new-models (2 messages):

``


OpenRouter ▷ #discussion (21 messages🔥):

Chutes Provider Downvoting Scandal, Gemini Flash Preview issues, OpenRouter's Payments to Anthropic, SambaNova Status and DeepSeek Terminus Hosting


HuggingFace ▷ #general (183 messages🔥🔥):

Teacher Forcing Issues, Apriel-1.5-15b-Thinker-GGUF Model, Ollama vs HuggingFace Embeddings, Model Fine-Tuning, Civitai Content Removal


HuggingFace ▷ #today-im-learning (1 messages):

Andrej Karpathy, fullstack LLMs, nanochat-students


HuggingFace ▷ #i-made-this (8 messages🔥):

Dataset Curation, ArXiv Papers Dataset, GitHub Code Dataset, Dataset Licensing


HuggingFace ▷ #computer-vision (2 messages):

Cloud GPUs, Object Detection


HuggingFace ▷ #NLP (1 messages):

jazzco0151: https://discord.com/api/oauth2/token


HuggingFace ▷ #smol-course (4 messages):

nanochat course, Andrej Karpathy, LLMs guides


HuggingFace ▷ #agents-course (4 messages):

Certificate of Completion, Posting too quickly


GPU MODE ▷ #general (7 messages):

SOSP in S.Korea, Blackwell GEMM DSL, DSA Efficiency, GPU Programming Trend, vLLM and SGLang Determinism Tests


GPU MODE ▷ #triton (1 messages):

Triton Kernel, Scalar value casting, Large double values, inf issue


GPU MODE ▷ #cuda (4 messages):

Threadblock 0 special case, Race Condition Detection with Compute Sanitizer, Warps behavior during cluster sync


GPU MODE ▷ #torch (5 messages):

PyTorch, Matrix Multiplication, CPU implementation, MKL


GPU MODE ▷ #beginner (12 messages🔥):

Matrix Multiplication Blog, Compiler Optimizations for GPUs, GPU programming starting point, Sites similar to leet gpu, Pearson Correlation kernel


GPU MODE ▷ #jax (1 messages):

Pallas:MGPU, NVLINK comms with local compute, all-gather collective matmul


GPU MODE ▷ #irl-meetup (1 messages):

Multi-node kernel hackathon


GPU MODE ▷ #intel (3 messages):

Crescent Island, LPDDR5X, Xe3P


GPU MODE ▷ #self-promotion (1 messages):

AlphaFold 3, MegaFold


GPU MODE ▷ #🍿 (1 messages):

Agent Hacking, Kernelbench v0.1, Sakana Paper Removal


GPU MODE ▷ #submissions (31 messages🔥):

MI300x8 Leaderboard Updates, amd-all2all performance, amd-gemm-rs benchmarks, amd-ag-gemm submissions


GPU MODE ▷ #status (5 messages):

Leaderboard Deadline, PST vs UTC, Time Discrepancies


GPU MODE ▷ #amd-competition (11 messages🔥):

MI300x Access, Competition Runners, HotAisle's Offer


GPU MODE ▷ #cutlass (3 messages):

MoE, GEMV, Qwen3 variants


GPU MODE ▷ #singularity-systems (6 messages):

Python/Rust interop, OpenCL kernels, Autograd and backward kernels, Correctness and speed testing, SITP/picograd with gpumode compute


GPU MODE ▷ #general (15 messages🔥):

VSCode extension for GPU Mode, Outdated documentation on GPU Mode website, Submitting kernels to PMPP v2, Bug in reference-kernels repo, Self-selecting working group roles


GPU MODE ▷ #multi-gpu (9 messages🔥):

Multi-GPU Systems, HPC Research, Data Movement, Latency and Bandwidth


GPU MODE ▷ #helion (5 messages):

Helion Contributions, GPU Mode Talk


Nous Research AI ▷ #general (96 messages🔥🔥):

Veo Model Annotation, Qwen VL Model inference, SAM 3 Model, DGX Spark, DeMO optimizer


Nous Research AI ▷ #ask-about-llms (4 messages):

Rage-bait attractor, Gemini's response


Nous Research AI ▷ #research-papers (1 messages):

arxiv 2410.10450, model setup difficulty, good repo for llama


Nous Research AI ▷ #research-papers (1 messages):

arXiv Paper Discussion, Model Setup Difficulty, Helpful Repository


Modular (Mojo 🔥) ▷ #general (6 messages):

ARM Linux support, DGX Spark compatibility, Mojo on Jetson Orin Nano


Modular (Mojo 🔥) ▷ #mojo (71 messages🔥🔥):

SCTP vs QUIC, WebRTC datachannels, Mojo testing framework deprecation, Mojo type reflection, Iroh cross platform


Modular (Mojo 🔥) ▷ #max (1 messages):

TorchAX, Pure Python Custom Devices, JAX device in Torch


Latent Space ▷ #ai-general-chat (69 messages🔥🔥):

Salesforce Agent Scripting, Agentic Platforms like Devin, Google Gemini 3 with Jules, Nvidia DGX Spark, Anthropic Deepens Salesforce Partnership


Latent Space ▷ #private-agents (1 messages):

AI Freelancing, Model Fine-Tuning, LLM Infra, AI Startups, AI Agent Development


Yannick Kilcher ▷ #general (35 messages🔥):

East China Normal University AI Call for Papers, State of AI 2025 Report, Cursor AI Code Editor, DGX Spark Availability, RTX 5090 vs DGX Spark


Yannick Kilcher ▷ #paper-discussion (12 messages🔥):

Cursor Review, SEAL Paper, Tab Completion & Agentic Coding, Multi-Agent Systems, AI Completions for Coding


Yannick Kilcher ▷ #ml-news (1 messages):

erkinalp: https://clockss.org/


Moonshot AI (Kimi K-2) ▷ #general-chat (37 messages🔥):

Kimi Team contact, Trickle vibe coding website, Aspen leveraged 100x on bitcoin, Gemini vs GPT5


Eleuther ▷ #general (18 messages🔥):

Mixture of Experts, Sliding Window Attention, LM Evaluation Harness, ArXiv Papers Dataset, GitHub Code 2025 Dataset


Eleuther ▷ #research (15 messages🔥):

Less is More: Recursive Reasoning with Tiny Networks, backpropping only the last step of deep recursion, ARC rules, video models based on 3D rendered video clips, REPA


DSPy ▷ #show-and-tell (5 messages):

ACE Playbook, AgentLearningEE, StraughterG's X post


DSPy ▷ #general (14 messages🔥):

Big Paper Tease, CheshireCat 3.0 Release, Neo4j Integration Request, London Meetup?


MCP Contributors (Official) ▷ #general (17 messages🔥):

MCP Server Implementation, Binary data support in tool calls, Embedded resources, Host engineering, Mapping parts of the tool response


tinygrad (George Hotz) ▷ #general (2 messages):

Pylint Removal, Test Refactoring with ChatGPT


tinygrad (George Hotz) ▷ #learn-tinygrad (10 messages🔥):

Contributing to tinygrad, Tensor buffer is not writable, Freezing parts of a matrix for training, Virtual tensor creation, Accessing computed gradients


Manus.im Discord ▷ #general (11 messages🔥):

Manus Functionality, Job Openings at Manus, Community Moderator Perks, Product Dissatisfaction and Feedback, Daily Credits Issue


aider (Paul Gauthier) ▷ #general (5 messages):

aider alias, OpenCode GLM 4.6


aider (Paul Gauthier) ▷ #questions-and-tips (2 messages):

Adding files to long messages, Aider workflow tips


aider (Paul Gauthier) ▷ #links (1 messages):

Agentic Tools, Aider's Capabilities