Frozen AI News archive

not much happened today

**Bytedance** showcased an impressive state-of-the-art video generation model called **Seedance 1.0** without releasing it, while **Morph Labs** announced **Trinity**, an autoformalization system for Lean. **Huggingface Transformers** deprecated Tensorflow/JAX support. **Andrew Ng** of **DeepLearning.AI** highlighted the rise of the **GenAI Application Engineer** role emphasizing skills in **AI building blocks** and **AI-assisted coding tools** like **Codex** and **Claude Code**. Engineering teams are increasingly testing API designs against LLMs for usability. **Figure AI**'s CEO stressed speed as a key competitive advantage, and **LangChain** introduced the concept of **Context Engineering** for AI agents. Reinforcement learning on LLMs shows transformative potential, and the community values **AI evals** and data work. **Sakana AI** released **Text-to-LoRA**, a hypernetwork method for generating task-specific LoRA adapters from natural language, enabling efficient model customization. The video generation race heats up with **Bytedance**'s Seed-based model praised for quality, challenging American labs, alongside models like **Kling 2.1** and **Veo 3**.

Canonical issue URL

a quiet day

AI News for 6/11/2025-6/12/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (218 channels, and 7130 messages) for you. Estimated reading time saved (at 200wpm): 579 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Bytedance showed off, but did not release, an impressive SOTA videogen model called Seedance 1.0, Morph Labs announced Trinity, an autoformalization system for Lean, and Huggingface Transformers deprecated Tensorflow/JAX.


AI Twitter Recap

AI Engineering Skills, Roles, and Development Philosophy

Model & Research Breakthroughs

Tooling, Frameworks, and Integrations

Infrastructure, Industry Events & Funding

Geopolitics, Critiques, and Broader Commentary

Humor & Memes


AI Reddit Recap

/r/LocalLlama Recap

1. OpenAI and Industry Model Release Activity and Delays

2. Open Source Model Releases and Ecosystem Tools

3. Unique LLM Deployments and Industry Investment in Superintelligence

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Claude Code: User Experiences, Productivity, and Agent Techniques

2. AI Video Generation, Animation, and Creative Uses (Veo, i2v, Midjourney, Kling, etc.)

3. Seminal AI Research, Industry Debates, and Global AI Impact


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: AI Model Performance & Capabilities Unleashed (and Compared)

Theme 2: When Clouds Cry: Infrastructure Woes and Platform Stability Saga

Theme 3: Squeezing AI Brains: Fine-Tuning, Quantization, and Optimization Frontiers

Theme 4: Dev Tooling & API Adventures: From Rate Limits to WASM Dreams

Theme 5: Research Ripples: New Papers and Projects Making Waves


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


OpenRouter (Alex Atallah) Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


Eleuther Discord


OpenAI Discord


Manus.im Discord Discord


LM Studio Discord


HuggingFace Discord


GPU MODE Discord


tinygrad (George Hotz) Discord


Notebook LM Discord


Yannick Kilcher Discord


Modular (Mojo 🔥) Discord


Torchtune Discord


MCP (Glama) Discord


LlamaIndex Discord


Cohere Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


Nomic.ai (GPT4All) Discord


Codeium (Windsurf) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1408 messages🔥🔥🔥):

AI Combos, Power Grid Prompts, Text 2 Vid Arena, Deepsearch Delayed, Comet Browser Issues


Perplexity AI ▷ #sharing (3 messages):

RTX 4090, Windows Recall Security Flaws


Perplexity AI ▷ #pplx-api (1 messages):

Sonar API Documentation, Perplexity API documentation feedback


LMArena ▷ #general (999 messages🔥🔥🔥):

G(n,k) program, Claude ultrathink option, O3 Pro Benchmarks, Kingfall, Titanforge


LMArena ▷ #announcements (1 messages):

Cloud Provider Outage, Data Loss Incident


OpenRouter (Alex Atallah) ▷ #announcements (7 messages):

Cloudflare downtime, Google Cloud outage, Internet outage


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

memgrafter: I will test it tomorrow, send it over


OpenRouter (Alex Atallah) ▷ #general (971 messages🔥🔥🔥):

Free Model Rate Limits, Paid Model Rate Limits, OpenRouter Global Outage, DeepSeek models and Chinese, Requesty as an alternative to OpenRouter


Cursor Community ▷ #general (619 messages🔥🔥🔥):

Opus vs Sonnet, Gemini 2.5 Pro fails, MCP servers, Cloudflare outage, Cursor Mobile App


Cursor Community ▷ #background-agents (44 messages🔥):

Background Agents, Code Storage, Privacy Mode, Windows Bugs, Non-Github Repositories


Unsloth AI (Daniel Han) ▷ #general (215 messages🔥🔥):

DeepSeek R1 fine-tuning issues, Safetensors to AWQ conversion, DeepSeek R1 8Q model fine-tuning, Aider Polygot benchmark trustworthiness, QwenLong-32B model release


Unsloth AI (Daniel Han) ▷ #off-topic (19 messages🔥):

Hyperbolic Pricing, Synthetic Datasets, VRAM vs RAM, Typo in Advertisements


Unsloth AI (Daniel Han) ▷ #help (103 messages🔥🔥):

Unsloth version requirements, bias training issues, Granite biases, Qwen2.5-VL-7B-Instruct fine-tuning, Overfitting with LoRA


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

not_easy_to: I’m fine-tuning Qwen 2.5 7B (using Unsloth) and need a small French math dataset


Unsloth AI (Daniel Han) ▷ #research (5 messages):

ABBA architecture, LoRA alternatives, Parameter-Efficient Fine-Tuning


Eleuther ▷ #announcements (1 messages):

Volume Estimator, Neural Redshift, Generalization Heuristic, AI Alignment, Inductive Bias


Eleuther ▷ #general (86 messages🔥🔥):

AI Model Comparison Platforms, Open Science at CVPR, AI Safety India, GPT models behavior, Symbolica AI startup


Eleuther ▷ #research (210 messages🔥🔥):

Small LLM Training Epochs, Meta's V-JEPA 2 Self-Supervised Video Models, Building an AI Expert Agent for Google Ads, Parameter-Efficient Fine-Tuning (PEFT) with ABBA, CommonPile Data and the Role of Synthetic Data


Eleuther ▷ #interpretability-general (4 messages):

Knockoffs, Predictor-corrector methods, Realistic null distribution


Eleuther ▷ #lm-thunderdome (14 messages🔥):

EvalEval coalition, Standardized LM evaluation methods, Inspect standard, lm_eval multi-gpu progress bar


OpenAI ▷ #annnouncements (2 messages):

ChatGPT Projects, Canvas Updates, Model selector on mobile


OpenAI ▷ #ai-discussions (182 messages🔥🔥):

O3 Pro performance versus O3, Limits of LLMs, Google Ads expert AI agent, Discord activity drop, GPT-4o cost to train


OpenAI ▷ #gpt-4-discussions (22 messages🔥):

GPT Quantization, ChatGPT for Language Learning, Free GPT Credits for Training, GPT Memory Across Custom GPTs


OpenAI ▷ #prompt-engineering (26 messages🔥):

Prompt Security, LLM leakage, Forbidden tokens, Recency bias, Adversarial prompt injection


OpenAI ▷ #api-discussions (26 messages🔥):

Forbidden tokens, LLM Leakage, Prompt Security, AI moral values


Manus.im Discord ▷ #general (220 messages🔥🔥):

Manus Chat Mode, Veo 3 Video Generation, High Effort Mode Removal, Context Limits, Credit Usage and Pricing


LM Studio ▷ #general (85 messages🔥🔥):

Dual GPUs in LM Studio, SSD lifespan concerns with swapping, Speculative decoding with Qwen models, Model updates in LM Studio, LLM conciseness training


LM Studio ▷ #hardware-discussion (81 messages🔥🔥):

CPU vs GPU Setups, EPYC for LLMs, Strix Halo Memory, DeepSeek R1 on CPU, Unified Memory Comparison


HuggingFace ▷ #general (84 messages🔥🔥):

Screenplay and Filmmaker AI Tools, No Inference Provider Error, Image-to-Text Models, Hugging Face Spaces Runtime Errors, LLM Distillation with Qwen


HuggingFace ▷ #today-im-learning (17 messages🔥):

MCP servers study, AI avatar project, Deep3DFaceReconstruction and Face-vid2vid models, AI Agent course


HuggingFace ▷ #i-made-this (20 messages🔥):

Unlimited Text To Video, LLM exploring its own awareness, Structural Interactions in the input, Hy-Bio Agent vs ChatGPT, Building an AI avatar voice-by-voice


HuggingFace ▷ #computer-vision (2 messages):

model explainability, heatmap visualization, Kaggle datasets


HuggingFace ▷ #NLP (1 messages):

ut_nkezins: ive sent you friends request, maybe i could help you out


HuggingFace ▷ #agents-course (21 messages🔥):

requirements.txt, llama-index issues, certification path deadline, course sign up link broken, Tool Calling agents error


GPU MODE ▷ #general (4 messages):

GPU Engineering Role Preparation, Parallel Programming Patterns, PMPP


GPU MODE ▷ #triton (2 messages):

Conv1d performance optimization, Triton kernel optimization, LeetGPU challenge


GPU MODE ▷ #cuda (5 messages):

PTX modifiers, cache eviction policies, Blackwell library


GPU MODE ▷ #torch (9 messages🔥):

torch.func.functional_call, nn.Linear.from_pretrained, torch.compile and RL training, Mojo + PyTorch, torch.compile speedup


GPU MODE ▷ #off-topic (1 messages):

Image analysis, Running AI on anything


GPU MODE ▷ #irl-meetup (3 messages):

OSDI 2025, AMD Advancing AI day


GPU MODE ▷ #rocm (3 messages):

ROCm 6.4.1, MI50s, gfx906, rocprofiler-sdk, aqlprofile


GPU MODE ▷ #liger-kernel (5 messages):

Efficient Attention Varieties, MLA Implementation, GQA with GLA Benchmarks, Distillation Loss Function


GPU MODE ▷ #self-promotion (4 messages):

cuBLASDx 0.4.0 Release, Ozaki Scheme for FP64, cuBLASDx Python Bindings, MathDx Package, cuBLASDx and CuTe DSL Integration


GPU MODE ▷ #🍿 (3 messages):

AMD GPU Support, Triton evals, Backward prop, Roadmap, Undergrad collaboration


GPU MODE ▷ #submissions (5 messages):

conv2d leaderboard, H100 results


GPU MODE ▷ #hardware (1 messages):

CUDA 12.9 Update 1, CC 10.3, B300


GPU MODE ▷ #factorio-learning-env (43 messages🔥):

Factorio capabilities/performance, FLE usability obstacles, Visual inputs usefulness, FLE interface alignment, FLE Docker image and mod


GPU MODE ▷ #amd-competition (44 messages🔥):

AMD Conference meetup, AMD Advancing AI sign, Workshop 202, Fireside chat, Official Photo Link


GPU MODE ▷ #cutlass (4 messages):

CUTLASS Matmul Optimizations, EVT API Epilogues, Fused LoRA Layers


GPU MODE ▷ #singularity-systems (2 messages):

j4orz.ai, picograd, picoc, CUDA C extension


tinygrad (George Hotz) ▷ #general (6 messages):

Usefulness of CS Degree, SVD Test Failure


tinygrad (George Hotz) ▷ #learn-tinygrad (55 messages🔥🔥):

eigh() bounty, Tensor.norm(), LLM Discord Chatbot, tinygrad vs numpy accuracy, QR algorithm discrepancies


Notebook LM ▷ #use-cases (16 messages🔥):

AI Audio Overview Customization, YouTube Channel Promotion, Podcast Compilation


Notebook LM ▷ #general (43 messages🔥):

NotebookLM Age Restrictions, NotebookLM Feature Requests, Audio Overview Issues, Image as sources


Yannick Kilcher ▷ #general (28 messages🔥):

oscar-c project, Sam Altman vs Gary Marcus, system prompts for agents, Adaptive Resonance Theory (ART)


Yannick Kilcher ▷ #paper-discussion (26 messages🔥):

World Models, Energy Based Models, Active Inference, Predictive Coding


Yannick Kilcher ▷ #ml-news (4 messages):

Mistral Compute, New video model


Modular (Mojo 🔥) ▷ #mojo (36 messages🔥):

Mojo on LeetGPU, FastxReader in Mojo, Modular Docs issues with nightly, Dynamic Dispatch/Type Lambdas in Mojo, String Performance Improvements in Mojo


Torchtune ▷ #general (14 messages🔥):

Memory Usage, Flex Attention, FSDP, TP, Loss Parallel


Torchtune ▷ #dev (8 messages🔥):

packing refactor, iterable datasets, contributing to torchtune, qwen3 and qwen2 builders


Torchtune ▷ #papers (6 messages):

Mistral 3.1 Small, Architectural Novelties, Multimodal Support, Devstral


MCP (Glama) ▷ #general (18 messages🔥):

Service Workers, MCP and Zapier, Playwright MCP Server, Hyper-MCP WASM


MCP (Glama) ▷ #showcase (1 messages):

whoateit: having some fun with this. https://github.com/aj-geddes/fastfs-mcp


LlamaIndex ▷ #announcements (1 messages):

Office Hour Reminder


LlamaIndex ▷ #blog (3 messages):

Order Completion Agent, LlamaCloud Stability, MistralAI Magistral


LlamaIndex ▷ #general (14 messages🔥):

Firebase outage, OpenRouter Down, Cloudflare Issues, GCP is down, BGP problems


Cohere ▷ #🧵-general-thread (10 messages🔥):

Multi-Model Re-Ranker, Amotions AI, Xarray-JAX library


Cohere ▷ #🔌-api-discussions (1 messages):

Reranking profiles


Cohere ▷ #👋-introduce-yourself (1 messages):

Introductions, Company/Industry/University, Tech/Tools, Community Goals


Cohere ▷ #🧭-status-feed (1 messages):

GCP Outage, Infrastructure Degradation


DSPy ▷ #general (9 messages🔥):

DSPy 3.0 Release, Referencing Input Fields in Docstrings, Agent Bricks Introduction


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (3 messages):

AgentX summit, Research Paper Submission, Summit Attendance


Nomic.ai (GPT4All) ▷ #general (3 messages):

Model Speed, Token Count


Codeium (Windsurf) ▷ #announcements (1 messages):

Windsurf Wave 10, UI/UX Upgrades, EU Cluster, Enterprise Offerings