Frozen AI News archive

not much happened today

**Google** released **Gemma 3n**, a multimodal model for edge devices available in **2B and 4B** parameter versions, with support across major frameworks like **Transformers** and **Llama.cpp**. **Tencent** open-sourced **Hunyuan-A13B**, a **Mixture-of-Experts (MoE)** model with **80B total parameters** and a **256K context window**, optimized for tool calling and coding. **Black Forest Labs** released **FLUX.1 Kontext [dev]**, an open image AI model gaining rapid Hugging Face adoption. **Inception AI Labs** launched **Mercury**, the first commercial-scale **diffusion LLM** for chat. The **FineWeb2** multilingual pre-training dataset paper was released, analyzing data quality impacts. The **Qwen** team released **Qwen-VLo**, a unified visual understanding and generation model. **Kyutai Labs** released a top-ranked open-source speech-to-text model running on Macs and iPhones. **OpenAI** introduced **Deep Research API** with **o3/o4-mini** models and open-sourced prompt rewriter methodology, integrated into **LangChain** and **LangGraph**. The open-source **Gemini CLI** gained over **30,000 GitHub stars** as an AI terminal agent.

Canonical issue URL

a super quiet day

AI News for 6/26/2025-6/27/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (220 channels, and 6364 messages) for you. Estimated reading time saved (at 200wpm): 564 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Congrats to Tencent Hunyuan A13B, and Inception Mercury!


AI Twitter Recap

Model & Dataset Releases

Developer Tools & Agent Frameworks

AI Techniques, Research, & Evaluation

Companies, Industry, & Funding

Geopolitics & Broader Implications

Humor, Satire & Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Recent Open Source and Commercial Model Launches (Hunyuan-A13B, OmniGen 2, SYNTHETIC-2)

2. Innovative LLM Client Integrations on Consumer Devices (PS Vita, Gaming Dialogue)

3. AI Hardware Benchmarking and Market Trends (Smartphone SoCs, RTX 3090 Pricing, LLM Reasoning's Impact on Translation)

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Neuralink Human Trials and Integration with Tesla Optimus

2. FLUX & Kontext Features, Use Cases, and Licensing Updates

3. User Experiences and Impact of ChatGPT


AI Discord Recap

A summary of Summaries of Summaries by o1-preview-2024-09-12

Theme 1. AI Models and Tools Race Ahead

Theme 2. AI Safety and Privacy Alarms Sound Off

Theme 3. AI Supercharges Coding and Technical Tasks

Theme 4. AI Powers Creative Content Creation

Theme 5. Hardware Hurdles and Performance Tweaks


Discord: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


OpenAI Discord


LMArena Discord


Cursor Community Discord


LM Studio Discord


OpenRouter (Alex Atallah) Discord


GPU MODE Discord


HuggingFace Discord


Yannick Kilcher Discord


Latent Space Discord


tinygrad (George Hotz) Discord


Modular (Mojo 🔥) Discord


Eleuther Discord


aider (Paul Gauthier) Discord


Notebook LM Discord


Nous Research AI Discord


Torchtune Discord


Cohere Discord


DSPy Discord


LlamaIndex Discord


Manus.im Discord Discord


Nomic.ai (GPT4All) Discord


AI21 Labs (Jamba) Discord


MCP (Glama) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1264 messages🔥🔥🔥):

Gemini CLI, Audiobooks vs Podcasts, Perplexity max, Comet rollout updates


Perplexity AI ▷ #sharing (6 messages):

DeepSeek, NBA Draft, Armed Standoff, Lu Bu Diaochan, Fenghuang


Perplexity AI ▷ #pplx-api (21 messages🔥):

Credits Pending, Finance with Perplexity Sonar, Perplexity API Credits, SEC Filings with API


Unsloth AI (Daniel Han) ▷ #general (781 messages🔥🔥🔥):

GGUF conversion issues, Dynamic Unsloth Quantization, Gemma 3 finetuning, Devstral finetuning, GPU recommendations


Unsloth AI (Daniel Han) ▷ #off-topic (16 messages🔥):

Moderation Conflicts, Reddit Supermods, Local Llama


Unsloth AI (Daniel Han) ▷ #help (426 messages🔥🔥🔥):

Unsloth inference for model testing, Llama 3 templates, Memory leak problems with SFT, Loading datasets error, Qwen3 Vision tuning


Unsloth AI (Daniel Han) ▷ #showcase (3 messages):

Sandboxed Code, GitHub Code Uploads


Unsloth AI (Daniel Han) ▷ #research (3 messages):

Multi-agent AI evaluation, Automated GPU kernel optimization, Evolutionary programming for Metal kernels, OpenEvolve project, LLMs for low-level optimization


OpenAI ▷ #ai-discussions (969 messages🔥🔥🔥):

Dall-E 2, universal keys, OpenAI's policies, image prompts, Image generation models


OpenAI ▷ #gpt-4-discussions (7 messages):

OpenAI Conversation Recording, Privacy Concerns, NY Times Case


LMArena ▷ #general (623 messages🔥🔥🔥):

GPT-5 release, Gemini 3 speculation, Style control impacts on leaderboards, O3 vs 2.5 Pro benchmarks, OpenAI's development roadmap


Cursor Community ▷ #general (448 messages🔥🔥🔥):

MCP Issues, Snapshot sharing, Cursor and MacOS, Warp 2.0, Prompt Enhancers


Cursor Community ▷ #background-agents (37 messages🔥):

Python virtual environment in Dockerfile, Static HTML preview in background agent interface, BugBot workflow improvements, Docker in the agent's environment, Background agent pricing


LM Studio ▷ #general (252 messages🔥🔥):

LM Studio and Ollama, Roo Code Context Window, Magistral Tokenizer, Multi-Model ChatUI, Self-Expanding Programs


LM Studio ▷ #hardware-discussion (102 messages🔥🔥):

ROCm on 9070 with LMStudio, LLM tests, serverless pods, LMStudio server deployment on AWS, Hosted LLM serving 100+ users


OpenRouter (Alex Atallah) ▷ #announcements (3 messages):

LLM Presets, Morph v2 code patching, Llama 3.3 70B Discount


OpenRouter (Alex Atallah) ▷ #app-showcase (8 messages🔥):

Quicke.in, Multiple Models inference, PGaaS feedback


OpenRouter (Alex Atallah) ▷ #general (255 messages🔥🔥):

Preset API keys, LLM websearch, Gemini's Grounding, Morph, OpenAI SDK


OpenRouter (Alex Atallah) ▷ #new-models (1 messages):

Readybot.io: OpenRouter - New Models


GPU MODE ▷ #general (12 messages🔥):

GPU kernel-level scheduler introspection, Retrieving timestamps of sub-videos within a long video, Gemini context length limitations, Speeding up audio inputs for cost reduction


GPU MODE ▷ #cuda (19 messages🔥):

Tensor Cores in CUDA, Memory Bandwidth Experiments, GPU Mode Submission, CUDA vs HIP


GPU MODE ▷ #torch (7 messages):

Custom CUDA Kernels, LLM Inference in Torch, Torch Compile Randomness, PyTorch Nightly, Opcheck


GPU MODE ▷ #cool-links (1 messages):

marksaroufim: https://mobiusml.github.io/fp4_blogpost/


GPU MODE ▷ #beginner (6 messages):

GPU BruteForcers, CPU vs GPU Speed, Floating Point Precision


GPU MODE ▷ #youtube-recordings (1 messages):

alice_18898: hi


GPU MODE ▷ #rocm (17 messages🔥):

HIP support, PyTorch's HIP, aten, c10


GPU MODE ▷ #self-promotion (7 messages):

FP4 weights quantization, GPU kernel optimization, Apple Silicon, Two-pass softmax algorithm, Automated kernel optimization


GPU MODE ▷ #🍿 (2 messages):

CUDA Events, Kernel Timing


GPU MODE ▷ #thunderkittens (1 messages):

TK Kernels, INT8 Matmul Support in TK


GPU MODE ▷ #general (4 messages):

FP32 usage, tensor cores, MI300x kernel, fp16 usage


GPU MODE ▷ #submissions (52 messages🔥):

H100 sort performance, H100 vectorsum performance, H100 vectoradd performance, A100, B200, MI300 trimul performance, L4 vectoradd


GPU MODE ▷ #factorio-learning-env (54 messages🔥):

FLE structure, LuaPlayer, Rockets failing, Gym environment, Factorio Draftsman


GPU MODE ▷ #cutlass (2 messages):

Cutlass, cute DSL, atomic arrive and wait


GPU MODE ▷ #singularity-systems (3 messages):

Systems ML compiler project, Subset implementation (C, CUDA C, Triton, PyTorch), Compiler IRs, SoN compiler


HuggingFace ▷ #general (81 messages🔥🔥):

Tool for generating BibTex entries, SSML output models, Running Gemma-3n on Colab, Fine-tuning data from multiple sources to Jsonl, HuggingFace in HPC


HuggingFace ▷ #cool-finds (6 messages):

Artificial Human Project, Hunyuan Gamecraft, Roko's Basilisk


HuggingFace ▷ #i-made-this (18 messages🔥):

X-Spanformer, Tokenizer-Free Encoding, GPU Kernel Optimization, TorchDevice Release


HuggingFace ▷ #NLP (4 messages):

Tokenizer porting to Android, Rust to SO compilation, Cosine distance in KMeans, Text Tilling Paper


HuggingFace ▷ #smol-course (1 messages):

Certificate Extraction


HuggingFace ▷ #agents-course (11 messages🔥):

HF Pro subscription, AI agent builders, prompt engineers, LLM workflows, code reading


Yannick Kilcher ▷ #general (25 messages🔥):

Ghost in the Shell, Pretraining Corpus, Paper Discussion Recording, K-means Clustering


Yannick Kilcher ▷ #paper-discussion (50 messages🔥):

Old papers needing more love, Your Brain on ChatGPT paper, Conference proceedings and physical copies, Transformer understanding via associative memory, Using AI to predict content virility


Yannick Kilcher ▷ #agents (3 messages):

Git Repo Secrets, Public to Private Repo Leaks


Yannick Kilcher ▷ #ml-news (4 messages):

Deepseek's Model Release Cadence, Qwen VLo Model


Latent Space ▷ #ai-general-chat (77 messages🔥🔥):

Deep Research API, Mercor Valuation, AI Shutdown Mechanisms, Etched Funding, Stripe AI Index


tinygrad (George Hotz) ▷ #general (65 messages🔥🔥):

BERT Step Optimization, Multi-QP RDMA Transfers, PCIe Topology Impact on GPU-NIC, RoCE MTU Limitation, Kernel/BIOS Tweaks for RDMA


tinygrad (George Hotz) ▷ #learn-tinygrad (8 messages🔥):

Realtime Diffusion, f16 support on tinygrad, webui with websocket to diffusers


Modular (Mojo 🔥) ▷ #general (29 messages🔥):

Jupyter and Mojo, Pixi Installation Issues, Modular CLI Abandonment, GPU Puzzle P17 Broken


Modular (Mojo 🔥) ▷ #mojo (24 messages🔥):

LLVM intrinsics with packed result types, Graph compiler: Python vs Mojo, Performance cost: Mojo from Python vs standalone, Mojo crashes and bug reports, LayoutTensor saving/reading to file


Modular (Mojo 🔥) ▷ #max (7 messages):

model graph compilation caching, max serve, docker volume


Eleuther ▷ #general (27 messages🔥):

Ersatz Discord User, Institute for Defense Analyses (IDA), ML Engineer vs Research Engineer, Flow Matching


Eleuther ▷ #research (17 messages🔥):

SVD Optimizer Steps, Muon Approximation Speed, Japanese Hammer Weight Decay, Continuous Thought Machines


Eleuther ▷ #interpretability-general (1 messages):

Stochastic Parameter Decomposition, APD issues, Parameter-decomposition directions, SAEs problems


Eleuther ▷ #lm-thunderdome (4 messages):

Codex, TyDiQA, HumanEval


aider (Paul Gauthier) ▷ #announcements (1 messages):

Gemini 2.5 Models, o3-pro Model Support, Co-authored-by Attribution, Repository Map Updates, GitHub Copilot Token Handling


aider (Paul Gauthier) ▷ #general (37 messages🔥):

Qwen distillation, CoT for o3, Server tags, Sonnet, QLORA training examples


aider (Paul Gauthier) ▷ #questions-and-tips (7 messages):

Aider Blueprint Generation, Anthropic Bans, Aider Wrapper Script, Gemini 2.5 quirk


Notebook LM ▷ #use-cases (11 messages🔥):

Customer discovery conversations, Mind Maps Sharing, Book Upload Issue, Artistic Exploration Use Case


Notebook LM ▷ #general (23 messages🔥):

Podcast Creation, Image Upload Issues, PDF Upload Failures, Service Unavailability, Multilingual Support


Nous Research AI ▷ #general (22 messages🔥):

Agentic VLMs, RL Environments support, Tencent 80B MoE Model, Qwen VLO Day, Deepseek Focus on MoE


Nous Research AI ▷ #ask-about-llms (4 messages):

DeepSeek Token Usage, Nous API Inference


Nous Research AI ▷ #interesting-links (4 messages):

Thought Anchors, Visualizations


Torchtune ▷ #general (11 messages🔥):

sm100 support, Qwen3-235B-A22B finetune, VRAM saving techniques, FSDP limitations, torchaos optimizer


Torchtune ▷ #dev (18 messages🔥):

Memory increase with self.mask_ignored_tokens = False, Iterable Dataset and on-the-fly packing, Effective batch size with packing, Packing with chat_dataset gotchas, Position ID mask


Cohere ▷ #🧵-general-thread (6 messages):

Command A dataset, Command-r EOL


Cohere ▷ #👋-introduce-yourself (6 messages):

Real-time Inference Stacks, Federated Learning and Privacy-Preserving AI, Computational Linguistics & NLP, AI Job Hunt in Canada/India


Cohere ▷ #🔬-research (1 messages):

cryptic.girl: Anyone here working on Privacy Preserving AI?


DSPy ▷ #general (12 messages🔥):

DSPy Versioning, DSPy Evals, VLLM settings, Append Prompt


LlamaIndex ▷ #blog (4 messages):

Observability, Open Source Native, Klavis AI MCP Servers, LlamaCloud Native MCP Server, Gradio MCP Hackathon


LlamaIndex ▷ #general (7 messages):

LlamaParse with LlamaIndex, Context Window Limits for LLMs, Chunk + Map-Reduce Pattern


Manus.im Discord ▷ #general (11 messages🔥):

Manus browser issues, Manus Reddit blocking, Manus Proxy Usage, Manus API, Manus Promo Code


Nomic.ai (GPT4All) ▷ #general (7 messages):

LocalDocs persistence, ChatGPT-like local LLM, Qwen models for coding, Waiting for GPT4All Update


AI21 Labs (Jamba) ▷ #general-chat (5 messages):

Discord Server Redirects, New Server Migration, User Confusion, Server Legitimacy


MCP (Glama) ▷ #general (1 messages):

Tree Sitter MCP Server, Typescript, npmjs


MCP (Glama) ▷ #showcase (2 messages):

Prompt-MCP Tool, Obsidian-Semantic-MCP