Frozen AI News archive

Apple exposes Foundation Models API and... no new Siri

**Apple** released on-device foundation models for iOS developers, though their recent "Illusion of Reasoning" paper faced significant backlash for flawed methodology regarding LLM reasoning. **OpenAI** updated **ChatGPT's Advanced Voice Mode** with more natural voice and improved translation, demonstrated by Greg Brockman. **LangChain** and **LlamaIndex** launched new AI agents and tools, including a SWE Agent for software automation and an Excel agent using reinforcement learning for data transformation. The AI community engaged in heated debate over reasoning capabilities of LLMs, highlighting challenges in evaluation methods.

Canonical issue URL

We don't know what to say, Tim.

AI News for 6/6/2025-6/9/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (218 channels, and 12496 messages) for you. Estimated reading time saved (at 200wpm): 1124 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

A year after the excitement of Apple Intelligence last WWDC, with Qwen and Gemma 3 likely so far ahead of Apple Foundation Models that there are nothing to crow about, the one AI relevant update is that the on device models are at least now available for iOS developers to use in the standard modalities (documentation here):

The Siri delays were well telegraphed months ago, so nothing more to write home about, but this is sitll big news for Apple AI engineers.


AI Twitter Recap

Apple's "Illusion of Reasoning" Paper & Backlash

New AI Models, Tools & Features

AI Industry & Platform Dynamics

Technical Concepts & Research

Robotics & General AI Progress

Humor & Memes


AI Reddit Recap

/r/LocalLlama Recap

1. DeepSeek R1 0528 Coding Benchmark Achievements

2. Novel AI Hardware and Efficient Inference Techniques

3. Reasoning-Aware LLM Workflows in Open WebUI

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Apple 'Illusion of Thinking' Paper Debate and Reasoning in LLMs

2. OpenAI and Industry Revenue, Benchmarks, and GPU Race

3. AI Coding Benchmarks, Claude & Gemini Collaboration, and User Feedback


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: Bleeding-Edge Model Developments & Performance Showdowns

Theme 2: Dev Tooling & Infrastructure: The Nuts and Bolts of AI Ops

Theme 3: Hardware & Optimization: Squeezing Every Last FLOP

Theme 4: AI's Expanding Horizons: Novel Applications & Ethical Frontiers

Theme 5: Community-Driven Innovation & Open Source Ecosystem Flourishes


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


LM Studio Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


HuggingFace Discord


Cursor Community Discord


OpenRouter (Alex Atallah) Discord


Eleuther Discord


aider (Paul Gauthier) Discord


Yannick Kilcher Discord


GPU MODE Discord


Notebook LM Discord


Nous Research AI Discord


Manus.im Discord Discord


Modular (Mojo 🔥) Discord


Latent Space Discord


MCP (Glama) Discord


tinygrad (George Hotz) Discord


LlamaIndex Discord


Torchtune Discord


Cohere Discord


Nomic.ai (GPT4All) Discord


DSPy Discord


LLM Agents (Berkeley MOOC) Discord


MLOps @Chipro Discord


Gorilla LLM (Berkeley Function Calling) Discord


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1168 messages🔥🔥🔥):

Perplexity CEO identity, Memory feature rollout, Silksong release, Pro role access, Samsung 1-year Perplexity Pro code


Perplexity AI ▷ #sharing (9 messages🔥):

Musk asylum, universe map, Shareable threads, forensic biology, North Korean surveillance


Perplexity AI ▷ #pplx-api (6 messages):

Perplexity API vs Web UI, API citations and details, Dyfi AI and API issues


LMArena ▷ #general (1237 messages🔥🔥🔥):

Sydney dataset, GPT-4.5 vs. Flash 2.5, Titanforge release, Grok 3.5 release, Apple's AI strategy


LM Studio ▷ #general (450 messages🔥🔥🔥):

LM Studio on Ubuntu Server, RooCode & LM Studio incompatibility, Context token limits with Qwen models, Importing models into LM Studio, API key Input with GPT


LM Studio ▷ #hardware-discussion (593 messages🔥🔥🔥):

Quantization impact on model performance, NPU focus vs memory bandwidth, Dual GPU setup considerations, VLLM advantages and GUI desires, Strix Halo performance expectations


OpenAI ▷ #ai-discussions (446 messages🔥🔥🔥):

Codex Internet Access, Gemini vs GPT, Context Window Sizes, GPT mobile app microphone update, DeepSeek on Apple


OpenAI ▷ #gpt-4-discussions (91 messages🔥🔥):

GPT feedback mechanisms, Mobile editing of GPTs, AI consciousness debates, GPT performance on scientific topics, File limits in GPT projects


OpenAI ▷ #prompt-engineering (232 messages🔥🔥):

Lazy Dungeon Master prompting, PDF vs TXT for models, Markdown formatting advantages, GPTs and YouTube videos, Breaking ChatGPT's memory


OpenAI ▷ #api-discussions (232 messages🔥🔥):

Lazy Dungeon Master Prompting, PDF vs TXT, Markdown Formatting, ChatGPT memory limitations, Prompt Engineering best practices


Unsloth AI (Daniel Han) ▷ #general (421 messages🔥🔥🔥):

DeepSeek-R1-0528, Gemma3 Quantization Results, Nemotron Ultra 253B, Qwen 3 235B i-quant Models, Chrome crashes


Unsloth AI (Daniel Han) ▷ #off-topic (8 messages🔥):

Android Diagnostics, Video on Independence, Stupification Beam


Unsloth AI (Daniel Han) ▷ #help (299 messages🔥🔥):

Unsloth Updates, Continued Pretraining, GRPO vs Dr GRPO, Deepseek-R1-0528-Qwen3-8B Troubleshooting, Qwen3 32B Memory Issues


Unsloth AI (Daniel Han) ▷ #research (57 messages🔥🔥):

Dapo Integration, Packing Contamination Paradox, Apple's 'Illusion of Thinking' Paper, Reasoning Models Reliability


HuggingFace ▷ #general (466 messages🔥🔥🔥):

EXL3 in Transformers, Hugging Chat 0.10, Claude API Performance, Factuality Evaluation Datasets, Qwen-2.5VL Hosting


HuggingFace ▷ #today-im-learning (22 messages🔥):

QKV latent space, Experiment tracking: wandb vs neptune, LLM reward models, AI model fatigue, iOS app dev access


HuggingFace ▷ #cool-finds (6 messages):

Manus referral, Reasoning models reliability, Attribution Graphs


HuggingFace ▷ #i-made-this (187 messages🔥🔥):

Dataset Tools, Unlimited Free FLUX Pro API, WhatsApp AI Assistant, Awesome Agent Learning, structured finance dataset


HuggingFace ▷ #reading-group (1 messages):

prandragon: Hello! What does this group do?


HuggingFace ▷ #computer-vision (2 messages):

Image Models Benchmark, Bias Datasets in Vision Language Models


HuggingFace ▷ #NLP (5 messages):

Document Comparison, Similarity Scores


HuggingFace ▷ #gradio-announcements (1 messages):

Hackathon Extension, Builder Community Growth, Project Submissions Surge


HuggingFace ▷ #smol-course (4 messages):

ML beginners, Ollama CLI, smol-course deadlines, ML theory


HuggingFace ▷ #agents-course (20 messages🔥):

GPT-4o Parsing Errors, Agent Course Final Project, Local Video/Image Processing vs OpenAI, Commutative Set Question Tool Usage, Certification Extension


Cursor Community ▷ #general (582 messages🔥🔥🔥):

Gemini Max Usage, Background Agents, Claude Code limitations, Cursor outage in Cuba, MCP Server Structure


Cursor Community ▷ #background-agents (58 messages🔥🔥):

Background Agents vs Regular Agents, Docker and Background Agents, Environment Configuration for Background Agents, GitHub Access Issues, Resource Exhaustion Errors


OpenRouter (Alex Atallah) ▷ #announcements (33 messages🔥):

Database issues, Platform fee simplification, BYOK subscription fee, RSS Chrome extension for new models, Model versioning


OpenRouter (Alex Atallah) ▷ #app-showcase (5 messages):

Dana AI launch, AI powered learning platform, Web app development, Excel macros


OpenRouter (Alex Atallah) ▷ #general (341 messages🔥🔥):

Model Sorting by Throughput, DAPO vs Dr GRPO, Small VLM like Gemma 3, Gemini+Claude deprecated OpenAI, Compromised Accounts


Eleuther ▷ #general (127 messages🔥🔥):

Common Pile Naming Controversy, LLM Chess Skills Explanation, Userbot Detection and Moderation, Synthetic Data in Pretraining, Sycophancy in Models


Eleuther ▷ #research (160 messages🔥🔥):

vixra vs arxiv, LM-based evolutionary algorithms, point cloud completion, Hyper Scaler point of view


Eleuther ▷ #scaling-laws (1 messages):

openMaMMUT-L/14, DataComp-1.4B, scaling laws, zero-shot IN1K


Eleuther ▷ #interpretability-general (22 messages🔥):

MPL weights visualization, Compute exhaustion measurement, Activation patterns in context length, Attention entropy measurement, Length Generalization


Eleuther ▷ #lm-thunderdome (23 messages🔥):

Ruler QA Tasks, LongBench evaluation Harness, RoPE length in config, Prompt Hacking


Eleuther ▷ #gpt-neox-dev (2 messages):

NeMo, NeoX


aider (Paul Gauthier) ▷ #general (203 messages🔥🔥):

Gemini 2.5 Pro vs Opus vs Deepseek, DeepMind Alpha, R1 0528 Unsloth IQ1_M, MCP Integration, Native Sparse Attention


aider (Paul Gauthier) ▷ #questions-and-tips (39 messages🔥):

vllm server configuration with aider, model selection for cpp/rust/embedded workloads, managing project progress and context with AI coding tools, frustrations with edit doom loops in aider, Claude Code vs Aider


Yannick Kilcher ▷ #general (146 messages🔥🔥):

Hyper projection, RL Transfer Learning, AI Peer Review


Yannick Kilcher ▷ #paper-discussion (58 messages🔥🔥):

Active Inference, Free Energy Principle, Apple's research, LLMs generalization capabilities


Yannick Kilcher ▷ #ml-news (25 messages🔥):

Cohere Business Model, Nvidia Nemotron-H Reasoning Models, Open Source vs On-Prem Services, Apache License Permissiveness, AI Patches BIOS


GPU MODE ▷ #general (2 messages):

GPU Pointers, HazyResearch ThunderKittens


GPU MODE ▷ #triton (4 messages):

vLLM ecosystem, llama3.1 architecture, Qwen2 architecture, memory bound, cache bound


GPU MODE ▷ #cuda (10 messages🔥):

nsys vs ncu discrepancies, Double Buffering slowdown, Async copy implementation


GPU MODE ▷ #torch (11 messages🔥):

MoE Expert Routing, Cutlass Kernel Selection, Predefined Weights for torch.nn.Linear, Meta Device Usage, functorch Integration


GPU MODE ▷ #announcements (1 messages):

Songlin Yang, Efficient Alternatives to Transformers


GPU MODE ▷ #algorithms (1 messages):

chrisw0473: Hey guys any algorithms I should know


GPU MODE ▷ #cool-links (2 messages):

NVIDIA GB200 NVL72, NVIDIA Dynamo, Mixture of Experts Models, Compiler Explorer


GPU MODE ▷ #jobs (1 messages):

Deepfake Detection, NVIDIA GPU, Training Data Collection


GPU MODE ▷ #beginner (2 messages):

Dual GPU PC for ML/CV, Importance of same model GPUs


GPU MODE ▷ #off-topic (14 messages🔥):

D-Matrix Chip Pricing, AMD Acquires Untether.ai, TPU


GPU MODE ▷ #rocm (7 messages):

ATT plugin for instruction latency profiling, rocprofv2, SQTT traces, Radeon GPU Analyzer (RGA)


GPU MODE ▷ #lecture-qa (1 messages):

geri8904: hi, where to ask questions for todays lecture?


GPU MODE ▷ #self-promotion (1 messages):

Voxel Bricks, GPU-heavy Tech, Spatial Structures, Rendering Tech


GPU MODE ▷ #🍿 (124 messages🔥🔥):

Scalable Environment for Kernel Generation, Synthetic Data Generation, Kernel-LLMs Data and Architecture, Coverage in Kernel Optimization, KernelTune Repo


GPU MODE ▷ #edge (3 messages):

Qualcomm inference, Adreno Vulkan


GPU MODE ▷ #reasoning-gym (1 messages):

zafstojano: http://incompleteideas.net/IncIdeas/KeytoAI.html


GPU MODE ▷ #gpu模式 (3 messages):

Triton compiler internals, MLA-decode solutions, FP8-MM solutions, Kernel References


GPU MODE ▷ #general (2 messages):

MLA-decode, FP8-mm, Reference Kernels


GPU MODE ▷ #submissions (3 messages):

MI300, mla-decode, fp8-mm, gitee.com


GPU MODE ▷ #status (2 messages):

Popcorn CLI, installation simplification, UX improvements


GPU MODE ▷ #amd-competition (19 messages🔥):

rocwmma fragments, MFMA atom, rocprof compute viewer, block id remapping, HIP kernels


GPU MODE ▷ #cutlass (10 messages🔥):

Physical Layout Visualization, CuTe and Cutlass, CUTLASS Profiler for Benchmarking, CuTe Docs vs. Cutlass Learning, PDL Weight Prefetching


Notebook LM ▷ #use-cases (31 messages🔥):

Audiobook Generation, Podcast Length, NoteTube, AI Documentation Platform, Discord Data Extraction


Notebook LM ▷ #general (142 messages🔥🔥):

NotebookLM Enterprise/Education Controls, Podcast Length Variations, YouTube as a Learning Source, NoteTube AI structured learning platform, Roskilde History Association Chatbot


Nous Research AI ▷ #general (132 messages🔥🔥):

Responsible Prompting API, RL for compression, Nous tags, Hermes-4 release, Holographic names


Nous Research AI ▷ #ask-about-llms (5 messages):

Psyche project, Remote MCP servers, LLM SDKs


Nous Research AI ▷ #research-papers (2 messages):

KV Compression, New Compression Method


Nous Research AI ▷ #interesting-links (7 messages):

AI Diplomacy, Reasoning Models


Nous Research AI ▷ #research-papers (2 messages):

KV Compression Method


Manus.im Discord ▷ #general (125 messages🔥🔥):

GPT-4 vs Claude for coding, Manus Credit Disappearance, AI UI/UX design limitations, GTA 8 Release Date, Open invitation email


Modular (Mojo 🔥) ▷ #general (3 messages):

DumPy, TPDE


Modular (Mojo 🔥) ▷ #announcements (2 messages):

Modular Forum Navigation, Community Meeting, Mojo in Bioinformatics, Accelerating Particle Physics with Mojo


Modular (Mojo 🔥) ▷ #mojo (103 messages🔥🔥):

macOS vs Linux for development, Mojo compile-time slicing with parametric __getitem__, Custom decorators in Mojo, Linear types in Mojo, Mojo parameterization syntax


Latent Space ▷ #ai-general-chat (78 messages🔥🔥):

Claude Subtweet, AI Model Quantization Transparency, Suno Restrictions, Linear MCP and Claude Code, AI Reasoning Models


MCP (Glama) ▷ #general (57 messages🔥🔥):

MCP Server Publishing, Image Uploading with MCPs, Accessing GitHub MCP Server, MCP and SSE vs. WebSockets, MCP Resources and Prompts


MCP (Glama) ▷ #showcase (11 messages🔥):

OAuth support in MCP, Slack's official MCP server, glama.ai blog post, MCP specification utility, Google MCP server


tinygrad (George Hotz) ▷ #general (31 messages🔥):

Lazy setitem in tinygrad, tinygrad meeting #74, Lovely Grad working with tinygrad, huggingface models on tinygrad, True float16


tinygrad (George Hotz) ▷ #learn-tinygrad (21 messages🔥):

Tensor Indexing, FUSE_ARANGE effect, ProcessPoolExecutor issues


LlamaIndex ▷ #announcements (1 messages):

Office Hours, Form Filling Agents, LlamaIndex MCP Servers, MCP Dev Summit, Spreadsheet Agent


LlamaIndex ▷ #blog (7 messages):

Vector Databases, Llama Cloud, Agentic Extraction Workflow, MCP Servers, AI Summit


LlamaIndex ▷ #general (21 messages🔥):

Gemini 2.5 streaming output, ACP protocol, Agent workflow termination, Plan upgrade issue, LlamaParse product down


LlamaIndex ▷ #ai-discussion (1 messages):

RAG, Llama Index, ReactAgent, Sparse Data


Torchtune ▷ #dev (21 messages🔥):

Issue #2470, Clipping logprobs, Adagrad errors on nightly


Cohere ▷ #🧵-general-thread (10 messages🔥):

Cohere AI, Cohere support, command-a, documentation


Cohere ▷ #📣-announcements (1 messages):

North Integration, GameWarden, Partnerships, Security Deployments


Cohere ▷ #🔌-api-discussions (5 messages):

Cohere's r7b model, Cohere signup issues, Cohere open source contributions, Cohere Developer Experience GitHub repository


Cohere ▷ #👋-introduce-yourself (3 messages):

Introductions, Discord App Development, AI and Machine Learning


Cohere ▷ #🔔-ping-settings (1 messages):

competent: Moved to id:customize


Nomic.ai (GPT4All) ▷ #announcements (1 messages):

Nomic team updates


Nomic.ai (GPT4All) ▷ #general (15 messages🔥):

save chats in plain text, Nomic team updates, GIGABYTE server, nomic-embed-text-v1.5


DSPy ▷ #general (4 messages):

Transfer Posttraining Learning, Blockchain Expertise, AI Agent Devs


LLM Agents (Berkeley MOOC) ▷ #hackathon-announcements (1 messages):

Agentic AI Summit, Early Bird Tickets, UC Berkeley, Vinod Khosla, Ion Stoica


MLOps @Chipro ▷ #general-ml (1 messages):

sebash6677: hi


Gorilla LLM (Berkeley Function Calling) ▷ #leaderboard (1 messages):

Leaderboard Updates, Project Continuation