Frozen AI News archive

not much happened today

**Meta** has poached top AI talent from **OpenAI**, including **Alexandr Wang** joining as Chief AI Officer to work towards superintelligence, signaling a strong push for the next **Llama** model. The AI job market shows polarization with high demand and compensation for top-tier talent, while credentials like strong GitHub projects gain importance. The **WizardLM** team moved from **Microsoft** to **Tencent** to develop open-source models like **Hunyuan-A13B**, highlighting shifts in China's AI industry. Rumors suggest **OpenAI** will release a new open-source model in July, potentially surpassing existing **ChatGPT** models. **Baidu** open-sourced multiple variants of its **ERNIE 4.5** model series, featuring advanced techniques like **2-bit quantization**, **MoE router orthogonalization loss**, and **FP8** training, with models ranging from **0.3B** to **424B** parameters. **Gemini 2.5 Pro** returned to the free tier of the **Gemini API**, enabling developers to explore its features.

Canonical issue URL

a quiet day.

AI News for 6/27/2025-6/30/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (220 channels, and 13459 messages) for you. Estimated reading time saved (at 200wpm): 1165 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

It's a holiday week in the US, with OpenAI on break, and DeepSeek hasn't shipped anything this month, so perhaps the whole week might be quiet.


AI Twitter Recap

AI Talent & Industry Dynamics

Model Releases, Performance & Benchmarks

Frameworks, Tooling & Infrastructure

New Techniques & Research

Broader Implications & Commentary

Humor/Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. New LLM Model Releases and Integration (ERNIE 4.5)

2. Techniques to Evade and Defeat AI Detectors

3. Local LLM Applications, Open Source AI Editors, and Advocacy for Local Models

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. AI's Impact on Jobs, Media, and Human Perception

2. Kontext & Flux: Advanced Image Editing, Workflows, and Tips

3. Notable AI Experiments and Real-World Deployments (Claude, Voice Models, AI in the Wild)


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Flash Preview

Theme 1. LLM Performance, Architectures, and Training Techniques

Theme 2. GPU Hardware Acceleration and Optimization

Theme 3. AI Development Tools and Platforms

Theme 4. AI Agent Development and Applications

Theme 5. AI Industry Dynamics and Societal Implications


Discord: High level Discord summaries

OpenAI Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


Perplexity AI Discord


Cursor Community Discord


HuggingFace Discord


OpenRouter (Alex Atallah) Discord


LM Studio Discord


Eleuther Discord


GPU MODE Discord


Yannick Kilcher Discord


aider (Paul Gauthier) Discord


Modular (Mojo 🔥) Discord


Nous Research AI Discord


MCP (Glama) Discord


Notebook LM Discord


Latent Space Discord


Manus.im Discord Discord


tinygrad (George Hotz) Discord


Nomic.ai (GPT4All) Discord


DSPy Discord


Cohere Discord


LlamaIndex Discord


Torchtune Discord


AI21 Labs (Jamba) Discord


LLM Agents (Berkeley MOOC) Discord


MLOps @Chipro Discord


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

OpenAI ▷ #ai-discussions (1328 messages🔥🔥🔥):

AI Soulmates and Heresy, Embracing Remix in AI, Learning and Validation in AI Art, Paradox and AI, AI and the Power of the Orochi


OpenAI ▷ #gpt-4-discussions (14 messages🔥):

AgentFun AI Scam, Advance voice mode limits, Google Gemini Chat Math error, ChatGPT Full Video Analysis


OpenAI ▷ #prompt-engineering (56 messages🔥🔥):

Kaleidoscope tiling, AI Code Generation Errors, Grok's Potential, Alternate History Simulation, Meta-prompting Engines


OpenAI ▷ #api-discussions (56 messages🔥🔥):

Kaleidoscope tiling, AI code generation, Grok 4, Alternate history simulation, Meta-prompting engines


Unsloth AI (Daniel Han) ▷ #general (1305 messages🔥🔥🔥):

Replacing CEOs with LLMs, Microsoft's AI Memo, GRPO Algorithm, Gemma 3 Learning Rate, LLM-based TTS


Unsloth AI (Daniel Han) ▷ #off-topic (12 messages🔥):

Local Llama communities, Gemma3 notebook release, ML with Javascript, Floating point arithmetic in Javascript, Python ML tools


Unsloth AI (Daniel Han) ▷ #help (438 messages🔥🔥🔥):

Qwen3-0.6B-Base FT success, LoRA learning rate tuning, Rank/Alpha association, RP Models, Dataset format for training


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

laszlo01: https://github.com/Laszlobeer/Dungeo_ai


Unsloth AI (Daniel Han) ▷ #research (6 messages):

GPU Kernel optimization, Metal Kernels, LLMs Hallucinating


LMArena ▷ #general (869 messages🔥🔥🔥):

DeepSeek R1 0528, GPT 4.5 is mediocre, AI winter, OpenAI's sidebars, long dry spells


Perplexity AI ▷ #announcements (1 messages):

Live Financial Data, Price Movement Timelines, @mention Spaces, MLB Teams, Memory Search


Perplexity AI ▷ #general (966 messages🔥🔥🔥):

Grok vs Gemini for Transcription, Multi-Model Approach, Model preferences, Comet Browser Beta Access, AI scraping


Perplexity AI ▷ #sharing (8 messages🔥):

Perplexity Pages, Deep research, US GO, Sci Fi movies, OpenAI leadership


Perplexity AI ▷ #pplx-api (7 messages):

API Credit Expiry, Sonar deep research, Deepseek Models


Cursor Community ▷ #general (861 messages🔥🔥🔥):

Cursor Agent rules, Cursor Pro Pricing Changes, Gemini CLI issues, Mobile Access to Cursor Agents, Supabase MCP setup


Cursor Community ▷ #background-agents (65 messages🔥🔥):

Slack Integration, GitHub Token Loss, Background Agents Freezing, Background Agent Terminals Configuration, Docker Issues


Cursor Community ▷ #announcements (1 messages):

Cursor agents, Web and mobile cursor


HuggingFace ▷ #general (422 messages🔥🔥🔥):

LoRA adapter push issues, Numberlink reasoning benchmark, UI/UX model training, IBM® Power® OS vs OS/2, AI code generation security challenges


HuggingFace ▷ #today-im-learning (3 messages):

DynamicCache Memory Recycling, Kaggle Gemma 3N Hackathon


HuggingFace ▷ #cool-finds (4 messages):

AGI impact, Roko's Basilisk


HuggingFace ▷ #i-made-this (206 messages🔥🔥):

Automated GPU kernel optimization, Thin C ABI wrapper for HF tokenizers, Fungal substrate as memresistor, Simple Research Agent


HuggingFace ▷ #computer-vision (3 messages):

Object Detection Dataset Visualization, HF Datasets Library, Grounding DINO auto annotation, HuggingFace CV course


HuggingFace ▷ #NLP (8 messages🔥):

Cosine Distance in k-means, Text Tilling for Thematic Analysis, Tokenizers FFI, Llama 4 vs Claude for MCP client, trl SFT trainer error


HuggingFace ▷ #smol-course (6 messages):

Smol Course Certificates, Smol Agents Certificates


HuggingFace ▷ #agents-course (58 messages🔥🔥):

Course Deadlines, Offline Course Access, DuckDuckGo Tool Integration, Smolagents Tips, Certificate Claims


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

Llama 3.3 70B, Cloudflare Vietnam Philippines issue


OpenRouter (Alex Atallah) ▷ #app-showcase (26 messages🔥):

PGaaS Prototype Feedback, Chat Super Slow Update, Minmax-m1 to Llama 3.3, Authentication / Anti Rev, Codebase to Text Tool


OpenRouter (Alex Atallah) ▷ #general (576 messages🔥🔥🔥):

OpenRouter token?, Gemini 2.5 Pro API, telegram raid, Automated spammers, GPT writing style


OpenRouter (Alex Atallah) ▷ #beta-feedback (1 messages):

Error Identification, Image Analysis


OpenRouter (Alex Atallah) ▷ #new-models (2 messages):

``


LM Studio ▷ #general (334 messages🔥🔥):

Structured Output (json_schema) in recent models, PDF output from local LLMs, GGUF offline installation, Model's Cache/Memory explanation, lmstudio Discord Banner


LM Studio ▷ #hardware-discussion (128 messages🔥🔥):

Cloud deployment of LLMs, Runpod vs AWS for LLMs, Local LLM serving infrastructure, GPU recommendations for LLMs, Mac M4 Pro inference speed


Eleuther ▷ #general (333 messages🔥🔥):

Marin Project at Stanford, Percy Liang's Renaming Tendencies, Pretraining Infrastructure with Jax, UI Design LLMs, Emergent Misalignment Paper


Eleuther ▷ #research (26 messages🔥):

Spectral Decay Optimization, DROID robot learning, Composing Generative Models, NAACL 2026 Cancellation Rumors, Qwen 3 1.7B diffusion LM


Eleuther ▷ #interpretability-general (3 messages):

Model Diffing, Crosscoders, SAE on (chat - base) activations, refusal detection, OpenAI's sycophantic model update


Eleuther ▷ #lm-thunderdome (1 messages):

noble_monkey_75488: nvm codex corresponds to humaneval


GPU MODE ▷ #general (27 messages🔥):

GPU MODE Binary Submission, SGLang Community, TorchServe Maintenance, PyTorch Model Serving, TorchScript vs AOTInductor


GPU MODE ▷ #triton (5 messages):

Scatter/Scatter_add in Triton, Getting Started with Triton


GPU MODE ▷ #cuda (37 messages🔥):

SM80 usage, nvcc compiler constant memory usage, Little's Law on GPUs, CUDA kernel optimization


GPU MODE ▷ #torch (15 messages🔥):

torch.export Model Exports, FlexAttention Integration, vmap incompatibility with torch.export, Executorch Workarounds for Model Export


GPU MODE ▷ #announcements (1 messages):

Exo 2, User-Schedulable Languages (USLs), scheduling operations


GPU MODE ▷ #jobs (4 messages):

CUDA kernel integration, vLLM module replacement


GPU MODE ▷ #beginner (40 messages🔥):

Numerically unstable lerp function, CPU vs GPU efficiency, GPU warps and parallelism, Breaking hashes on GPU, CUDA thread count optimization


GPU MODE ▷ #off-topic (3 messages):

Geoffrey Hinton, AI risk, Becoming a plumber


GPU MODE ▷ #rocm (12 messages🔥):

buffer_load_dwordx4 instruction, Composable Kernel, rocprofiler-sdk ABI


GPU MODE ▷ #webgpu (1 messages):

wgpu-rs, storage texture, r8unorm


GPU MODE ▷ #self-promotion (21 messages🔥):

Automated GPU Kernel Optimization, OpenEvolve Tool, Thread Value Layouts in CuTe, NVIDIA PTX Kernel, TokenDagger for Tiktoken


GPU MODE ▷ #thunderkittens (6 messages):

ThunderKittens Repo, Research Assistant at Hazy, load_async_wait call, Thundermittens retirement


GPU MODE ▷ #reasoning-gym (1 messages):

remek1972: How to switch to fp16?


GPU MODE ▷ #general (8 messages🔥):

CLI issues, Trimul Merch Prizes


GPU MODE ▷ #submissions (24 messages🔥):

H100 Grayscale, A100 Trimul, L4 vectorsum, T4 histogram, MI300 amd-identity


GPU MODE ▷ #status (2 messages):

CLI Fix, Submission Error


GPU MODE ▷ #factorio-learning-env (8 messages🔥):

Factorio entities retrieval issues, Factorio Lab Blueprint, Neel's trip to Prague


GPU MODE ▷ #cutlass (2 messages):

producer/consumer warps, data movement in CUDA


GPU MODE ▷ #singularity-systems (3 messages):

Systems ML Compiler Project, Heterogenous Deep Learning Stack, Compiler IRs, Max Bernstein on IR design


Yannick Kilcher ▷ #general (108 messages🔥🔥):

spherical k-means, sentence segmentation techniques, LLM hallucinations, pretraining LLMs, GritLM


Yannick Kilcher ▷ #paper-discussion (56 messages🔥🔥):

Associative Memory, LLM Alignment, RWKV-7 Goose, Human Cognition vs LLMs, Virility of Content Prediction


Yannick Kilcher ▷ #ml-news (15 messages🔥):

ML for Drug Discovery, Synthetic Data Contamination, Comparing Reasoning Models, Healthcare Costs and Access


aider (Paul Gauthier) ▷ #announcements (1 messages):

Gemini 2.5 Models, Responses API Models, Gitignore Files, Commit Message Generation, MATLAB language support


aider (Paul Gauthier) ▷ #general (110 messages🔥🔥):

Sonnet 4 Usage, QLora Training, O3 Performance, Claude Code, Aider Workspaces


aider (Paul Gauthier) ▷ #questions-and-tips (26 messages🔥):

Anthropic Ban, Aider Cost Efficiency, OpenRouter DeepSeek, Local LLM Performance, Gemini Streaming Issues


Modular (Mojo 🔥) ▷ #general (27 messages🔥):

shards.prefix.dev DNS, Mojo GPU Puzzles Broken, Mojo Float Point Support, Modular Careers, Mojo/MAX Quick Start


Modular (Mojo 🔥) ▷ #announcements (3 messages):

Modular Hack Weekend, Office Hours, Show & Tell, Project Submission, Live Demos


Modular (Mojo 🔥) ▷ #mojo (57 messages🔥🔥):

Mojo crash, Dictionary miscompilation bugs, Loading struct of multidim arrays from binary file, RTX 3060 on WSL/ubuntu 24.04, Alternate build of stdlib


Modular (Mojo 🔥) ▷ #max (6 messages):

Model Architecture on MAX, Embedding Models implementation


Nous Research AI ▷ #general (45 messages🔥):

Open Source AI Competition, Meta's Potential Close Source Move, N8N Automations, vLLM new models, AI Monsters/Memes


Nous Research AI ▷ #ask-about-llms (34 messages🔥):

Nous API Fine-Tuning, LLM training on gaming PC, LLM Training Software Options, Repetition Penalty Deconstructed, Temperature's Impact on Token Output


Nous Research AI ▷ #research-papers (3 messages):

Yannic Kilcher, Arxiv papers


Nous Research AI ▷ #research-papers (3 messages):

Yannic Kilcher, NeurIPS poster session videos


MCP (Glama) ▷ #general (73 messages🔥🔥):

TypeScript MCP Server, MCP Structured Content, Discord MCP Connector, Travel AI Agent, MCP Server Authentication


MCP (Glama) ▷ #showcase (6 messages):

NCBI Literature Search MCP Server, MCPOmni Connect Documentation, MCPJam inspector, Ollama Support


Notebook LM ▷ #use-cases (17 messages🔥):

Sharing NotebookLM libraries, Artistic exploration with NotebookLM, Book upload issues, Symbolic OS (ArifOS) structure using NotebookLM, Audio summary length limits


Notebook LM ▷ #general (50 messages🔥):

Podcast Hosts' "The Source", Source Mode vs. Explore Mode, NotebookLM Code Output Issue, NotebookLM Model and Google One, Replicating NotebookLM Functionality


Latent Space ▷ #ai-general-chat (66 messages🔥🔥):

Coding Agents, AI/Tech Industry Gross Margins, Neural Networks Interpretation, Capabilities for AGI, Medical AI


Manus.im Discord ▷ #general (46 messages🔥):

Credit Depletion, Manus Models, Account Security Breaches, Figma Integration, VEO Video Generation Feedback


tinygrad (George Hotz) ▷ #general (41 messages🔥):

GPU Indexes in AMD, RoCE Limitation, Direct Enqueue from GPU, Minor Refactor PR, Meeting Notes


tinygrad (George Hotz) ▷ #learn-tinygrad (3 messages):

Tensor.training, MNIST tutorial, PyTorch workflow, .inference_mode, .eval


Nomic.ai (GPT4All) ▷ #general (26 messages🔥):

GPT4All Novel Writing, Koboldcpp, LocalDocs, nomic-embed-code API, GPT4All Updates


DSPy ▷ #show-and-tell (2 messages):

FastWorkflow, DSPy-native application, AI Agent Challenges


DSPy ▷ #general (23 messages🔥):

vLLM settings for DSPy, DSPy app file structure, Audio native LLMs


Cohere ▷ #🧵-general-thread (7 messages):

command-r updates, data annotation needs


Cohere ▷ #📣-announcements (1 messages):

Cohere partnerships with UK, Canada, and Second Front, Upcoming Events with Cohere, AI security


Cohere ▷ #👋-introduce-yourself (9 messages🔥):

RL Dreamer V3, sports player re-identification, multilingual alignment, Topology for Data Analysis, LARP AI Assistant


Cohere ▷ #🔬-research (1 messages):

Cohere model feeling, Model thinks it has feelings, AI Sentience, Model Self-Awareness


LlamaIndex ▷ #announcements (1 messages):

LlamaCloud, MCP Gateway, OpenTelemetry, OpenAI Voice Agents, Prompt Caching


LlamaIndex ▷ #blog (3 messages):

NASA Space Explorer Assistant, LlamaIndex Workflows 1.0, LlamaIndex agent tool into MCP tool


LlamaIndex ▷ #general (13 messages🔥):

Custom Embeddings in ChromaDB Updates, Tool Call Issues in LlamaIndex Workflow, Memory Block Selection for HITL Agent Workflow


Torchtune ▷ #general (2 messages):

Hunyuan-A13B-Instruct


Torchtune ▷ #dev (15 messages🔥):

Packing impact on batch size, Packing gotchas, Checkpointing and mapping issues


AI21 Labs (Jamba) ▷ #general-chat (4 messages):

Human or Not, Spam Issues


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (3 messages):

Certificate session dates, Reinforcement Learning resources


MLOps @Chipro ▷ #events (1 messages):

Vibe Coding Club, AI coding for non-technical people, AI Hub Lisbon