Frozen AI News archive

not much happened today

Over the holiday weekend, key AI developments include the upcoming release of **Grok 4**, **Perplexity** teasing new projects, and community reactions to **Cursor** and **Dia**. Research highlights feature a paper on **Reinforcement Learning (RL)** improving generalization and reasoning across domains, contrasting with Supervised Fine-Tuning's forgetting issues. **Energy-Based Transformers (EBTs)** are proposed as a promising alternative to traditional transformers. **AI21 Labs** updated its **Jamba** model family with enhanced grounding and instruction following, maintaining a **256K** context window. **Baidu** open-sourced its massive **424 billion** parameter **Ernie 4.5** model, while **Kontext-dev** became the top trending model on **Hugging Face**. Advances in length generalization for recurrent models and the introduction of **2-simplicial attention** were noted. In biomedical AI, **Biomni**, powered by **Claude 4 Sonnet**, demonstrated superior accuracy and rare disease diagnosis capabilities. Additionally, the Python package manager `uv` received praise for improving Python installation workflows.

Canonical issue URL

a quiet holiday weekend

AI News for 7/4/2025-7/7/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (222 channels, and 15367 messages) for you. Estimated reading time saved (at 200wpm): 1249 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Grok 4 is coming, Perplexity is teasing something, people are upset at Cursor, excited at Dia, and monitoring the situation of more Meta Superintelligence hires.


AI Twitter Recap

AI Models, Research, and Techniques

Tooling, Frameworks, and Infrastructure

Industry, Companies, and Funding

Broader Implications and Philosophy

Humor and Memes


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Jamba and Qwen3 Model Releases

2. Llama Model Community Comics

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Major AI Model, Tool, and Hardware Launches & Benchmarks (2024/2025)

2. AI in Real-World Robotics, Medicine, and Military Applications

3. AI in Society: Ethics, Human Impact & Culture


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Flash Preview

Theme 1. Developer Tool Turmoil & Innovation

Theme 2. AI Training & Infrastructure Challenges

Theme 3. Cutting-Edge AI Agent Applications

Theme 4. AI's Policy, Market & Infrastructure Impact


Discord: High level Discord summaries

Cursor Community Discord


OpenAI Discord


Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


OpenRouter (Alex Atallah) Discord


LM Studio Discord


HuggingFace Discord


Yannick Kilcher Discord


Eleuther Discord


Nous Research AI Discord


aider (Paul Gauthier) Discord


GPU MODE Discord


Latent Space Discord


Modular (Mojo 🔥) Discord


MCP (Glama) Discord


Notebook LM Discord


tinygrad (George Hotz) Discord


DSPy Discord


Manus.im Discord Discord


Torchtune Discord


Cohere Discord


LlamaIndex Discord


Nomic.ai (GPT4All) Discord


MLOps @Chipro Discord


LLM Agents (Berkeley MOOC) Discord


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Cursor Community ▷ #general (989 messages🔥🔥🔥):

Claude UI output, Gemini UI output, Cursor new pricing issues, Cursor performance degrading, Windsurf vs Cursor


Cursor Community ▷ #background-agents (33 messages🔥):

GitHub IP allowlists, Background Agent Keeps Generating, Background Agents and Secrets, Port Forwarding, Final Checks for Background Agents


OpenAI ▷ #ai-discussions (834 messages🔥🔥🔥):

Google Translate vs Stella Translation, Posh Buffalo, LLMs' consciousness, Emergent AI


OpenAI ▷ #gpt-4-discussions (37 messages🔥):

GPT prompt engineering, Dall-e3 image generation issues, ChatGPT content policy ambiguities, GPT-4o memory bleed, Gemini 2.5 Pro Canvas superiority


OpenAI ▷ #prompt-engineering (8 messages🔥):

ICP Prompting, Prompt Epigenetics, Recursive System Initializer, Symbolic Reasoning, Prompt Optimisation Loops


OpenAI ▷ #api-discussions (8 messages🔥):

ICP as Recursive System Initializer, Prompt Epigenetics Theory, AI-Affirmation, Symbolic Reasoning in LLMs, LLMs as Statistical Text Engines


Perplexity AI ▷ #general (1114 messages🔥🔥🔥):

Perplexity Labs, Is Perplexity Pro worth the $$$, Agentic browser Comet, Model selector button, Image generation


Perplexity AI ▷ #sharing (5 messages):

AI-Generated Code, Vibe Coding, Deskree AI Blog Post, Perplexity AI Links


Perplexity AI ▷ #pplx-api (3 messages):

Parameter Tweaking for Sonar, Reasoning Effort, Search Context Size


Unsloth AI (Daniel Han) ▷ #general (1245 messages🔥🔥🔥):

GPU Undervolting, RAM Overclocking, Gemma 3 Performance Issues, Moondream data filtering, Training with completions only


Unsloth AI (Daniel Han) ▷ #off-topic (15 messages🔥):

Synthetic dataset generation for GraphRAGs, Fine-tuning Whisper for audio event classification, Gemini's accuracy in audio processing, Improving Tokenizer of a Model


Unsloth AI (Daniel Han) ▷ #help (337 messages🔥🔥):

LoRA Loading in Cloud, Unsloth Installation & CUDA, Roleplay finetuning with Deepseek, Generating Audio with Orpheus-3B, WandB Evaluation with Ngrok


Unsloth AI (Daniel Han) ▷ #research (44 messages🔥):

Formal Logic Prompts, Cross Entropy Loss Datasets, LTL vs English Prompts, LLMs and Human Language, Translating English to LTL


Unsloth AI (Daniel Han) ▷ #unsloth-bot (270 messages🔥🔥):

Apple Silicon support, LoRA rank, Cohere, multi-GPU support, RuntimeError CUDA


LMArena ▷ #general (847 messages🔥🔥🔥):

DeepSeek Pricing, China AI influence, Qwen 3 Models, Grok 4 Release, Gemini censorship


LMArena ▷ #announcements (2 messages):

Grok-3-mini-high model, Image Edit Leaderboard, July Contest, Out of Place Objects in Space, June's Contest Winner


OpenRouter (Alex Atallah) ▷ #app-showcase (6 messages):

MCP for Claude Code, personality.gg, NipponHomes.com


OpenRouter (Alex Atallah) ▷ #general (862 messages🔥🔥🔥):

Llama 3.2 3B pricing anomaly, DeepSeek V3 setup guide, Perplexity API issues, Grok 4 Leaks, Monad Tag


OpenRouter (Alex Atallah) ▷ #new-models (2 messages):

``


LM Studio ▷ #general (238 messages🔥🔥):

LLMs with good function calling, LM Studio and GPUs, MCP web-search server, Qwen3 for Reasoning, LLM context size considerations


LM Studio ▷ #hardware-discussion (92 messages🔥🔥):

GPU Detection in LM Studio, AMD vs. Nvidia for LLMs, Token Generation Speed, VRAM Requirements for Models, Combining GPUs


HuggingFace ▷ #general (142 messages🔥🔥):

Face Recognition Models, Text-to-SQL with T5, HuggingChat Shutdown, ComfyUI and GPU Performance, HairStyle Spaces


HuggingFace ▷ #today-im-learning (1 messages):

Building Neural Networks from Scratch, Challenges of Custom Neural Network Implementation


HuggingFace ▷ #cool-finds (8 messages🔥):

AI Model Identification, Claude AI Experience, same.dev comparison


HuggingFace ▷ #i-made-this (13 messages🔥):

JauAuth, PiTutor, BorgLLM, RecycloBot, Arena-RLHF


HuggingFace ▷ #reading-group (3 messages):

LLM Fine-Tuning Blogs, GPU Parallelism, Transformer Inference


HuggingFace ▷ #computer-vision (1 messages):

Deepfake Detection System, AI and Cybersecurity Combination


HuggingFace ▷ #NLP (1 messages):

GLoVE Model, GLoVE Paper, Co-occurrence Probability Symmetry


HuggingFace ▷ #agents-course (31 messages🔥):

Time commitment for Agents course, Course assignments and certifications, Access to Llama 3 Model, Issues with Unit 1 Notebook, Guidance on completing Quiz 2.1


Yannick Kilcher ▷ #general (180 messages🔥🔥):

US Copyright Office AI Policy Volumes, Gemini vs ChatGPT for Math, Logical Fallacies in AI Criticism, Material Science, Physics with LLMs


Yannick Kilcher ▷ #paper-discussion (7 messages):

log(n) scaling model exhibit, Hierarchical Reasoning Models, Quaternion products in LLMs


Yannick Kilcher ▷ #ml-news (5 messages):

The Ocean is Wet, Critical Look at Chain of Thought, AI Training Load Fluctuations


Eleuther ▷ #announcements (1 messages):

EleutherAI Summer of Open AI Research, Open Science AI Research Project, Mentorship Program


Eleuther ▷ #general (81 messages🔥🔥):

AI Alignment & Interpretability, ROPE frequencies for sliding window, Decentralized AI, Language Modeling as Compression, GLoVE Model Symmetry


Eleuther ▷ #research (41 messages🔥):

Aurko Paper, Cognitive Markers, Concat-and-chunk strategy, Flex Attention, CFG


Eleuther ▷ #interpretability-general (3 messages):

SAE expansion ratio, Set Autocomplete Model, Publishing Research


Eleuther ▷ #lm-thunderdome (7 messages):

MMLU-SR task subsets, datasets parquet conversion, lm eval running time


Eleuther ▷ #gpt-neox-dev (6 messages):

Small datasets for training, FineWeb dataset samples, FA3 support for H100/H200, dclm-dedup dataset


Nous Research AI ▷ #general (101 messages🔥🔥):

HF documentation 3B model, LLM Matching Generative Responses, Grok's Politically Incorrect Stance, PiTutor interactive learning, Grok's Knowledge Updates


Nous Research AI ▷ #ask-about-llms (18 messages🔥):

Ollama and Openwebui for AI Services, Training LLMs on Their Own Weights, Temperature and Token Usage, Math 500


Nous Research AI ▷ #research-papers (3 messages):

AI Mouse Tracking, Architecture for Mouse Path Training


Nous Research AI ▷ #interesting-links (8 messages🔥):

Chinese AI investment, Parameter-efficient fine-tuning, Trustless agents, Codeium 3.2, Autonomous optical network


Nous Research AI ▷ #research-papers (3 messages):

AI-Youtube-OG Architecture, Natural Human Mouse Paths Dataset, Model Training Architecture


aider (Paul Gauthier) ▷ #general (93 messages🔥🔥):

Grok 4 Leaks, Deepseek R2, Gemini-CLI Search Integration, MCP Integration with Aider, Documentation Fetching MCPs


aider (Paul Gauthier) ▷ #questions-and-tips (25 messages🔥):

InputOutput class in aider, OpenRouter provider settings, Git reset, sonnet-4 with ask/code, deepseek 70b

from aider.io import InputOutput
io = InputOutput(yes=True)

GPU MODE ▷ #general (3 messages):

Tensor Compilers, CUDA, PyTorch, Vector Matrix Multiplication


GPU MODE ▷ #triton (1 messages):

einops, einstein notation, triton


GPU MODE ▷ #cuda (36 messages🔥):

CUDA printf debugging, NCU memory bandwidth, GPU division vs multiplication, CUDA tile scheduler vs cutlass, CUDA unified memory


GPU MODE ▷ #jobs (1 messages):

ML Platform Hiring, Remote Work, Quora


GPU MODE ▷ #rocm (2 messages):

ROCm traces, in-person hackathon, make flags, gpumode-amd-fp8-mm repo


GPU MODE ▷ #self-promotion (4 messages):

voxel raytracing, Hopper Pipelining, PiTutor, Deep Infra B200 Instances


GPU MODE ▷ #general-leaderboard (5 messages):

GPU Mode Challenges, Leaderboard Questions, Kernelbot Data


GPU MODE ▷ #submissions (6 messages):

A100 Performance, H100 Performance, Trimul Leaderboard


GPU MODE ▷ #factorio-learning-env (41 messages🔥):

instance.py refactoring, Github Actions integration, Ruff linting, Pydantic bump PR


GPU MODE ▷ #singularity-systems (4 messages):

picoc compiler for llm.c, picograd kernels, nn: zero to hero follow up, picoc frontend semantic analysis


GPU MODE ▷ #general (1 messages):

tonic_1: 💪🏻 🏅 🇫🇷 🏆 🚀


Latent Space ▷ #ai-general-chat (100 messages🔥🔥):

Cursor.ai Pricing Controversy, Trae-Agent Open-Sourced by Bytedance, ChatGPT Diagnoses Rare Genetic Defect, ChatGPT's New 'Study Together' Feature, Books3 Dataset Lore


Modular (Mojo 🔥) ▷ #general (41 messages🔥):

Mojo for research and training, Arrow spec, Mojo and Python callbacks, Mojo server tag, Mojo as a Python superset


Modular (Mojo 🔥) ▷ #mojo (52 messages🔥):

StringLiteral Materialization, Parametric Traits vs Trait Objects, Cloud-Based Mojo Environment, Static Linking in Mojo, SIMD Bitcasting


Modular (Mojo 🔥) ▷ #max (5 messages):

Mojo GPU Puzzles, ModuleNotFoundError, Pixi


MCP (Glama) ▷ #general (58 messages🔥🔥):

EpicMe MCP, WinCalcMCP, MCP Python Interpreter, MCP for documentation indexing, LangGraph vs Custom Agents


MCP (Glama) ▷ #showcase (4 messages):

MCP Monetization, Agentic Payments with Crypto, Fast Agent's MCP Elicitation Support, EpicAI's MCP Search Engine


Notebook LM ▷ #use-cases (16 messages🔥):

Google Docs comments, NMC Standards Notebook, NHS adoption of NotebookLM, Mindmap embedding, Newsletter drafting from audio files


Notebook LM ▷ #general (36 messages🔥):

Saving Notes in NotebookLM, Interactive Mode Button missing, PDF Upload issues, Saving Chats in NotebookLM, New Model for NotebookLM


tinygrad (George Hotz) ▷ #general (50 messages🔥):

tinygrad vs pytorch, MLIR, Halide, Exo-lang, whisper example


DSPy ▷ #general (31 messages🔥):

Automatic Prompt Optimization (APO), Claude Code vs DSPy, DSPy 3.0, SIMBA vs MIPROV2, Tool Selection for LLMs


DSPy ▷ #examples (6 messages):

DSPy quickstart, A/B testing prompts in DSPy, signature.py exploration


Manus.im Discord ▷ #general (32 messages🔥):

ChatGPT personality, Claude 4 Comparison, Manus Airdrops, Manus Credits, Manus as project starter


Torchtune ▷ #general (3 messages):

QLoRA throughput, Fused RMSNorm, Linear cross entropy, LoRAMLP


Torchtune ▷ #dev (5 messages):

Custom Tokenizers, Mistral 2506-small, HF AutoTokenizer, CI Bug


Torchtune ▷ #papers (14 messages🔥):

Context-Parallel vs. Limited GPUs, Skepticism on Architectural Improvements, Data Cleaning vs. Architecture Iterations, MoE Training Techniques


Cohere ▷ #🧵-general-thread (4 messages):

Cohere Labs, Open Science Community Summer School


Cohere ▷ #🔌-api-discussions (2 messages):

Embed v4 API, Hybrid image/text embeddings


Cohere ▷ #👋-introduce-yourself (10 messages🔥):

Self-learning RL/DL, Agentic AI, Cohere Embed v4, Probabilistic Generative Models


LlamaIndex ▷ #blog (5 messages):

Open Source NotebookLM, LlamaCloud MCP Servers, AI Hack Night


LlamaIndex ▷ #general (9 messages🔥):

Document Intelligence for P&IDs, Handwritten Text Extraction, LlamaIndex UX for Business Users


Nomic.ai (GPT4All) ▷ #general (9 messages🔥):

Nomic API, GPT4All server mode, Jinja Chat Template, CrowdLLM, OpenAssistant


MLOps @Chipro ▷ #general-ml (3 messages):

Free Book PDF, Author's Blog, Copyright Issues