Frozen AI News archive

Bartz v. Anthropic PBC — "Training use is Fair Use

**Anthropic** won a significant fair use ruling allowing the training of **Claude** on copyrighted books, setting a precedent for AI training legality despite concerns over pirated data. **Replit** achieved a major milestone with **$100M ARR**, showing rapid growth. **Delphi** raised **$16M Series A** to scale digital minds, while **Thinking Machines Lab** focuses on reinforcement learning for business applications. **Disney** and **Universal** sued **Midjourney** over unauthorized use of copyrighted images. **Google DeepMind** released **Gemini Robotics On-Device**, a compact foundation model for robotics.

Canonical issue URL

An important ruling, but not a final one.

AI News for 6/23/2025-6/24/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (220 channels, and 3440 messages) for you. Estimated reading time saved (at 200wpm): 365 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Last August, a group of authors led by Andrea Bartz brought a class action lawsuit on Anthropic PBC for "illegally downloading" their works to train Claude. The scale of the destructive book scanning (perhaps <$2 per book esp used books) is impressive:

This is of course familiar to anyone who knows Authors Guild v Google, aka the Google Books lawsuit, which had a very similar setup, but this is the first direct ruling on the legality of pretraining on copyrighted content.

The filings from that case are here but the result today is from the Motion for Summary Judgment, where Anthropic arguably "won" with the explicit ruling that "training use [is] fair use".

It seems that the ghost of Books3 haunts Anthropic as there is a separate issue on using pirated books, but the judgment is pretty clear here and likely sets an important precedent for years to come: no less than 32 mentions of how "transformative" a use case that pretraining is, regardless of how much the LLM memorizes:


AI Twitter Recap

Companies, Funding, and Legal

Model & Tech Releases & Updates

New Techniques & Research

Frameworks, Tooling, and Infrastructure

Broader Implications & Community Discourse

Humor & Memes


AI Reddit Recap

/r/LocalLlama Recap

1. LocalLlama Subreddit Moderator Transition and Recovery

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Anthropic Copyright Lawsuit & Fair Use Ruling

2. Claude Code Advanced Uses and Community Response

3. AI’s Disruption of Careers and Education


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Pro Exp

Theme 1: New Models & Architectures: The Innovation Race Continues

Theme 2: Developer Experience & Tooling: Navigating the AI Frontier

Theme 3: Performance & Optimization: From Silicon Dreams to Speedy Realities

Theme 4: AI Applications & Integrations: Bridging Code, Content, and Conversation

Theme 5: The AI Ecosystem: Navigating Funding Rapids, Ethical Eddies, and Platform Quirks


Discord: High level Discord summaries

Unsloth AI (Daniel Han) Discord


Cursor Community Discord


Perplexity AI Discord


OpenAI Discord


LMArena Discord


HuggingFace Discord


LM Studio Discord


OpenRouter (Alex Atallah) Discord


GPU MODE Discord


tinygrad (George Hotz) Discord


aider (Paul Gauthier) Discord


Yannick Kilcher Discord


Latent Space Discord


Nous Research AI Discord


Eleuther Discord


Modular (Mojo 🔥) Discord


Manus.im Discord Discord


Torchtune Discord


Notebook LM Discord


Nomic.ai (GPT4All) Discord


DSPy Discord


MCP (Glama) Discord


LLM Agents (Berkeley MOOC) Discord


Cohere Discord


LlamaIndex Discord


AI21 Labs (Jamba) Discord


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Unsloth AI (Daniel Han) ▷ #general (537 messages🔥🔥🔥):

Polaris 4B Model, 3D Meshes, LoRA Hyperparameters, NVFP4 for Efficient Inference, Reddit Moderation


Unsloth AI (Daniel Han) ▷ #off-topic (4 messages):

QAT models, Recommendation systems hobby project


Unsloth AI (Daniel Han) ▷ #help (323 messages🔥🔥):

Profiling performance metrics for fine-tuning, Gradient accumulation strategies, Qwen GRPO Notebook issues, Unsloth checkpoint vs official checkpoint, Gemma-3 Vision Notebook issues


Unsloth AI (Daniel Han) ▷ #showcase (3 messages):

Nahuatl Translator, Unsloth fine-tuning


Unsloth AI (Daniel Han) ▷ #research (6 messages):

BNPO vs Dr.GRPO, RL-tuning performance, training instability, GRPO-lora and GRPO-Qlora


Cursor Community ▷ #general (411 messages🔥🔥🔥):

Cursor Setup, Cursor Terminal Issues, Windsurf vs Cursor, Rate Limits and Pricing, MCPs VisionCraft and Sequential Thinking


Cursor Community ▷ #background-agents (34 messages🔥):

Background Agents on Multiple Machines, Devcontainer support for Background Agents, Background Agent API, Background Agents and Git Initialization Issues, Accessing Private GitHub Repos During Install Step


Perplexity AI ▷ #general (396 messages🔥🔥):

Homeschooling prompts, Perplexity Pro version issues, ChessChamp AI release, Doctors charging fees, O4 Mini High better than Omni


Perplexity AI ▷ #sharing (4 messages):

Shareable threads, Trump ceasefire, Donation fatigue, Ubisoft patch


Perplexity AI ▷ #pplx-api (4 messages):

Perplexity AI tech support


OpenAI ▷ #ai-discussions (301 messages🔥🔥):

Memory context service, Multi-head Latent Attention, AI dubbing voice lines, Sora alternatives, Chat search connectors


OpenAI ▷ #gpt-4-discussions (5 messages):

OAI Server Tag, GPT-4o Cutoff, ChatGPT vs GPT Models, File Upload/Deletion Issues


OpenAI ▷ #prompt-engineering (2 messages):

PDF generation failures, Deep Research report PDF


OpenAI ▷ #api-discussions (2 messages):

PDF generation alternatives, Deep Research report format, ChatGPT PDF failures, Triggering DeepResearch output


LMArena ▷ #general (252 messages🔥🔥):

Grok3 SOTA, Claude niche, Apple Foundation Models, Google Flamesong, Kingfall release


HuggingFace ▷ #general (140 messages🔥🔥):

HuggingFace Site Issues, AI Jailbreaking, Gradio Loading Issues, Freelance AI Work, Fine-tuning Models


HuggingFace ▷ #today-im-learning (1 messages):

h2he3: Very useful, thank you.


HuggingFace ▷ #i-made-this (50 messages🔥):

Gradio Custom Component Packaging, Gradient Descent on LLM Input Space, Evaluating Language Models for Computer Graphics Code Completion, AI Dialogue with Ollama, Shader Graph Code Generation by LLM


HuggingFace ▷ #reading-group (1 messages):

LessWrong Post Acceptance, Gradient Descent on Token Input Embeddings


HuggingFace ▷ #core-announcements (1 messages):

Diffusers v0.34.0, New Release


HuggingFace ▷ #computer-vision (4 messages):

JAX models, Model Optimization


HuggingFace ▷ #NLP (27 messages🔥):

Docker crashes with sentence transformers, Input embeddings, Scaling Vector Search, Langchain’s FAISS, IndexIVFPQ


HuggingFace ▷ #smol-course (1 messages):

Hugging Face Certificates


HuggingFace ▷ #agents-course (26 messages🔥):

Unit 4 Final Project Submission Workflow, Certificate Access Issues, Final Assignment Evaluation Deadline, Unit 1 Quiz Access Problems, Challenges with HF environment variables in agent creation


LM Studio ▷ #general (81 messages🔥🔥):

Voice installation for language practice, Image generation feature in LM Studio, Roo code Discord issue with LM Studio context windows, Dynamic quant size estimation issues with Unsloth, Increasing chat history context length


LM Studio ▷ #hardware-discussion (41 messages🔥):

P40 on mATX boards, Multiple GPUs vs bottleneck, LM Studio performance on Ryzen 55900xt, 3x3090 slows down


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

Meta Provider Issues, Pricing Questions


OpenRouter (Alex Atallah) ▷ #general (103 messages🔥🔥):

OpenRouter Provider Preference, Novita's Incorrect Information on R1-528 Max Output Length, Stripe Payment Method Issues on OpenRouter, Reasoning Tokens vs Total Token Count, Cent-ML Provider Replacement


GPU MODE ▷ #general (9 messages🔥):

C++ CUDA build systems, Meson, Buck2, xmake, Zig


GPU MODE ▷ #triton (2 messages):

Triton AOT Compilation, Triton Community Meetings, Fused Attention Kernel


GPU MODE ▷ #cuda (11 messages🔥):

CUB with NVRTC, matmul overlap, JIT safe standard library headers, torch.cdist implementation


GPU MODE ▷ #torch (1 messages):

TorchTitan, SimpleFSDP, TP and FSDP collectives, Inductor


GPU MODE ▷ #algorithms (4 messages):

LLM, CUDA, algorithms


GPU MODE ▷ #jobs (1 messages):

PyTorch Tool, Machine Learning Efficiency, Optimization, Mentorship Opportunity, Medical Device CV


GPU MODE ▷ #beginner (6 messages):

cuML, NVIDIA driver, CUDA toolkit, threadIdx.y vs threadIdx.x


GPU MODE ▷ #pmpp-book (1 messages):

Reduction code correctness, Input length handling


GPU MODE ▷ #rocm (4 messages):

rocprofiler-sdk Integration, Chisel Performance Counters


GPU MODE ▷ #intel (1 messages):

Intel GPU atomic latency, Ponte Vecchio VTUNE, SYCL device cycle counters


GPU MODE ▷ #self-promotion (8 messages🔥):

GPU Rental, Chisel Tooling, CUDA Competition, 3D Gaussian Splatting


GPU MODE ▷ #🍿 (30 messages🔥):

KernelLLM, Triton Data, Kernelbot Data, Synthetic Datasets, PyTorch to Triton Conversion


GPU MODE ▷ #reasoning-gym (1 messages):

dragan.jovanovich: congrats👏


GPU MODE ▷ #general (2 messages):

CUDA Matmul precision issues, Triangle Multiplicative Update (Trimul) in AlphaFold


GPU MODE ▷ #submissions (7 messages):

prefixsum performance, sort performance, trimul performance on B200, trimul performance on A100


GPU MODE ▷ #status (1 messages):

New leaderboard problem, AMD + NVIDIA hardware


GPU MODE ▷ #factorio-learning-env (5 messages):

Factorio Client Authentication, FLE updates, Error cases in FLE


GPU MODE ▷ #cutlass (1 messages):

CuTe DSL, GEMM kernel, TMA transfers, MMA operations, sm90 architecture


tinygrad (George Hotz) ▷ #general (90 messages🔥🔥):

NVMe, Network cards, MI300x with AMDGPU, ResNet and BERT Training, GPU kernel


tinygrad (George Hotz) ▷ #learn-tinygrad (1 messages):

FP8 Conversion, Hardware Compatibility


aider (Paul Gauthier) ▷ #general (50 messages🔥):

Synthetic Data for Training, Meta's Synthetic Data Kit, Gemini Pro Stable's Instruction Following Issues, Aider Benchmark Framework, Claude Max Integration with Aider


aider (Paul Gauthier) ▷ #questions-and-tips (26 messages🔥):

Aider strange interactions, deepseek-r1 token limits, MCP support in aider, Gemini's intelligence


aider (Paul Gauthier) ▷ #links (10 messages🔥):

Claude Code, Backend, Subscription, API calls, SDK


Yannick Kilcher ▷ #general (37 messages🔥):

Efficient Pneumonia Detection with Vision Transformers, Scaling Vector Search with FAISS, GRPO for RL, FYP ML domain


Yannick Kilcher ▷ #paper-discussion (12 messages🔥):

Cloud GPU Platforms, AI in Education, RWKV v6 and Finch Series, Time Crystal Computer


Yannick Kilcher ▷ #ml-news (26 messages🔥):

Natural Selection and AI, Genetic Engineering vs Automation, AI as Calculator, Richer People Reproduce Less, Papers on RL & LLMs


Latent Space ▷ #ai-general-chat (70 messages🔥🔥):

Harvey AI Funding, Replit ARR, AI Agent Supervision, Startup vs Incumbent, Magenta RealTime


Nous Research AI ▷ #general (37 messages🔥):

grok3mini, humanizing AI agents, building llms from scratch, llm inference app llamabarn, COCONUT gating layer


Nous Research AI ▷ #ask-about-llms (2 messages):

Model Recommendations, LORA Training, GGUF Conversion, Local LLMs on GTX 1080


Nous Research AI ▷ #research-papers (4 messages):

MultiNet v0.2, Manifold platform, R1-Zero-Like Training, RL Incentivize Reasoning, Spurious Rewards in RLVR


Nous Research AI ▷ #interesting-links (2 messages):

Reward Models, PAIE Curator


Nous Research AI ▷ #research-papers (4 messages):

MultiNet v0.2, Manifold platform, Generalist AI evaluation, R1-Zero-Like Training, RL Incentivizes Reasoning


Eleuther ▷ #general (14 messages🔥):

Multiagent Cooperation, Prefix caching, red teaming conversational AI


Eleuther ▷ #research (30 messages🔥):

Spectral Normalization, Sleeping-DISCO Dataset, Generative Models and Dynamical Systems, Manifold Multimodal AI Benchmarks, RL incentive


Eleuther ▷ #interpretability-general (4 messages):

NNsight pre-release, Loss curve decomposition, NDIF update, Orthogonal Gradient Basis


Modular (Mojo 🔥) ▷ #general (4 messages):

Mojo GPU kernels, Mojo from Python Limitations


Modular (Mojo 🔥) ▷ #mojo (39 messages🔥):

Larecs Testing in Modular Community CI, Mojo as Rust Replacement, Mojo Async vs Rust Async, Statement Beginning Error in Mojo


Manus.im Discord ▷ #general (30 messages🔥):

Manus PDF reading issues, New AI architecture development, Credit promo issues, Manus credits, Manus down


Torchtune ▷ #general (3 messages):

TorchTune, Single Machine LORA, GitHub Issues


Torchtune ▷ #dev (25 messages🔥):

Expandable Segments Bug, max-autotune issue, clearing cache, L40S card bug, reward modeling RFC


Notebook LM ▷ #use-cases (5 messages):

NotebookLM Model, Latest Model Info, Model Options


Notebook LM ▷ #general (22 messages🔥):

New user options, Share the link feature, NotebookLM Alternatives, Audio Overview Generation, Vimeo Videos as Sources


Nomic.ai (GPT4All) ▷ #general (12 messages🔥):

Debian 12 vs Ubuntu Jammy, Python SDK update, GPT4All official website issues


DSPy ▷ #papers (1 messages):

Atom of Thought, GAIA benchmark, Agent Startup, Implementation code issues


DSPy ▷ #general (5 messages):

Ax for TypeScript, module status messages, OpenAI Issues, LiteLLM


MCP (Glama) ▷ #general (6 messages):

Google's A2A, Anthropic A2A, MCP Timeouts, Chrome AI APIs


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (5 messages):

Certificate Timing, Course Completion, Social Media Posts for Course


Cohere ▷ #🧵-general-thread (3 messages):

Cohere Reranker Pricing, Token Usage in Cohere API


Cohere ▷ #👋-introduce-yourself (1 messages):

Introductions, Community, Tech, Tools


LlamaIndex ▷ #blog (2 messages):

Open Source Resume Matching, Claude-Compatible MCP Server


LlamaIndex ▷ #general (1 messages):

FAISS Optimization, Vectorized Computation, Quantized FAISS Index, Dynamic Query Vectors