Frozen AI News archive

Western Open Models get Funding: Cohere $500m @ 6.8B, AI2 gets $152m NSF+NVIDIA grants

**OpenAI's GPT-5** achieved a speedrun of Pokemon Red 3x faster than **o3**. **Perplexity** raised **$200M** at a **$20B valuation**. **AI2** secured **$75M NSF grants** and **$77M from NVIDIA** for AI infrastructure projects like Olmo and Molmo. **Cohere** raised **$500M** and hired **Joelle Pineau** from **meta-ai-fair**, boosting models like Command A. **Google** released the **Gemma 3 270M** on-device tiny LLM with INT4 QAT checkpoints and large embedding tables, and made **Imagen 4** generally available with a fast version at $0.02/image. **Meta-ai-fair** introduced **DINOv3**, a family of self-supervised vision foundation models with high-resolution dense features and strong performance on benchmarks like COCO detection and ADE20K segmentation, under a permissive license. A **$150,000 MiniMax AI Agent Challenge** is ongoing with 200+ prizes, encouraging AI project builds by August 25.

Canonical issue URL

Funding for open models are all we need.

AI News for 8/13/2025-8/14/2025. We checked 12 subreddits, 544 Twitters and 29 Discords (227 channels, and 9744 messages) for you. Estimated reading time saved (at 200wpm): 710 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Congrats to GPT5 speedruning Pokemon Red 3x faster than o3, and to Perplexity raising $200m at $20B valuation, but the day belongs to the open models crew who announced big injections of cash this week:

Good guys and gals won today.


🚀 $150,000 MiniMax AI Agent Challenge — Bring Your A-Game!


AI Twitter Recap

Google’s Gemma 3 270M model and Imagen 4 Fast

Meta’s DINOv3: high-resolution dense vision features at scale (permissive)

Frontier model capability and efficiency: GPT‑5, FormulaOne, DetailBench, GFPO

Open ecosystem, scale, and infra

Agents: simulation, deep research, and browser-native assistants

Interactive video, robotics, and multimodality

Top tweets (by engagement)


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. Benchmarking and Popularity of Small Language Models

2. Upcoming and Open Source AI Model Releases (Grok 2, DeepSeek)

3. Hardware and Practical Challenges in AI Model Deployment (Qwen Context & GPUs)

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. GPT-5 Beats Previous Models at Pokémon Red Speedruns

2. Notable New Benchmarks: GPT-5, Google Image Model, SWE-bench

3. AI Model and Platform Feature Launches: Claude Code & Gemma 3 270M


AI Discord Recap

A summary of Summaries of Summaries by gpt-5

1. Tiny Titans & Trusty Benchmarks

2. Agent Tooling & Protocols Heat Up

3. Routers, Reliability & Receipts

4. Compilers, Kernels & Local Runtimes

5. AI IDEs Ship Serious Upgrades


Discord: High level Discord summaries

Perplexity AI Discord


Unsloth AI (Daniel Han) Discord


LMArena Discord


Cursor Community Discord


OpenRouter (Alex Atallah) Discord


HuggingFace Discord


OpenAI Discord


LM Studio Discord


Latent Space Discord


Moonshot AI (Kimi K-2) Discord


Nous Research AI Discord


GPU MODE Discord


Modular (Mojo đŸ”„) Discord


Yannick Kilcher Discord


aider (Paul Gauthier) Discord


Eleuther Discord


DSPy Discord


Notebook LM Discord


MCP (Glama) Discord


Cohere Discord


LlamaIndex Discord


Nomic.ai (GPT4All) Discord


Manus.im Discord Discord


tinygrad (George Hotz) Discord


Codeium (Windsurf) Discord


Gorilla LLM (Berkeley Function Calling) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Torchtune Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1184 messagesđŸ”„đŸ”„đŸ”„):

Meta's AI documents leaked, Perplexity gamification, Grok's uncensored versions, Claude's language mixing issue, AI in book writing


Perplexity AI ▷ #sharing (6 messages):

Comet Projects, Syntax Gang, AI-Designed Antibiotics, Puch AI's $50 Billion Counterfactual Bet


Perplexity AI ▷ #pplx-api (7 messages):

Disable Search on Sonar, Search Control Guide


Unsloth AI (Daniel Han) ▷ #general (1236 messagesđŸ”„đŸ”„đŸ”„):

Local LLMs and RAM upgrades, GPT-OSS Fine-tuning Updates, Gemma 3 270M Model, LLM Training and Benchmarking, Multi-GPU Training Paused


Unsloth AI (Daniel Han) ▷ #introduce-yourself (2 messages):

Discord server settings


Unsloth AI (Daniel Han) ▷ #off-topic (325 messagesđŸ”„đŸ”„):

Windows 12, Debian vs Ubuntu, Pantheon TV show, AI and Humanity, Steroid Use in Gyms


Unsloth AI (Daniel Han) ▷ #help (65 messagesđŸ”„đŸ”„):

Sagemaker Deployment with LMI Instances, Manual Notebook Configuration vs. Claude Code, VLLM for Fast Inference, P40 vs Mi50 for Fine-Tuning, Synthetic Dataset Generation from PDFs


Unsloth AI (Daniel Han) ▷ #showcase (18 messagesđŸ”„):

MoLA-LM, LoRA, Qwen, Gemini, Jan v1 model


Unsloth AI (Daniel Han) ▷ #research (7 messages):

Data Efficiency, Synthetic Data Generation, Two-Stage Training, Compute vs Data


LMArena ▷ #general (1117 messagesđŸ”„đŸ”„đŸ”„):

GPT-5 versions compared, Benchmarking nuances, AI Relationships, Censorship in AI models, File Uploads


LMArena ▷ #announcements (1 messages):

July Contest, Contest Voting


Cursor Community ▷ #general (968 messagesđŸ”„đŸ”„đŸ”„):

Cursor Disk Usage, GPT-5 Loop, Copilot Limitations, Cursor Pricing Change, CC versus Cursor


Cursor Community ▷ #background-agents (9 messagesđŸ”„):

Cursor API Access, Background Agents Beginner's Guide, Docker Compose with Background Agent, Linear Integration Repository Specification, Background Agent Docker Installation


OpenRouter (Alex Atallah) ▷ #announcements (2 messages):

Self-Serve Refunds, Activity Improvements, Token Usage Breakdown, 3rd party credit usage, Chutes Capacity Offline


OpenRouter (Alex Atallah) ▷ #app-showcase (1 messages):

Deno or-models tool, OpenRouter model list


OpenRouter (Alex Atallah) ▷ #general (525 messagesđŸ”„đŸ”„đŸ”„):

Deepseek v3 issues and outages, Chutes' rate limiting and API key management, Alternatives to Deepseek for roleplaying, Azure credits liquidation strategies, Sonnet 4 pricing inconsistencies


OpenRouter (Alex Atallah) ▷ #discussion (16 messagesđŸ”„):

Self-serve refunds, Chatroom app creation, Qwen Coder via Cerebras, Tool call evals, Model Select performance


HuggingFace ▷ #general (249 messagesđŸ”„đŸ”„):

HuggingFace search box, Gemini 2.5 Flash GGUF, Local text to video model, Qwen-3-4B-Thinking-2507 Model, CIRISAgent by ethicsengine.org


HuggingFace ▷ #today-im-learning (3 messages):

MultiLLM Access, AI Fraud Detector App


HuggingFace ▷ #cool-finds (1 messages):

jariullah: yo


HuggingFace ▷ #i-made-this (6 messages):

MLX Knife, ChonkyBin, TalkT2-0.1b, AI-powered Web App


HuggingFace ▷ #computer-vision (48 messagesđŸ”„):

Integrating Open-Source Models, Medical Image Analysis, Emotional Support System, First AI Project, Environmental Projects


HuggingFace ▷ #NLP (3 messages):

transformers_distillation, model compression, efficient NLP, Hugging Face Transformers


HuggingFace ▷ #agents-course (1 messages):

virtual environments, venv


OpenAI ▷ #ai-discussions (183 messagesđŸ”„đŸ”„):

Claude 1m context window, GPT-5 vs GPT-4, Gemini Update, Grok 5, Llama's Status


OpenAI ▷ #gpt-4-discussions (23 messagesđŸ”„):

Emotionless Goth Girl GPT-5, GPT-5 Tone Issues, GPT 5 bugginess, GPT android fighting voice models, GPT for AWS Design


OpenAI ▷ #prompt-engineering (12 messagesđŸ”„):

Positive prompts, Customizing ChatGPT-5, UI Buttons for ChatGPT, Suggestion Box


OpenAI ▷ #api-discussions (12 messagesđŸ”„):

Positive Prompts, Customizing ChatGPT-5 for Permanent Memories, Reasoning Process Changes, UI Buttons for Chatbot Interactions, Minimizing Chatbot Prompts with Custom Instructions


LM Studio ▷ #general (174 messagesđŸ”„đŸ”„):

LM Studio tool calling, Qwen3 Coder Flash, LM Studio TTS/STT, GPT-OSS settings, LM Studio's config override dot


LM Studio ▷ #hardware-discussion (36 messagesđŸ”„):

Framework 13 LLM Speed, AMD GPU ROCM Pytorch, Flash Attention KV Values, Maxsun Arc Pro B60, RTX PRO 4000SFF


Latent Space ▷ #ai-general-chat (193 messagesđŸ”„đŸ”„):

Multi-Layer SPVs, AI Employee Adoption Tactics, Agentic AI MOOC, OpenAI Operator vs Anthropic Fin, Claude 3.5 Sonnet Deprecation


Moonshot AI (Kimi K-2) ▷ #general-chat (95 messagesđŸ”„đŸ”„):

Kimi K2 PPT Generation, Kimi vs Grok Reddit Bot Policy, Kimi K2 vs K1.5 Model Performance, DeepSeek Next Gen Model Release, Kimi's Reasoning Model Parameter


Nous Research AI ▷ #announcements (1 messages):

Token Usage, Reasoning Models, Open Models vs Closed Models


Nous Research AI ▷ #general (66 messagesđŸ”„đŸ”„):

Hermes-3 dataset refusals, Menlo Research joining Interspeech2025, Uncensoring AI intelligence, Google released Gemma-3-270m, DeepSeek R2 release rumors


Nous Research AI ▷ #ask-about-llms (12 messagesđŸ”„):

Claude's Spying, Channel Privacy, AI Oversight


Nous Research AI ▷ #research-papers (3 messages):

Open WebUI setup difficulty, Emergent Behavior in Tiny LMs paper, DINOv3


Nous Research AI ▷ #research-papers (3 messages):

Open WebUI, Emergent Behavior, Dino V3


GPU MODE ▷ #general (33 messagesđŸ”„):

0xc0000409 exception with llama_model_load_from_file, CUDA backend initialization, STATUS_STACK_BUFFER_OVERRUN error


GPU MODE ▷ #triton (7 messages):

Triton Resources for Speculative Decoding, GPT-Fast PyTorch Implementation, Lucidrains Speculative Decoding Repo, Triton Developer Conference 2025


GPU MODE ▷ #beginner (12 messagesđŸ”„):

CUDA/C++ submission, Shared memory CUDA, GPU MODE Documentation


GPU MODE ▷ #triton-puzzles (5 messages):

Triton Puzzle Notebook, Triton Viz Compatibility, Colab for Triton Puzzles, Triton Version


GPU MODE ▷ #self-promotion (1 messages):

Apple Silicon Training, Cohere Labs Event


GPU MODE ▷ #submissions (1 messages):

Leaderboard results, A100, Trimul


GPU MODE ▷ #factorio-learning-env (20 messagesđŸ”„):

Agent Framework Integration, Entity-Ghost Warning, GameState Serialization


GPU MODE ▷ #singularity-systems (3 messages):

Picocuda, Picograd, Elements Repo, Graph Data Structures, Tensor Data Structures


Modular (Mojo đŸ”„) ▷ #announcements (1 messages):

Modular Meetup, High-Performance AI, Inworld AI Collaboration, Matrix Multiplication Optimization


Modular (Mojo đŸ”„) ▷ #mojo (2 messages):

MAX SDK LSP crashes, Mojo LSP, GitHub issue for MAX SDK


Modular (Mojo đŸ”„) ▷ #max (66 messagesđŸ”„đŸ”„):

MAX in ComfyUI, Kyutai benefits from MAX, Unet compile times, Pytorch Backends Comparisons, Memory Leaks when compiling


Yannick Kilcher ▷ #general (32 messagesđŸ”„):

LLM Providers Batching User Requests, MoE Scheduling, Non-Determinism in GPT-4, VTuber Sister


Yannick Kilcher ▷ #agents (1 messages):

AIxCC, DARPA, LLM Agents, Open Source


Yannick Kilcher ▷ #ml-news (28 messagesđŸ”„):

Huawei Ascend, Gemma 3-270M, Inference Time on Low-End Devices


aider (Paul Gauthier) ▷ #general (34 messagesđŸ”„):

gpt-oss-120b vs gpt-5-mini, Empty response received from LLM, Aider using completions vs responses


aider (Paul Gauthier) ▷ #questions-and-tips (4 messages):

aider native function calling, local inference providers, aider with MCP servers, aider tutorial with ollama/lmstudio/vllm


Eleuther ▷ #announcements (1 messages):

Multilingual Representation Learning Workshop, physical commonsense reasoning benchmark


Eleuther ▷ #general (12 messagesđŸ”„):

Multilingual Representation Learning Workshop, Portuguese vs Brazilian Portuguese datasets, ISO 639-3, NLP Resources for languages


Eleuther ▷ #research (7 messages):

Diffusion Language Models, Generative AI, Llada, Mercury


Eleuther ▷ #scaling-laws (18 messagesđŸ”„):

Scaling Laws, Chinchilla scaling laws paper, GPT scaling laws paper


DSPy ▷ #show-and-tell (1 messages):

Narrator Tool, LLMs iteratively learn, LLMs for creative writing, SIMBA optimizer


DSPy ▷ #general (26 messagesđŸ”„):

MLflow GEPA vs SIMBA, GEPA pronunciation, GEPA logprobs for evolutionary selection, Gemma 3-270m finetuning, Databricks sponsorship of DSPy


Notebook LM ▷ #use-cases (5 messages):

NLM extensions, QoL UI updates


Notebook LM ▷ #general (20 messagesđŸ”„):

NotebookLM, Gemini Integration, Recall.ai, Audio/Video Generation by AI, Bug Reporting


MCP (Glama) ▷ #general (16 messagesđŸ”„):

Bun executable path, Reddit post auto-removal, MCP authorization code flow, Elicitations in MCP client specification


MCP (Glama) ▷ #showcase (3 messages):

MCP Server, Hypertool-MCP, Tool-binding Limits, Persona-specific toolsets, Local MCP Server


Cohere ▷ #đŸ§”-general-thread (12 messagesđŸ”„):

Mapler returns, Caramel genetics researchers, Pineau joins Cohere, Treatment planner with RAG


Cohere ▷ #👋-introduce-yourself (4 messages):

Genetics Research, AI Researcher


Cohere ▷ #🔬-research (1 messages):

Treatment Planner, RAG, Open Source LLM


LlamaIndex ▷ #blog (5 messages):

LlamaExtract in TypeScript SDK, GPT-5 with LlamaParse, AI Agent Applications, AI Stock Portfolio Agent, Web-scraping AI agents


LlamaIndex ▷ #general (11 messagesđŸ”„):

Agent efficiency with large JSON dependencies, ReactAgent migration breaking changes, Structured outputs via tool calls, PGVectorStore Errors in 0.13.1


Nomic.ai (GPT4All) ▷ #general (10 messagesđŸ”„):

Strix Halo mini PC, HP Z2 Mini, Ryzen 7 7840HS, GPT-OSS 120B, Quantum Computers


Manus.im Discord ▷ #general (9 messagesđŸ”„):

Web application deployment improvements, Manus AI interface with Unitree robot, Manus account login issues, Session expiry issues with Google account, Internal server errors


tinygrad (George Hotz) ▷ #general (4 messages):

Kernelize and Codegen ordering, Tinygrad Compilation Process


tinygrad (George Hotz) ▷ #learn-tinygrad (4 messages):

CUDA_ERROR_UNSUPPORTED_PTX_VERSION, tinygrad SM support, tinygrad Op documentation


Codeium (Windsurf) ▷ #announcements (1 messages):

Windsurf Wave 12, DeepWiki Integration, Vibe and Replace, Smarter Cascade Agent


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (1 messages):

qwen3, tool call arguments, streaming