Frozen AI News archive

Cognition's DeepWiki, a free encyclopedia of all GitHub repos

**Silas Alberti** of **Cognition** announced **DeepWiki**, a free encyclopedia of all GitHub repos providing Wikipedia-like descriptions and Devin-backed chatbots for public repos. **Meta** released **Perception Encoders (PE)** with A2.0 license, outperforming **InternVL3** and **Qwen2.5VL** on vision tasks. **Alibaba** launched the **Qwen Chat App** for iOS and Android. **Hugging Face** integrated the **Dia 1.6B SoTA** text-to-speech model via **FAL**. **OpenAI** expanded deep research usage with a lightweight version powered by **o4-mini** model, now available to free users. **Perplexity AI** updated their model selector with **Grok 3 Beta**, **o4-mini**, and support for models like **gemini 2.5 pro**, **claude 3.7**, and **gpt-4.1**. **vLLM** project introduced **OpenRLHF** framework for reinforcement learning with human feedback. **Surya OCR** alpha model supports 90+ languages and LaTeX. **MegaParse** open-source library was introduced for LLM-ready data formats.

Canonical issue URL

300k is all you need to index GitHub.

AI News for 4/25/2025-4/26/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (214 channels, and 5186 messages) for you. Estimated reading time saved (at 200wpm): 373 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Silas Alberti of Cognition (maker of Devin) announced DeepWiki today, "a free encyclopedia of all GitHub repos. You can replace any public GitHub repo url with "https://deepwiki.com/org/repo" and you get a remarkably accurate wikipedia-like description of the library as well as a Devin-backed chatbot on how to use it.

https://resend-attachments.s3.amazonaws.com/k0kw0QHMzZ5Urnn

Our intial tests with the React and Astro repos, which we are familiar with because AINews is now built on them, was VERY encouraging. Worth trying, especially for using open source code.


AI Twitter Recap

Model Releases and Updates

Frameworks, Tools, and Datasets

Agentic Systems and Tool Use

Interpretability and Evaluation

AI Ethics and Welfare

Industry and Business

ICLR Conference

Transportation and Infrastructure

Humor


AI Reddit Recap

/r/LocalLlama Recap

1. Lossless LLM Compression for Inference (DF11) Research and Release

2. Open-Source Local LLM & AI App Builders (Dyad)

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. CivitAI Controversy and Model Hosting Alternatives

2. Recent OpenAI Model Release Issues and Strategy

3. Frontier Model Benchmarks and Human vs AI Reasoning


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.5 Flash Preview

Theme 1. Model Updates and Performance Shifts

Theme 2. Hardware and Optimization Techniques

Theme 3. AI Frameworks and Tooling Updates

Theme 4. AI Research and Fundamental Concepts

Theme 5. Industry News and Platform Challenges


PART 1: High level Discord summaries

Perplexity AI Discord


Nous Research AI Discord


LMArena Discord


Manus.im Discord Discord


OpenAI Discord


Unsloth AI (Daniel Han) Discord


Eleuther Discord


aider (Paul Gauthier) Discord


HuggingFace Discord


Yannick Kilcher Discord


OpenRouter (Alex Atallah) Discord


Cursor Community Discord


Modular (Mojo 🔥) Discord


LM Studio Discord


GPU MODE Discord


Notebook LM Discord


MCP (Glama) Discord


LlamaIndex Discord


DSPy Discord


Torchtune Discord


Nomic.ai (GPT4All) Discord


Cohere Discord


The tinygrad (George Hotz) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


PART 2: Detailed by-Channel summaries and links

Perplexity AI ▷ #announcements (1 messages):

iOS Voice Assistant, GPT Image Generation, Grok 3 Beta, OpenAI o4-mini


Perplexity AI ▷ #general (1110 messages🔥🔥🔥):

Discord Roles Rework, GPTs Agent Training, OpenAI sidebars, Apple Music Discord Support, Grok 3 Mini


Perplexity AI ▷ #sharing (2 messages):

Perplexity AI Search, Spanish Inquisition


Perplexity AI ▷ #pplx-api (1 messages):

Perplexity API Introduction, Denis Yarats


Nous Research AI ▷ #general (1315 messages🔥🔥🔥):

TACQ compression, TRL reward functions, Paradigm invests in Nous Research, Psyche distributed training project, Nous API waitlist issues


Nous Research AI ▷ #ask-about-llms (36 messages🔥):

Max model size on Mac M4, LLMs and control vectors, Nous Psyche vs Petals, Hermes 3 70B settings, Open source coding models for 8GB VRAM


Nous Research AI ▷ #interesting-links (5 messages):

Fine-tuning LLMs, LoRA, LLaMa Factory, CPU-only fine-tuning, GGUF


LMArena ▷ #general (317 messages🔥🔥):

o3 can output large file code, AGI path for big tech companies, Yann LeCun's 'A Path Towards Machine Intelligence', Meta's Memory+, DeepMind's Titans


Manus.im Discord ▷ #general (293 messages🔥🔥):

Manus's strategic role, Geostrategic relevance of HK, Manus better than ChatGPT?, Double credits, Manus's issues


OpenAI ▷ #ai-discussions (80 messages🔥🔥):

Sora limitations and pricing, RAG + LLM + Word2Vec discussion, Fine-tuning a math model, Gemini Advanced video generation limits, Alternatives to Sora for photorealistic image generation


OpenAI ▷ #gpt-4-discussions (16 messages🔥):

OpenAI Pro vs Plus Deep Research limits, Deep Research Tier List, GPT generating images when code is expected


OpenAI ▷ #prompt-engineering (48 messages🔥):

Learning Python with ChatGPT, Python vs. Java for Beginners, Web Development Languages (JavaScript), The Odin Project


OpenAI ▷ #api-discussions (48 messages🔥):

Learning Python with ChatGPT, Python vs Java for Beginners, Python for Web Development, The Odin Project, Python's Role in AI


Unsloth AI (Daniel Han) ▷ #general (96 messages🔥🔥):

Unsloth Dynamic v2.0 GGUFs release, GLM 4 models, Tesslate Rust 7B model, Dynamic Quantization Workflow, QwQ 32b training


Unsloth AI (Daniel Han) ▷ #help (62 messages🔥🔥):

torch.compile issues, GGUF working, chat template, llama.cpp error


Unsloth AI (Daniel Han) ▷ #research (2 messages):

Kimi Audio, MoonshotAI


Eleuther ▷ #general (86 messages🔥🔥):

Llama-3.2 checkpoints, Nvidia drivers in TCC mode on Windows, Cost of computer specs and digital divide, Colaboratory for data analytics, Flash Linear Attention models


Eleuther ▷ #research (68 messages🔥🔥):

ReaSCAN results, User edits, Key replacement with learned LERP, Memory mosaics paper, tokenshift


Eleuther ▷ #interpretability-general (6 messages):

TinyStories replacement dataset, Computational Sparsity talk by Leo, Attribution based Parameter Decomposition (APD), Local Loss Landscape Decomposition (L3D)


aider (Paul Gauthier) ▷ #general (95 messages🔥🔥):

Aider Language Sorting, Grok-3-Mini Pricing, Gemini+Claude Combo, Aider Architect/Editor Workflow, Gemini Verbose Comments


aider (Paul Gauthier) ▷ #questions-and-tips (18 messages🔥):

Gemini 2.5 Pro Setup, Read only files in Aider, Work tracking markdown logs and Aider, Aider CLI vs IDE, Openrouter Fallbacks


aider (Paul Gauthier) ▷ #links (3 messages):

Aider Commands, Code Generation, Conversation Summarization


HuggingFace ▷ #general (53 messages🔥):

Framepack impressions, Multimodal LLMs, Hugging Face Inference API Issues, SD3.5 API refresh, LLM for python coding


HuggingFace ▷ #today-im-learning (1 messages):

Deeplearning.ai course, Smolagents and Gemma3, Code Agents vs LLMs


HuggingFace ▷ #cool-finds (3 messages):

GAN, Excel, GAN model


HuggingFace ▷ #i-made-this (4 messages):

Small Models Math Dataset, ingest-anything, Malware Analysis Agent


HuggingFace ▷ #NLP (1 messages):

ilham_xx: thank you so much, <:agree:1098629085955113011> ⭐


HuggingFace ▷ #smol-course (3 messages):

Agent Template Error, Local LLMs with SmolAgents, Code Agents vs LLMs for Reasoning


HuggingFace ▷ #agents-course (50 messages🔥):

Course Deadlines, Certificate verification, DuckDuckGoSearch timeout, Pokemon Showdown LLM Cheating, Unit 4 Assignment


Yannick Kilcher ▷ #general (109 messages🔥🔥):

Reward Shaping Best Practices, Symbolic World Models, Oscillatory State Space Models, LLM World Models, Generating New Agents/Models/Worlds


Yannick Kilcher ▷ #ml-news (4 messages):

Music AI, Perplexity Browser


OpenRouter (Alex Atallah) ▷ #announcements (20 messages🔥):

Gemini 2.5 Pro Experimental Free, Rate Limits, Error Messages


OpenRouter (Alex Atallah) ▷ #general (92 messages🔥🔥):

Baidu Models on OpenRouter, OpenRouter support for OpenAI's o3, Gemini 2.5 Pro rate limits, Nvidia Nemotron settings, OpenRouter credits exploited


Cursor Community ▷ #general (105 messages🔥🔥):

RTSP firmware package for ESP32, Gemini 2.5 and Sonnet 3.7 performance, Billing settings on Cursor, Model evaluations for Cursor


Modular (Mojo 🔥) ▷ #general (9 messages🔥):

Python interop capabilities, Mojo roadmap, Mac GPU support, Suspicious link shared


Modular (Mojo 🔥) ▷ #mojo (94 messages🔥🔥):

uom crate in Rust, Nm vs Joules, QuantityKind tag, radian confusion, integer support


LM Studio ▷ #announcements (1 messages):

LM Studio Update, NVIDIA RTX 50-series Support, GLM-4 Enabled, New System Prompt Editor, Tool Choice Parameter


LM Studio ▷ #general (75 messages🔥🔥):

Dataset Preparation for LLM Character, GGUF Conversion Script Issues, LM Studio vs Ollama Advantages, Google AIStudio Lora training, LM Studio community presets


LM Studio ▷ #hardware-discussion (20 messages🔥):

RTX 3060 vs Intel B580 for AI, AVX2 speeds with Xeon, OpenVINO vs llama.cpp


GPU MODE ▷ #general (3 messages):

Grayscale mode, ML Compilers, TVM/MLIR Project


GPU MODE ▷ #triton (4 messages):

Quantization, FP4, 5090, Matmul Kernel


GPU MODE ▷ #beginner (2 messages):

NVIDIA's cuda-python library, CUDA Functionality


GPU MODE ▷ #rocm (2 messages):

MI300 ISA, VGPRs, wavefront, occupancy


GPU MODE ▷ #submissions (15 messages🔥):

MI300, H100, amd-fp8-mm leaderboard, amd-identity leaderboard, vectoradd leaderboard


GPU MODE ▷ #amd-competition (17 messages🔥):

Profiling info, Personal best timing calculation, Kernel fails when benchmarking, Benchmark shapes, Azure MI300 compute time


Notebook LM ▷ #use-cases (5 messages):

Prompt engineering updates, Zotero collection integration, Analyzing US Code with NotebookLM, File size limitations


Notebook LM ▷ #general (31 messages🔥):

Content Security Policies Errors, Gemini Chat Deletion, NotebookLM Use Cases, Source Uploading Errors, Downgrading from Plus to Free


MCP (Glama) ▷ #general (26 messages🔥):

Claude Desktop Image Uploads to MCP Tools, MCP Server Public Description Updates, JSON Formatting Errors with MCP Hooks, MCP Server Streamable Client Support, mcp-remote Usage and Supergateway Comparison


MCP (Glama) ▷ #showcase (2 messages):

Kubernetes MCP, GitHub Repo MCP, StacklokLabs mkp


LlamaIndex ▷ #blog (2 messages):

CondoScan, LlamaIndex, LlamaParse, Agents


LlamaIndex ▷ #general (19 messages🔥):

``memoryvschat_history, AgentWorkflow Errors, FunctionCallingAgent timeout, FunctionAgent timeout

agent = FunctionAgent(
 tools=[multiply],
 llm=Ollama(model="llama3.1", request_timeout=360.0),
 system_prompt="You are a helpful assistant that can multiply two numbers.",
)

DSPy ▷ #general (8 messages🔥):

DSPy support for multimodal models, Chain of Thoughts and ReACT pattern, Frameworks like Langchain and LlamaIndex, Multimodal Reasoning Models, Ember and Compound AI Systems


Torchtune ▷ #general (1 messages):

Torchtune's GRPO code sharing


Torchtune ▷ #rl (4 messages):

PPO Epochs, GRPO Data Padding


Nomic.ai (GPT4All) ▷ #general (5 messages):

Memory Requirements for LLMs, Alternatives to Rust Coding, Llama4 and Scout 17x16


Cohere ▷ #「🤝」introductions (2 messages):

Healthcare AI/ML startup seeks Prompt Optimization, Cohere Community Discord server Introductions


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.