Frozen AI News archive

not much happened today

The AI news recap highlights several key developments: **nanoMoE**, a PyTorch implementation of a mid-sized Mixture-of-Experts (MoE) model inspired by Andrej Karpathy's nanoGPT, enables pretraining on commodity hardware within a week. An agentic leaderboard ranks LLMs powering **smolagents CodeAgent**, with **GPT-4.5** leading, followed by **Claude-3.7-Sonnet**. Discussions around **DeepSeek-R1** emphasize AI model commoditization, with DeepSeek dubbed the "OpenAI of China." **Q-Filters** offer a training-free method for KV cache compression in autoregressive models, achieving **32x compression** with minimal perplexity loss. The **PokéChamp** minimax language agent, powered by **GPT-4o** and **Llama-3-8b**, demonstrates strong performance in Pokémon battles. Other notable models include **TinyR1-32B-Preview** with Branch-Merge Distillation, **R1-Searcher** incentivizing search capability via reinforcement learning, and the **Forgetting Transformer** using a Forget Gate in softmax attention. These advancements reflect ongoing innovation in model architectures, compression, reinforcement learning, and agentic AI.

Canonical issue URL

AI News for 3/7/2025-3/10/2025. We checked 7 subreddits, 433 Twitters and 28 Discords (223 channels, and 14958 messages) for you. Estimated reading time saved (at 200wpm): 1424 minutes. You can now tag @smol_ai for AINews discussions!

Lots of folks are talking positives and negatives about Manus AI, and we wrote a recap of Why MCP Won, but neither story is really title worthy.


{% if medium == 'web' %}

Table of Contents

[TOC]

{% else %}

The Table of Contents and Channel Summaries have been moved to the web version of this email: [{{ email.subject }}]({{ email_url }})!

{% endif %}


AI Twitter Recap

AI Models, Architectures, and Benchmarks

AI Tools, Platforms, and Applications

Research and Development in AI

Industry News and Business Developments

AI Safety, Alignment, and Ethical Considerations

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

Theme 1. Manus Agent: Claude Sonnet Integrated with 29 Tools

Theme 2. LLMs not Ready for Large Codebases Yet: Evidence from <70B Evaluations

Theme 3. Apple M3 Ultra: Challenges for AI Workloads Compared to Traditional Systems

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding

Theme 1. Open-Source Viral Squish Effect: Releasing a New Trend

Theme 2. WAN 2.1 I2V Provides Unprecedented Capabilities

Theme 3. Engine01 Humanoid: Advancements in Robotic Motion

Theme 4. Triton for Windows: Streamlining AI Workflows


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. Emerging AI Models and Agents

Theme 2. LLM Performance and Benchmarking

Theme 3. AI Development Tools and IDEs

Theme 4. AI Communication Protocols (MCP, SLOP, ANP)

Theme 5. Hardware and Performance Optimization


PART 1: High level Discord summaries

Cursor IDE Discord


Unsloth AI (Daniel Han) Discord


Perplexity AI Discord


OpenAI Discord


LM Studio Discord


aider (Paul Gauthier) Discord


Nous Research AI Discord


Nomic.ai (GPT4All) Discord


HuggingFace Discord


Yannick Kilcher Discord


GPU MODE Discord


Latent Space Discord


Notebook LM Discord


Interconnects (Nathan Lambert) Discord


Modular (Mojo 🔥) Discord


MCP (Glama) Discord


Eleuther Discord


Torchtune Discord


Codeium (Windsurf) Discord


LlamaIndex Discord


Cohere Discord


DSPy Discord


tinygrad (George Hotz) Discord


AI21 Labs (Jamba) Discord


LLM Agents (Berkeley MOOC) Discord


MLOps @Chipro Discord


Gorilla LLM (Berkeley Function Calling) Discord


PART 2: Detailed by-Channel summaries and links

{% if medium == 'web' %}

Cursor IDE ▷ #general (1035 messages🔥🔥🔥):

Opacity of Product Code, Fix Dumb Code Finding, Model Iteration, Tag Query-ability, Version 47

Links mentioned:


Unsloth AI (Daniel Han) ▷ #general (1059 messages🔥🔥🔥):

Taycan Tyres, Dopamine Based Learning, GRPO Reward Functions, Model Embedding, Context Embedding

Links mentioned:


Unsloth AI (Daniel Han) ▷ #off-topic (119 messages🔥🔥):

RLHF with Unsloth GRPO on Qwen7b, GRPO examples, KL divergence instability, LLM Inference Optimization, Unsloth Pro Subscription

Links mentioned:


Unsloth AI (Daniel Han) ▷ #help (277 messages🔥🔥):

Mac Studio RAM configuration, Unsloth's 1.58bit quantized model of deepseek-r1, RoPE Scaling, custom dataset, hyper params for Phi4 model

Links mentioned:


Unsloth AI (Daniel Han) ▷ #showcase (11 messages🔥):

ASCII Cats finetuning, LoRA rank and alpha, Decoding methods for ASCII art, Beam search vs top-p/top-k, Custom decoding methods for 2D grids

Link mentioned: Can I Finetune an LLM with LoRA to Generate ASCII Cats?: LLMs are reaching impressive levels of reasoning, but why do they still struggle to create something as seemingly simple as ASCII art? Can you fine-tune an L...


Unsloth AI (Daniel Han) ▷ #research (291 messages🔥🔥):

Diffusion Effect with Unsloth, MoE training with Unsloth, Proximal Policy Optimization (PPO) for LLMs, Model Collapse, Continued Pretraining (CPT) vs Supervised Fine-Tuning (SFT)

Links mentioned:


Perplexity AI ▷ #general (839 messages🔥🔥🔥):

Perplexity Pro Subscriptions, Claude 3.7 Sonnet, Deepseek R1 for Reasoning, Grok AI for Perplexity, Comet browser integration

Links mentioned:


Perplexity AI ▷ #sharing (34 messages🔥):

Foldable iPhone, OpenAI Agent, AI Dubbing, AI Search Option, US Crypto Reserve

Links mentioned:


Perplexity AI ▷ #pplx-api (3 messages):

70b-online model, sonar model, API billing, Citations in API response


OpenAI ▷ #ai-discussions (546 messages🔥🔥🔥):

ChatGPT rate limits, Real-time GPS with AI, Manus computer control AGI, Sonnet 3.7 code quality issues, AI's effect on developer coding ability

Links mentioned:


OpenAI ▷ #gpt-4-discussions (50 messages🔥):

Manus AI agent, O1 limits on Plus, LLM API for code review, SimTheory O1 message cap, ChatGPT app folders


OpenAI ▷ #prompt-engineering (26 messages🔥):

Model Steerability, GPT Vision Limitations, Prompting for Image Puzzles, Human-in-the-Loop problem solving


OpenAI ▷ #api-discussions (26 messages🔥):

Model presumption and user intent, Request evaluation and discussion before project start, Solving image-based puzzles with language models, Vision/OCR limitations in language models, Prompt engineering for puzzle-solving


LM Studio ▷ #announcements (1 messages):

LM Studio v0.3.12, QwQ Template Bug, RAG Chunking Speed, MLX Models on exFAT

Link mentioned: LM Studio 0.3.12: Bug fixes and document chunking speed improvements for RAG


LM Studio ▷ #general (311 messages🔥🔥):

Open Source LLM for coding tasks on M2 Macbook, Qwen Coder vs Claude for code generation, Managing context length with LLMs, Draft models for faster token generation, Hardware considerations for LLM performance

Links mentioned:


LM Studio ▷ #hardware-discussion (298 messages🔥🔥):

9070 XT vs 7900 XTX, ROCm support on Windows, Vulkan Performance, AMD Driver Issues, GPU Memory and Bandwidth

Links mentioned:


aider (Paul Gauthier) ▷ #announcements (1 messages):

Aider v0.76.0, Thinking/Reasoning Models, LLM Notifications, Model Support, Tree-sitter Language Pack

Link mentioned: Release history: Release notes and stats on aider writing its own code.


aider (Paul Gauthier) ▷ #general (451 messages🔥🔥🔥):

AI21 Maestro, Copilot suspension, DeepSeek R2 release, X cyberattack, Refact AI

Links mentioned:


aider (Paul Gauthier) ▷ #questions-and-tips (123 messages🔥🔥):

aider with no api key, MCP agents integration, aider scripting, OpenRouter slowness, remove tokens in repo map context

Links mentioned:


aider (Paul Gauthier) ▷ #links (5 messages):

Effective Commit Messages, Manus AI, Aider NotebookLM Integration

Links mentioned:


Nous Research AI ▷ #general (529 messages🔥🔥🔥):

Fine-tuning models with reward models, Tool use accessibility, Anthropic's marketing, AGI as a meaningful concept, Graph system on TinyStories dataset

Links mentioned:

  Adding memory to LLMs with Letta &middot; Terse Systems

: no description foundLoRA Learns Less and Forgets Less: Low-Rank Adaptation (LoRA) is a widely-used parameter-efficient finetuning method for large language models. LoRA saves memory by training only low rank perturbations to selected weight matrices. In t...

  LLM Complexity and Pricing &middot; Terse Systems

: no description foundAnthropic’s Recommendations to OSTP for the U.S. AI Action Plan : Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.Tutorial: How to Run QwQ-32B effectively | Unsloth Documentation: How to run QwQ-32B effectively with our bug fixes and without endless generations + GGUFs.General Reasoning: Making state-of-the-art reasoning more accessible to everyone.Tweet from AK (@_akhaliq): PokéChampan Expert-level Minimax Language AgentPokéChamp outperforms all existing LLM-based (76%) and rule-based bots (84%) by an enormous margin, including winning consistently (64%) against prior hu...China Releases WORLD'S FIRST AUTONOMOUS AI Agent... Open Source | Manus: The latest AI News. Learn about LLMs, Gen AI and get ready for the rollout of AGI. Wes Roth covers the latest happenings in the world of OpenAI, Google, Anth...Manus is out of control: The latest AI News. Learn about LLMs, Gen AI and get ready for the rollout of AGI. Wes Roth covers the latest happenings in the world of OpenAI, Google, Anth...Tweet from jian (@jianxliao): So... I just simply asked Manus to give me the files at "/opt/.manus/", and it just gave it to me, their sandbox runtime code... > it's claude sonnet > it's claude sonnet with 2...GeneralReasoning/GeneralThought-195K · Datasets at Hugging Face: no description foundScaling RL: 3B AI w Long Chain-of-Thought & 4 Patterns: In summary, these two new AI research studies (see below), while differing in experimental setups and focus areas, collectively offer a comprehensive roadmap...Manus tools and prompts: Manus tools and prompts. GitHub Gist: instantly share code, notes, and snippets.MasterControlAIML/R1-Reasoning-Unstructured-To-Structured · Datasets at Hugging Face: no description foundsimplescaling/s1K-1.1 · Datasets at Hugging Face: no description foundChina’s AI agent Manus gains traction amid growing demand for autonomous AI · TechNode: On March 6, China’s AI agent Manus trended on Chinese social media platform Weibo. According to its team, Manus is an autonomous AI agent designed to


Nous Research AI ▷ #ask-about-llms (11 messages🔥):

Vibe coding benchmark, LLM creativity, Sonnet's training meta-objective, Claude code inspecting images


Nous Research AI ▷ #research-papers (1 messages):

teknium: https://x.com/ksshumab_/status/1897560985315238046?s=46


Nous Research AI ▷ #interesting-links (1 messages):

rikufps: https://arena.hume.ai/


Nous Research AI ▷ #research-papers (1 messages):

teknium: https://x.com/ksshumab_/status/1897560985315238046?s=46


Nomic.ai (GPT4All) ▷ #general (518 messages🔥🔥🔥):

registry tweaking, Memory usage by programs, Quantization process, LocalDocs issues, Speech recognition and AI integration

Links mentioned:


HuggingFace ▷ #general (221 messages🔥🔥):

Implementing Research Papers, Hugging Face Pro Subscription, AI Security Research, Automating Data Creation with HF and MCP, Video Model Comparison: WAN, HUN, LTX

Links mentioned:


HuggingFace ▷ #today-im-learning (3 messages):

smolagents, PokemonLLMAgentBenchmark, Agent Course Study Focus

Link mentioned: GitHub - CalebDeLeeuwMisfits/PokemonLLMAgentBenchmark: Contribute to CalebDeLeeuwMisfits/PokemonLLMAgentBenchmark development by creating an account on GitHub.


HuggingFace ▷ #cool-finds (4 messages):

fxtwitter obsolescence, HighlightAI

Links mentioned:


HuggingFace ▷ #i-made-this (19 messages🔥):

Llama-3.2-3B-Instruct Distillation, Differential Privacy Blogpost, AI Neovim config, Qwen_QwQ-32B-GGUF_QX_k_f32 weights, Automated web app testing

Links mentioned:


HuggingFace ▷ #reading-group (1 messages):

chad_in_the_house: very cool! I esp like how you can somewhat prevent the distortion


HuggingFace ▷ #computer-vision (2 messages):

OCR Guidance Needed, Blendshapes Blogpost

Link mentioned: Blendshapes: a facial expressions representation: In computer vision and computer graphics, there are many methods to represent the face, such as Landmark vectors, Action units, valence and…


HuggingFace ▷ #NLP (8 messages🔥):

Hermes Function Calling Dataset, Gemma 2B Precision, Serverless API Input Conversion, LoRA Adapter with BitsAndBytes Error

Link mentioned: NousResearch/hermes-function-calling-v1 · Datasets at Hugging Face: no description found


HuggingFace ▷ #smol-course (9 messages🔥):

MCP module update, PokemonLLMAgentBenchmark, HuggingFace Token issues, Chat Template Exercise, HuggingFaceInferenceAPIEmbedding issues

Links mentioned:


HuggingFace ▷ #agents-course (217 messages🔥🔥):

Course Progress Tracking, Hugging Face PRO Subscription, LM Studio vs Ollama, Steam Account Scams

Links mentioned:


HuggingFace ▷ #open-r1 (2 messages):

Reasoning Datasets, Open Thought Dataset, ServiceNow-AI/R1-Distill-SFT

Link mentioned: Reasoning Datasets - a philschmid Collection: no description found


Yannick Kilcher ▷ #general (256 messages🔥🔥):

LinkedIn Premium Referral Codes, AI and the Zero Marginal Cost Society, DeepSeek Security Concerns, Power-Softmax equation, ManusAI Feedback

Links mentioned:


Yannick Kilcher ▷ #paper-discussion (13 messages🔥):

Latent Reasoning, Context Compression, Physical Intelligence and Cognitive Biases Toward AI, scilent paper

Links mentioned:


Yannick Kilcher ▷ #agents (7 messages):

DeepSeek efficiency, ScholarAgent updates, Arxiv papers search

Links mentioned:


Yannick Kilcher ▷ #ml-news (42 messages🔥):

LLMs hallucinating, Multi-step agentic workflows, Language Diffusion, China AI Agent Manus, Stanford Regex Ozempic alternative

Links mentioned:


GPU MODE ▷ #general (19 messages🔥):

SOTA Agentic Methods, Metal Kernel Launch Overhead, Torch.compile for MPS, Karpathy's Video


GPU MODE ▷ #triton (26 messages🔥):

SVD Quantization Kernel, Triton Autotuning, Kernel Fusion, Dynamic Activation Quantization

Links mentioned:


GPU MODE ▷ #cuda (26 messages🔥):

Learning PTX for CUDA, Inline PTX for microbenchmarking, memcpy_async slowdown, Debugging CUDA kernels, FP8 WMMA optimization

Link mentioned: Controlling Data Movement to Boost Performance on the NVIDIA Ampere Architecture | NVIDIA Technical Blog: The NVIDIA Ampere architecture provides new mechanisms to control data movement within the GPU and CUDA 11.1 puts those controls into your hands. These mechanisms include asynchronously copying data&#...


GPU MODE ▷ #torch (29 messages🔥):

DDP communication customization, FSDP communication customization, SimpleFSDP framework, Muon optimizer details

Links mentioned:


GPU MODE ▷ #announcements (1 messages):

Triton, CUDA, Flash Attention, YouTube tutorials, Performance

Link mentioned: GPU MODE: A GPU reading group and community https://discord.gg/gpumodeSupplementary content here https://github.com/gpu-modeCreated by Mark Saroufim and Andreas Köpf


GPU MODE ▷ #algorithms (3 messages):

Double Binary Tree vs. Ring Topology in NCCL, AllReduce Implementation Comparison, NCCL 2.4 and Double Binary Trees

Link mentioned: Massively Scale Your Deep Learning Training with NCCL 2.4 | NVIDIA Technical Blog: Imagine using tens of thousands of GPUs to train your neural network. Using multiple GPUs to train neural networks has become quite common with all deep learning frameworks, providing optimized…


GPU MODE ▷ #cool-links (16 messages🔥):

WoolyAI CUDA abstraction layer, Muon optimizer, Alternative to GPUs

Links mentioned:


GPU MODE ▷ #beginner (25 messages🔥):

GPU Memory Sharing on Apple, Cuda Graphs in Triton Autotune, Resources to get started with GPU and TPU programming, nvmlDeviceGetCudaComputeCapability Tuple Return, Cerebras Language vs CUDA

Link mentioned: A Conceptual View — SDK Documentation (1.3.0): no description found


GPU MODE ▷ #pmpp-book (1 messages):

PMPP 4th Edition, CUDA C, Latex text


GPU MODE ▷ #irl-meetup (6 messages):

GPU mode capacity increase at GTC, GTC and Game Developer's Conference, Semi-Analysis Hackathon team member search, CUTLASS kernels for GEMM or FMHA prefill/decoding


GPU MODE ▷ #rocm (3 messages):

AMD GPU, HIP Code Compilation, Runpod, MI300


GPU MODE ▷ #tilelang (9 messages🔥):

Kernel Compilation, Mixed Precision GEMM, TileLang GEMM Example

Links mentioned:


GPU MODE ▷ #metal (8 messages🔥):

Metal Parallel Reduction Kernels, Metal Shading Language, Metal-cpp, Swift, Objective-C


GPU MODE ▷ #self-promotion (19 messages🔥):

Cute Kernels, Triton vs CUTLASS, FA3's GEMM, LLVM compiler efficiency, GB200 access

Links mentioned:


GPU MODE ▷ #thunderkittens (2 messages):

LCF Concurrency, DDP+NCCL


GPU MODE ▷ #edge (1 messages):

FOSS CUDA developments, Open source platform for edgeai/TinyML, GPU "lab"


GPU MODE ▷ #reasoning-gym (44 messages🔥):

Reasoning Gym Curricula, Sonnet Context Expansion, Palindrome Partitioning Dataset, ACRE Dataset Integration, Reasoning Gym Goals

Links mentioned:


GPU MODE ▷ #gpu模式 (34 messages🔥):

Triton vs Cutlass, Zhihu registration, TileLang vs TVM, TileLang usage, CUDA optimization

Links mentioned:


GPU MODE ▷ #submissions (2 messages):

vectoradd benchmark, Modal runners success, GPU benchmarks


GPU MODE ▷ #ppc (2 messages):

AVX-256 Optimization, AVX-512 Optimization, Tiling, OpenMP


Latent Space ▷ #ai-general-chat (102 messages🔥🔥):

Minion.ai dead, Gemini Embedding Model, Muse AI Model, Manus AI Agent, RWKV7-G1 GooseOne

Links mentioned:


Latent Space ▷ #ai-announcements (13 messages🔥):

Model Context Protocol (MCP), AI Engineer Summit, SLOP Movement, Anthropic's Developer AI Brand

Links mentioned:


Latent Space ▷ #ai-in-action-club (132 messages🔥🔥):

web3 agents, HFT, creating their own cults, ElizaOS, AI persona

Links mentioned:


Notebook LM ▷ #use-cases (20 messages🔥):

NLM + Wondershare Podcast Creation, Data Encryption on Google Drive, Podcast Audio Language Change, Audio overview stammering, Ben Settle on Copywriting and Sales

Link mentioned: NotebookLM Podcasts - The Most Insane Content Creation Method Ever!: 🔥 LIMITED TIME: 50% OFF Wondercraft!Use this link and coupon code "MRC" https://mrc.fm/wondercraftIn this video, I walk you through a simple process to crea...


Notebook LM ▷ #general (220 messages🔥🔥):

Chrome extensions for uploading URLs, NotebookLM Android app, Automating document uploads, NotebookLM 'system unable to answer' errors, Source disappearing

Links mentioned:


Interconnects (Nathan Lambert) ▷ #news (41 messages🔥):

Microsoft's MAI Models, Reflection AI Launch, AMD MI300X Boxes, nGPT Implementation, Sutskever's New AI Venture

Links mentioned:


Interconnects (Nathan Lambert) ▷ #ml-questions (1 messages):

SOTA benchmark for bias, BBQ considerations


Interconnects (Nathan Lambert) ▷ #ml-drama (1 messages):

420gunna: https://x.com/sophiamyang/status/1897683402259591372


Interconnects (Nathan Lambert) ▷ #random (109 messages🔥🔥):

Claude Merch, AI-novelty-cake, Scale AI new CEO, Claude Pokemon Suicide, lmarena.ai super alpha

Links mentioned:


Interconnects (Nathan Lambert) ▷ #memes (8 messages🔥):

Vibe Coding, Claude Asshole, GPT Accuracy

Links mentioned:


Interconnects (Nathan Lambert) ▷ #rl (16 messages🔥):

SFT best practices, RLHF Book, Multi-turn prompts for coding

Link mentioned: Instruction Finetuning | RLHF Book by Nathan Lambert: The Reinforcement Learning from Human Feedback Book


Interconnects (Nathan Lambert) ▷ #reads (23 messages🔥):

Character training report, FrontierMath benchmark, In-Context RL, R1-Omni multimodal emotion recognition, Chain of Thought Monitoring

Links mentioned:


Interconnects (Nathan Lambert) ▷ #posts (9 messages🔥):

Metaphors on Twitter, Interp Data, SnailBot News


Interconnects (Nathan Lambert) ▷ #expensive-queries (7 messages):

GPT Architecture Variations, Surface-Level Summaries


Modular (Mojo 🔥) ▷ #general (199 messages🔥🔥):

Mojo Performance, Python Dynamicism, Compile-Time Correctness, Heterogeneous Compute, Mojo and MAX Relationship

Links mentioned:


Modular (Mojo 🔥) ▷ #mojo (9 messages🔥):

mojograd bigram model, Python standard library modules in Mojo, InlineArray usage in Mojo, Mojo formatting with fmt directives, Executing shell commands in Mojo

Links mentioned:


Modular (Mojo 🔥) ▷ #max (4 messages):

Max Serve Documentation, Autoscaling GPU Instances, Serving Multiple Models, GPU Utilization Metrics, Kubernetes Autoscaling


MCP (Glama) ▷ #general (157 messages🔥🔥):

MCP security concerns, Github Copilot support for MCP, Using MCP for Trading, Goose AI and MCP, RAG vs MCP

Links mentioned:


MCP (Glama) ▷ #showcase (45 messages🔥):

Typescript fetch server, Mastra file organization agent, Searxng MCP server, WebMCP tool exposure, GraphQL MCP server

Links mentioned:


Eleuther ▷ #general (88 messages🔥🔥):

Open Source AI Contribution, GPT-NeoX, ARIB subtitles on Transport Streams, Community driven organization building, Muon paper

Links mentioned:


Eleuther ▷ #research (41 messages🔥):

Token Assorted's latent codes, TorchTitan Embedding Sharding, Interpretabilty/Alignment Research Advice, NVLS vs TMA on H100, Lossless Compression

Links mentioned:


Eleuther ▷ #interpretability-general (36 messages🔥):

logit lens, emergent misalignment, open reproductions, model capabilities, activation patching

Links mentioned:


Torchtune ▷ #general (6 messages):

Research Paper Ideas, New SOTA BERT Model, MTEB Leaderboard Progress

Link mentioned: MTEB Leaderboard - a Hugging Face Space by mteb: no description found


Torchtune ▷ #dev (78 messages🔥🔥):

Audio modality in torchtune, GRPO recipe and LoRA, Memory issues on mac with mps, bitsandbytes on macOS, MPS support for Torchtune

Links mentioned:


Codeium (Windsurf) ▷ #discussion (84 messages🔥🔥):

IDE Telemetry and Codeium, Payment issues and account status, VS Code Extension Problems, JetBrains Plugin Context Retrieval Issues, VS Code Mobile on Android

Links mentioned:


LlamaIndex ▷ #blog (4 messages):

yFiles SDK, AnthropicAI cookbook, Task-Specific Agents, Multilingual Multimodal RAG system


LlamaIndex ▷ #general (70 messages🔥🔥):

SQLTableRetrieverQueryEngine prompt, Jina AI Package Install, LlamaExtract Beta Request, Reasoning model tool calling, Document Classification before Extraction

Links mentioned:


LlamaIndex ▷ #ai-discussion (1 messages):

AGiXT, AI Automation, Open Source AI

Link mentioned: GitHub - Josh-XT/AGiXT: AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.: AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features...


Cohere ▷ #「💬」general (51 messages🔥):

command R7B inference speed, Ollama langchain tool invocation errors, open-source AI projects, GPT-4o Arabic use cases, on-prem deployment costs

Link mentioned: Reddit - Dive into anything: no description found


Cohere ▷ #「🔌」api-discussions (9 messages🔥):

504 Gateway Errors, Multi-Modal Embeddings Availability, API Limit Issues, Rust Requirement for Cohere API


Cohere ▷ #「🤖」bot-cmd (5 messages):

Bot Response Problem


Cohere ▷ #「💡」projects (1 messages):

Knowledge Graphs, TogetherAI, Topic Modeling


Cohere ▷ #「🤝」introductions (4 messages):

Applied ML guidance with Cohere, Human Neural System as logic gates, Emotionally Intelligent AI


DSPy ▷ #general (37 messages🔥):

DSPy's batch function, MCP vs SLOP for agent communication, Error handling in DSPy's Refine module, Max token limit and error handling in LLM clients

Links mentioned:


tinygrad (George Hotz) ▷ #general (26 messages🔥):

tinygrad JIT time, Suspicious GPU listing, AMDGPU running hot, Why OpenCL failed, define_acc refactor

Link mentioned: Modular: Democratizing AI Compute, Part 5: What about CUDA C++ alternatives?: no description found


tinygrad (George Hotz) ▷ #learn-tinygrad (8 messages🔥):

NaN loss debugging, WebGPU long/ulong issue, TestLinearizerFailures bounty, Skipped tests in Python Backend CI, Optimizing big indexing

Link mentioned: tinygrad/tinygrad/device.py at master · tinygrad/tinygrad: You like pytorch? You like micrograd? You love tinygrad! ❤️ - tinygrad/tinygrad


AI21 Labs (Jamba) ▷ #jamba (8 messages🔥):

Jamba Workspace, Jamba conversational RAG, Jamba Mini Pricing, AI21 Maestro, Jamba multimodality

Link mentioned: Pricing: Our usage-based pricing helps reduce unnecessary spend. Find the right solution for your business needs at a cost-effective price point.


AI21 Labs (Jamba) ▷ #general-chat (9 messages🔥):

Jamba 1.6, AI21 Studio, Mamba1 optimizations, Batch API Solution

Link mentioned: AI21’s Jamba 1.6: The Best Open Model for Private Enterprise Deployment: AI21’s Jamba 1.6 outperforms models from Mistral, Meta, and Cohere to offer enterprises the best model for private LLM deployment at scale.


LLM Agents (Berkeley MOOC) ▷ #mooc-announcements (1 messages):

Multimodal Autonomous AI Agents, VisualWebArena, Internet-scale web-agent training, Ruslan Salakhutdinov

Link mentioned: CS 194/294-280 (Advanced LLM Agents) - Lecture 6, Ruslan Salakhutdinov: Questions: bli.do/rus-sal6


LLM Agents (Berkeley MOOC) ▷ #mooc-questions (8 messages🔥):

Research-track Availability, Quiz Retakes, Curriculum Release, Completion Certificates


LLM Agents (Berkeley MOOC) ▷ #mooc-lecture-discussion (4 messages):

Research Track Invites, Log Likelihood in Reinforcement Learning


MLOps @Chipro ▷ #events (1 messages):

AI4Legislation Competition, Civic Tech Entrepreneurs, SVCAF, AI-powered civic engagement


Gorilla LLM (Berkeley Function Calling) ▷ #discussion (1 messages):

Diffusion LLMs, Transformer-based models, LLaDA, Large Language Diffusion Models, autoregressive Transformers

Link mentioned: Diffusion LLMs - Revolutionary Language Model Architecture | LLaDA Research Hub: Discover how Diffusion LLMs are revolutionizing AI with parallel processing and advanced error correction. Learn about LLaDA architecture and stay updated with cutting-edge research.

{% else %}

The full channel by channel breakdowns have been truncated for email.

If you want the full breakdown, please visit the web version of this email: [{{ email.subject }}]({{ email_url }})!

If you enjoyed AInews, please share with a friend! Thanks in advance!

{% endif %}