Frozen AI News archive

Zuck goes Superintelligence Founder Mode: $100M bonuses + $100M+ salaries + NFDG Buyout?

**Meta AI** is reportedly offering **8-9 figure signing bonuses and salaries** to top AI talent, confirmed by **Sam Altman**. They are also targeting key figures like **Nat** and **Dan** from the AI Grant fund for strategic hires. **Essential AI** released the massive **24-trillion-token Essential-Web v1.0 dataset** with rich metadata and a 12-category taxonomy. **DeepLearning.AI** and **Meta AI** launched a course on **Llama 4**, featuring new MoE models **Maverick (400B)** and **Scout (109B)** with context windows up to **10M tokens**. **MiniMax** open-sourced **MiniMax-M1**, a long-context LLM with a 1M-token window, and introduced the **Hailuo 02** video model. **OpenAI** rolled out "Record mode" for **ChatGPT Pro, Enterprise, and Edu** on macOS. **Arcee** launched the **AFM-4.5B** foundation model for enterprise. **Midjourney** released its **V1 video model** enabling image animation. These developments highlight major advances in model scale, long-context reasoning, multimodality, and enterprise AI applications.

Canonical issue URL

Top AI Talent is all you need?

AI News for 6/17/2025-6/18/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (220 channels, and 6175 messages) for you. Estimated reading time saved (at 200wpm): 633 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

We try to keep things technical here and thought we were done after the Scale-Meta execuhire, but some stories are so extremely compelling that we just have to higlight them again to properly weight their importance in the year-end recap we will do.

Rumors of the 8-9 figure offers from Meta were circulating, and to some extent justified, but they cannot get more confirmed than Sam Altman saying it on a podcast with his brother: (coincidentally everyone and their dog is starting a podcast today - Stripe, OpenAI, Jack) - and you can watch all of them at once if you are sufficiently cracked) that makes it clear - it's not just confirmed, it's BOTH SIGNING BONUS -AND- SALARIES:

But Zuck is NOT STOPPING THERE. Later today The Information also broke that they were looking to hire Nat and Dan, of the famous NFDG aka AI Grant fund:

Considering that Dan is already CEO of SSI and Nat is doing some stuff with papyrus, they're unlikely to report to Alexandr and you cannot buy them off with just a paltry $100m. However the broader strategy makes sense if you look at the AIGrant portfolio that they have assembled.... potentially for Zuck to do something interesting with.


AI Twitter Recap

Model & Data Releases

AI Techniques & Research

Tooling, Frameworks & Infrastructure

Industry News & Company Strategy

Broader Implications & Commentary

Humor/Memes


AI Reddit Recap

/r/LocalLlama Recap

1. Google Gemini 2.5 Flash Price Increase Discussion

2. AI Model Visual Reasoning Challenges

3. Movie Meme Adaptations

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. Meta's Aggressive Recruitment of OpenAI Researchers

2. OpenAI Model Progress, Releases, and Perception

3. Philosophical and Societal Concerns Around AI's Impact


AI Discord Recap

A summary of Summaries of Summaries by Gemini 2.0 Flash Thinking

Theme 1. Model Performance and Benchmarks Spur Heated Debate

Theme 2. Training, Optimization, and Dataset Purity

Theme 3. AI Agent Development and the MCP Ecosystem

Theme 4. Hardware, Infrastructure, and Low-Level Optimization

Theme 5. Developer Tools, Platforms, and Ecosystem Evolution


Discord: High level Discord summaries

Cursor Community Discord


OpenAI Discord


Perplexity AI Discord


OpenRouter (Alex Atallah) Discord


LMArena Discord


Unsloth AI (Daniel Han) Discord


Eleuther Discord


LM Studio Discord


Nous Research AI Discord


HuggingFace Discord


aider (Paul Gauthier) Discord


Latent Space Discord


Modular (Mojo 🔥) Discord


Manus.im Discord Discord


LlamaIndex Discord


GPU MODE Discord


Cohere Discord


Torchtune Discord


Notebook LM Discord


Yannick Kilcher Discord


MCP (Glama) Discord


tinygrad (George Hotz) Discord


Nomic.ai (GPT4All) Discord


DSPy Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The MLOps @Chipro Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Cursor Community ▷ #general (971 messages🔥🔥🔥):

Cursor Rate Limits, Sonnet 4 Performance, Cursor Pricing Model, Context7 MCP, @Docs indexing feature


Cursor Community ▷ #background-agents (25 messages🔥):

Background Agents, IDE updates, Slack Integration, Code Storage Encryption, Version Control


OpenAI ▷ #annnouncements (3 messages):

OpenAI Podcast, Sam Altman, OpenAI to Z Challenge, Misalignment generalization


OpenAI ▷ #ai-discussions (808 messages🔥🔥🔥):

AI's Psychology, Text vs Voice Trust in AI, AI psychosis, Midjourney vs Veo, Free AI Art Generators


OpenAI ▷ #gpt-4-discussions (14 messages🔥):

File Tools vs. Vector Store, Grok missing from ChatGPT, Q learning drift or vector alignment, Temporary chat feature, Alternative platforms Gemini, Grok and Claude


OpenAI ▷ #prompt-engineering (224 messages🔥🔥):

Base44 implementation, GPTs, agent systems, multi-agent deliberation, Voltarre


OpenAI ▷ #api-discussions (224 messages🔥🔥):

Base44 context loss, recursive linguistics, agentic debate orchestration


Perplexity AI ▷ #general (730 messages🔥🔥🔥):

O3 Pro, Gemini 2.5 Pro, GPTs agents, Hallucination, Perplexity Labs


Perplexity AI ▷ #sharing (5 messages):

Food Network Chef, Humor Country, Random Subreddit, DreamOS Manifest


Perplexity AI ▷ #pplx-api (18 messages🔥):

Empty search results, Domain restriction, Location filter, Reasoning model


OpenRouter (Alex Atallah) ▷ #announcements (1 messages):

MiniMax M1, Gemini 2.5 Pro, Flash, and Flash Lite, Reasoning by default


OpenRouter (Alex Atallah) ▷ #general (316 messages🔥🔥):

Gemini 2.5 Pro, Key Credit Balances, AI Discord Bot Template, Minimax Models


LMArena ▷ #general (309 messages🔥🔥):

World knowledge in models, Graph AI, LM Arena issues, Gemini 2.5 Flash vs Claude Sonnet 3.7, Leaking system prompts


Unsloth AI (Daniel Han) ▷ #general (193 messages🔥🔥):

Magistral vision support, Optimizer per-group learn rates, Unsloth dual GPU support, Qwen 0.6b training speed, vLLM quants


Unsloth AI (Daniel Han) ▷ #off-topic (17 messages🔥):

BERT notebook issues, Colab compatibility, No-code AI app development


Unsloth AI (Daniel Han) ▷ #help (58 messages🔥🔥):

Troubleshooting Installation Errors, Gemma support in Unsloth, Orpheus TTS Model, Qwen2.5-VL image/text mismatch error, Llama3 local model path error


Unsloth AI (Daniel Han) ▷ #research (5 messages):

New Arxiv Papers, Gemini v2.5


Eleuther ▷ #general (82 messages🔥🔥):

AI voice cloning scams, EleutherAI name change, Gemini Diffusion, Thue Morse sequence


Eleuther ▷ #research (170 messages🔥🔥):

Spline Theory, Positional Encoding Ablations, Linear Attention vs Cosine Similarity, LLM Image Generation with RL, Byte Tokenization


Eleuther ▷ #lm-thunderdome (4 messages):

lm_eval steer_path, lm_eval multiple runs


LM Studio ▷ #general (146 messages🔥🔥):

Tool Calling in LM Studio, Open-Source Models Stagnation, LM Studio Feature Requests, MCP Server Connectivity, Quantized DeepSeek Models


LM Studio ▷ #hardware-discussion (57 messages🔥🔥):

DDR5 vs DDR6, NVLink Speed Delta, KV Cache Offload


Nous Research AI ▷ #general (107 messages🔥🔥):

CGS-GAN latent space, Google Gemini Canvas, LLMs compress information, Gemini is multimodal, Gemini Generated Native Images


Nous Research AI ▷ #ask-about-llms (2 messages):

Training LLMs, Evil Behavior Fine-Tuning, Response Filtering, Model Comparison


Nous Research AI ▷ #research-papers (8 messages🔥):

Meta Research, Llama team, Zuck Merge, world agent


Nous Research AI ▷ #interesting-links (1 messages):

Bigger Brains, Brain Implants


Nous Research AI ▷ #research-papers (8 messages🔥):

Meta Papers, Llama Team, Zuckerberg's vision, Generalist world agent


HuggingFace ▷ #announcements (1 messages):

Gradio MCP hackathon, SmolVLA Model, HF MCP server, HF Sheets, Google Colab x HF integration


HuggingFace ▷ #general (81 messages🔥🔥):

OS Agent, Codex Competition, Fine-tuning Mistral 7B, MLOps for Internships, Hugging Face Chat UI


HuggingFace ▷ #today-im-learning (6 messages):

HF AI Agents Course, Zig-Zag Ring Attention


HuggingFace ▷ #i-made-this (3 messages):

Chromium extension for speaking to readmes, Synthetic Data Generation for LLM Safeguards, memX: Shared memory layer for multi-agent LLM systems


HuggingFace ▷ #computer-vision (3 messages):

VSR Datasets, Attention Maps Visualization


HuggingFace ▷ #NLP (1 messages):

LLM Project Architecture, RAG Project Architecture, LLM API Design


HuggingFace ▷ #gradio-announcements (5 messages):

Gradio Agents & MCP Hackathon Winners, Modal Labs sponsors award, MCP Server Track, Custom Component Track, Agentic App Track


HuggingFace ▷ #agents-course (6 messages):

Ollama explanation, Final Assessment Template


aider (Paul Gauthier) ▷ #general (83 messages🔥🔥):

Gemini 2.5, Claude pricing, Aider and Gemini 2.5-pro, Open3 settings



  

---


### **aider (Paul Gauthier) ▷ #[questions-and-tips](https://discord.com/channels/1131200896827654144/1133060505792159755/1384614248558231662)** (9 messages🔥): 

> `Aider --restore-chat-history, Openrouter w/ Aider, Gemini copy-paste mode, Aider /read-only` 


- **Aider's history gets restored**: A user asked if there's a way to continue the previous session after a crash, and another member suggested using the **--restore-chat-history** flag.
   - A user hallucinated: *Sorry I hallunicated: --restore-chat-history*
- **Gemini enters copy-paste**: Someone wants to try **Gemini** in copy-paste mode, asking what the best quality editor model to use.
   - No recommendations were given.
- **Aider's /read-only prevents modification**: A user asked if **/read-only** is supposed to prevent a file from being modified in Aider.
   - A member confirms it should prevent modification.
- **Openrouter gets Budget 0 Error**: A member asks about getting a *Budget 0 is invalid* error from **OpenRouter** with Aider.
   - They clarify *this model only works in thinking* but no solution was provided.


  

---


### **Latent Space ▷ #[ai-general-chat](https://discord.com/channels/822583790773862470/1075282825051385876/1384612594052235456)** (79 messages🔥🔥): 

> `Midjourney Video Model V1, Krea AI, OpenHands CLI, Essential AI, AI and Proprietary Data` 


- **Midjourney Launches Video Model V1**: [Midjourney](https://xcancel.com/midjourney/status/1935377193733079452) has released **Version 1 of its Video Model**, allowing users to animate Midjourney-generated or external images at approximately **one image cost per second of video**.
   - The new **'Image-to-Video' feature** offers 'automatic' and 'manual' animation settings with 'high motion' and 'low motion' options, though video generation is web-only at launch.
- **KREA AI Releases Krea 1 Public Beta**: **KREA AI** has released **Krea 1** in public beta, aiming to provide superior aesthetic control and image quality, generating detailed textures, dramatic angles, and cinematic lighting, and moving away from the typical 'AI look'.
   - The new model supports various styles, style references, and custom trainings, and is available for free at [krea.ai/krea-1](https://xcancel.com/krea_ai/status/1934981993722466454?s=46).
- **OpenHands CLI Launched for Coding Agents**: **All Hands AI** introduces the **OpenHands CLI**, a new command-line interface for their coding agent offering top accuracy and simplifying installation by removing the need for Docker.
   - The CLI maintains the same accuracy as its previous Docker-based version, though without the web browser component, and offers slash commands and a command confirmation mode.
- **Essential AI Releases Massive 24T Token Dataset**: **Essential AI** has announced **Essential-Web v1.0**, a **24-trillion-token** pre-training dataset with detailed metadata designed for creating high-performing datasets.
   - Evaluations using domain-specific subsets of **Essential-Web v1.0** demonstrate improved performance in various fields like web code, STEM, and medical applications.
- **CoreWeave and Weights & Biases Launch New AI Inference Services**: **CoreWeave** and **Weights & Biases** are launching new AI inference services and Online Evaluation tools (monitors) for real-time LLM judgment.
   - These services, running on **CoreWeave GPUs**, include an inference endpoint for models like DeepSeek R1-0528 and LLama-4 Scout with OAI Compatible APIs, aiming to offer more competition and flexibility in the AI infrastructure space.


  

---


### **Latent Space ▷ #[ai-announcements](https://discord.com/channels/822583790773862470/1075282504648511499/1384675582884708402)** (5 messages): 

> `Andrej Karpathy's AI Talk, Software 3.0, LLM analogies, LLM Psychology, Partial Autonomy` 


- ****Latent Space** Reconstructs **Karpathy's** AI Talk**: **Latent Space** released a reconstructed version of **Andrej Karpathy's AI talk** with compiled slides and notes given the rapid value decrease of AI discussions, covering topics like **Software 3.0** and **LLM analogies**.
   - The content also touches on **LLM Psychology**, **Partial Autonomy**, **Vibe Coding**, and building for agents, with full slides available to **Latent Space** subscribers, linked [here](https://www.donnamagi.com/articles/karpathy-yc-talk).
- ****Karpathy's** Talk Covers Key Areas**: The talk encompasses areas such as **Software 3.0**, **LLM analogies** (Utilities, Fabs, OSes), **LLM Psychology**, and **Partial Autonomy** (including Human-AI Generation-Verification Loop).
   - It further explores **Vibe Coding** and building for agents, providing a comprehensive overview of current AI development concepts.


  

---


### **Modular (Mojo 🔥) ▷ #[general](https://discord.com/channels/1087530497313357884/1098713601386233997/1384818599302135808)** (4 messages): 

> `IR remapper in Mojo, Mojo bare metal kernel, Modular GitHub issue 4854, Mojo shirt without X` 


- **Mojo remapper and OS kernel arise**: One member plans to build an **IR remapper** and write an **OS bare metal kernel** in **Mojo**.
   - They linked to [Modular GitHub issue 4854](https://github.com/modular/modular/issues/4854) and exclaimed *"God help me persist. Mojo is lit"*.
- **Mojo merch and X requirment**: A member asked if there was any way to get a **Mojo shirt** without posting on **X**.
   - They mentioned sharing on **LinkedIn** instead and appreciating the **comic**.


  

---


### **Modular (Mojo 🔥) ▷ #[announcements](https://discord.com/channels/1087530497313357884/1098765954302873621/1384930608400171119)** (1 messages): 

> `AMD Partnership, GPU Support, Model Support, Open Source Code, Modular Hack Weekend` 


- **Modular Achieves GPU Agnostic Nirvana**: Modular Platform **25.4** introduces the ability to run the same code on both **AMD** and **NVIDIA** GPUs with zero code changes, marking the beginning of a partnership with **AMD**.
   - This release supports **AMD Instinct™ MI300X** and **MI325X GPUs**, enabling deployment across different hardware without vendor lock-in or extra configuration.
- **25.4 Turbocharges LLM Workloads**: Version **25.4** delivers up to **53%** better throughput on prefill-heavy BF16 workloads across state-of-the-art language models like **Llama 3.1**, **Gemma 3**, and **Mistral**.
   - This version expands hardware support to include **AMD MI300/325**, **NVIDIA Blackwell**, **RTX 40xx**, and **RDNA3**, broadening the platform's compatibility.
- **Modular Opens Kernel Code Vault**: Modular has open-sourced over **450k lines** of production-grade **Mojo** kernel code, enhancing transparency and community contribution.
   - The release also includes improved documentation, **PyTorch** ops tutorials, and kernel performance tools, fostering easier adoption and development.
- **Modular Hosts Hack Weekend with GPU Giveaways**: Modular is hosting a **Hack Weekend** on **June 27th** featuring a GPU Programming Workshop and a stacked GPU prize pool, inviting developers to participate virtually or in person via this [link](https://lu.ma/modular-gpu-workshop).
   - You can get the full story on the [Modular blog](https://www.modular.com/blog/modular-25-4-one-container-amd-and-nvidia-gpus-no-lock-in?utm_source=discord&utm_campaign=25_4).


  

---


### **Modular (Mojo 🔥) ▷ #[mojo](https://discord.com/channels/1087530497313357884/1151418092052815884/1384688130149584939)** (72 messages🔥🔥): 

> `Blackwell accelerators, Mojo bare metal, Kernel Development with Mojo, Mojo Async, Mojo Parametric Traits` 


- ****Blackwell Buyers Bummed by Buggy Business****: Early adopters of **Blackwell accelerators** are finding limited support and buggy drivers, leading to an unreliable and painful experience, despite [MAX supporting Blackwell](https://www.modular.com/max).
   - One user pointed out that **AMD** and **Intel** are launching GPUs with the same or more VRAM at half the price and even the cheaper **4090** might be a better choice.
- ****Mojo Goes Bare Metal, Boots Runtime Dependencies****: Members are excited that **Mojo** can now run bare metal with no system calls or runtime dependencies, illustrated by an [attached image](https://cdn.discordapp.com/attachments/1151418092052815884/1384859713388417126/image.png?ex=68549f5d&is=68534ddd&hm=d1af4a129aa4c5bd2cd0ebd8a5db27931f7454475f043fdf816af81fad245892&).
   - After simple function replacements like `KGEN_EE_JIT_GlobalConstructor` and `KGEN_EE_JIT_GlobalDestructor`, it can be used as *a modern systems programming language with zero-overhead abstractions* for kernel development.
- ****Kernel Konundrums: Mojo's Memory Management Marvels****: Discussed using Mojo for kernel development, highlighting its memory management capabilities, where *the compiler* can be trusted *on efficient and safe memory management*.
   - A developer noted the desire for *a clear strategy with error propagation/handling* to make the language golden, suggesting that manual async via FFI is already feasible.
- ****Mojo's Roadmap Riddles: Async and Traits Take Top Tier****: A member shared the top priorities for Mojo's development, including **Parametric Traits**, a big rework of **async**, and **dyn Traits** for automated vtable handling.
   - It was also noted that error handling might move to checked exceptions in the future.
- ****MAXimum Blackwell Boost: Modular's Secret Support****: Despite not being widely advertised, **MAX** supports **Blackwell** systems like the **5090**, with ongoing performance work before an official announcement.
   - Early testers are encouraged to kick the tires and report issues.


  

---


### **Modular (Mojo 🔥) ▷ #[max](https://discord.com/channels/1087530497313357884/1212827597323509870/1384731160801841193)** (3 messages): 

> `Max Inference, Max and Python, Orchestration of Model Graphs` 


- **Max Inference only controllable via Python Extension?**: A user inquired whether **Max** is only controllable via **Python extension**, to start an inference session and feed it numpy data.
   - Another user noted that the Mojo APIs have been removed for now.
- **Python Primary Interface to Graph Compiler**: It was clarified that the primary interface to the **graph compiler** (what MAX models are built with) is currently through **Python**.
   - A post was shared about why Python for orchestration of model graphs: [Mojo MAX Bindings](https://forum.modular.com/t/mojo-max-bindings/1499/3), and it was stated that *Python makes it really easy to integrate into existing tokenizers, processing logic, etc.*


  

---


### **Manus.im Discord ▷ #[general](https://discord.com/channels/1348819876348825620/1349440650495398020/1384610890988392498)** (74 messages🔥🔥): 

> `Web page generation credit costs, Website with scraping Facebook, Gumtree and Ebay, Manus Rival MiniMax AI with Agent Mode, Manus Video Generation with VEO, Manus Credit Refunds for Errors` 


- **High Traffic Thwarts Web Page Generation, Drains Credits**: Users reported that generating a simple web page consumed significant credits due to errors, with one user resorting to manual file edits.
   - One user noted that this was happening *after noon* and suggested that there may be different traffic at different times of the day.
- **Facebook, Gumtree, Ebay Scraping Website Botched**: A user spent **5k credits** attempting to create a website that scrapes **Facebook, Gumtree**, and **eBay** for stolen bike listings, but the AI failed, delivering fake results.
   - The user received a **2.5k credit refund**, but noted that it was *a waste of time and credits*.
- **MiniMax AI launches Agent Mode, Challenges Manus**: Users discussed **MiniMax AI's** new agent mode, viewing it as competition for Manus.
   - Some expressed concerns about **MiniMax's** credit system and subscription model, while praising Manus, others believe *competition will only bring the users more options on which AI to choose*.
- **Call for Credit Refunds on AI Task Errors**: Users debated whether **Manus** should refund credits when it encounters errors and has to re-run a process, or only charge upon successful task completion.
   - One user compared it to paying for a *burger that has rotten meat a stale bun* while another said *AI is programmed/trained to do much more then one human could tell it do in his whole life*.
- **Manus Gains Power with Free YouTube Boost**: A user posted a [YouTube link](https://m.youtube.com/watch?v=5PuofaVqXNI) saying that **Manus** can *grow even more powerful. Free of charge*.


  

---


### **LlamaIndex ▷ #[blog](https://discord.com/channels/1059199217496772688/1187460979064324127/1384633968153858049)** (3 messages): 

> `Model Context Protocol, AG-UI integration with CopilotKit, MCP vs Vector Search` 


- **Block designs MCP servers to assist Claude**: Block's engineering team shares their systematic approach to creating **Model Context Protocol (MCP) servers** that integrate seamlessly with Claude and other AI systems, starting with a clear [design](https://t.co/0vJajYzrfJ).
   - Their design patterns help build better **AI assistants**.
- **AG-UI integration with CopilotKit launches**: LlamaIndex launched official support for **AG-UI**, making it incredibly simple to bring your agents from the backend directly into user-facing applications to [create agent-powered frontends with zero boilerplate](https://t.co/hzxBrXKyTv).
   - The AG-UI integration is with **CopilotKit**.
- **MCP doesn't kill the need for vector search**: The **MCP** protocol creates new possibilities for agents to connect directly to data sources, but preprocessing and indexing are still needed for unstructured data like [PDFs and PPTs](https://t.co/OpUegepTAF).
   - It's estimated that *90% of enterprise data lives in PDFs, PPTs* and other similar formats.


  

---


### **LlamaIndex ▷ #[general](https://discord.com/channels/1059199217496772688/1059201661417037995/1384889675646238812)** (70 messages🔥🔥): 

> `FastAPI streaming issues, Agent workflow with streaming, Metadata filtering in chat/query engines, Response synthesizer refinement, Anthropic's AI Agent Frameworks` 


- **FastAPI Streaming Plagued by Pauses**: A user encountered **20+ second delays** in streaming events to the frontend using FastAPI, even though the backend processing was complete, which was later solved by adding `if ev.delta: yield`.
   - Adding newlines when yielding `yield json.dumps({..}) + "\n\n"` was suggested to delimit chunks, but ultimately, ensuring only non-empty deltas are yielded `if ev.delta: yield` solved the issue.
- **Agent Tool Call Timeouts with FastAPI**: A user debugging an agent workflow that streams events experienced delays with tool calls, and was advised to use the following code:
   - ```python
async for ev in handler.stream_events():
  if isinstance(ev, AgentStream):
    if ev.delta:
      print(ev.delta, end="", flush=True)
    elif ev.tool_calls:  # optionally print tool calls as they stream
      print(ev.tool_calls)
  elif isintance(ev, ToolCallResult):  # optionally print the complete tool call output
    print(ev.tool_name, ev.tool_kwargs, ev.tool_output)
  elif isintance(ev, ToolCallResult):  # optionally print the complete tool call input
    print(ev.tool_name, ev.tool_kwargs)

GPU MODE ▷ #general (11 messages🔥):

Stable Diffusion Performance, Custom Kernels, GPU Hardware Counter Monitoring, Expert Tiles Streaming


GPU MODE ▷ #triton (3 messages):

Triton, Shared Memory, Register Spilling


GPU MODE ▷ #cuda (16 messages🔥):

CUDA segfaults, CUDA gdb, NVIDIA bug reports, CUDA containers vs. CCCL/Thrust, CUDA Runtime error checking


GPU MODE ▷ #torch (2 messages):

Intervene in inductor kernels, Customize generated kernel code


GPU MODE ▷ #cool-links (4 messages):

Embed Issues with arXiv Links, Arctic Long Sequence Training, GitHub Repo Discovery


GPU MODE ▷ #beginner (2 messages):

Shared Memory Buffers, Tensor Data Types


GPU MODE ▷ #rocm (3 messages):

AMD bf16 instruction, MFMA supports bfp16, Triton gemvs


GPU MODE ▷ #liger-kernel (1 messages):

t_cc: Thanks! I'll take a look


GPU MODE ▷ #self-promotion (1 messages):

Distributed Training Course, Accelerate Maintainer


GPU MODE ▷ #general (4 messages):

PMPP Leaderboard, GPU MODE Competitions


GPU MODE ▷ #factorio-learning-env (19 messages🔥):

FLE Progress Concerns, Pull Request Delays, Write Access Democratization, Factorio Source Code Integration, Factorio Client Removal


GPU MODE ▷ #cutlass (4 messages):

CUTLASS Examples, Applied for grant


GPU MODE ▷ #singularity-systems (1 messages):

dumbpandabear: this sounds really exciting! looking forward to help get this off the ground


Cohere ▷ #🧵-general-thread (17 messages🔥):

Coordinate plane drawing models, AI Research and Development Channel, EU GDPR compliance for Embed v4, Cohere 4 AI projects


Cohere ▷ #🔌-api-discussions (28 messages🔥):

bufio.Scanner error, command-r-08-2024 issues, Go SDK issue


Cohere ▷ #👋-introduce-yourself (2 messages):

Introductions, Volunteer opportunities


Torchtune ▷ #dev (29 messages🔥):

Muon, AdamW, Kron, Qwen, torchtune


Torchtune ▷ #papers (4 messages):

Muon Optimizer, Jaguar Muon, Modern Optimizers


Notebook LM ▷ #use-cases (2 messages):

K-12 educators, podcast from sources, NotebookLM API


Notebook LM ▷ #general (25 messages🔥):

Gemini App Daily Limit, NotebookLM Model, LaTex support, Audio Overviews, Podcast issues


Yannick Kilcher ▷ #general (5 messages):

Typo Analysis, Cognitive Offloading to AI, Flow Matching in Production


Yannick Kilcher ▷ #paper-discussion (11 messages🔥):

Predictive Coding, V-JEPA-2 Paper Discussion, PRECO GitHub Repo


Yannick Kilcher ▷ #ml-news (8 messages🔥):

Keen Tech RL, Carmack Goals, Open Sourcing, Cursor Tier


MCP (Glama) ▷ #general (17 messages🔥):

Custom Transport in FastMCP, Corporate MCP Client/Host Alternatives, GraphQL API Tool Generation, Multi-Agent Systems with MCP, MCP Server Credentials


MCP (Glama) ▷ #showcase (2 messages):

Text-to-GraphQL, GraphQL Schemas, Arize-ai


tinygrad (George Hotz) ▷ #general (4 messages):

Hackathon Timing, tinygrad maturity


tinygrad (George Hotz) ▷ #learn-tinygrad (5 messages):

TinyJit, ShapeTracker, Variable Usage


Nomic.ai (GPT4All) ▷ #general (9 messages🔥):

Discord Spam, Mr. Beast


DSPy ▷ #show-and-tell (2 messages):

MIPROV2 Optimization, Agent Implementation, Workflow Metrics


DSPy ▷ #general (5 messages):

DSPy and LangGraph, DSPy in agentic coding IDEs, Finetuning Llama