Frozen AI News archive

ChatGPT Codex, OpenAI's first cloud SWE agent

**OpenAI** launched **Codex**, a cloud-based software engineering agent powered by **codex-1** (an optimized version of **OpenAI o3**) available in research preview for Pro, Enterprise, and Team ChatGPT users, featuring parallel task execution like refactoring and bug fixing. The **Codex CLI** was enhanced with quick sign-in and a new low-latency model, **codex-mini**. **Gemma 3** is highlighted as the best open model runnable on a single GPU. **Runway** released the Gen-4 References API for style transfer in generation. **Salesforce** introduced **BLIP3-o**, a unified multimodal model family using diffusion transformers for CLIP image features. The **Qwen 2.5** models (1.5B and 3B versions) were integrated into the PocketPal app with various chat templates. **Marigold IID**, a new state-of-the-art open-source depth estimation model, was released.

Canonical issue URL

Fire-and-forget is all you need.

AI News for 5/16/2025-5/17/2025. We checked 9 subreddits, 449 Twitters and 29 Discords (214 channels, and 3392 messages) for you. Estimated reading time saved (at 200wpm): 298 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Lots of people will be covering the Codex launch today, so we will just leave you with the Latent Space writeup and podcast:

https://www.youtube.com/watch?v=LIHP4BqwSw0


AI Twitter Recap

AI Model Releases and Updates

Research and Papers

AI Tools and Platforms

AI Engineering and Development Practices

AI Safety and Governance

Events

Memes and Humor


AI Reddit Recap

/r/LocalLlama Recap

1. LLM-Integrated Operating Systems and Edge Devices

2. Recent LLM/AI Model and Platform Security, Policy, and Compliance News

3. New LLM Model and Feature Releases (Ollama & Falcon-E) and Industry Progress Discussions

Other AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo

1. OpenAI and Claude New Feature & Research Preview Discussions

2. Job Automation and AI's Impact on Careers

3. AI-Generated Personalized Images: Reddit Username & Identity


AI Discord Recap

A summary of Summaries of Summaries by gpt-4.1-2025-04-14

1. Codex and Coding Agent Rollouts

2. LLM Infrastructure: Hardware, VRAM, and Performance

3. Dataset Quality and Training Strategies

4. Multi-Agent and Protocol Infrastructure

5. Open Source Tools, SDKs, and Ethics


Discord: High level Discord summaries

Perplexity AI Discord


LMArena Discord


Unsloth AI (Daniel Han) Discord


OpenAI Discord


Yannick Kilcher Discord


Latent Space Discord


LM Studio Discord


OpenRouter (Alex Atallah) Discord


aider (Paul Gauthier) Discord


HuggingFace Discord


Eleuther Discord


MCP (Glama) Discord


Notebook LM Discord


GPU MODE Discord


Nous Research AI Discord


DSPy Discord


tinygrad (George Hotz) Discord


Manus.im Discord Discord


LlamaIndex Discord


Torchtune Discord


Modular (Mojo 🔥) Discord


Nomic.ai (GPT4All) Discord


MLOps @Chipro Discord


Cohere Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Codeium (Windsurf) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The Gorilla LLM (Berkeley Function Calling) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


The AI21 Labs (Jamba) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

Perplexity AI ▷ #general (1080 messages🔥🔥🔥):

Dia browser review, OpenAI and Google browser development, Fellou browser UI, Perplexity down issues, Comet ad platform


Perplexity AI ▷ #sharing (1 messages):

kenthreetimes: https://www.perplexity.ai/search/ed5e41fd-0bda-447f-b05b-6152393b5195


LMArena ▷ #general (350 messages🔥🔥):

Gemini 3.5, AlphaEvolve vs AlphaExplore, Google API fears, ChatGPT Plus vs Gemini Advanced, Gemini's image generator


Unsloth AI (Daniel Han) ▷ #general (125 messages🔥🔥):

Gemma3 vision finetuning on T4, VRAM requirements for LLaMA 3.2 90B, Batch inference with vllm vs unsloth, Unsloth tool-calling RFT examples, GRPO and LASSA for TTS


Unsloth AI (Daniel Han) ▷ #help (208 messages🔥🔥):

Unsloth w/ DeepSpeed or Megatron LM, Gemma 3 install error, Qwen2-VL training ValueError, TPU support, GRPO notebook error


Unsloth AI (Daniel Han) ▷ #showcase (1 messages):

theyruinedelise: Wait how did I just see this damn


OpenAI ▷ #annnouncements (2 messages):

Codex, ChatGPT Livestream


OpenAI ▷ #ai-discussions (94 messages🔥🔥):

AI business to AI customer matching, ChatGPT's image generation with specific characters, ChatGPT successful diagnosis, ASILAB scam or not, Grok 3.5


OpenAI ▷ #gpt-4-discussions (6 messages):

Hello 4.1 Mathematics, STEM Model Teaching


OpenAI ▷ #prompt-engineering (60 messages🔥🔥):

ProtoMind_001 launch, Structure-aware AI peer, HyperEnglish vs English 2.0, Loading custom instructions on the fly


OpenAI ▷ #api-discussions (60 messages🔥🔥):

ProtoMind_001 launch, Structure-aware AI peer, HyperEnglish vs English 2.0, Loading custom instructions, Python tool for modes


Yannick Kilcher ▷ #general (186 messages🔥🔥):

AlphaTensor, matrix multiplication, quantum computing, NAND gates, classifier guidance


Yannick Kilcher ▷ #ml-news (32 messages🔥):

Trusting Corporations vs. Governments, Huang's Strategic Decision, AI Leadership Issues, AI Productivity, Meta's AI Research vs. Toxic Product


Latent Space ▷ #ai-general-chat (57 messages🔥🔥):

Codex rollout, Freeplay.ai Feedback, O3 debugging capabilities, Codex architecture and skills, Codex live stream


Latent Space ▷ #ai-in-action-club (141 messages🔥🔥):

Meta's Maverick LLM Arena Gate, Task Fatigue, Agent as Judge, Home Rolled Context Sharing, Value Curves and Negative Outcomes


LM Studio ▷ #general (32 messages🔥):

LM Studio and headless servers, Proxmox setup with LM Studio, RAG interface limitations in LM Studio, Multimodal model support, Silly Tavern samplers


LM Studio ▷ #hardware-discussion (142 messages🔥🔥):

GMKtec Design Speed, MoE Model Performance, Llama 3.3 Quantization, CUDA Driver Performance Boost, PCIE 7.0 Bandwidth


OpenRouter (Alex Atallah) ▷ #announcements (43 messages🔥):

Per-App Model Rankings, RooCode, OpenRouter App Dashboards, Passkeys, Gemini 2.5 Pro Experimental Rate Limits


OpenRouter (Alex Atallah) ▷ #general (126 messages🔥🔥):

Gemini 2.5 Pro inference, Google Gemini 2.0 Flash Experimental, AI resume builder, Recruiting hellscape, Extracting information from Gmail


aider (Paul Gauthier) ▷ #general (37 messages🔥):

DeepSeek Prover V2, Home workstation for local LLMs, Qwen3 235b memory requirements, Gemini 2.5 Pro's long context magic, Codex CLI vs Aider


aider (Paul Gauthier) ▷ #questions-and-tips (44 messages🔥):

aider install errors, repo map size, o3 API issues, adding projects to aider, complex prompts with cheaper models


HuggingFace ▷ #general (45 messages🔥):

Xet alternative uploader, MCP course channel, YoloX setup, Bing for professionals, Ollama remote


HuggingFace ▷ #cool-finds (3 messages):

Ethical AI Nurturing, Manifesto of Nurturing, Open Source AI Agents SDK


HuggingFace ▷ #i-made-this (4 messages):

3D Animation Arena video, Firebase storage, EcoArt Cellular Automaton, Realtime AI Visualization


HuggingFace ▷ #NLP (1 messages):

Dewey Decimal Classification, National Library of France, Open-source LLM Project, Prompt Design, LLM engineers


HuggingFace ▷ #smol-course (1 messages):

Hugging Face Hub Tools


HuggingFace ▷ #agents-course (19 messages🔥):

GAIA LLM, Inference Provider Credits, AI as a Living Presence, Multiagent Data Sharing, AI Agent Course Project Suggestions


Eleuther ▷ #general (11 messages🔥):

Visualize applications, UML diagrams, FinTech AI projects


Eleuther ▷ #research (21 messages🔥):

Alpha Evolve, RWKV-6, LM finetuning


Eleuther ▷ #interpretability-general (25 messages🔥):

TunedLens paper, translator trained for the final layer of the model, Autocorrect gives away your background in physics, GPT-2 XL, embedding layer


Eleuther ▷ #lm-thunderdome (6 messages):

vllm, lm_eval, Gemma 3 27 IT, Data Parallelism, Tensor Parallelism


MCP (Glama) ▷ #general (51 messages🔥):

MCP protocol ingestion for LLMs, TuringPaper hosted MCP, Local MCP server interaction, MCP Inspector debugging, MCP agent invoke method error


MCP (Glama) ▷ #showcase (2 messages):

cyberchef-mcp-sse, MCP UI, SDK to add UI to MCP


Notebook LM ▷ #use-cases (4 messages):

Deep Dive podcasting with music, Gemini Canvas, NBLM Gemini Integration


Notebook LM ▷ #general (48 messages🔥):

Organic Chemistry Gamification with NotebookLM, NTLM Beta App Experience, NotebookLM and Math Formatting, Source Formats for NotebookLM, NotebookLM's Web Access and Secondary Sources


GPU MODE ▷ #general (1 messages):

GPU Mode videos, Community Introduction


GPU MODE ▷ #triton (3 messages):

Native FP8 support in Triton, Shared Memory Calculation, Autotuning Failure Analysis


GPU MODE ▷ #cuda (2 messages):

fp16 GEMM, bf16 GEMM, cuBLAS, Lei Mao, cuda_hgemm


GPU MODE ▷ #torch (6 messages):

AOT Inductor Code Correlation, FSDP2 Device Mesh Performance, Torch Compile max-autotune batch sizes


GPU MODE ▷ #beginner (9 messages🔥):

Duff's Device in CUDA, Partial Unrolling with #pragma unroll, Thread Merging in Volta and Later GPUs


GPU MODE ▷ #rocm (1 messages):

snektron: https://github.com/ROCm/rocm-libraries ROCm libraries new monorepo


GPU MODE ▷ #self-promotion (1 messages):

X Post, Image Analysis


GPU MODE ▷ #submissions (19 messages🔥):

MI300, amd-mixture-of-experts, amd-fp8-mm


GPU MODE ▷ #factorio-learning-env (2 messages):

Server bumping, Code contribution


GPU MODE ▷ #amd-competition (3 messages):

AMD-FP8-MM leaderboard shapes, Mixture-of-experts submission errors


GPU MODE ▷ #cutlass (2 messages):

CuTe DSL, CUDA Python, Tensor Core Programming, Linear Algebra Programming Model


GPU MODE ▷ #mojo (1 messages):

GPU puzzles, pixi errors, 4090, PTXAS fatal error, KGEN_CompilerRT_AlignedAlloc


Nous Research AI ▷ #general (21 messages🔥):

OpenAI Release, Smart Glasses, Nous Research NYC Event, New Voice Model, Image Infilling Models


Nous Research AI ▷ #interesting-links (1 messages):

Augmentation Lab, Rhizome Futurism, Summer residency


DSPy ▷ #general (14 messages🔥):

DSpy Abstractions, Foundation Models, ChatAdapter, dspy.History, dspy.Suggest and dspy.Assert


tinygrad (George Hotz) ▷ #general (13 messages🔥):

Bounty Google Sheet, whitespace changes, PR closed because AI, GCC instead of Clang


Manus.im Discord ▷ #general (13 messages🔥):

Version Control, Manus GitHub Integration, OpenAI Codex Competition


LlamaIndex ▷ #general (10 messages🔥):

PropertyGraphIndex embeddings, Prompt caching, MCP Servers


Torchtune ▷ #general (6 messages):

Torchtune configurations, Alpaca dataset in LLM fine-tuning, Modern datasets for LLM training, Evaluation benchmark on Alpaca dataset, Torchtune performance increase


Torchtune ▷ #dev (1 messages):

segmenttreebeats: <@154226635338547200> Hey! Am I correct here? I would like to merge #2608


Modular (Mojo 🔥) ▷ #mojo (4 messages):

Bazel in Modular Repo, NDBuffer Multiplication, NDBuffer Deprecation


Nomic.ai (GPT4All) ▷ #general (4 messages):

koboldcpp, GPT4All, NMKD SDGUI, swarm-ui


MLOps @Chipro ▷ #general-ml (1 messages):

Designing Machine Learning Systems, Robotics AI


Cohere ▷ #🔌-api-discussions (1 messages):

SwiftUI, Cohere API