Frozen AI News archive

Gemini 3 Pro — new GDM frontier model 6, Gemini 3 Deep Think, and Antigravity IDE

**Google** launched **Gemini 3 Pro**, a state-of-the-art model with a **1M-token context window**, **multimodal reasoning**, and strong agentic capabilities, priced significantly higher than Gemini 2.5. It leads major benchmarks, surpassing **Grok 4.1** and competing closely with **Sonnet 4.5** and **GPT-5.1**, though GPT-5.1 excels in ultralong summarization. Independent evaluations from **Artificial Analysis**, **Vending Bench**, **ARC-AGI 2**, **Box**, and **PelicanBench** validate Gemini 3 as a frontier LLM. Google also introduced **Antigravity**, an agentic IDE powered by Gemini 3 Pro and other models, featuring task orchestration and human-in-the-loop validation. The launch marks Google's strong return to AI with more models expected soon. *"Google is very, very back in the business."*

Canonical issue URL

Google account is all you need?

AI News for 11/17/2025-11/18/2025. We checked 12 subreddits, 544 Twitters and 24 Discords (205 channels, and 14599 messages) for you. Estimated reading time saved (at 200wpm): 997 minutes. Our new website is now up with full metadata search and beautiful vibe coded presentation of all past issues. See https://news.smol.ai/ for the full news breakdowns and give us feedback on @smol_ai!

Finally we got the highly anticipated Gemini 3 launch — SOTA benchmarks across the board (except for one), 60% higher pricing than Gemini 2.5, and #1 across all major Arena leaderboards (yes, as suspected, Gemini 3 beats yesterday's Grok 4.1 which claimed #1 in Text Arena for one day, what a coincidence).

But you should carefully eyeball the HUGE jumps especially compared head on with Sonnet 4.5 and GPT-5.1, each competitor's literal frontier model:

Benchmark comparison of AI models showing performance across various tasks and evaluations, highlighting Gemini 3 Pro's impressive results across multiple domains.

Some notable independent evals from Aritifical Analysis, Vending Bench, ARC-AGI 2, Box, and of course PelicanBench (now with a v2) validate it as a frontier LLM. (we ran Gemini 3 vs GPT 5.1 and found GPT 5.1 clearly better for ultralong summarization/instruction following). The magnitude of the ARC AGI 2 results is also a pretty notable pareto improvement:

Leaderboard comparing AI models' performance across different benchmarks, showing their scores and cost per task on a logarithmic scale.

Oriol Vinyals confirmed there were both pretraining and posttraining improvements - no walls in sight.

On top of this, two more surprise launches: Gemini Deep Think, with better numbers but unreleased, and Antigravity, the former acquired Windsurf team's take on a Google-born agentic IDE with it's own domain name and a (un)surprisingly well executed YouTube channel.

Google is very, very back in the business. More models to come, and it's only Tuesday.


AI Twitter Recap

Google’s Gemini 3 and Antigravity: launch, specs, and availability

Benchmarks and empirical performance (incl. Deep Think)

Ecosystem rollouts and integrations

Anthropic x Microsoft x NVIDIA: multi-cloud Claude and massive capex

Open research agents and toolchain updates

Infra and ops notes

Top tweets (by engagement)

Notes for builders:


AI Reddit Recap

/r/LocalLlama + /r/localLLM Recap

1. AI Server Uptime and Outages

Less Technical AI Subreddit Recap

/r/Singularity, /r/Oobabooga, /r/MachineLearning, /r/OpenAI, /r/ClaudeAI, /r/StableDiffusion, /r/ChatGPT, /r/ChatGPTCoding, /r/aivideo, /r/aivideo

1. Gemini 3.0 Pro Benchmark and Release Discussions

2. Cloudflare Outage Impact on Major Platforms

3. Gemini 3.0 Pro Performance and User Feedback


AI Discord Recap

A summary of Summaries of Summaries by gpt-5.1

1. Gemini 3 Pro & Google Antigravity: Rollout, Benchmarks, and Ecosystem Integrations

2. Grok 4.1 and the Creativity/EQ Arms Race vs GPT-5.1 and Others

3. Tooling, Infra, and Governance: MCP, Graph-RAG, Sourcegraph Ads, Runlayer, Atlas

4. Security, Jailbreaking, and Abuse: Image-Based Prompts, Ransomware, CAPTCHAs, Fraud

5. Performance Engineering and Training Tooling: llama.cpp, Hugging Face Tricks, DSPy, Atropos


Discord: High level Discord summaries

LMArena Discord


Perplexity AI Discord


BASI Jailbreaking Discord


Cursor Community Discord


Unsloth AI (Daniel Han) Discord


OpenRouter Discord


OpenAI Discord


LM Studio Discord


Latent Space Discord


Nous Research AI Discord


HuggingFace Discord


Yannick Kilcher Discord


Eleuther Discord


Modular (Mojo 🔥) Discord


DSPy Discord


tinygrad (George Hotz) Discord


aider (Paul Gauthier) Discord


Manus.im Discord Discord


Windsurf Discord


MLOps @Chipro Discord


MCP Contributors (Official) Discord


The LLM Agents (Berkeley MOOC) Discord has no new messages. If this guild has been quiet for too long, let us know and we will remove it.


You are receiving this email because you opted in via our site.

Want to change how you receive these emails? You can unsubscribe from this list.


Discord: Detailed by-Channel summaries and links

LMArena ▷ #general (1404 messages🔥🔥🔥):

Gemini 3 Pro, Google AI Studio, LLM limitations, Antigravity IDE, LMArena Issues


LMArena ▷ #announcements (4 messages):

Grok-4.1-thinking, Expert leaderboard, November AI Generation Contest - Code Arena, Image-to-Video Leaderboard, Text-to-Image Leaderboard


Perplexity AI ▷ #general (1028 messages🔥🔥🔥):

Gemini 3 Pro, Antigravity IDE, Comet Android, Kimi Model


Perplexity AI ▷ #pplx-api (4 messages):

purrvv.me


BASI Jailbreaking ▷ #general (1056 messages🔥🔥🔥):

Grok's abrupt hardness, AI solving captchas, Builder.ai scandal, Grok for ransomware, Gemini 3 Pro


BASI Jailbreaking ▷ #jailbreaking (683 messages🔥🔥🔥):

Indirect Prompt Injection, Grok 4.1 System Prompt Leak, Bypassing AI Security, Jailbreaking Gemini 3.0, Exploiting AI for Malicious Purposes


BASI Jailbreaking ▷ #redteaming (19 messages🔥):

AI Jailbreaking, ChatGPT Agent Mode, Suit vs Business Suit prompt, Banning from ChatGPT, Random Email for ChatGPT


Cursor Community ▷ #general (1093 messages🔥🔥🔥):

Mac OS vs Linux for power users, Cursor support for multiple GitHub accounts, Composer free period ending, Gemini 3 Pro release and performance, Cloudflare outage


Unsloth AI (Daniel Han) ▷ #general (222 messages🔥🔥):

TPU support, Gemma 3 270M, LLama CPP Speed, Activation functions, 4bit merged


Unsloth AI (Daniel Han) ▷ #introduce-yourself (3 messages):

AI and Full Stack Development, LLM Integration


Unsloth AI (Daniel Han) ▷ #off-topic (275 messages🔥🔥):

SF vs NYC food, Breadclip classifier, LoRA training hacks, Gemini 3 code changes


Unsloth AI (Daniel Han) ▷ #help (83 messages🔥🔥):

Unsloth online training, Llama CPP Deployment with Unraid/Docker, Voice AI Agent LLM Training, Unsloth and HuggingFace TRL, Text to Speech inference with Unsloth


Unsloth AI (Daniel Han) ▷ #research (2 messages):

LLM Inference, Non-determinism, Determinism


OpenRouter ▷ #app-showcase (6 messages):

Efficiency in current projects, YouTube video link


OpenRouter ▷ #general (473 messages🔥🔥🔥):

editing images issues, Grok 4.1 Release, LiteAPI shady practices, Gemini 3 launch, Deepseek V3 0324 replacement


OpenRouter ▷ #new-models (14 messages🔥):

``


OpenRouter ▷ #discussion (34 messages🔥):

Cloudflare Replicate acquisition, Grok-4 compared to GPT-5, Hallucination theories, Gemini 3 Preview on OpenRouter, Key Expiration


OpenAI ▷ #annnouncements (1 messages):

Atlas Browser, OpenAI Podcast


OpenAI ▷ #ai-discussions (439 messages🔥🔥🔥):

Custom GPT efficiency measurement, Gemini 3 Pro vs GPT-5.1 comparison, Grok 4.1 capabilities, AGI and ASI discussion, Grok Imagine and video generation


OpenAI ▷ #gpt-4-discussions (4 messages):

GPT 4.5 vs 5, GPT Photo upload error


OpenAI ▷ #prompt-engineering (12 messages🔥):

Prompt Engineering Advice, Model Interpretations, Subjectivity of Model Quality, Model A/B Testing, Automated Filter Issues


OpenAI ▷ #api-discussions (12 messages🔥):

Prompt Engineering Tips, Image Generation Quality, Model Accuracy, A/B Testing Models, Automated Filters


LM Studio ▷ #general (234 messages🔥🔥):

E-commerce Seller Woes, LLM Performance Degradation, 2FA App Recommendations, LLM Tool Integration via MCP, Gemini3-Pro benchmarks


LM Studio ▷ #hardware-discussion (226 messages🔥🔥):

Silent PC Building, GPU Mods, RAM Prices, eBay selling, UPS Issues


Latent Space ▷ #ai-general-chat (178 messages🔥🔥):

Sourcegraph Ads, xAI Grok 4.1, Varun Mohan's Teaser, Poe Group Chat, Runlayer Funding


Nous Research AI ▷ #announcements (1 messages):

Atropos, RL Environments framework, Thinking Machines' Tinker training API


Nous Research AI ▷ #general (131 messages🔥🔥):

Amazon Nova Premier v1, Bedrock AWS issues, Jeff Bezos buff, Hermes 4 70B on dual 3090s, Factorio Agent project


Nous Research AI ▷ #ask-about-llms (6 messages):

Uncensored MoEs, LoRAing a Josified model, Kimi 1T Jailbreak, Kimi Linear for housekeeping, Model Censorship Frustrations


Nous Research AI ▷ #interesting-links (2 messages):

AI Paper, NousResearch Github Repo


HuggingFace ▷ #general (109 messages🔥🔥):

Graph-RAG database, Mimir project, Lablab hackathons controversy, Cloudflare outage, Gemini 3


HuggingFace ▷ #i-made-this (4 messages):

Cornserve, FastMayavangelis, Grounded Video Caption Generation, Medical Reasoning GPT OSS 20B


HuggingFace ▷ #computer-vision (1 messages):

grounded video caption generation, ICCV 2025, HowToGround1M, iGround dataset


HuggingFace ▷ #gradio-announcements (1 messages):

Gradio 6 Launch, Youtube Announcement, X Announcement


HuggingFace ▷ #smol-course (1 messages):

Jinja Code, instruct_tokenizer.apply_chat_template, HuggingFaceTB/SmolLM3-3B chat template, Bug Reporting, Pull Requests


Yannick Kilcher ▷ #general (36 messages🔥):

Understanding Machine Learning Book, More Technical ML Books, ReLU activation function, Gemini 3 Pro Model, Mathematical theory of Deep Learning


Yannick Kilcher ▷ #paper-discussion (9 messages🔥):

Circuit Sparsity, Yannic Kilcher Transformer Videos, Youtube Search


Yannick Kilcher ▷ #ml-news (26 messages🔥):

DS Star, WeatherNext-2, Grok-4, Project Prometheus, Gemini 3 Pro Model


Eleuther ▷ #general (31 messages🔥):

NeurIPS 2025, EleutherAI's Papers, Social events at NeurIPS 2025, Huggingface performance tips, ML Research guidance


Eleuther ▷ #research (36 messages🔥):

Tied weights in Transformers, Cohere's command A model, Virtual Width Networks (VWN), Linear Attention, Residual Matrix Transformers


Eleuther ▷ #interpretability-general (1 messages):

Paper review, Model review, Result analysis, Result discussion, Future research


Modular (Mojo 🔥) ▷ #announcements (1 messages):

Modular updates, Mojo 1.0 roadmap, Community Meeting, MMMAudio, Shimmer


Modular (Mojo 🔥) ▷ #mojo (34 messages🔥):

NVFP4 support in Mojo, Negative indexing and IntvsUInt, Poetry integration


DSPy ▷ #papers (1 messages):

the_4th_doctor: The structure of this is ideal for an optimizer: https://arxiv.org/abs/2511.09030


DSPy ▷ #general (22 messages🔥):

VSCode Extension for DSPy, MLflow Alternatives, LLM Determinism, DSPy in Production Community, Tracing and Observability


tinygrad (George Hotz) ▷ #general (14 messages🔥):

tinybox, Llama 1B, torch compile, Kernel imports, examples/extra


aider (Paul Gauthier) ▷ #general (6 messages):

Gemini 3 errors, Experimental Channel Move


Manus.im Discord ▷ #general (3 messages):

Manus.im, Manus 1.5, Skilled Developer


Windsurf ▷ #announcements (2 messages):

Gemini 3 Pro, Windsurf, Download editor


MLOps @Chipro ▷ #events (1 messages):

AI Governance, AI Control, AI Webinar


MCP Contributors (Official) ▷ #general (1 messages):

Image Analysis, Attachments, Discord CDN