All tags
Model: "gpt-5.1-codex"
OpenAI fires back: GPT-5.1-Codex-Max (API) and GPT 5.1 Pro (ChatGPT)
gpt-5.1-codex-max gpt-5.1-codex gemini-3-pro claude-3.5-sonnet openai google anthropic langchain-ai coding autonomous-systems benchmarking model-scaling multi-agent-systems model-performance reasoning model-architecture sama
OpenAI released GPT-5.1-Codex-Max, featuring compaction-native training, an "Extra High" reasoning mode, and claims of over 24-hour autonomous operation, showing significant performance gains on benchmarks like METR, CTF, and PaperBench. Google's Gemini 3 Pro demonstrates strong coding and reasoning capabilities, achieving new state-of-the-art results on SWE-bench Verified and WeirdML, with estimated model size between 5-10 trillion parameters. The AI coding agent ecosystem is rapidly evolving with integrations and tooling improvements from multiple companies. Sam Altman highlighted the significant improvements in GPT-5.1-Codex-Max. The news also covers educational offerings like ChatGPT for Teachers and multi-agent workflows involving Gemini 3, GPT-5.1-Codex-Max, and Claude Sonnet 4.5.
minor updates to GPT 5.1 and SIMA 2
gpt-5.1 gpt-5.1-codex gpt-5.1-codex-mini sima-2 gemini openai google-deepmind github microsoft cursor_ai perplexity-ai weaviate llamaindex adaptive-reasoning agentic-coding tool-use context-engineering memory-architecture self-improvement retrieval-augmentation database-query-planning chart-parsing robotics sama allisontam_ cline cognition demishassabis omarsar0 helloiamleonie
OpenAI released GPT-5.1 family models including 5.1-Codex and 5.1-Codex-Mini with improved steerability, faster responses, and new tools like apply_patch and shell command execution. Pricing remains unchanged from 5.0. Immediate integrations include GitHub Copilot, VS Code, Cursor, and Perplexity adopting GPT-5.1 models. Google DeepMind announced SIMA 2, a Gemini-powered agent capable of language instruction following, planning, and self-improvement without human feedback, targeting robotics applications. New research on context engineering and agentic tool use patterns was published, with contributions from Weaviate and LlamaIndex on database query planning and chart parsing respectively. "Adaptive reasoning" and agentic coding improvements are highlighted in GPT-5.1- Instant.