All tags
Company: "kling-ai"
not much happened today
qwen-image-layered kling-2.6 gwm-1 gen-4.5 gemini-3-flash gpt-5.2 codex-cli opus-4.5 alibaba kling-ai runway google anthropic openai image-decomposition motion-control video-generation agentic-reinforcement-learning long-context model-degradation benchmarking tool-use prompt-engineering ankesh_anand
Alibaba released Qwen-Image-Layered, an open-source model enabling Photoshop-grade layered image decomposition with recursive infinite layers and prompt-controlled structure. Kling 2.6 introduced advanced motion control for image-to-video workflows, supported by a creator contest and prompt recipes. Runway unveiled the GWM-1 family with frame-by-frame video generation and Gen-4.5 updates adding audio and multi-shot editing. In LLM platforms, Gemini 3 Flash leads benchmarks over GPT-5.2, attributed to agentic reinforcement learning improvements post-distillation. Users note GPT-5.2 excels at long-context tasks (~256k tokens) but face UX limitations pushing some to use Codex CLI. Discussions around Anthropic Opus 4.5 suggest perceived model degradation linked to user expectations.
Gemini 2.5 Pro/Flash GA, 2.5 Flash-Lite in Preview
gemini-2.5 gemini-2.5-flash-lite gemini-2.5-flash gemini-2.5-pro gemini-2.5-ultra kimi-dev-72b nanonets-ocr-s ii-medical-8b-1706 jan-nano deepseek-r1 minimax-m1 google moonshot-ai deepseek cognitivecompai kling-ai mixture-of-experts multimodality long-horizon-planning benchmarking coding-performance long-context ocr video-generation model-releases tulsee_doshi oriolvinyalsml demishassabis officiallogank _philschmid swyx sainingxie scaling01 gneubig clementdelangue mervenoyann
Gemini 2.5 models are now generally available, including the new Gemini 2.5 Flash-Lite, Flash, Pro, and Ultra variants, featuring sparse Mixture-of-Experts (MoE) transformers with native multimodal support. A detailed 30-page tech report highlights impressive long-horizon planning demonstrated by Gemini Plays Pokemon. The LiveCodeBench-Pro benchmark reveals frontier LLMs struggle with hard coding problems, while Moonshot AI open-sourced Kimi-Dev-72B, achieving state-of-the-art results on SWE-bench Verified. Smaller specialized models like Nanonets-OCR-s, II-Medical-8B-1706, and Jan-nano show competitive performance, emphasizing that bigger models are not always better. DeepSeek-r1 ties for #1 in WebDev Arena, and MiniMax-M1 sets new standards in long-context reasoning. Kling AI demonstrated video generation capabilities.