All tags
Model: "intellect-3"
not much happened today
kling-2.6 kling-o1 runway-gen-4.5 gemini-3 deepseek-v3.2 ministral-3 evoqwen2.5-vl hermes-4.3 intellect-3 openai anthropic google runway elevenlabs freepik openart deepseek mistral-ai alibaba nous-research video-generation audio-processing multimodality image-generation reasoning model-quantization sparse-attention model-pricing multimodal-models retrieval-augmentation model-training model-release
OpenAI's Code Red response and Anthropic's IPO are major highlights. In AI video and imaging, Kling 2.6 introduces native audio co-generation with coherent lip-sync, partnered with platforms like ElevenLabs and OpenArt. Runway Gen-4.5 enhances lighting fidelity, while Google's Gemini 3 Nano Banana Pro supports advanced image compositing. Open model releases include DeepSeek V3.2 with sparse attention and cost-effective pricing, and Mistral's Ministral 3 multimodal family with strong 14B variants. Retrieval and code models from Alibaba's EvoQwen2.5-VL and Nous Research's Hermes 4.3 show competitive performance with permissive licensing and HF availability. The community arena sees additions like INTELLECT-3 (106B MoE). "coherent looking & sounding output" and "auto-lighting to match scene mood" are noted advancements.