All tags
Person: "googleaidevs"
not much happened today
mai-thinking-1 mai-image-2.5 mai-code-1-flash gemma-4-12b microsoft google vllm-project ollama llama-cpp model-training reinforcement-learning model-architecture multimodality model-deployment model-efficiency fine-tuning on-device-ai eliebakouch nrehiew_ mustafasuleyman minjiyoon90 lateinteraction harold_matmul googlegemma googleaidevs mtschannen armandjoulin osanseviero
Microsoft released the detailed technical report for MAI-Thinking-1, a generalist reasoning model trained without third-party distillation, achieving 97% on AIME 2025 and outperforming Sonnet 4.6 in human preference tests. The report was praised for transparency, revealing no synthetic data use, a unique scaling ladder recipe, and detailed training data composition including 50% code and 17.5% STEM. Microsoft also introduced Frontier Tuning for workflow-specific model adaptation, claiming efficiency gains up to 10× and GPT-5.4-level quality in Excel tasks, alongside new models like MAI-Image-2.5 and MAI-Code-1-Flash. Meanwhile, Google launched Gemma 4 12B, an Apache 2.0 multimodal model with an innovative encoder-free architecture designed for on-device use with 16GB VRAM, collapsing vision and audio encoders into the LLM backbone, receiving positive community feedback and immediate tooling support.
not much happened today
kling-2.5-turbo sora-2 gemini-2.5-flash granite-4.0 qwen-3 qwen-image-2509 qwen3-vl-235b openai google ibm alibaba kling_ai synthesia ollama huggingface arena artificialanalysis tinker scaling01 video-generation instruction-following physics-simulation image-generation model-architecture mixture-of-experts context-windows token-efficiency fine-tuning lora cpu-training model-benchmarking api workflow-automation artificialanlys kling_ai altryne teortaxestex fofrai tim_dettmers sundarpichai officiallogank andrew_n_carr googleaidevs clementdelangue wzhao_nlp alibaba_qwen scaling01 ollama
Kling 2.5 Turbo leads in text-to-video and image-to-video generation with competitive pricing. OpenAI Sora 2 shows strong instruction-following but has physics inconsistencies. Google Gemini 2.5 Flash "Nano Banana" image generation is now generally available with multi-image blending and flexible aspect ratios. IBM Granite 4.0 introduces a hybrid Mamba/Transformer architecture with large context windows and strong token efficiency, outperforming some peers on the Intelligence Index. Qwen models receive updates including fine-tuning API support and improved vision capabilities. Tinker offers a flexible fine-tuning API supporting LoRA sharing and CPU-only training loops. The ecosystem also sees updates like Synthesia 3.0 adding video agents.