All tags
Person: "altryne"
not much happened today
qwen3.6-27b qwen3.5-397b-a17b privacy-filter mimo-v2.5-pro mimo-v2.5 gemini-3.1-pro gemini-3.1-flash-image alibaba openai xiaomi google google-deepmind vllm_project unsloth ggml ollama arena nous-research open-models multimodality vision tokenization pii-detection privacy enterprise-ai agentic-ai benchmarking long-context model-deployment hardware-optimization model-integration software-engineering alibaba_qwen clementdelangue altryne eliebakouch mervenoyann xiaomimo sundarpichai scaling01
Alibaba released Qwen3.6-27B, a dense, Apache 2.0 open coding model with thinking and non-thinking modes, outperforming the larger Qwen3.5-397B-A17B on multiple coding benchmarks including SWE-bench and Terminal-Bench. It supports native vision-language reasoning over images and video, with immediate ecosystem support from vLLM, Unsloth, ggml, and Ollama. OpenAI open-sourced a practical Privacy Filter model for PII detection and masking, a 1.5B parameter token-classification model with a 128k context window aimed at enterprise redaction tasks. Xiaomi announced MiMo-V2.5-Pro and MiMo-V2.5 models, emphasizing software engineering advances, long-horizon agents, and large context windows (up to 1M tokens), with strong benchmark results and integrations with Hermes and Nous. At Google Cloud Next, Google and Google DeepMind unveiled 8th-gen TPUs (TPU 8t for training and TPU 8i for inference) with claims of scaling to a million TPUs in a cluster, and launched the Gemini Enterprise Agent Platform evolving Vertex AI with Agent Studio and access to 200+ models including Gemini 3.1 Pro and Gemini 3.1 Flash Image. This marks a significant vertical integration of hardware, models, and enterprise tooling.
not much happened today
kling-2.5-turbo sora-2 gemini-2.5-flash granite-4.0 qwen-3 qwen-image-2509 qwen3-vl-235b openai google ibm alibaba kling_ai synthesia ollama huggingface arena artificialanalysis tinker scaling01 video-generation instruction-following physics-simulation image-generation model-architecture mixture-of-experts context-windows token-efficiency fine-tuning lora cpu-training model-benchmarking api workflow-automation artificialanlys kling_ai altryne teortaxestex fofrai tim_dettmers sundarpichai officiallogank andrew_n_carr googleaidevs clementdelangue wzhao_nlp alibaba_qwen scaling01 ollama
Kling 2.5 Turbo leads in text-to-video and image-to-video generation with competitive pricing. OpenAI Sora 2 shows strong instruction-following but has physics inconsistencies. Google Gemini 2.5 Flash "Nano Banana" image generation is now generally available with multi-image blending and flexible aspect ratios. IBM Granite 4.0 introduces a hybrid Mamba/Transformer architecture with large context windows and strong token efficiency, outperforming some peers on the Intelligence Index. Qwen models receive updates including fine-tuning API support and improved vision capabilities. Tinker offers a flexible fine-tuning API supporting LoRA sharing and CPU-only training loops. The ecosystem also sees updates like Synthesia 3.0 adding video agents.