All tags  
  Company: "ostrisai"
 not much happened today 
   qwen-image-edit  qwen-vl-max  kling-2.1  veo-3  deepseek-v3.1  genie-3  sima   google-deepmind  alibaba  google  deepseek  baseten  yupp   multimodality  embodied-ai  simulation  fine-tuning  quantization  video-generation  image-generation  local-inference  scaling  agent-training  real-time-control  spatial-memory   demishassabis  bonniesjli  shreyar  ostrisai  lmarena_ai  teortaxestex  ivanfioravanti  
 DeepMind released Genie 3, an interactive multimodal world simulator with advanced spatial memory and real-time avatar control, and SIMA, an embodied training agent operating inside generated worlds. Alibaba introduced Qwen-Image-Edit, an open-weights image editor scoring ELO 1098 (#2) in the Image Editing Arena, running on Qualcomm NPUs, alongside Qwen-VL-Max entering the Vision top-20. Video models like Kling 2.1 showed a 235% improvement in frame control, with new entrants Luma Ray 2 and Runway Gen-4 Turbo debuting. Google provided free Veo 3 generations in Gemini App and enhanced Google Photos with natural-language edits. DeepSeek v3.1 launched with focus on SWE and Search agents, supporting local inference on Apple Silicon with 4-bit quantization achieving ~21 tok/s on M3 Ultra. The news highlights advances in interactive simulation, vision editing, video synthesis, and scalable local AI inference.
  Databricks' $100B Series K 
   deepseek-v3.1-base  deepseek-v3.1-instruct  chatgpt-go  qwen-image-edit   databricks  openai  deepseek  hugging-face  alibaba   model-release  benchmarking  pricing-models  fine-tuning  model-architecture  image-editing  video-generation  api  agentic-ai   sama  nickaturley  kevinweil  gdb  sherwinwu  nptacek  reach_vb  clementdelangue  teortaxestex  quixiai  georgejrjrjr  scaling01  alibaba_qwen  linoy_tsaban  ostrisai  lmarena_ai  
 Databricks reached a $100 billion valuation, becoming a centicorn with new Data (Lakebase) and AI (Agent Bricks) products. OpenAI launched ChatGPT Go in India at ₹399/month (~$4.55), offering significantly increased usage limits and UPI payment support, with plans for global expansion. The DeepSeek V3.1 Base/Instruct models were quietly released on Hugging Face, showing strong coding benchmark performance and adopting an Anthropic-style hybrid system. The Qwen-Image-Edit model from Alibaba is gaining traction with integrations and community pruning experiments. "DeepSeek V3.1 Base outperforms Claude 4 Opus on coding benchmarks" and "ChatGPT Go offers 10x higher message limits and 2x longer memory" highlight key advancements.
  Figma's $50+b IPO 
   horizon-alpha  gpt-5  gemini-2.5-pro  qwen3-coder  qwen3-coder-flash-30b-a3b  command-a-vision  gpt-4.1  llama-4-maverick  flux-1-krea-dev  glm-4.5  voxtral   openai  openrouter  alibaba  unslothai  cohere  huggingface  black-forest-labs  diffusers  ostrisai  zhipu-ai  together-ai  mistral-ai   reasoning  svg-generation  agentic-ai  context-windows  vision  fine-tuning  inference-time-training  model-generalization  open-models  technical-reports   scaling01  teortaxestex  huybery  nickfrosst  aidangomez  reach_vb  zai_org  corbtt  jxmnop  teknuim1  
 OpenAI's stealth model horizon-alpha on OpenRouter sparks speculation as a precursor to GPT-5, showing strong reasoning and SVG generation capabilities, comparable to Gemini 2.5 Pro. Alibaba released the Qwen3-Coder family, including a fast Qwen3-Coder-Flash (30B-A3B) variant with agentic features and 1M context length support via UnslothAI. Cohere launched Command A Vision, a 111B parameter open-weights vision-language model outperforming GPT-4.1 and Llama 4 Maverick on enterprise benchmarks. Black Forest Labs introduced FLUX.1 Krea [dev], an open-weights photorealism model compatible with fine-tuning tools like diffusers and ostrisai. Zhipu AI unveiled GLM-4.5, a hybrid reasoning open model with agentic capabilities available on Together AI. Discussions highlight the rising importance of inference-time training and reasoning model generalization. Mistral AI released the technical report for Voxtral continuing its open science efforts.
  not much happened today 
   glm-4.5  glm-4.5-air  qwen3-coder  qwen3-235b  kimi-k2  grok-imagine  wan-2.2  smollm3  figure-01  figure-02  vitpose++  chatgpt   zhipu-ai  alibaba  moonshot-ai  x-ai  figure  openai  runway  mlx  ollama  deeplearningai   model-releases  model-performance  moe  image-generation  video-generation  pose-estimation  robotics  training-code-release  interactive-learning  in-context-learning   yuchenj_uw  corbtt  reach_vb  ollama  deeplearningai  gdb  sama  c_valenzuelab  adcock_brett  skalskip92  loubnabenallal1  hojonathanho  ostrisai  
 Chinese AI labs have released powerful open-source models like GLM-4.5 and GLM-4.5-Air from Zhipu AI, Qwen3 Coder and Qwen3-235B from Alibaba, and Kimi K2 from Moonshot AI, highlighting a surge in permissively licensed models. Zhipu AI's GLM-4.5 is a 355B parameter MoE model competitive with Claude 4 Opus and Gemini 2.5 Pro. Alibaba's Qwen3 Coder shows strong code generation performance with a low edit failure rate, while Moonshot AI's Kimi K2 is a 1 trillion-parameter MoE model surpassing benchmarks like LiveCodeBench. In video and image generation, xAI launched Grok Imagine, and Wan2.2 impressed with innovative image-to-video generation. Robotics advances include Figure's Figure-01 and Figure-02 humanoid robots and ViTPose++ for pose estimation in basketball analysis. SmolLM3 training and evaluation code was fully released under Apache 2.0. OpenAI introduced Study Mode in ChatGPT to enhance interactive learning, and Runway rolled out Runway Aleph, a new in-context video model for multi-task visual generation. The community notes a competitive disadvantage for organizations avoiding these Chinese open-source models. "Orgs avoiding these models are at a significant competitive disadvantage," noted by @corbtt.
  not much happened today 
   glm-4.5  glm-4.5-air  qwen3-coder  qwen3-235b  kimi-k2  wan-2.2  grok-imagine  smollm3  figure-01  figure-02  vitpose++   zhipu-ai  alibaba  moonshot-ai  x-ai  ideogram  figure  smollm  openai   model-releases  moe  model-benchmarking  image-generation  video-generation  pose-estimation  robotics  training-code-release  apache-license   yuchenj_uw  corbtt  cline  reach_vb  ollama  deeplearningai  ostrisai  hojonathanho  adcock_brett  skalskip92  loubnabenallal1  
 Chinese labs have released a wave of powerful, permissively licensed models in July, including Zhipu AI's GLM-4.5 and GLM-4.5-Air, Alibaba's Qwen3 Coder and Qwen3-235B, and Moonshot AI's Kimi K2. These models feature large-scale Mixture of Experts architectures with active parameters ranging from 3B to 32B and context windows up to 256K tokens. Zhipu AI's GLM-4.5 competes with Claude 4 Opus and Gemini 2.5 Pro in benchmarks. Moonshot AI's Kimi K2 is a 1 trillion-parameter MoE model surpassing other open-weight models on LiveCodeBench and AceBench. In video and image generation, xAI launched Grok Imagine, and Wan2.2 impressed with its Image-to-Video approach. Ideogram released a character consistency model. Robotics advances include Figure's Figure-01 and Figure-02 humanoid robots and ViTPose++ for pose estimation in basketball analysis. The SmolLM3 training and evaluation code was fully released under an Apache 2.0 license. "Orgs avoiding these Chinese open-source models are at a significant competitive disadvantage," noted by @corbtt.
  not much happened today 
   llama-3.1-nemotron-70b  golden-gate-claude  embed-3   liquid-ai  anthropic  cohere  openai  meta-ai-fair  nvidia  perplexity-ai  langchain  kestra  ostrisai  llamaindex   feature-steering  social-bias  multimodality  model-optimization  workflow-orchestration  inference-speed  event-driven-workflows  knowledge-backed-agents  economic-impact  ai-national-security  trust-dynamics   sam-altman  lmarena_ai  aravsrinivas  svpino  richardmcngo  ajeya_cotra  tamaybes  danhendrycks  jerryjliu0  
 Liquid AI held a launch event introducing new foundation models. Anthropic shared follow-up research on social bias and feature steering with their "Golden Gate Claude" feature. Cohere released multimodal Embed 3 embeddings models following Aya Expanse. There was misinformation about GPT-5/Orion debunked by Sam Altman. Meta AI FAIR announced Open Materials 2024 with new models and datasets for inorganic materials discovery using the EquiformerV2 architecture. Anthropic AI demonstrated feature steering to balance social bias and model capabilities. NVIDIA's Llama-3.1-Nemotron-70B ranked highly on the Arena leaderboard with style control. Perplexity AI expanded to 100M weekly queries with new finance and reasoning modes. LangChain emphasized real application integration with interactive frame interpolation. Kestra highlighted scalable event-driven workflows with open-source YAML-based orchestration. OpenFLUX optimized inference speed by doubling it through guidance LoRA training. Discussions on AI safety included trust dynamics between humans and AI, economic impacts of AI automation, and the White House AI National Security memo addressing cyber and biological risks. LlamaIndex showcased knowledge-backed agents for enhanced AI applications.