Company: "artificialanlys"

deepseek-v4-flash gpt-5.6-luna terra deepseek huggingface openai post-training agent-specialization quantization model-deployment api cost-efficiency cache-optimization long-context agentic-ai open-weights model-performance kimmonismus cline artificialanlys miaai_lab _akhaliq vllm_project unslothai danielhanchen jakevin7 arena omarsar0

DeepSeek launched the public-beta of DeepSeek-V4-Flash API, boasting a significant post-training performance leap without architecture or size changes, achieving a Terminal-Bench score of 82.7 and nearing GPT-5.6 Luna's 51 score at about 60% lower cost per task. The model features 284B total / 13B active parameters, supports 1M context length, and offers aggressive pricing with a 98% cache-hit discount. Open weights were released immediately under MIT license on Hugging Face, enabling local and quantized deployment with 4-bit and 3-bit quantization options. The update emphasizes improved agent specialization and tool use, with autonomous subagent swarm patterns and better harness sensitivity. This release also intensified the ongoing price competition with OpenAI's GPT-5.6 Luna and Terra models, highlighting a new era of "cheap intelligence" in AI agent benchmarks.