All tags
Topic: "cuda-graphs"
NVIDIA to invest $100B in OpenAI for 10GW of Vera Rubin rollout
qwen3-omni deepseek-v3.1 nvidia openai oracle intel enfabrica wayne gpu-infrastructure deterministic-inference reinforcement-learning fp8-precision gpu-performance ai-infrastructure strategic-partnerships investment datacenters cuda-graphs pipeline-parallelism data-parallelism artificialanlys gdb
NVIDIA and OpenAI announced a landmark strategic partnership to deploy at least 10 gigawatts of AI datacenters using NVIDIA's systems, with NVIDIA investing up to $100 billion progressively as each gigawatt is deployed, starting in the second half of 2026 on the Vera Rubin platform. This deal significantly impacts the AI infrastructure funding landscape, potentially supporting OpenAI's $300 billion commitment to Oracle. The announcement caused major stock market reactions, with NVIDIA's market cap surging by $170 billion. Additionally, advancements in deterministic inference for reinforcement learning and FP8 precision gains in GPU performance were highlighted by AI practitioners.
Quis promptum ipso promptiet?
llama-3-70b llama-3-120b llama-3 llama-cpp anthropic openai zoominfo neuralink prompt-engineering chain-of-thought rag quantization cuda-graphs gpu-optimization thought-controlled-devices modeling-consciousness conference sama gdb bindureddy svpino rohanpaul_ai alexalbert__ abacaj
Anthropic released upgrades to their Workbench Console, introducing new prompt engineering features like chain-of-thought reasoning and prompt generators that significantly reduce development time, exemplified by their customer Zoominfo. OpenAI teased a "magic" new development coming soon, speculated to be a new LLM replacing GPT-3.5 in the free tier or a search competitor. The open-source community highlighted Llama 3 70B as "game changing" with new quantized weights for Llama 3 120B and CUDA graph support for llama.cpp improving GPU performance. Neuralink demonstrated a thought-controlled mouse, sparking interest in modeling consciousness from brain signals. The ICLR 2024 conference is being held in Asia for the first time, generating excitement.