Filter

×
Active Filters Clear All
Keyword: Blackwell ×
50 Total Reports
1/3 Page
NVIDIA Other 2026-06-13

NVIDIA AgentPerf Benchmark: Blackwell Ultra Delivers 20x More Agents per Megawatt vs Hopper

NVIDIA and Artificial Analysis unveil AgentPerf, the first benchmark for agentic AI workloads. Results show the GB300 NVL72 platform delivers up to 20x more concurrent agents per megawatt than the HGX H200 when running DeepSeek V4 Pro, using real coding agent trajectories to measure throughput and responsiveness.

NVIDIA Other 2026-06-12

NVIDIA and SK Hynix Lock Down HBM4/5 Roadmap, Cementing Vera Rubin Supply Chain

NVIDIA and SK Hynix sign a multi-year agreement to co-define HBM4 production and HBM5 pre-research for Vera Rubin GPUs. Samsung also enters HBM4 supply as a second source. The deal elevates SK Hynix from vendor to co-developer, potentially creating a de facto memory standard barrier that marginalizes Micron and others.

Microsoft Other 2026-06-11

Microsoft & NVIDIA RTX Spark Brings 1 Petaflop AI to Windows, Reshaping Local Inference

At Computex 2026, Microsoft unveiled RTX Spark, an Arm-based AI superchip co-developed with NVIDIA and MediaTek, delivering up to 1 petaflop AI performance and 128GB unified memory for local 120B parameter models. Intel Arc G3 and Qualcomm Snapdragon X2 series also launched, accelerating the Windows AI PC ecosystem.

NVIDIA Other 2026-06-11

NVIDIA Locks Local AI Inference Control with DiffusionGemma Parallel Generation

NVIDIA optimizes Google DeepMind's DiffusionGemma open model, which generates 256 tokens in parallel for 4x speedup over autoregressive models. Achieves 1000 tokens/sec on H100, 150 tokens/sec on DGX Spark, running fully locally with no cloud cost. This reinforces NVIDIA GPU's centrality in compute-bound local AI inference.

NVIDIA Other 2026-06-09

NVIDIA NVFP4: Native 4-Bit Training Boosts Throughput 1.73x, Locks Blackwell Ecosystem

NVIDIA introduces NVFP4, a native 4-bit format on Blackwell, enabling lossless mixed-precision pretraining in JAX/MaxText. Achieves 1.73x throughput gain over FP8 on Llama 3.1 405B (GB300). Techniques like micro-block scaling and Random Hadamard Transform boost performance but lock users into NVIDIA hardware.

NVIDIA Other 2026-06-08

NVIDIA's UK Sovereign AI Play: From Chip Vendor to National Infrastructure Controller

NVIDIA partners with the UK government to deploy sovereign AI infrastructure via Isambard-AI (5,400 GH200 superchips) and the Sovereign AI Fund, backing local startups. This move establishes a national AI control plane, locking compute into NVIDIA's ecosystem and bypassing traditional hyperscalers like AWS and Azure.

NVIDIA Other 2026-06-08

NVIDIA and LG Build AI Factory: DSX Platform Locks Physical AI Stack

NVIDIA and LG Group jointly build an AI factory leveraging NVIDIA's DSX platform, integrating Isaac Sim/Lab, Cosmos, GR00T frameworks for robotics, autonomous driving, data centers, and sovereign AI. LG subsidiaries align cooling, robotics, and sensor components exclusively with NVIDIA, creating a fortified ecosystem.

NVIDIA Other 2026-06-04

NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration

NVIDIA launches Nemotron 3 Ultra, a 550B-parameter MoE model (55B active) purpose-built for AI agent orchestration. Featuring Multi-Teacher On-Policy Distillation (MOPD) and a Hybrid Mamba-Transformer architecture, it achieves 5x throughput and 30% cost savings on tasks like SWE-bench, signaling a shift of reasoning control to a layered agent system.

Intel Other 2026-06-02

Intel and SambaNova Rackscale AI: CPU Regains Inference Control Plane

At Computex 2026, Intel unveiled rack-scale AI infrastructure combining Xeon 6+ with SambaNova SN-50 RDUs, plus a fully disaggregated inference cloud (prefill on NVIDIA Blackwell, decode on RDUs) by Vector Core Compute. This aims to reposition the CPU as the central orchestrator for inference, challenging GPU dominance.

ARM Other 2026-06-02

Arm-NVIDIA RTX Spark: Tightly Coupled CPU-GPU for Agentic AI PCs

The Arm-based NVIDIA RTX Spark integrates Arm Grace CPU with NVIDIA Blackwell RTX GPU via unified memory, enabling ultra-low latency on-device AI inference for the agentic era. This platform marks a major milestone for Windows on Arm, targeting developers, creators, and gamers.

ARM Other 2026-06-02

Arm and NVIDIA RTX Spark: Unified Memory PC Architecture Targets Agentic AI, Encircles x86

Arm and NVIDIA unveil RTX Spark, an Arm-based Grace CPU + Blackwell RTX GPU platform with unified memory, targeting Windows on Arm for agentic AI inference. It delivers 1 Petaflop, reduces token cost, and signals a PC paradigm shift from app-driven to agent-driven, backed by Microsoft.

NVIDIA Other 2026-06-01

NVIDIA FOX Blueprint Shifts Factory Control from PLCs to AI Agents on DGX

NVIDIA unveiled the Factory Operations Blueprint (FOX), a reference design for autonomous factory manager agents using NemoClaw, AI-Q Blueprint, and DGX Station (GB300 with 20 PFLOPS FP4, 748GB coherent memory). It unifies live machine signals, quality systems, and robot fleets under an AI decision layer. Foxconn, Pegatron, Advantech, and Wistron are early adopters, projecting 80% faster root cause analysis and 15% labor productivity gains.

NVIDIA Other 2026-06-01

NVIDIA Locks Taiwan Supply Chain with AI Factory Stack, Vera Rubin Production Tied to Proprietary Software

NVIDIA partners with TSMC, Foxconn, and others to embed its proprietary AI software (cuLitho, Omniverse, Isaac) into semiconductor manufacturing and server assembly, while ramping Vera Rubin NVL72 production. The move uses efficiency gains (e.g., 20-50% cycle time reduction) as bait to lock the supply chain into a full-stack ecosystem, increasing switching costs for partners.

NVIDIA Other 2026-06-01

NVIDIA Cosmos 3: Open-Source Physical AI Model with MoT for Ecosystem Lock-in

NVIDIA releases Cosmos 3, a unified physical AI foundation model with Mixture-of-Transformers architecture combining reasoning, world generation, and action generation. Open-sourced with training scripts and six synthetic datasets, but deployment optimized for NVIDIA NIM and GPUs, signaling an ecosystem lock-in strategy.

NVIDIA Other 2026-06-01

NVIDIA RTX Spark: SoC Seizes PC Control, AI Compute Revolution with Ecosystem Lock-in

NVIDIA launches RTX Spark SoC, integrating Blackwell GPU with 20-core Grace CPU (MediaTek co-designed), NVLink-C2C at 600GB/s, up to 128GB unified memory, 1 petaflop FP4 AI, and local 120B-parameter LLM support. This marks a shift from GPU vendor to platform provider, directly challenging Apple M, Qualcomm, and x86 incumbents.

NVIDIA Product Launch 2026-05-29

NVIDIA Blackwell Ultra GB300 NVL72: 1.44 EFLOPS FP4, 50x AI Factory Boost

NVIDIA launches Blackwell Ultra GB300 NVL72 rack system with 72 Blackwell Ultra GPUs and 36 Grace CPUs, delivering 1,440 PFLOPS FP4 sparse, 20TB HBM3e, 130TB/s NVLink. Claims 50x AI factory output over Hopper. Available now.

NVIDIA Product Launch 2026-05-29

NVIDIA's Triple Play: Vera CPU, N1X Laptop Chip, and $6.5B Silicon Photonics Reshape AI Infra Control

NVIDIA delivers first agent-specific Vera CPU (88 Arm v9.2 cores, 1.2TB/s memory bandwidth), teases consumer N1X laptop chip, and invests $6.5B in silicon photonics. This shifts AI orchestration control from x86 to NVIDIA's Arm ecosystem, while CPO addresses memory wall, but volume production remains challenging until post-2028.

NVIDIA Other High Signal 2026-05-06

NVIDIA Opens MRC Protocol via OCP, Pushing Standardization of AI Ethernet Fabrics

NVIDIA announced the opening of its MRC (Multipath Reliable Connection) RDMA transport protocol via the Open Compute Project (OCP). The protocol, proven on Spectrum-X Ethernet hardware, aims to enhance throughput, resilience, and GPU utilization for large-scale AI training clusters through multi-path load balancing and hardware-level failure bypass.

NVIDIA Other 2026-05-05

NVIDIA Extreme Co-Design: Vera Rubin Platform Targets Agentic Inference TCO Inflection

NVIDIA unveils an extreme co-design stack for agentic systems, featuring Vera Rubin NVL72, NVLink 6, ConnectX-9, BlueField-4, and Spectrum-X. By disaggregating inference, optimizing KV cache management, and deploying low-latency fabrics, it aims to break the throughput-interactivity tradeoff, making high-context token processing economically viable.

NVIDIA Technology Update High Signal 2026-05-02

Global GPU Shortage to Persist Until 2027: Core Bottleneck for AI Infrastructure Expansion

Global GPU shortage expected to extend to 2027-2028, rooted in AI data center demand surge, constrained HBM production, CoWoS packaging tightness, and geopolitical risks. NVIDIA Rubin's mass production hindered (target reduced from 2M to 1.5M units), with Blackwell capturing 71% of high-end GPU shipments in 2026. Consumer RTX 5080/5070 Ti priced $200-$500 above MSRP, enterprise AI infrastructure procurement cycles will further extend.