Filter

×
Active Filters Clear All
Keyword: Vera ×
205 Total Reports
1/11 Page
NVIDIA Other 2026-06-14

NVIDIA & SK hynix Deepen Memory Co-Engineering: Custom HBM for Vera Rubin and Jetson Thor

NVIDIA and SK hynix have announced a multiyear partnership to co-develop next-generation custom memory for NVIDIA's AI factory ecosystem, including Vera Rubin supercomputers, Vera CPUs, RTX Spark PCs, and Jetson Thor robotic platforms. SK hynix will also use NVIDIA CUDA-X libraries and Omniverse to accelerate semiconductor design and build fab digital twins.

NVIDIA Other 2026-06-14

NVIDIA Vera CPU: Seizing the AI Agent Control Plane from x86

NVIDIA unveils Vera CPU, purpose-built for AI agents, featuring 88 Olympus cores and 1.2TB/s LPDDR5X memory. Claiming 1.8x faster task completion over x86, it targets agentic AI workloads. Customers include Anthropic, OpenAI, and Oracle Cloud Infrastructure, signaling a shift of the AI control plane to NVIDIA's ecosystem.

NVIDIA Other 2026-06-13

NVIDIA AgentPerf Benchmark: Blackwell Ultra Delivers 20x More Agents per Megawatt vs Hopper

NVIDIA and Artificial Analysis unveil AgentPerf, the first benchmark for agentic AI workloads. Results show the GB300 NVL72 platform delivers up to 20x more concurrent agents per megawatt than the HGX H200 when running DeepSeek V4 Pro, using real coding agent trajectories to measure throughput and responsiveness.

NVIDIA Other 2026-06-12

NVIDIA and SK Hynix Lock Down HBM4/5 Roadmap, Cementing Vera Rubin Supply Chain

NVIDIA and SK Hynix sign a multi-year agreement to co-define HBM4 production and HBM5 pre-research for Vera Rubin GPUs. Samsung also enters HBM4 supply as a second source. The deal elevates SK Hynix from vendor to co-developer, potentially creating a de facto memory standard barrier that marginalizes Micron and others.

AMD Other 2026-06-12

AMD Zen 6 Venice 256-Core EPYC Claims 3.3x Rack Performance Over NVIDIA Vera, But Estimates Raise Questions

AMD unveils first estimated performance of Zen 6 Venice EPYC (2nm, 256 cores), claiming 3.3x rack-level integer throughput over NVIDIA Vera at 100kW total power. A direct counter to NVIDIA's Arm push, but based on projected estimates, not silicon.

NVIDIA Other 2026-06-11

NVIDIA Halos OS: A Certified Safety OS That Seizes Control of Autonomous Driving

NVIDIA introduces Halos OS, a full-stack safety system comprising ASIL D certified Halos Core, standardized Halos SDK, AI guardrails in Halos Applications, and cloud-based Safety Evaluation Framework. Built on DRIVE Hyperion, it aims to embed safety into L4 robotaxis from the ground up.

Cisco Other 2026-06-11

Cisco Cloud Control: The Control Plane Shift to AI-Native Unified Infrastructure and Observability

Cisco unveils Cisco Cloud Control, a new operating model integrating Splunk for AI-native observability and agentic operations. By unifying network infrastructure, data fabric, and AI trust, it aims to reduce MTTR and costs—but also tightens vendor lock-in on both networking and monitoring.

Microsoft Other 2026-06-11

Microsoft & NVIDIA RTX Spark Brings 1 Petaflop AI to Windows, Reshaping Local Inference

At Computex 2026, Microsoft unveiled RTX Spark, an Arm-based AI superchip co-developed with NVIDIA and MediaTek, delivering up to 1 petaflop AI performance and 128GB unified memory for local 120B parameter models. Intel Arc G3 and Qualcomm Snapdragon X2 series also launched, accelerating the Windows AI PC ecosystem.

Amazon Other 2026-06-10

Graviton5 + Nitro Formal Verification: AWS Locks AI CPU Control with ARM and Math

AWS launches Graviton5-based M9g/M9gd instances with 25% compute gain, PCIe Gen6, DDR5-8800, and the first formally verified cloud hypervisor (Nitro Isolation Engine). Meta deploys tens of millions of cores for agentic AI, marking a decisive ARM victory in cloud CPU.

Google Other 2026-06-10

Google Lightning Engine: 4.9x Spark Performance with Ecosystem Lock-in Risks

Google Cloud launches Lightning Engine GA for Apache Spark, delivering up to 4.9x faster performance via vectorized native execution on Gluten/Velox. Optimized Cloud Storage and BigQuery connectors boost throughput, but the premium tier and deep integration create vendor lock-in risks.

AMD Other 2026-06-10

AMD EPYC Challenges Rack-Scale Density for Agentic AI Control

AMD claims its EPYC processors lead in rack-scale performance for agentic AI's CPU-intensive services (orchestration, caching, databases). Under a 100kW rack model, EPYC 9965 'Turin' delivers 2.37x throughput over NVIDIA Vera, with next-gen 'Venice' projected at 3.30x. Emphasizes deployability on current x86 platforms, avoiding future architecture dependency.

Microsoft Other 2026-06-09

Microsoft Locks Enterprise AI Agent Control Plane via KPMG's Global Agent 365 Rollout

KPMG globally adopts Microsoft Agent 365 to govern AI agents and expands Copilot deployment. Agent 365 becomes the central orchestration layer within KPMG Workbench, coordinating agents across systems, data, and business processes. This embeds Microsoft's AI management plane into the world's largest consulting delivery network, creating vendor lock-in for enterprise AI agent lifecycle control.

Google Other 2026-06-09

GKE Inference Gateway Prefix Caching: 92% Faster AI Inference with Hidden Lock-in

Google Cloud launches GKE Inference Gateway with prefix caching and model-aware routing, achieving 92.8% lower TTFT and 15.7% higher throughput on Llama 3.1 8B. Snap reports 75-80% cache hit rates. However, deep integration with GKE Gateway API risks lock-in, limiting multi-cloud portability.

NVIDIA Other 2026-06-09

NVIDIA NVFP4: Native 4-Bit Training Boosts Throughput 1.73x, Locks Blackwell Ecosystem

NVIDIA introduces NVFP4, a native 4-bit format on Blackwell, enabling lossless mixed-precision pretraining in JAX/MaxText. Achieves 1.73x throughput gain over FP8 on Llama 3.1 405B (GB300). Techniques like micro-block scaling and Random Hadamard Transform boost performance but lock users into NVIDIA hardware.

NVIDIA Other 2026-06-08

NVIDIA's UK Sovereign AI Play: From Chip Vendor to National Infrastructure Controller

NVIDIA partners with the UK government to deploy sovereign AI infrastructure via Isambard-AI (5,400 GH200 superchips) and the Sovereign AI Fund, backing local startups. This move establishes a national AI control plane, locking compute into NVIDIA's ecosystem and bypassing traditional hyperscalers like AWS and Azure.

NVIDIA Other 2026-06-08

NVIDIA and Doosan: Full-Stack Physical AI Platform Restructures Industrial Automation

NVIDIA expands collaboration with Doosan Group to integrate its physical AI stack (Isaac Sim, Cosmos, Jetson Thor) into Doosan Robotics' Agentic Robot OS, explore AI factory power (SMR, hydrogen fuel cells), and MGX ecosystem PCB materials. This move transforms NVIDIA from a GPU vendor into the central platform for physical AI and AI factory infrastructure, deeply locking industrial automation partners.

Cloudflare Other 2026-06-08

Cloudflare Embeds Live Threat Intel into WAF, Shifting Control from Manual Rules to Automated Engine

Cloudflare announces integration of real-time threat intelligence (from Cloudforce One) into its WAF engine, enabling proactive rules based on IP, attacker names, target industries, etc. Uses always-on detection with O(1) constant-time lookup for negligible latency. Currently IP-based, with plans for JA3 and domain matching.

Cloudflare Other 2026-06-05

Cloudflare AI Gateway Adds Identity-Driven Budgets, Seizing AI Traffic Control

Cloudflare launches spend limits and identity-driven budgets (closed beta) in AI Gateway, integrating with Cloudflare Access. It enables per-user, per-team dollar budgets with fallback routing, shifting AI cost governance from model providers to the gateway control plane.

NVIDIA Other 2026-06-04

NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration

NVIDIA launches Nemotron 3 Ultra, a 550B-parameter MoE model (55B active) purpose-built for AI agent orchestration. Featuring Multi-Teacher On-Policy Distillation (MOPD) and a Hybrid Mamba-Transformer architecture, it achieves 5x throughput and 30% cost savings on tasks like SWE-bench, signaling a shift of reasoning control to a layered agent system.

Cloudflare Other 2026-06-04

Cloudflare Acquires VoidZero: Capturing Dev Pipeline via Vite Integration

Cloudflare acquires VoidZero, bringing Vite, Rolldown, Oxc and other Rust-native tools into Workers, enabling one-click deploy from local code to global edge. This aims to unify the full dev lifecycle and push intent-based infrastructure provisioning.