Arm - AI Infrastructure Intelligence Search

NVIDIA Other 2026-07-31

NVIDIA Vera Rubin Goes Global: 10x Token per Megawatt, Locks In AI Factory Standard

NVIDIA announces full production and global delivery of Vera Rubin platform, with CoreWeave and cloud providers deploying NVL72 systems. Featuring Vera CPU and Rubin GPU, the platform delivers 10x token throughput per megawatt over Blackwell, enabling gigawatt-scale AI factories across 350+ sites.

ARM Other 2026-07-30

Arm Beat on Every Number — So Why Did the Stock Drop 6%?

...

ARM Other 2026-07-29

Arm's AGI CPU Targets x86 Dominance with 3x Performance Per Watt in AI Datacenters

Arm Neoverse dominates TOP500 and Green500, announces first custom AGI CPU for gigawatt-scale AI datacenters as central orchestrator. Claims 3x performance per watt over x86, signaling a control plane shift towards Arm. Also launches Arm Performix analysis tool.

NVIDIA Other 2026-07-29

NVIDIA Rubin GPU Detailed: 3nm Dual-Die, 336B Transistors, 288GB HBM4, NVLink 6 Doubles Bandwidth

NVIDIA unveiled the full Rubin GPU architecture at SIGGRAPH 2026: 3nm dual-die, 336B transistors, 288GB HBM4 with 22 TB/s bandwidth, and NVLink 6 at 3600 GB/s. The NVL72 rack integrates 72 GPUs with 36 Vera CPUs, requiring full liquid cooling due to >1000W TDP.

ARM Other 2026-07-28

Arm发布Neoverse CSS V3计算子系统单芯片AI性能提升50%

...

Microsoft Other 2026-07-26

Microsoft Launches MAI Models, Slashes GPU Costs 89%, Reducing OpenAI Dependency

Microsoft unveiled MAI-Image-2.5-Pro and MAI-Voice-2-Flash on Azure Foundry, achieving 96.8% text rendering accuracy at 8K and reducing GPU costs by 84-89% vs GPT. Integrated across Bing, PowerPoint, and Dynamics 365, it marks a strategic shift from OpenAI dependency. Also, NVIDIA Jetson heads to the moon for edge AI.

AMD Other 2026-07-26

AMD Helios Enters Production: 12-Stack HBM4 Outmuscles NVIDIA, UALoE Opens AI Network

AMD's second-generation Helios rack-scale AI server enters full production, featuring 72 MI455X GPUs with Samsung's exclusive 12-stack HBM4 (31TB per rack). Compared to NVIDIA's 8-stack design, it offers 50% more memory and 30% lower token cost. Microsoft Azure commits to large-scale deployment, solidifying hyperscaler dual-vendor strategy.

Microsoft Other 2026-07-25

Microsoft and Databricks Expand Partnership: Full Azure Migration with Cobalt 200 ARM, Locking AI Agent Control Plane

Databricks will fully migrate to Azure, using Microsoft Cobalt 200 ARM chips for data and AI workloads, with deep integration of Genie and Unity AI Gateway into Microsoft products, locking in long-term partnership. Databricks raises $18.8B valuation.

NVIDIA Other 2026-07-25

NVIDIA and Wistron Launch US-Based GB300 Production, Shifting AI Manufacturing Landscape

NVIDIA partnered with Wistron to open a $700M factory in Texas, achieving L6 integration of the GB300 Grace Blackwell Ultra system. The first US-made unit contains 1.5M parts, weighs 2 tons, and costs $4M. This milestone accelerates US AI capex localization from 5% to 30%, reshaping the global AI supply chain.

Microsoft Other 2026-07-24

AMD Helios Full-Stack AI Server Deployed on Azure, UALoE Open Standard Challenges NVLink

AMD and Microsoft Azure announce large-scale deployment of Helios full-stack AI servers, featuring 72 MI455X GPUs, 18 EPYC Venice CPUs, and Pensando DPUs, with 31TB HBM4 and 1.7PB/s memory bandwidth. Two new Azure VM series target Agentic AI and semiconductor design, marking Microsoft's multi-vendor strategy and challenging NVIDIA's dominance.

Google Cloud Other 2026-07-23

Google Cloud Unveils GKE AI Security Blueprint with Three-Layer Defense

Google Cloud launches a security blueprint for AI workloads on GKE, featuring a three-layer defense: Confidential GKE Nodes for hardware memory encryption, open-source k8s-aibom for AI bill of materials, and Model Armor for prompt injection and data leak detection.

ARM Other 2026-07-22

ARM架构驱动全球最快超算LineShine，首次突破2 exaflops sustained性能

...

NVIDIA Other 2026-07-22

NVIDIA Reveals Vera Rubin GPU and Vera CPU: 3360B Transistors, 88-Core Olympus, 10x Agentic AI Efficiency

NVIDIA fully discloses Vera Rubin GPU and Vera CPU specifications. The GPU features 3360B transistors, HBM4 288GB, and 10x agentic AI efficiency over Blackwell. The CPU has 88 custom Olympus cores, delivering 2.2x faster agentic AI performance than Intel Sapphire Rapids. This solidifies NVIDIA's full-stack strategy against x86 incumbents.

Hewlett Packard Enterprise Other 2026-07-20

HPE Expands Private Cloud AI with NVIDIA Vera Rubin, Enabling Agent-Native AI Factory

HPE expands its Private Cloud AI line with NVIDIA Vera Rubin NVL72 and HGX Rubin NVL8, introducing Compute XD700, Cray GX240 blade with Vera CPU, and Quantum-X800 InfiniBand. New software includes Agent Toolkit and NemoClaw for agent-native AI, with Alletra Storage MP X10000 in Q4 2026.

NVIDIA Other 2026-07-20

NVIDIA Agent Toolkit Shifts AI Agent Control from Cloud to Local DGX Station

NVIDIA launches Agent Toolkit for DGX Station, comprising NemoClaw, Nemotron 3 Ultra, Omniverse Libraries, and OpenShell. It enables local AI agent deployment in 30 minutes, locking developers into NVIDIA's hardware-software stack and shifting control from cloud services to on-premises hardware.

ARM Other 2026-07-19

ARM Launches AGI CPU, Achieves 3x Performance Per Watt, Tops Supercomputing Rankings

At ISC 2026, ARM announced the Armv9-based LineShine supercomputer as the first to exceed 2 exaflops, topping the TOP500. It also launched the AGI CPU with 136 Neoverse V3 cores for gigawatt-scale AI datacenters, with Neoverse systems achieving 3x performance per watt over x86 and leading the Green500.

NVIDIA Other 2026-07-16

NVIDIA Jetson Thor T3000/T2000: Blackwell GPU Crashes Edge AI Cost Barrier

NVIDIA unveils Jetson Thor T3000 and T2000 modules. The T3000 packs a Blackwell GPU and 8-core Neoverse CPU, delivering 865 FP4 TFLOPS at half the power of the T5000. New Jetson Agent Skills automate memory optimization, aiming to scale deployment of humanoid robots and edge AI.

Google Other 2026-07-15

Google Deeply Integrates Gemini Enterprise Telemetry with BigQuery for AI Governance

Google Cloud enables streaming Gemini Enterprise app telemetry (prompts, responses, activity logs) into BigQuery for real-time analysis. Leveraging BigQuery's AI capabilities (Conversational Analytics, auto-schema), it automates auditing, compliance, and insights for large-scale AI deployments, driving data-driven AI observability.

AMD Other 2026-07-15

AMD Confirms Zen 6 EPYC Venice: First 2nm Server CPU Launching July 2026

AMD confirms Zen 6 EPYC Venice launch at Advancing AI 2026 (July 22-23). As the first 2nm server CPU, it features triple-core hybrid architecture, up to 192 cores, ~29% single-thread and ~22% multi-thread gains, targeting AI inference and tight CPU-GPU synergy via Infinity Fabric.

Cisco Other 2026-07-13

Cisco Launches Cloud Control and AgenticOps to Consolidate Network Management

At Cisco Live 2026, Cisco unveiled Cloud Control to unify Meraki, Catalyst Center, Nexus Dashboard, Security Cloud Control, and Splunk, along with AgenticOps for AI-driven network automation. Concurrently, it laid off 471 employees to align with an AI-first strategy, shifting from hardware sales to operational subscriptions and creating vendor lock-in.

Reports

Filter