Agentic AI - AI Infrastructure Intelligence Search

Nokia Other 2026-07-29

Nokia Launches Industry First Commercial AI-RAN Platform with NVIDIA: AI-Native anyRAN, MantaRay SMO with Agentic AI, Doksuri AI-Ready Radios

...

ARM Other 2026-07-29

Arm's AGI CPU Targets x86 Dominance with 3x Performance Per Watt in AI Datacenters

Arm Neoverse dominates TOP500 and Green500, announces first custom AGI CPU for gigawatt-scale AI datacenters as central orchestrator. Claims 3x performance per watt over x86, signaling a control plane shift towards Arm. Also launches Arm Performix analysis tool.

Intel Other 2026-07-29

AI Evolution Resurrects Intel CPU Strength: Xeon 6 Fastest-Ramping Product, DCAI Revenue Up 59% YoY

...

AMD Other 2026-07-26

AMD Helios Enters Production: 12-Stack HBM4 Outmuscles NVIDIA, UALoE Opens AI Network

AMD's second-generation Helios rack-scale AI server enters full production, featuring 72 MI455X GPUs with Samsung's exclusive 12-stack HBM4 (31TB per rack). Compared to NVIDIA's 8-stack design, it offers 50% more memory and 30% lower token cost. Microsoft Azure commits to large-scale deployment, solidifying hyperscaler dual-vendor strategy.

Microsoft Other 2026-07-25

Microsoft and Databricks Expand Partnership: Full Azure Migration with Cobalt 200 ARM, Locking AI Agent Control Plane

Databricks will fully migrate to Azure, using Microsoft Cobalt 200 ARM chips for data and AI workloads, with deep integration of Genie and Unity AI Gateway into Microsoft products, locking in long-term partnership. Databricks raises $18.8B valuation.

Google Other 2026-07-24

Google Begins Gemini 4 Pre-training with 4M+ Context and Monthly Releases

Alphabet confirms start of largest pre-training run for Gemini 4, featuring 4M+ context and native multimodality with near-monthly releases. 2026 capex raised to $195-205B, Google Cloud Q2 up 82%, signaling full-stack AI acceleration.

AMD Other 2026-07-24

AMD Helios Rack Challenges NVIDIA NVLink with Open UALoE Interconnect

At Advancing AI 2026, AMD launched the Helios rack with 72 MI455X GPUs, 18 Venice EPYC CPUs, and Pensando networking, claiming 30% higher inference token/$ vs NVIDIA NVL72. It introduced UALoE open interconnect to break NVLink lock-in, partnering with Cerebras, Cisco, and major AI firms.

Microsoft Other 2026-07-24

AMD Helios Full-Stack AI Server Deployed on Azure, UALoE Open Standard Challenges NVLink

AMD and Microsoft Azure announce large-scale deployment of Helios full-stack AI servers, featuring 72 MI455X GPUs, 18 EPYC Venice CPUs, and Pensando DPUs, with 31TB HBM4 and 1.7PB/s memory bandwidth. Two new Azure VM series target Agentic AI and semiconductor design, marking Microsoft's multi-vendor strategy and challenging NVIDIA's dominance.

Hewlett Packard Enterprise Other 2026-07-22

HPE扩展Private Cloud AI产品线，集成NVIDIA Vera Rubin NVL72平台

...

NVIDIA Other 2026-07-22

NVIDIA Reveals Vera Rubin GPU and Vera CPU: 3360B Transistors, 88-Core Olympus, 10x Agentic AI Efficiency

NVIDIA fully discloses Vera Rubin GPU and Vera CPU specifications. The GPU features 3360B transistors, HBM4 288GB, and 10x agentic AI efficiency over Blackwell. The CPU has 88 custom Olympus cores, delivering 2.2x faster agentic AI performance than Intel Sapphire Rapids. This solidifies NVIDIA's full-stack strategy against x86 incumbents.

ARM Other 2026-07-19

ARM Launches AGI CPU, Achieves 3x Performance Per Watt, Tops Supercomputing Rankings

At ISC 2026, ARM announced the Armv9-based LineShine supercomputer as the first to exceed 2 exaflops, topping the TOP500. It also launched the AGI CPU with 136 Neoverse V3 cores for gigawatt-scale AI datacenters, with Neoverse systems achieving 3x performance per watt over x86 and leading the Green500.

Huawei Other 2026-07-17

Huawei unveils Atlas 950 SuperPoD: 1024-card memory-coherent AI supernode

Huawei unveiled the Atlas 950 SuperPoD at WAIC 2026, powered by the 950DT chip, supporting 1024 interconnected cards with 256TB unified memory addressing. Designed for trillion-parameter model training and Agentic AI inference, it marks a shift from chip-level stacking to system-level unified architecture.

Google Other 2026-07-15

Google Deeply Integrates Gemini Enterprise Telemetry with BigQuery for AI Governance

Google Cloud enables streaming Gemini Enterprise app telemetry (prompts, responses, activity logs) into BigQuery for real-time analysis. Leveraging BigQuery's AI capabilities (Conversational Analytics, auto-schema), it automates auditing, compliance, and insights for large-scale AI deployments, driving data-driven AI observability.

Microsoft Other 2026-07-15

Microsoft Releases Go SDK for Agent Framework, Challenging Google in Go Ecosystem

In July 2026, Microsoft released the Go SDK for Agent Framework in public preview, supporting MCP and multi-agent coordination. This positions Microsoft alongside Google as the only major cloud vendors offering native Go Agent SDKs, while OpenAI and Anthropic lag with Python-only support, risking developer ecosystem erosion.

NVIDIA Other 2026-07-08

NVIDIA Rigel Core: Single-Threaded CPU as the New Control Plane for Agentic AI

NVIDIA unveils Rosa CPU architecture with custom Rigel core (Arm v9.2), targeting single-threaded performance for Agentic AI workloads, paired with Feynman GPU (1.6nm, 50 PFLOPS) in 2028. This shifts CPU design from core-count scaling to serial-latency optimization, directly challenging AMD EPYC and Intel Xeon dominance.

NVIDIA Other 2026-07-07

NVIDIA Vera CPU: Max Single-Threaded Performance at Scale for Agentic AI

NVIDIA launches Vera CPU, a max single-threaded CPU at scale for agentic AI. With Olympus cores delivering 1.8x sustained per-core performance over x86, 1.2TB/s LPDDR5X bandwidth, and 3.4TB/s core-to-core bandwidth, Vera integrates into NVIDIA's unified AI factory architecture, aiming to lock users into its ecosystem.

NVIDIA Other 2026-07-07

AI Innovators Adopt NVIDIA Vera — Why Max Single-Threaded CPU at Scale Matters

...

Huawei Other 2026-06-25

Huawei Pushes Token-Based Billing at MWC Shanghai 2026: Shifting Carrier Monetization from Bytes to AI Inference Value

At MWC Shanghai 2026, Huawei urged carriers to shift from byte-based to token-based billing for AI workloads, showcasing a 372% token throughput improvement in long-sequence inference via its AI Inference Acceleration Solution. It also highlighted the Upper-6 GHz band as critical for AI wearables requiring 20 Mbps uplink, aiming to reposition 5G-A networks as AI compute delivery infrastructure.

OpenAI Other 2026-06-25

Oracle Defense Ecosystem Cohort 3: Offline AI on Roving Edge Devices Goes Operational

Oracle announced the third cohort of its Defense Ecosystem at the Brussels summit, adding 10 companies. Concurrently, Whitespace's Saga AI system deployed on Oracle Roving Edge Devices during Royal Navy's Operation HIGHMAST, running classified AI workloads completely offline, proving sovereign edge AI is operational.

NVIDIA Other 2026-06-25

Qualcomm Dragonfly: 250-core CPU, HBC memory, UALink interconnects target AI inference TCO

Qualcomm unveils full data center portfolio: Dragonfly C1000 250-core Oryon CPU (>5GHz, PCIe Gen7, CXL), HBC near-memory compute (133TB/s Gen1, 18x-54x effective BW), AI300 inference accelerator (UALink/ESUN scale-up), and 800G/1.6T connectivity. Multi-year Meta CPU deal. Commercial sampling 2027-2028. Targets inference TCO with tokens-per-watt leadership.

Reports

Filter