enterprise AI - AI Infrastructure Intelligence Search

NVIDIA Other 2026-06-23

NVIDIA Launches Agent Toolkit: Nemotron Models, OpenShell Runtime for Specialized AI Agents

NVIDIA unveils Agent Toolkit, an open modular foundation with Nemotron models, NemoClaw blueprints, and OpenShell runtime, enabling enterprises to build secure, specialized AI agents. It targets life sciences, cybersecurity, and industrial workflows, aiming to turn frontier models into domain-specific digital coworkers.

AMD Other 2026-06-23

AMD MI430X GPU Delivers >200 TFLOPS Native FP64, Reshaping HPC-AI Convergence Baseline

AMD powers 4 of top 10 TOP500 supercomputers and previews MI430X GPU with >200 TFLOPS native FP64. This targets AI-for-science workloads, making double-precision compute a key metric for converged HPC-AI infrastructure, directly challenging NVIDIA and Intel.

ARM Other 2026-06-23

Arm servers capture >45% data center revenue, x86 ecosystem under AI-driven assault

IDC reports Q1 2026 global server revenue hit a record $122.6B, with Arm-based servers capturing >45% share (x86 at 52%). Accelerated servers (GPU/ASIC/FPGA) generated >70% revenue. Nvidia's Grace CPU (NVL72) and hyperscaler custom Arm chips drive the shift; x86 still leads in unit volume but faces supply constraints.

Amazon Other 2026-06-21

AWS Seizes Agent Control Plane with MCP Gateway and AgentCore

AWS launches managed web search for Bedrock AgentCore, autonomous agents in Amazon Quick, subagent MicroVM orchestration with LangChain, and MCP Gateway, shifting enterprise AI agents from prototypes to governed infrastructure with cloud-native control planes and execution isolation.

Cisco Other 2026-06-18

Cisco Leverages NVIDIA Spectrum Silicon and Nexus One to Reshape AI Network Control Plane

Cisco launches N9100 switches with NVIDIA Spectrum-6/4 silicon, delivering 102.4T throughput. It also introduces Nexus One unified management plane spanning NX-OS and SONiC, and extends Hybrid Mesh Firewall to BlueField DPUs for AI workload security offload, aiming for a turnkey AI fabric control plane.

Hewlett Packard Enterprise Other 2026-06-17

HPE Consolidates Morpheus & GreenLake into Unified Agentic Control Plane for Hybrid Cloud and AI

HPE integrates Morpheus software into GreenLake, delivering a unified agentic orchestration and control plane for AI factories and traditional workloads. GreenLake Intelligence advances agentic AIOps, with partnerships with ServiceNow and Citrix, aiming to reduce virtualization costs and simplify hybrid cloud operations under a single operating model.

Google Cloud Other 2026-06-17

ASUS Launches NVIDIA GB300 Deskside AI Supercomputer, Shifting Control from Cloud to On-Prem

ASUS launches the ExpertCenter Pro ET900N G3, powered by NVIDIA's GB300 Grace Blackwell Ultra Desktop Superchip, delivering 20 PFLOPS and 748GB of coherent memory for near-trillion parameter models. Concurrently, Coherent expands InP fab in Texas for optical interconnects, and NVIDIA plans a $20-25B debt offering, signaling a systemic shift of AI control from cloud to localized enterprise hardware.

Google Cloud Other 2026-06-17

Google Cloud Embeds Legal Verifiability into AI Agents via SPIFFE and Kakunin

Google Cloud introduces SPIFFE-based Agent Identity for Gemini Enterprise and Vertex AI, then overlays Kakunin's compliance layer to map internal SPIFFE identifiers to X.509 certificates generated in AWS KMS, with all state changes committed to WORM audit logs. This converts secure cloud workloads into legally auditable market participants to meet EU AI Act and MiCA accountability mandates.

AMD Other 2026-06-17

AMD MLPerf 6.0: MI350 GPUs Achieve 3.5x Leap with MXFP4, Debut Multi-Node Training

AMD submitted its most comprehensive MLPerf Training 6.0 results, including first multi-node training (FLUX.1 on 512 GPUs) and MXFP4 training recipe. MI355X delivers 3.5x generational leap over MI300X on Llama 2-70B, within 5% of NVIDIA B200. 10 ecosystem partners validated reproducibility.

AMD Other 2026-06-16

AMD and Rackspace Deploy 30MW Governed AI Stack: Ecosystem Restructuring from Silicon to Outcomes

AMD and Rackspace sign a definitive agreement to deploy 30MW of AMD AI compute (Instinct GPUs including MI355X, EPYC CPUs) across Rackspace's data centers, creating a governed enterprise AI stack with single accountability from silicon to outcomes, targeting regulated industries.

AMD Other 2026-06-15

AMD Acquires MEXT: AI-Predicted Flash Nears DRAM Performance to Cut AI Memory TCO

AMD acquires MEXT, an AI-driven memory optimization startup. MEXT's predictive technology makes NAND Flash behave like DRAM, expanding effective memory capacity for AI workloads and lowering TCO. The tech will be integrated across AMD's data center portfolio (EPYC, Instinct) to address memory bottlenecks in large models.

AMD Other 2026-06-15

AMD Open-Sources AI Software Stack on Vultr, Taking on NVIDIA CUDA Ecosystem

AMD launches a suite of open-source, modular enterprise AI software components on Vultr Marketplace, including AMD Inference Microservices (AIMs), AI Workbench, Resource Manager, and Solution Blueprints. This aims to provide production-grade AI infrastructure without vendor lock-in, directly challenging NVIDIA's CUDA ecosystem.

NVIDIA Other 2026-06-15

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

ASUS launches ExpertCenter Pro ET900N G3, built on NVIDIA DGX Station GB300 architecture with GB300 Grace Blackwell Ultra chip, 748GB coherent memory, and 20 PFLOPS AI performance. This deskside AI supercomputer enables local LLM fine-tuning, inference, and agentic AI workflows via NVLink-C2C and the full NVIDIA AI software stack including NemoClaw.

Anthropic Other 2026-06-15

DXC and Anthropic Forge Multi-Year Alliance: Claude-Certified Engineers for Mission-Critical AI

DXC Technology and Anthropic announce a multi-year global partnership, making DXC a Global Premier partner in the Claude Partner Network. They will train tens of thousands of Claude-certified engineers to deploy Claude models in mission-critical environments via the DXC OASIS platform, using a 'Customer Zero' internal validation approach.

NVIDIA Other 2026-06-14

NVIDIA Partners SK Telecom for Gigawatt-Scale AI Cloud, Pushes DSX as Sovereign AI Factory Blueprint

SK Telecom plans to build a gigawatt-scale AI cloud in Korea using NVIDIA's DSX platform, with first AI factory online in 2027. The platform integrates NVIDIA accelerated computing, systems, and software to support sovereign, physical, and agentic AI services, targeting expansion across Asia.

NVIDIA Other 2026-06-11

NVIDIA Optimizes Google's DiffusionGemma for 1,000 tok/s Parallel Text Generation

NVIDIA optimizes Google DeepMind's DiffusionGemma, a diffusion-based text model generating 256 tokens per step in parallel. On a single H100, it achieves 1,000 tok/s, with deployment via NIM and NeMo. This breaks the sequential token bottleneck, slashing serving costs and latency for real-time AI.

AMD Other 2026-06-11

AMD, Dell, Cambridge Launch UK Sovereign AI Lab to Challenge NVIDIA's CUDA Dominance with Open ROCm

AMD, Dell, and the University of Cambridge launch the Sovereign AI Innovation Lab (SAIL) in the UK, deploying Zenith supercomputer with 5th Gen EPYC and Instinct MI355X GPUs, plus the Sunrise fusion AI system. The lab promotes open, interoperable AI infrastructure based on AMD ROCm, challenging NVIDIA's CUDA lock-in and offering long-term technology choice for national AI initiatives.

AMD Other 2026-06-10

AMD EPYC Challenges Rack-Scale Density for Agentic AI Control

AMD claims its EPYC processors lead in rack-scale performance for agentic AI's CPU-intensive services (orchestration, caching, databases). Under a 100kW rack model, EPYC 9965 'Turin' delivers 2.37x throughput over NVIDIA Vera, with next-gen 'Venice' projected at 3.30x. Emphasizes deployability on current x86 platforms, avoiding future architecture dependency.

Microsoft Other 2026-06-09

Microsoft Locks Enterprise AI Agent Control Plane via KPMG's Global Agent 365 Rollout

KPMG globally adopts Microsoft Agent 365 to govern AI agents and expands Copilot deployment. Agent 365 becomes the central orchestration layer within KPMG Workbench, coordinating agents across systems, data, and business processes. This embeds Microsoft's AI management plane into the world's largest consulting delivery network, creating vendor lock-in for enterprise AI agent lifecycle control.

NVIDIA Other 2026-06-08

NVIDIA's UK Sovereign AI Play: From Chip Vendor to National Infrastructure Controller

NVIDIA partners with the UK government to deploy sovereign AI infrastructure via Isambard-AI (5,400 GH200 superchips) and the Sovereign AI Fund, backing local startups. This move establishes a national AI control plane, locking compute into NVIDIA's ecosystem and bypassing traditional hyperscalers like AWS and Azure.

Reports

Filter