Reports
AI-generated structured vendor updates
AMD Announces Breakthrough MLPerf Inference 6.0 Results, Showcasing Multinode Scaling and Multimodal Capabilities
AMD's MLPerf Inference 6.0 submission, powered by Instinct MI355X GPUs, surpassed 1 million tokens per second for the first time on models like Llama 2 70B and GPT-OSS-120B. The results highlight efficient multinode scaling, rapid enablement of new workloads (e.g., text-to-video model Wan-2.2-t2v), and reproducible performance across a broad partner ecosystem.
ARM Launches AGI CPU Silicon, Extends AI Infrastructure Reach
ARM debuts its first self-designed AGI CPU silicon, moving beyond IP licensing to offer full-stack solutions from custom silicon to integrated platforms. This shift redefines control points in AI infrastructure supply chains, enabling enterprises to optimize AI workload deployment at hardware layer.
Intel Demonstrates AI Performance with Xeon 6 and Arc Pro GPUs in MLPerf Inference
Intel showcased the performance of its Xeon 6 CPUs and Arc Pro B-Series GPUs in the MLPerf Inference v6.0 benchmarks, particularly in handling large language models (LLMs). The results indicate that a system with four Arc Pro B70 GPUs can process 120B parameter models, delivering up to 1.8x higher inference performance in multi-GPU setups.
Cisco Introduces Full-Stack Post-Quantum Cryptography Architecture
At Cisco Live 2026, Cisco unveiled the industry's first full-stack post-quantum cryptography (PQC) architecture using NIST-approved quantum-resistant algorithms, spanning from device boot integrity to data-in-transit protection. This represents the most significant cryptographic advancement in two decades, addressing the 'harvest now, decrypt later' threat posed by quantum computing.
AWS Collaborates with Flagship to Accelerate Life Sciences AI Innovation
AWS announced a strategic collaboration with Flagship Pioneering, becoming the preferred cloud provider for Flagship's portfolio companies, offering cloud resources, technical support, and AI capabilities to accelerate drug discovery and scientific platform development. Flagship's early-stage companies will receive AWS cloud credits, technical support, and go-to-market resources, while internal teams gain specialized support to enhance company creation and scaling.
NVIDIA Collaborates with Energy Leaders to Position AI Factories as Smart Grid Assets
NVIDIA, in collaboration with Emerald AI, proposes treating large-scale AI data centers (AI factories) as flexible, intelligent grid assets rather than static power loads. This architecture integrates accelerated computing, power networking, and control to enhance grid reliability and optimize energy efficiency. Several major energy companies plan to collaborate on this architecture to support AI workloads and accelerate power connection.
NVIDIA Collaborates with Energy Leaders on AI Factory-Grid Integration Architecture
NVIDIA and Emerald AI introduced a new architecture treating AI factories as intelligent grid assets, combining accelerated computing, real-time energy orchestration and reference designs. The Vera Rubin DSX-based approach enables dynamic grid response and has gained support from multiple energy providers.
OpenAI Secures $122B Funding for Global AI Infrastructure Expansion
OpenAI has raised $122 billion to expand frontier AI capabilities globally, invest in next-generation compute infrastructure, and meet growing demand for ChatGPT, Codex and enterprise AI solutions. This record funding will significantly scale up its AI training clusters and inference infrastructure.
Meta Elevates Product Privacy Review to AI-Driven Company-Wide Risk Review
Meta announced it is expanding its product Privacy Review program into a broader, AI-centric company-wide Risk Review. The program leverages AI to automate compliance workflows, identify risks earlier in product development, and enable continuous monitoring, aiming to make manual processes the fallback.
Meta Elevates AI-Powered Risk Review to Cross-Company Program
Meta transforms its product Privacy Review into an AI-centric cross-company Risk Review program, automating documentation pre-filling, proactive development-phase scanning, and continuous monitoring for earlier risk identification. The initiative combines AI scalability with human expertise to establish an automation-first compliance culture.
Arm Partners with Malaysian University to Cultivate Semiconductor Talent for AI Era
Arm announced a collaboration with Monash University Malaysia's School of Engineering, donating IC design development boards and establishing a guest lecturer program. The initiative aims to provide students with hands-on experience in AI chip design based on Arm architecture, addressing the growing demand for advanced computing talent in the APAC region.
Cisco Extends Enterprise Agreement to Nutanix, Framing Procurement Flexibility as Architectural
Cisco has extended its Enterprise Agreement (EA) framework to include Nutanix, marking Nutanix's first such agreement with an OEM. This move offers customers a unified procurement model with predictable pricing, capacity expansion on-demand, and flexibility to shift value within the Nutanix software portfolio. Cisco's SVP positions commercial flexibility as an integral part of modern infrastructure architecture.
AWS and TGS Strategic Partnership for Energy AI and HPC Transformation
TGS selected AWS as preferred cloud provider, leveraging AWS HPC and generative AI for energy exploration solutions. Collaboration includes modernizing TGS Imaging AnyWare platform and deploying multimodal Subsurface Foundation Model with AWS Nitro security.
Arm Expands into Silicon Products with First Self-Designed AGI CPU
Arm is expanding its compute platform into production silicon for the first time, launching the self-designed Arm AGI CPU for AI data centers and agentic workloads. It targets over 2x performance per rack versus x86 platforms and is backed by lead partner Meta, customers like OpenAI, and a broad OEM/ODM ecosystem.
NVIDIA Introduces Physical AI Data Factory Blueprint, Transforming Compute into Synthetic Data
At GTC, NVIDIA introduced the Physical AI Data Factory Blueprint, an open reference architecture designed to transform compute into large-scale, high-quality synthetic training data. Built on Cosmos world models and the OSMO operator, it addresses the bottleneck of scaling real-world data, aiming to serve as the data engine for next-gen autonomous systems and robots.
NVIDIA Forms Nemotron Coalition to Advance Open Frontier Models
NVIDIA announced the Nemotron Coalition at GTC, a collaboration with model builders and AI labs like Mistral AI to advance open, frontier-level foundation models. The initiative aims to foster the open model ecosystem by sharing expertise, data, and compute, emphasizing a future where AI is powered by a system of both open and proprietary models.
Meta Partners with Arm to Develop New AI Data Center CPUs
Meta partners with Arm to co-develop data center CPUs optimized for AI workloads. The first product, the Arm AGI CPU, aims to boost rack performance density for large-scale AI deployments. It will be available through Arm's ecosystem, with board designs to be open-sourced via the Open Compute Project.
ARM Launches AGI CPU for Agentic AI Infrastructure Era
ARM introduces the Arm AGI CPU, its first silicon product, designed for agentic AI infrastructure on Neoverse. Optimized for massively parallel workloads, it supports 272 cores per blade in a 1OU design, delivering 8160 cores per rack and over 2x performance vs. x86 systems.
ARM Launches AGI CPU Silicon for AI Infrastructure Market
ARM introduced its first production AGI CPU silicon in March 2026, marking a strategic shift from IP licensing to full silicon solutions provider. Designed for next-gen AI infrastructure, this move may reshape the data center processor ecosystem.
Arm Neoverse Reshapes Control Layer in AI Infrastructure
ARM introduces Neoverse infrastructure CPU cores optimized for cloud, AI, and HPC workloads, adopted by NVIDIA, AWS, Microsoft, and Google for their AI platforms, delivering performance gains and energy efficiency. This architecture enables high-density AI workload deployment in cloud and edge environments with enhanced multi-tenant security.