Reports
AI-generated structured vendor updates
ARM Optimizes Gemma 4 On-Device AI Performance with Google
ARM's SME2 technology in Armv9 architecture accelerates Google's Gemma 4 model on mobile devices, achieving 5.5x prefill speedup and 1.6x faster decoding. The collaboration enables developers to access optimizations without code changes, shifting on-device AI toward default mobile app architecture.
AMD Announces Breakthrough MLPerf Inference 6.0 Results, Showcasing Multinode Scaling and Multimodal Capabilities
AMD's MLPerf Inference 6.0 submission, powered by Instinct MI355X GPUs, surpassed 1 million tokens per second for the first time on models like Llama 2 70B and GPT-OSS-120B. The results highlight efficient multinode scaling, rapid enablement of new workloads (e.g., text-to-video model Wan-2.2-t2v), and reproducible performance across a broad partner ecosystem.
ARM Launches AGI CPU Silicon, Extends AI Infrastructure Reach
ARM debuts its first self-designed AGI CPU silicon, moving beyond IP licensing to offer full-stack solutions from custom silicon to integrated platforms. This shift redefines control points in AI infrastructure supply chains, enabling enterprises to optimize AI workload deployment at hardware layer.
Intel Demonstrates AI Performance with Xeon 6 and Arc Pro GPUs in MLPerf Inference
Intel showcased the performance of its Xeon 6 CPUs and Arc Pro B-Series GPUs in the MLPerf Inference v6.0 benchmarks, particularly in handling large language models (LLMs). The results indicate that a system with four Arc Pro B70 GPUs can process 120B parameter models, delivering up to 1.8x higher inference performance in multi-GPU setups.
Qualcomm Launches NPU-Integrated Wearable Platform to Advance On-Device AI and Personal AI Ecosystem
Qualcomm unveiled the Snapdragon Wear Elite platform, its first wearable platform with an integrated NPU designed for on-device AI, capable of supporting up to two-billion-parameter models. It marks a strategic shift from smartphone-centric to agent-centric computing, leveraging wearables for continuous context and enabling intelligence to flow across a user's device ecosystem.
AWS and TGS Strategic Partnership for Energy AI and HPC Transformation
TGS selected AWS as preferred cloud provider, leveraging AWS HPC and generative AI for energy exploration solutions. Collaboration includes modernizing TGS Imaging AnyWare platform and deploying multimodal Subsurface Foundation Model with AWS Nitro security.
Arm Expands into Silicon Products with First Self-Designed AGI CPU
Arm is expanding its compute platform into production silicon for the first time, launching the self-designed Arm AGI CPU for AI data centers and agentic workloads. It targets over 2x performance per rack versus x86 platforms and is backed by lead partner Meta, customers like OpenAI, and a broad OEM/ODM ecosystem.
Intel and CrowdStrike Deepen AI PC Security Integration for Enhanced Endpoint Threat Detection
Intel and CrowdStrike expanded collaboration to deeply integrate Falcon platform with Intel AI PC hardware, leveraging CPU/GPU/NPU on-device AI acceleration and chip-level telemetry. The solution aims to enable real-time threat detection and intrusion prevention without performance loss, addressing generative AI data leakage risks at enterprise scale.
NVIDIA Demonstrates AI Factories as Flexible Grid Assets for Peak Demand Management
NVIDIA, in collaboration with EPRI, National Grid, and Emerald AI, demonstrated how AI factories powered by Blackwell GPU clusters can dynamically adjust power consumption in response to grid signals. This allows them to act as 'shock absorbers' during peak demand while maintaining performance for high-priority AI workloads.
Arm Launches Data Center Silicon Product Entering Server Hardware Market
Arm launched its first data center silicon product, Arm AGI CPU, featuring a 1OU dual-node reference server design. This marks Arm's strategic shift from IP licensing to providing complete server hardware reference designs, aimed at building the chip foundation for agent AI cloud.
Arm Launches Self-Developed AGI CPU for AI Data Center Market
Arm introduces its first self-developed AGI CPU for AI data centers, featuring Neoverse V3 architecture with claimed 2x performance per rack over x86 platforms. This marks Arm's strategic shift from IP licensing to silicon provider, with support from key customers including Meta and OpenAI.
Meta Partners with Arm to Develop New AI Data Center CPUs
Meta partners with Arm to co-develop data center CPUs optimized for AI workloads. The first product, the Arm AGI CPU, aims to boost rack performance density for large-scale AI deployments. It will be available through Arm's ecosystem, with board designs to be open-sourced via the Open Compute Project.
Meta and Arm Collaborate on AI-Optimized Data Center CPU
Meta partners with Arm to develop Arm AGI CPU optimized for AI workloads, targeting higher performance density and energy efficiency. As lead partner, Meta will open-source hardware designs via OCP and integrate with its proprietary MTIA chips.
ARM Launches AGI CPU for Agentic AI Infrastructure Era
ARM introduces the Arm AGI CPU, its first silicon product, designed for agentic AI infrastructure on Neoverse. Optimized for massively parallel workloads, it supports 272 cores per blade in a 1OU design, delivering 8160 cores per rack and over 2x performance vs. x86 systems.
ARM Launches AGI CPU Silicon for AI Infrastructure Market
ARM introduced its first production AGI CPU silicon in March 2026, marking a strategic shift from IP licensing to full silicon solutions provider. Designed for next-gen AI infrastructure, this move may reshape the data center processor ecosystem.
Arm Neoverse Reshapes Control Layer in AI Infrastructure
ARM introduces Neoverse infrastructure CPU cores optimized for cloud, AI, and HPC workloads, adopted by NVIDIA, AWS, Microsoft, and Google for their AI platforms, delivering performance gains and energy efficiency. This architecture enables high-density AI workload deployment in cloud and edge environments with enhanced multi-tenant security.
ARM and NVIDIA Drive Localization Revolution in AI Workstations
ARM and NVIDIA jointly launch DGX Spark AI workstations based on GB10 Grace Blackwell chips, with eight major OEMs releasing products simultaneously. The solution features unified memory architecture supporting 200B parameter models locally, with third-party tests showing 41% faster rendering and 3.2x AI processing speed versus x86 alternatives, enabling seamless cloud-to-edge toolchain migration.
Check Point AI Factory Blueprint: Security Control Shifts to NVIDIA DPU and LLM Layer
Check Point unveils AI Factory Security Blueprint, tightly integrating its firewall with NVIDIA BlueField DPU via DOCA. The architecture enforces security at four layers: LLM, AI infrastructure, perimeter, and workload. The new AI Factory Firewall delivers hardware-accelerated threat prevention without consuming CPU/GPU cycles, aiming to embed security into the AI fabric.
NVIDIA CEO Outlines Accelerated Computing Paradigm, Signaling AI Infrastructure Evolution
In an interview, NVIDIA CEO Jensen Huang systematically elaborated on accelerated computing as a fundamental shift in computer architecture. He emphasized the data center's transition from general-purpose CPUs to specialized acceleration platforms led by GPUs, and believes the future computing stack will be re-architected around accelerated computing.
NVIDIA Outlines Three-Stage Accelerated Computing Evolution and Software-Defined Data Center Strategy
NVIDIA CEO outlined a three-stage accelerated computing evolution, progressing from single GPU acceleration to full-stack acceleration, and now entering the software-defined, AI-driven data center phase. The company emphasizes dynamic resource allocation through software-defined infrastructure and reaffirms its full-stack AI strategy from chips to applications.