Filter

×
Active Filters Clear All
Keyword: CPU ×
134 Total Reports
5/7 Page
ARM Other High Signal 2026-04-03

ARM Optimizes Gemma 4 On-Device AI Performance with Google

ARM's SME2 technology in Armv9 architecture accelerates Google's Gemma 4 model on mobile devices, achieving 5.5x prefill speedup and 1.6x faster decoding. The collaboration enables developers to access optimizations without code changes, shifting on-device AI toward default mobile app architecture.

AMD Other High Signal 2026-04-02

AMD Announces Breakthrough MLPerf Inference 6.0 Results, Showcasing Multinode Scaling and Multimodal Capabilities

AMD's MLPerf Inference 6.0 submission, powered by Instinct MI355X GPUs, surpassed 1 million tokens per second for the first time on models like Llama 2 70B and GPT-OSS-120B. The results highlight efficient multinode scaling, rapid enablement of new workloads (e.g., text-to-video model Wan-2.2-t2v), and reproducible performance across a broad partner ecosystem.

ARM Other High Signal 2026-04-01

ARM Launches AGI CPU Silicon, Extends AI Infrastructure Reach

ARM debuts its first self-designed AGI CPU silicon, moving beyond IP licensing to offer full-stack solutions from custom silicon to integrated platforms. This shift redefines control points in AI infrastructure supply chains, enabling enterprises to optimize AI workload deployment at hardware layer.

Intel Other Medium Signal 2026-04-01

Intel Demonstrates AI Performance with Xeon 6 and Arc Pro GPUs in MLPerf Inference

Intel showcased the performance of its Xeon 6 CPUs and Arc Pro B-Series GPUs in the MLPerf Inference v6.0 benchmarks, particularly in handling large language models (LLMs). The results indicate that a system with four Arc Pro B70 GPUs can process 120B parameter models, delivering up to 1.8x higher inference performance in multi-GPU setups.

Qualcomm Other High Signal 2026-03-31

Qualcomm Launches NPU-Integrated Wearable Platform to Advance On-Device AI and Personal AI Ecosystem

Qualcomm unveiled the Snapdragon Wear Elite platform, its first wearable platform with an integrated NPU designed for on-device AI, capable of supporting up to two-billion-parameter models. It marks a strategic shift from smartphone-centric to agent-centric computing, leveraging wearables for continuous context and enabling intelligence to flow across a user's device ecosystem.

Amazon Other Medium Signal 2026-03-30

AWS and TGS Strategic Partnership for Energy AI and HPC Transformation

TGS selected AWS as preferred cloud provider, leveraging AWS HPC and generative AI for energy exploration solutions. Collaboration includes modernizing TGS Imaging AnyWare platform and deploying multimodal Subsurface Foundation Model with AWS Nitro security.

ARM Other High Signal 2026-03-27

Arm Expands into Silicon Products with First Self-Designed AGI CPU

Arm is expanding its compute platform into production silicon for the first time, launching the self-designed Arm AGI CPU for AI data centers and agentic workloads. It targets over 2x performance per rack versus x86 platforms and is backed by lead partner Meta, customers like OpenAI, and a broad OEM/ODM ecosystem.

Intel Other Medium Signal 2026-03-26

Intel and CrowdStrike Deepen AI PC Security Integration for Enhanced Endpoint Threat Detection

Intel and CrowdStrike expanded collaboration to deeply integrate Falcon platform with Intel AI PC hardware, leveraging CPU/GPU/NPU on-device AI acceleration and chip-level telemetry. The solution aims to enable real-time threat detection and intrusion prevention without performance loss, addressing generative AI data leakage risks at enterprise scale.

NVIDIA Other High Signal 2026-03-25

NVIDIA Demonstrates AI Factories as Flexible Grid Assets for Peak Demand Management

NVIDIA, in collaboration with EPRI, National Grid, and Emerald AI, demonstrated how AI factories powered by Blackwell GPU clusters can dynamically adjust power consumption in response to grid signals. This allows them to act as 'shock absorbers' during peak demand while maintaining performance for high-priority AI workloads.

ARM Other High Signal 2026-03-25

Arm Launches Data Center Silicon Product Entering Server Hardware Market

Arm launched its first data center silicon product, Arm AGI CPU, featuring a 1OU dual-node reference server design. This marks Arm's strategic shift from IP licensing to providing complete server hardware reference designs, aimed at building the chip foundation for agent AI cloud.

ARM Other High Signal 2026-03-25

Arm Launches Self-Developed AGI CPU for AI Data Center Market

Arm introduces its first self-developed AGI CPU for AI data centers, featuring Neoverse V3 architecture with claimed 2x performance per rack over x86 platforms. This marks Arm's strategic shift from IP licensing to silicon provider, with support from key customers including Meta and OpenAI.

Meta Other High Signal 2026-03-25

Meta Partners with Arm to Develop New AI Data Center CPUs

Meta partners with Arm to co-develop data center CPUs optimized for AI workloads. The first product, the Arm AGI CPU, aims to boost rack performance density for large-scale AI deployments. It will be available through Arm's ecosystem, with board designs to be open-sourced via the Open Compute Project.

Meta Other High Signal 2026-03-25

Meta and Arm Collaborate on AI-Optimized Data Center CPU

Meta partners with Arm to develop Arm AGI CPU optimized for AI workloads, targeting higher performance density and energy efficiency. As lead partner, Meta will open-source hardware designs via OCP and integrate with its proprietary MTIA chips.

ARM Other High Signal 2026-03-25

ARM Launches AGI CPU for Agentic AI Infrastructure Era

ARM introduces the Arm AGI CPU, its first silicon product, designed for agentic AI infrastructure on Neoverse. Optimized for massively parallel workloads, it supports 272 cores per blade in a 1OU design, delivering 8160 cores per rack and over 2x performance vs. x86 systems.

ARM Other High Signal 2026-03-25

ARM Launches AGI CPU Silicon for AI Infrastructure Market

ARM introduced its first production AGI CPU silicon in March 2026, marking a strategic shift from IP licensing to full silicon solutions provider. Designed for next-gen AI infrastructure, this move may reshape the data center processor ecosystem.

ARM Other High Signal 2026-03-25

Arm Neoverse Reshapes Control Layer in AI Infrastructure

ARM introduces Neoverse infrastructure CPU cores optimized for cloud, AI, and HPC workloads, adopted by NVIDIA, AWS, Microsoft, and Google for their AI platforms, delivering performance gains and energy efficiency. This architecture enables high-density AI workload deployment in cloud and edge environments with enhanced multi-tenant security.

ARM Other High Signal 2026-03-24

ARM and NVIDIA Drive Localization Revolution in AI Workstations

ARM and NVIDIA jointly launch DGX Spark AI workstations based on GB10 Grace Blackwell chips, with eight major OEMs releasing products simultaneously. The solution features unified memory architecture supporting 200B parameter models locally, with third-party tests showing 41% faster rendering and 3.2x AI processing speed versus x86 alternatives, enabling seamless cloud-to-edge toolchain migration.

Check Point Other 2026-03-23

Check Point AI Factory Blueprint: Security Control Shifts to NVIDIA DPU and LLM Layer

Check Point unveils AI Factory Security Blueprint, tightly integrating its firewall with NVIDIA BlueField DPU via DOCA. The architecture enforces security at four layers: LLM, AI infrastructure, perimeter, and workload. The new AI Factory Firewall delivers hardware-accelerated threat prevention without consuming CPU/GPU cycles, aiming to embed security into the AI fabric.

NVIDIA Other High Signal 2026-03-21

NVIDIA CEO Outlines Accelerated Computing Paradigm, Signaling AI Infrastructure Evolution

In an interview, NVIDIA CEO Jensen Huang systematically elaborated on accelerated computing as a fundamental shift in computer architecture. He emphasized the data center's transition from general-purpose CPUs to specialized acceleration platforms led by GPUs, and believes the future computing stack will be re-architected around accelerated computing.

NVIDIA Other High Signal 2026-03-21

NVIDIA Outlines Three-Stage Accelerated Computing Evolution and Software-Defined Data Center Strategy

NVIDIA CEO outlined a three-stage accelerated computing evolution, progressing from single GPU acceleration to full-stack acceleration, and now entering the software-defined, AI-driven data center phase. The company emphasizes dynamic resource allocation through software-defined infrastructure and reaffirms its full-stack AI strategy from chips to applications.