Filter

×
Active Filters Clear All
Keyword: CPU ×
134 Total Reports
3/7 Page
Huawei Other 2026-05-25

Huawei's Tao Law: LogicFolding Bypasses Lithography, 55% Density Gain on Fixed Node

At ISCAS 2026, Huawei's He Tingbo unveiled the Tao Law, replacing geometric scaling with temporal optimization targeting tau (characteristic time). LogicFolding vertically stacks active layers to shorten critical paths, achieving 55% transistor density increase and 41% energy efficiency gain on a fixed node. Kirin 2026 reaches 3.1GHz; Ascend series will adopt LogicFolding. The roadmap projects equivalent 1.4nm density by 2031, fundamentally challenging Moore's Law's lithography dependency.

Intel Other 2026-05-25

Intel CEO: AI Inference Flips CPU/GPU Ratio, Multi-Agent Pushes CPU Back to Core

Intel CEO Lip-Bu Tan forecasts AI inference driving CPU/GPU ratio from 1:8 to 1:1 or even 4:1, with Multi-Agent demands (OS scheduling, KV Cache offload, high-concurrency tool calls) elevating CPU from supporting role to lead. NVIDIA Vera, AMD Venice, and Intel 18A CPU mass production confirm a CPU demand super-cycle.

NVIDIA Other 2026-05-25

NVIDIA Vera CPU Threatens x86: 1.5x Performance, 4x Density, Full-Stack AI Lock-In

Rumors indicate NVIDIA will unveil its first general-purpose CPU Vera at Computex 2026, claiming 1.5x x86 performance, 2x throughput, and 4x rack density. Shipment targets: 1.2M units in FY2027, 4.2M in FY2028. Vera targets the AI inference shift from 1:8 to 1:1 CPU/GPU ratio, complementing Grace to create a full GPU+CPU stack.

Google Product Launch 2026-05-22

Google I/O 2026 Pivots to Agentic AI: Antigravity 2.0 and TPU 8t/8i Reshape Control Plane

At I/O 2026, Google unveiled Gemini 3.5 Flash (4x output speed), Antigravity 2.0 multi-agent orchestration, TPU 8t/8i (3x training, 2x inference perf/W), and Gemini Spark, signaling a full pivot to Agentic AI infrastructure. By integrating platform and silicon, Google shifts control from model APIs to orchestration and hardware lock-in.

Intel Other 2026-05-20

Intel Core Ultra 3 SoC Replaces Discrete GPUs in Edge Robotics, Slashing TCO

Intel Core Ultra Series 3 SoC integrates CPU, GPU, and NPU to power edge robotics, replacing discrete GPUs. Partners like Sensory AI run multi-agent AI (vision, language, motion) locally, cutting TCO and eliminating cloud latency. This shifts the cost-performance curve for service robots.

AMD Other 2026-05-20

AMD Ryzen AI Halo & Max PRO 400: Local 300B Parameter Inference, but Hidden Lock-in and Thermal Limits

AMD launches Ryzen AI Halo developer platform (128GB unified memory, 200B parameter models) and Ryzen AI Max PRO 400 series (first x86 client to run 300B parameter models locally). Unified memory, ROCm optimization, and OEM partnerships aim to shift agentic AI from cloud to local, but shared memory bandwidth and thermal constraints limit real-world throughput.

Intel Other 2026-05-16

AI Agent Workloads Trigger Structural CPU Shortage, Arm and AMD Reshape Server Value Chain

AI inference and agent orchestration surge CPU demand, shifting CPU-GPU ratio from 1:8 to 1:1. AMD EPYC lead time 8-12 weeks, Intel Xeon up to 6 months; Arm's 3nm 136-core AGI processor co-developed with Meta/Cerebras/Cloudflare/OpenAI sees demand exceeding 200 billion USD. CPU replaces GPU as the new AI infrastructure bottleneck, with Arm and AMD reshaping the value chain.

Microsoft Other 2026-05-14

Microsoft's DQI at WinHEC 2026: Shifting Driver Control from IHVs to Microsoft

At WinHEC 2026, Microsoft announced the Driver Quality Initiative (DQI), centered on transitioning third-party kernel-mode drivers to user-mode or Microsoft-authored class drivers, alongside enhanced trust verification, lifecycle management, and quality metrics. This aims to systematically improve Windows driver quality but effectively consolidates Microsoft's control over the driver ecosystem.

Cloudflare Other 2026-05-14

Cloudflare's Trio of Patches Breaks ClickHouse Partition Bloat Lock Contention

Cloudflare's billing pipeline slowed after a partitioning change to (namespace, day) in ClickHouse, causing massive lock contention from exploding part counts. Three patches—shared lock, deferred vector copy, and binary search—cut query latency by >50% and decoupled performance from part count.

Cisco Other 2026-05-13

Cisco N9300 Smart Switches Embed Security into AI Data Center Fabric

At ONUG 2026, Cisco unveiled Nexus One architecture and N9300 Smart Switches, embedding L4 segmentation, Hypershield, eBPF-based Live Protect, and DPU-integrated firewall directly into the network fabric. This aims to deliver bottleneck-free security for AI workloads while enabling AI-driven operations via AgenticOps and AI Canvas.

Amazon Other 2026-05-12

AWS AgentCore Payments: Autonomous AI Agent Spending Unlocks New Lock-in and Threat Surface

AWS previews managed payment capabilities in Bedrock AgentCore, enabling AI agents to autonomously pay for APIs, MCP servers, and web content, integrated with Coinbase and Stripe. Also launches Agent Toolkit for AWS and MCP Server GA. This pushes AI agents toward autonomous execution but introduces new security and lock-in risks.

ARM Other High Signal 2026-05-07

Arm Reports Record Results, AGI CPU Emerges as New AI Infrastructure Focal Point

Arm reported record FY2026 results with $4.92B revenue and over 20% growth for three consecutive years. The core highlight is the Arm AGI CPU designed for agentic AI, securing over $2B in customer demand and backing from Meta, AWS, Google, and others.

AMD Other Medium Signal 2026-05-07

AMD Backs SPEC CPU 2026 Benchmark, Emphasizing Open, Trusted Performance Measurement

AMD published a blog endorsing the upcoming SPEC CPU 2026 industry benchmark, emphasizing the critical role of open, reproducible CPU performance standards for customer infrastructure decisions in the AI era. The new benchmark updates its application suite and strengthens support for bare-metal cloud environments and parallel computing.

Google Other High Signal 2026-05-06

Google Launches Gemma 4 Open Models, Accelerating Local AI Agent Deployment

Google released the Gemma 4 open model family under Apache 2.0 license, introducing MoE architecture for the first time. It aims to deliver high-performance AI agent capabilities directly to mobile and edge hardware, reducing reliance on cloud clusters and enabling new local, private AI applications.

AMD Other High Signal 2026-05-06

AMD and OpenAI Contribute MRC Protocol to OCP for Scalable AI Networking

AMD, in collaboration with OpenAI, Microsoft, and others, contributed the MRC (Multipath Reliable Connection) protocol, designed for large-scale AI training, to the Open Compute Project (OCP). AMD co-authored the specification and has already deployed MRC on its programmable Pensando DPU/NIC products, positioning its networking technology as a key enabler for resilient and adaptive AI infrastructure.

AMD Other High Signal 2026-05-06

AMD and OpenAI Introduce MRC, a Next-Gen Transport Protocol for AI Training

AMD, in collaboration with OpenAI, Microsoft, and other industry leaders, has released the specification for the Multipath Reliable Connection (MRC) protocol. MRC addresses performance bottlenecks of RoCEv2 in hyperscale AI training clusters through intelligent packet spraying, selective retransmission, and network-signaled congestion control, aiming to improve bandwidth utilization and job resilience.

Intel Other Medium Signal 2026-05-06

Intel at Computex 2026 Emphasizes CPU's Critical Role in AI Compute

Intel will outline its vision for the AI-driven computing era at Computex 2026, centering on the resurgence of the CPU as a critical AI engine. It emphasizes CPU-GPU/accelerator synergy to build efficient, scalable AI systems atop the broad x86 ecosystem.

NVIDIA Other 2026-05-05

NVIDIA Extreme Co-Design: Vera Rubin Platform Targets Agentic Inference TCO Inflection

NVIDIA unveils an extreme co-design stack for agentic systems, featuring Vera Rubin NVL72, NVLink 6, ConnectX-9, BlueField-4, and Spectrum-X. By disaggregating inference, optimizing KV cache management, and deploying low-latency fabrics, it aims to break the throughput-interactivity tradeoff, making high-context token processing economically viable.

AMD Other Medium Signal 2026-05-04

AMD Showcases Heterogeneous Computing Strategy for Enterprise AI with Dell

At Dell Technologies World, AMD highlighted its heterogeneous computing portfolio, aiming to match the right compute engine to specific enterprise AI workloads, while emphasizing hardware-based security and manageability. This signals a shift in AI infrastructure from generic solutions to fine-tuned, scenario-specific deployments.

NVIDIA Partnership High Signal 2026-05-02

NVIDIA and Intel Announce $5 Billion Strategic Partnership: New AI Chip Supply Chain Landscape

NVIDIA and Intel announced a $5 billion strategic partnership on September 18, 2025: NVIDIA invests $5 billion for ~4% Intel stake, while Intel customizes x86 CPUs for NVIDIA AI infrastructure and x86 SoCs integrating RTX GPU chiplets for PC products. Through NVLink, the two companies form a coalition of 'AI Computing + NVIDIA CUDA + x86 Ecosystem'. This reshapes the AI chip supply chain landscape with far-reaching implications for AMD and independent chip designers.