AI计算 - AI Infrastructure Intelligence Search

Intel Other 2026-12-30

Intel at Computex 2026: CPU as Agentic AI Orchestrator, x86 Reclaims Inference Control

At Computex 2026, Intel unveiled the 288-core Xeon 6+ (Intel 18A) and 3rd-gen Core Ultra, claiming Agentic AI shifts CPU:GPU ratio from 1:8 to 1:1. Partnering with SambaNova and Foxconn for rack-scale inference systems, Intel repositions the CPU as the orchestrator for multi-step AI reasoning, aiming to reclaim control from GPU-centric architectures.

Microsoft Azure Other 2026-06-21

Microsoft Azure Debuts Blackwell Ultra AI Supercomputer, Training-as-a-Service Reshapes Ecosystem

Microsoft Azure launched an AI supercomputer cluster powered by NVIDIA Blackwell Ultra GPUs, delivering over 200 exaflops of AI compute. It introduced AI Training as a Service for on-demand model training and partnered with OpenAI to deploy GPT-6 training clusters by 2027. Liquid cooling achieves a PUE of 1.08, positioning Azure as the premier cloud for trillion-parameter models.

ARM Other 2026-06-21

ARMv10 Delivers 30% IPC Uplift and Native AI Acceleration, Tightening Ecosystem Lock-In

ARM launches v10 architecture with 30% IPC gain, SVE3 instructions, dedicated AI acceleration, and enhanced confidential computing. First cores (Cortex-X6, Cortex-A830) target 2027, aiming for leading per-watt AI performance across data center, PC, and mobile.

NVIDIA Product Launch 2026-05-29

NVIDIA Blackwell Ultra GB300 NVL72: 1.44 EFLOPS FP4, 50x AI Factory Boost

NVIDIA launches Blackwell Ultra GB300 NVL72 rack system with 72 Blackwell Ultra GPUs and 36 Grace CPUs, delivering 1,440 PFLOPS FP4 sparse, 20TB HBM3e, 130TB/s NVLink. Claims 50x AI factory output over Hopper. Available now.

Meta Other High Signal 2026-04-14

Meta-Broadcom Multi-Year 2nm AI Chip Partnership, Initial 1GW+ Deployment

Meta and Broadcom announced multi-year, multi-generation strategic partnership to co-develop MTIA (Meta Training and Inference Accelerator) chips through 2029. Initial deployment exceeds 1GW, with multi-gigawatt expansion planned. Industry-first 2nm AI compute accelerator, based on Broadcom XPU platform. Meta has planned MTIA 300/400/450/500 iterations for recommendation, ranking, and large-scale inference. Broadcom CEO Hock Tan to step down from Meta board, transition to strategic advisor.

NVIDIA Other High Signal 2026-03-18

NVIDIA Launches GRT Platform for Full-Stack Robotics AI Development

NVIDIA launches GRT platform integrating multi-modal AI models including Eureka, VIMA and Octo, with Isaac Lab simulator accelerating reinforcement learning. The platform enables end-to-end development from simulation to physical deployment, shifting robotics development from coding to AI model-driven paradigm.

NVIDIA Other High Signal 2026-03-18

NVIDIA and Telecom Operators Build AI Grids to Redistribute AI Inference

NVIDIA is partnering with global telecom operators like AT&T and Comcast to transform existing distributed network sites into 'AI Grids' for edge AI inference. This initiative aims to deploy AI compute closer to users and data, reducing latency and cost per token. It represents a strategic shift for telcos from being data carriers to distributed AI computing platforms.

NVIDIA Other High Signal 2026-03-18

NVIDIA Partners with Telecom Operators to Build Distributed AI Inference Grid

NVIDIA collaborates with telecom operators to transform 100,000 global network sites and 100GW backup power into a distributed AI computing platform for low-latency inference. The AI grid has been validated in IoT and cloud gaming scenarios, achieving sub-500ms latency and 50% cost reduction.

NVIDIA Other High Signal 2026-03-17

NVIDIA Partners with Industrial Software Giants to Advance AI in Manufacturing

NVIDIA collaborates with industrial software leaders like Siemens and Ansys to integrate AI into design and manufacturing. Leveraging Omniverse platform and generative AI to accelerate digital-to-physical workflows. Focuses on digital twins and generative AI for product development optimization.

NVIDIA Other High Signal 2026-03-17

NVIDIA Partners with Industrial Software Giants on AI-Driven Manufacturing Solutions

NVIDIA collaborates with Ansys, Cadence, and Siemens to integrate generative AI and physical AI technologies into product development using Omniverse and AI compute infrastructure. The solutions enable deep integration of digital twins, simulation, and automated design to address industrial efficiency challenges.

NVIDIA Other High Signal 2026-03-17

NVIDIA Mass Produces Dynamo 1.0 Inference OS, Strengthening AI Factory Platform Strategy

NVIDIA begins mass production of Dynamo 1.0 inference OS, providing a unified software layer to coordinate AI inference workloads across data centers, cloud and edge. The system simplifies large-scale AI model deployment through standardized runtime and scheduler, abstracting infrastructure management.

NVIDIA Other High Signal 2026-03-17

NVIDIA Collaborates with Telecom Giants to Build AI Grids for Distributed Inference

NVIDIA announced AI Grids architecture at GTC 2026, collaborating with telecom operators to dynamically distribute inference tasks to optimal network locations, reducing latency and improving efficiency. This represents deep integration of AI computing with communication infrastructure to support edge expansion of AI-native applications.

NVIDIA Other Medium Signal 2026-03-17

Adobe and NVIDIA Partner to Optimize AI PC Creative Workflows

Adobe and NVIDIA formed a strategic partnership to co-develop next-gen Firefly generative AI models and optimize performance on NVIDIA RTX AI PCs. The collaboration focuses on deep integration of AI capabilities into core products like Creative Cloud and Experience Cloud workflows, accelerating creative marketing and agentic processes.

NVIDIA Other Medium Signal 2026-03-17

NVIDIA Launches Spatial Computing for Physical AI Applications

NVIDIA introduces spatial computing technology to extend AI capabilities from digital to physical and orbital spaces. The technology enables real-time perception, reasoning and action for robots and physical systems in unstructured environments. This represents a key step in NVIDIA's physical AI strategy to build an AI+robotics+space ecosystem.

NVIDIA Other High Signal 2026-03-17

NVIDIA Releases AI Factory Reference Design and Digital Twin Blueprint

NVIDIA unveiled Vera Rubin DSX AI factory reference design and Omniverse DSX digital twin blueprint, built on Spectrum-X Ethernet, Quantum-X800 InfiniBand and BlueField-3 DPU. The architecture connects real-world sensors with digital twins for continuous AI model training and optimization, extending AI computing from data centers to physical world automation.

Hewlett Packard Enterprise Other Medium Signal 2026-03-16

HPE Deepens NVIDIA Partnership with Expanded AI Computing Portfolio

HPE announced a significant expansion of its 'NVIDIA AI Computing by HPE' portfolio, aiming to accelerate enterprise AI deployment, operationalization, and scaling through pre-integrated and validated systems that address scale, security, and governance requirements.

Hewlett Packard Enterprise Other Medium Signal 2026-03-16

HPE Deepens AI Factory Partnership with NVIDIA, Unveils Full-Stack Supercomputing Solutions

At GTC 2026, HPE announced enhancements to its NVIDIA AI Computing portfolio, introducing full-stack solutions for large-scale AI factories and supercomputers. The offerings integrate compute, GPUs, networking, liquid cooling, software, and services to improve deployment efficiency and time-to-insight.

TSMC Other High Signal 2026-03-05

TSMC Advances AI Hardware Innovation with Advanced Process and 3D Packaging

TSMC reveals AI technology research progress, focusing on N3/N2 advanced nodes and 3D Fabric heterogeneous integration. It enhances AI chip performance and efficiency through optimized transistor architecture and packaging, targeting memory bandwidth bottlenecks for cloud-to-edge AI applications.

Huawei Other High Signal 2026-03-04

Huawei Launches AI Data Platform with Compute-Storage Separation

Huawei launched an AI data platform featuring compute-storage separation architecture for efficient data flow. It integrates high-performance file system supporting EB-level data and accelerates AI training data preparation by 30%. Provides unified data management with seamless integration to major AI frameworks and Ascend hardware.

Apple Other Medium Signal 2026-03-03

Apple Introduces M5 Pro/Max Chips with Fusion Architecture for Enhanced AI Performance

Apple launches M5 Pro and M5 Max chips featuring a new fusion architecture that packages two 3nm dies into a single SoC, delivering over 4x AI performance improvement. The chips include an 18-core CPU and GPU with integrated neural accelerators, with unified memory bandwidth up to 614GB/s.

Reports

Filter

Intel at Computex 2026: CPU as Agentic AI Orchestrator, x86 Reclaims Inference Control

Microsoft Azure Debuts Blackwell Ultra AI Supercomputer, Training-as-a-Service Reshapes Ecosystem

ARMv10 Delivers 30% IPC Uplift and Native AI Acceleration, Tightening Ecosystem Lock-In

NVIDIA Blackwell Ultra GB300 NVL72: 1.44 EFLOPS FP4, 50x AI Factory Boost

Meta-Broadcom Multi-Year 2nm AI Chip Partnership, Initial 1GW+ Deployment

NVIDIA Launches GRT Platform for Full-Stack Robotics AI Development

NVIDIA and Telecom Operators Build AI Grids to Redistribute AI Inference

NVIDIA Partners with Telecom Operators to Build Distributed AI Inference Grid

NVIDIA Partners with Industrial Software Giants to Advance AI in Manufacturing

NVIDIA Partners with Industrial Software Giants on AI-Driven Manufacturing Solutions

NVIDIA Mass Produces Dynamo 1.0 Inference OS, Strengthening AI Factory Platform Strategy

NVIDIA Collaborates with Telecom Giants to Build AI Grids for Distributed Inference

Adobe and NVIDIA Partner to Optimize AI PC Creative Workflows

NVIDIA Launches Spatial Computing for Physical AI Applications

NVIDIA Releases AI Factory Reference Design and Digital Twin Blueprint

HPE Deepens NVIDIA Partnership with Expanded AI Computing Portfolio

HPE Deepens AI Factory Partnership with NVIDIA, Unveils Full-Stack Supercomputing Solutions

TSMC Advances AI Hardware Innovation with Advanced Process and 3D Packaging

Huawei Launches AI Data Platform with Compute-Storage Separation

Apple Introduces M5 Pro/Max Chips with Fusion Architecture for Enhanced AI Performance