AI inference - AI Infrastructure Intelligence Search

Intel Other Medium Signal 2026-04-01

Intel Demonstrates AI Performance with Xeon 6 and Arc Pro GPUs in MLPerf Inference

Intel showcased the performance of its Xeon 6 CPUs and Arc Pro B-Series GPUs in the MLPerf Inference v6.0 benchmarks, particularly in handling large language models (LLMs). The results indicate that a system with four Arc Pro B70 GPUs can process 120B parameter models, delivering up to 1.8x higher inference performance in multi-GPU setups.

NVIDIA Other High Signal 2026-03-26

NVIDIA Introduces Physical AI Data Factory Blueprint, Transforming Compute into Synthetic Data

At GTC, NVIDIA introduced the Physical AI Data Factory Blueprint, an open reference architecture designed to transform compute into large-scale, high-quality synthetic training data. Built on Cosmos world models and the OSMO operator, it addresses the bottleneck of scaling real-world data, aiming to serve as the data engine for next-gen autonomous systems and robots.

ARM Other High Signal 2026-03-25

Arm Neoverse Reshapes Control Layer in AI Infrastructure

ARM introduces Neoverse infrastructure CPU cores optimized for cloud, AI, and HPC workloads, adopted by NVIDIA, AWS, Microsoft, and Google for their AI platforms, delivering performance gains and energy efficiency. This architecture enables high-density AI workload deployment in cloud and edge environments with enhanced multi-tenant security.

NVIDIA Other 2026-03-24

NVIDIA IGX Thor: 8x Edge AI Compute with ConnectX-7 Network Lock-In

NVIDIA launches IGX Thor edge AI platform with Blackwell GPU, up to 5,581 FP4 TFLOPS, dual 200GbE RDMA via ConnectX-7, and ISO 26262 safety. Pin-compatible with Jetson Thor and 10-year lifecycle enable seamless migration, but create vendor lock-in through proprietary networking and GPU dependencies.

NVIDIA Other High Signal 2026-03-18

NVIDIA and Telecom Operators Build AI Grids to Redistribute AI Inference

NVIDIA is partnering with global telecom operators like AT&T and Comcast to transform existing distributed network sites into 'AI Grids' for edge AI inference. This initiative aims to deploy AI compute closer to users and data, reducing latency and cost per token. It represents a strategic shift for telcos from being data carriers to distributed AI computing platforms.

Hewlett Packard Enterprise Other High Signal 2026-03-17

HPE Unveils AI Grid Solution for AI WAN Fabric with NVIDIA

HPE announced a collaboration with NVIDIA to launch the AI Grid Solution, securely scaling edge AI. The solution transforms WAN into an AI WAN fabric, connecting distributed inference sites with AI factories for consistent policy and predictable performance. It enables service providers to evolve from connectivity to AI services.

Cisco Other High Signal 2026-03-17

Cisco Expands Secure AI Factory with NVIDIA to Edge and Security

Cisco expands its Secure AI Factory with NVIDIA to enable AI deployment from data centers to edge sites, adding security capabilities like firewall policy enforcement on DPUs and AI Defense integration, offering flexible architecture options to accelerate production scaling.

Google Other High Signal 2020-10-11

Google Cloud Integrates MCP with Apigee and Advances Agentic Platform to Evolve Enterprise APIs for AI Agents

Google Cloud announced the general availability of Model Context Protocol (MCP) in Apigee and the advancement of its Agentic Platform, aiming to transform traditional enterprise APIs into secure, governed tools for AI agents at scale. This move integrates API governance, security layers, and AI inference infrastructure, providing core platform capabilities for enterprises shifting from API-driven to agent-driven architectures.

Reports

Filter