inference - AI Infrastructure Intelligence Search

Cisco Other 2026-05-14

Cisco Unified Edge: Service Providers' New Ecosystem Bundle for Edge AI Services

Cisco launches Unified Edge platform integrating compute, networking, storage, and security, managed via Intersight, targeting service providers to deploy AI inference at thousands of edge sites. Verizon as early adopter plans to bundle edge capabilities into enterprise connectivity offerings.

Cisco Other 2026-05-07

Cisco-AMD Benchmark Shifts AI Fabric Control from GPU to SmartNIC and Switch

Cisco and AMD jointly release benchmarks for AI scale-out fabrics using N9000 800G switches, Pensando Pollara 400 smartNICs, and MI300X GPUs. IBPerf and MLPerf tests show P01/P99 bandwidth near 400Gbps line rate under incast congestion, proving deterministic performance that eliminates GPU stalls.

ARM Other High Signal 2026-05-07

Arm Reports Record Results, AGI CPU Emerges as New AI Infrastructure Focal Point

Arm reported record FY2026 results with $4.92B revenue and over 20% growth for three consecutive years. The core highlight is the Arm AGI CPU designed for agentic AI, securing over $2B in customer demand and backing from Meta, AWS, Google, and others.

AMD Other Medium Signal 2026-05-07

AMD Backs SPEC CPU 2026 Benchmark, Emphasizing Open, Trusted Performance Measurement

AMD published a blog endorsing the upcoming SPEC CPU 2026 industry benchmark, emphasizing the critical role of open, reproducible CPU performance standards for customer infrastructure decisions in the AI era. The new benchmark updates its application suite and strengthens support for bare-metal cloud environments and parallel computing.

AMD Other High Signal 2026-05-06

AMD and OpenAI Contribute MRC Protocol to OCP for Scalable AI Networking

AMD, in collaboration with OpenAI, Microsoft, and others, contributed the MRC (Multipath Reliable Connection) protocol, designed for large-scale AI training, to the Open Compute Project (OCP). AMD co-authored the specification and has already deployed MRC on its programmable Pensando DPU/NIC products, positioning its networking technology as a key enabler for resilient and adaptive AI infrastructure.

AMD Other High Signal 2026-05-06

AMD and OpenAI Introduce MRC, a Next-Gen Transport Protocol for AI Training

AMD, in collaboration with OpenAI, Microsoft, and other industry leaders, has released the specification for the Multipath Reliable Connection (MRC) protocol. MRC addresses performance bottlenecks of RoCEv2 in hyperscale AI training clusters through intelligent packet spraying, selective retransmission, and network-signaled congestion control, aiming to improve bandwidth utilization and job resilience.

Anthropic Other High Signal 2026-05-06

Anthropic Secures Compute Deal with SpaceX, Significantly Boosting Claude Capacity

Anthropic announced a partnership with SpaceX to utilize all compute capacity at the Colossus 1 data center, gaining over 300MW of new capacity. This move aims to directly improve service for Claude Pro and Max subscribers, with immediate increases to Claude Code and API rate limits.

Intel Other Medium Signal 2026-05-06

Intel at Computex 2026 Emphasizes CPU's Critical Role in AI Compute

Intel will outline its vision for the AI-driven computing era at Computex 2026, centering on the resurgence of the CPU as a critical AI engine. It emphasizes CPU-GPU/accelerator synergy to build efficient, scalable AI systems atop the broad x86 ecosystem.

NVIDIA Other 2026-05-05

NVIDIA Extreme Co-Design: Vera Rubin Platform Targets Agentic Inference TCO Inflection

NVIDIA unveils an extreme co-design stack for agentic systems, featuring Vera Rubin NVL72, NVLink 6, ConnectX-9, BlueField-4, and Spectrum-X. By disaggregating inference, optimizing KV cache management, and deploying low-latency fabrics, it aims to break the throughput-interactivity tradeoff, making high-context token processing economically viable.

Cisco Other High Signal 2026-05-04

Cisco Shifts Network Paradigm from Bandwidth Carrier to Intelligent Platform

Cisco argues that AI-driven traffic patterns are fundamentally reshaping network architecture for service providers, requiring a shift from static, reactive systems to predictive and adaptive intelligent platforms. Cisco is enabling this transition through its full-stack solution portfolio to transform network design, operations, and monetization models.

Intel Other Medium Signal 2026-05-04

Intel Appoints Leadership to Integrate Client Computing and Physical AI

Intel appointed Alex Katouzian as EVP/GM of Client Computing and Physical AI Group, and named Pushkar Ranade as CTO. This move aims to align traditional PC business with physical AI systems (robotics, autonomous machines) and advance frontier technologies like quantum computing.

AMD Other Medium Signal 2026-05-04

AMD Showcases Heterogeneous Computing Strategy for Enterprise AI with Dell

At Dell Technologies World, AMD highlighted its heterogeneous computing portfolio, aiming to match the right compute engine to specific enterprise AI workloads, while emphasizing hardware-based security and manageability. This signals a shift in AI infrastructure from generic solutions to fine-tuned, scenario-specific deployments.

Cisco Other High Signal 2026-05-01

Cisco Report Reveals Fundamental Impact of Agentic AI on WAN Traffic Patterns

Cisco released a research report based on real-world network traffic data, quantifying for the first time the disruptive impact of agentic AI on WAN traffic patterns, symmetry, and critical paths, and predicting AI inference traffic will comprise 25% of total network traffic by 2035.

NVIDIA Other High Signal 2026-05-01

NVIDIA Collaborates with OpenClaw via NemoClaw to Drive Secure Enterprise Autonomous AI Agent Deployment

NVIDIA introduces NemoClaw, a reference implementation that bundles OpenClaw with the OpenShell secure runtime and Nemotron open models, providing a blueprint for secure enterprise deployment of long-running autonomous AI agents. This move addresses the 1000x inference demand surge and security governance challenges, shifting the AI infrastructure control point towards local, secure, and auditable architectures.

AMD Other High Signal 2026-04-30

AMD Proposes New AI Infrastructure Networking Paradigm: From Lossless Fabrics to Intelligent Endpoints

AMD published a blog outlining seven key questions for building large-scale AI infrastructure, arguing that traditional lossless Ethernet or InfiniBand architectures face cost and complexity bottlenecks. It advocates shifting network intelligence and reliability functions from expensive, specialized switches to intelligent NICs, enabling reliable transport over standard (potentially lossy) Ethernet to reduce TCO and simplify operations.

Intel Other High Signal 2026-04-30

Intel Collaborates with ChatPPT to Launch Hybrid AI PC Edition, Driving AI Workload Localization

Intel partnered with AI app ChatPPT to launch a hybrid AI PC edition using Intel's AI Super Builder technology. This version offloads certain AI workloads (e.g., formatting) from the cloud to the local PC, reducing cloud token costs by over 50%, boosting usage duration by 32%, and enhancing data privacy.

NVIDIA Other High Signal 2026-04-30

NVIDIA Releases Enterprise AI Factory Reference Architectures, Standardizing On-Premises AI Infrastructure

NVIDIA has released Enterprise AI Factory Reference Architectures, offering three standardized configurations from RTX PRO to NVL72 for on-premises deployments. This architecture integrates compute, networking, storage, and software, aiming to transform AI infrastructure from experimental setups into predictable, scalable industrial operational platforms.

AMD Other High Signal 2026-04-29

AMD and Liquid AI Discuss Efficient AI Architecture from Silicon to Systems

AMD's CTO and Liquid AI's CEO discuss the evolution of AI architecture, emphasizing efficiency as key to extending AI from the cloud to edge and endpoint devices. They argue that co-design from silicon to systems enables low-power, responsive AI inference, supporting always-on agents and multi-model orchestration.

Amazon Other High Signal 2026-04-29

AWS Platformizes AI Agents and Deepens Cloud Integration with OpenAI

At its annual event, AWS announced the productization of AI agent capabilities, launching the personal AI assistant for work, Amazon Quick, and expanding Amazon Connect into four vertical-specific Agentic AI solutions. Concurrently, AWS and OpenAI expanded their partnership, deeply integrating the latest models, Codex, and managed agent services into the Amazon Bedrock platform.

NVIDIA Other High Signal 2026-04-29

NVIDIA Launches Nemotron 3 Nano Omni, Targeting AI Agent Perception Layer

NVIDIA released the open-source multimodal model Nemotron 3 Nano Omni, featuring a 30B-A3B hybrid MoE architecture. It unifies vision, audio, and language processing into a single model, designed to act as the 'eyes and ears' for AI agents. It claims to eliminate latency and context fragmentation from multi-model collaboration, achieving up to 9x higher throughput while maintaining interactivity, thereby reducing AI agent deployment and inference costs.

Reports

Filter