Filter

×
Active Filters Clear All
Keyword: AI Infrastructure ×
95 Total Reports
3/5 Page
Cisco Other High Signal 2026-05-05

Cisco Launches Nexus Dashboard 4.2, Enhancing Network Monitoring and Security for AI Workloads

Cisco has released Nexus Dashboard 4.2, a data center management platform update. Key enhancements include Slurm integration for AI/HPC job monitoring, LLDP-based integration with NVIDIA NICs for adaptive routing, and Live Protect for zero-downtime vulnerability mitigation using eBPF. The release aims to provide a unified, intelligent, and secure operations plane for hybrid cloud and AI infrastructure.

AMD Other Medium Signal 2026-05-04

AMD Showcases Heterogeneous Computing Strategy for Enterprise AI with Dell

At Dell Technologies World, AMD highlighted its heterogeneous computing portfolio, aiming to match the right compute engine to specific enterprise AI workloads, while emphasizing hardware-based security and manageability. This signals a shift in AI infrastructure from generic solutions to fine-tuned, scenario-specific deployments.

Cisco Other High Signal 2026-04-30

Cisco Launches Liquid-Cooled Network Switch, Extending Cooling Architecture to AI Infrastructure Core

Cisco has officially launched its N9000 and 8000 systems with direct-to-chip liquid cooling, extending liquid cooling from GPU servers to network switches. The product doubles bandwidth density and reduces energy consumption by nearly 70%, addressing the thermal challenges of high-power AI clusters. This move signals a shift in data center cooling architecture from component-level optimization to systemic redesign.

AMD Other High Signal 2026-04-30

AMD Proposes New AI Infrastructure Networking Paradigm: From Lossless Fabrics to Intelligent Endpoints

AMD published a blog outlining seven key questions for building large-scale AI infrastructure, arguing that traditional lossless Ethernet or InfiniBand architectures face cost and complexity bottlenecks. It advocates shifting network intelligence and reliability functions from expensive, specialized switches to intelligent NICs, enabling reliable transport over standard (potentially lossy) Ethernet to reduce TCO and simplify operations.

Intel Other High Signal 2026-04-30

Intel Collaborates with ChatPPT to Launch Hybrid AI PC Edition, Driving AI Workload Localization

Intel partnered with AI app ChatPPT to launch a hybrid AI PC edition using Intel's AI Super Builder technology. This version offloads certain AI workloads (e.g., formatting) from the cloud to the local PC, reducing cloud token costs by over 50%, boosting usage duration by 32%, and enhancing data privacy.

Microsoft Other High Signal 2026-04-30

Microsoft Defines ‘Agentic Computing Era’, Positions AI Infrastructure and Agent Platform as Core Strategy

Microsoft's CEO, post-earnings, explicitly identifies the shift from end-user-driven workloads to those driven by both end-users and agents as a platform shift that will change the entire tech stack. The company's strategy is focused on building leading AI infrastructure and an agent platform, having already grown its AI business to a $37 billion annual run rate.

Google Other 2026-04-29

Google Opens TPU Hardware to On-Prem, 8th-Gen Chips Target Nvidia

Google announces 8th-gen TPUs (8t for training with 3x performance over Ironwood, 8i for inference with 80% better perf/dollar) and plans to deliver TPU hardware directly to customer data centers. Also closed Wiz acquisition to bolster AI security. This marks a strategic pivot from cloud-only to hardware supplier.

Cisco Other High Signal 2026-04-28

Cisco Leverages Hardware Refresh Cycle to Drive AI-Ready Data Center Architecture

Cisco argues that the core impediment to enterprise AI strategy is data center infrastructure. It advocates integrating AI readiness into routine hardware refresh cycles, emphasizing proactive operations, security embedded in the network fabric, end-to-end observability, and high-performance networking as foundational for AI infrastructure.

ARM Other High Signal 2026-04-28

Arm Launches Performix Performance Toolkit, Targeting AI Agent Era Optimization

Arm launched Performix, a free performance analysis toolkit designed to provide unified performance insights and optimization across the Arm platform for AI agent development. Integrated into mainstream AI dev environments via the Arm MCP Server, it turns runtime hardware data into actionable optimization guidance, with support from ecosystem partners like Microsoft and MongoDB.

NVIDIA Other High Signal 2026-04-24

NVIDIA Internalizes GPT-5.5 Powered AI Agents at Scale, Defining New Enterprise AI Infrastructure Paradigm

NVIDIA announced that over 10,000 employees have scaled the use of GPT-5.5 via the Codex app, running on NVIDIA GB200 NVL72 infrastructure. This demonstrates the technical feasibility of 'transformative' productivity gains from frontier model inference in enterprise workflows. It also provides a reference architecture for deploying AI agents with auditable, isolated security via dedicated cloud VMs.

Cisco Other High Signal 2026-04-22

Cisco Positions Network as Energy Control Layer for AI Infrastructure

Cisco's blog outlines energy as a critical bottleneck for AI scaling, citing a next-gen AI data center design for a European bank. It emphasizes the network's role at the convergence of digital and energy systems, positioning it as a control layer for visibility, coordination, and security to manage energy, cooling, and space constraints for AI workloads.

NVIDIA Other High Signal 2026-04-22

NVIDIA and Google Cloud Deepen Collaboration to Build Cloud Infrastructure for AI Factories and Physical AI

NVIDIA and Google Cloud have announced an expanded collaboration, introducing new Vera Rubin and Blackwell GPU-powered instances to build "AI factories" scaling to nearly a million GPUs. The integration of Gemini, Nemotron, and other platforms aims to accelerate production deployment of agentic and physical AI, such as robotics and digital twins.

Google Other 2026-04-22

Google Cloud Next '26: Agent Gateway Seizes Control Plane, TPU 8i Locks Inference

Google Cloud Next '26 announces 8th-gen TPUs (8t for training, 8i for inference), Agent Platform with Agent Gateway, Agent Identity, Agent-to-Agent Orchestration, Agentic Data Cloud, and Agentic Defense integrating Wiz. The move shifts control from infrastructure to agent orchestration, locking enterprises into a vertically integrated stack.

NVIDIA Other High Signal 2026-04-20

NVIDIA Partners with Adobe and WPP to Build Enterprise-Grade AI Agent Security Architecture Centered on OpenShell

NVIDIA deepens its strategic collaboration with Adobe and WPP to place intelligent AI agents at the center of enterprise marketing operations. The key move is the introduction and emphasis on the NVIDIA OpenShell secure runtime, which provides a policy-based, auditable, and isolated execution environment for AI agents handling multi-step workflows. This signals a shift from purely functional AI towards controlled and trustworthy enterprise-grade agentic architectures.

Cisco Other High Signal 2026-04-16

Cisco and NVIDIA Elevate Network to AI Media Processing Control Plane

Cisco and NVIDIA deepen collaboration with a validated design based on the open-standard Media Exchange Layer (MXL). This integration merges Cisco's IP media fabric with NVIDIA's Holoscan platform, transforming the network from a transport layer into an active processing layer that supports real-time AI inference, enabling low-latency, multilingual AI-driven live media production for broadcasters.

Microsoft Other High Signal 2026-04-16

Microsoft Activates Fairwater Hyperscale AI Datacenter Ahead of Schedule, Setting New Infrastructure Standard

Microsoft announced the early activation of its Fairwater datacenter in Wisconsin, positioned as the world's most powerful AI facility. It integrates hundreds of thousands of NVIDIA GB200 GPUs into a single seamless cluster via massive fiber interconnect, targeting unprecedented compute scale for next-generation AI training and inference workloads.

NVIDIA Other High Signal 2026-04-15

NVIDIA Shifts AI Infrastructure Metric from FLOPS to Cost Per Token

NVIDIA advocates for "cost per token" as the primary economic metric for AI infrastructure, replacing "FLOPS per dollar." This shift moves the focus from computational inputs to business outputs, requiring full-stack optimization across hardware, software, and networking to lower enterprise AI inference TCO.

Cisco Other High Signal 2026-04-14

Cisco Validates On-Premises AI Deployment Logic with Internal Case Study

Cisco's Customer Experience (CX) unit deployed on-premises AI infrastructure using UCS servers and Nexus switches to handle sensitive customer data, addressing cloud-related data sovereignty and unpredictable inferencing cost challenges. This move demonstrates an architectural shift from variable operational expenses to deterministic capital investment for AI workloads.

Intel Other High Signal 2026-04-09

Intel and Google Deepen Collaboration to Define Core of Heterogeneous AI Infrastructure

Intel and Google announced a multiyear collaboration to advance next-generation AI and cloud infrastructure. The core is reinforcing the central role of CPUs and custom IPUs in heterogeneous AI systems, optimizing performance and efficiency through multi-generational Xeon processors, and expanding co-development of ASIC-based IPUs to improve efficiency and predictable performance at hyperscale.

Intel Other High Signal 2026-04-09

Intel and Google Deepen Collaboration on CPU and IPU for Heterogeneous AI Infrastructure

Intel and Google announced a multi-year collaboration to advance next-generation AI and cloud infrastructure through aligned Xeon processor roadmaps and expanded co-development of custom ASIC-based IPUs. This reinforces the central role of CPUs in AI system orchestration and the critical value of IPUs in offloading infrastructure tasks to improve efficiency at hyperscale.