GPU - AI Infrastructure Intelligence Search

Intel Other Medium Signal 2026-02-24

Intel Partners with SambaNova to Expand AI Inference Infrastructure

Intel announces multi-year strategic partnership with SambaNova to develop AI inference solutions based on Xeon processor infrastructure. The collaboration integrates Intel's compute, networking, storage hardware with SambaNova's AI platform, offering rack-scale inference options for heterogeneous data centers. Intel confirms this doesn't alter its independent GPU roadmap and will continue investing in edge-to-cloud AI products.

Cisco Other High Signal 2026-02-23

Cisco Partners with NVIDIA to Launch Australia's First Sovereign AI Factory

Cisco collaborates with Sharon AI to deploy an AI factory in Australia powered by 1024 NVIDIA Blackwell Ultra GPUs, integrating UCS servers, Nexus Hyperfabric, and VAST Data storage for in-country AI processing.

NVIDIA Other Medium Signal 2026-02-19

NVIDIA Survey Shows Significant ROI Growth in Telecom Network AI Automation

NVIDIA's telecom industry survey reveals AI as a core driver of network automation. The survey predicts significant ROI for telecom operators by 2026, with applications in traffic prediction, fault diagnosis, and energy efficiency. Growing demand for high-performance computing infrastructure drives investments in GPU acceleration and dedicated AI platforms.

NVIDIA Other 2026-02-19

NVIDIA Expands GeForce NOW Library to 4,500 Games to Strengthen Cloud Gaming Platform

NVIDIA expanded its GeForce NOW cloud gaming library to over 4,500 titles, adding major games like Battlefield 2042. The service streams games via cloud RTX GPUs across multiple devices and integrates with popular game stores. This move strengthens NVIDIA's cloud gaming platform through content ecosystem expansion.

Cisco Other High Signal 2026-02-10

Cisco Launches AI Infrastructure Chip and AgenticOps Platform to Strengthen Unified Architecture Strategy

Cisco introduced Silicon One G300 chip and AgenticOps platform to optimize AI cluster network performance and job completion time, while simplifying hybrid cloud operations via unified Nexus One management plane. Its updated AI Defense solution focuses on AI supply chain governance and runtime protection.

Cisco Other High Signal 2026-02-10

Cisco Launches G300 Chip and Systems for AI Agent-Era Data Center Networking

Cisco introduces 102.4Tbps Silicon One G300 switching chip with liquid-cooled N9000/8000 systems delivering 70% energy efficiency, 1.6T optics support, and Nexus One unified management plane upgrade.

NVIDIA Other 2026-01-23

NVFP4 + TeaCache Drive 10x FLUX.2 Inference Speedup, Locking Blackwell Ecosystem

NVIDIA and BFL optimize FLUX.2 on DGX B200/B300 using NVFP4 4-bit quantization, TeaCache step skipping, CUDA Graphs, and torch.compile, achieving 6.3x (single GPU) to 10.2x (dual GPU) latency reduction vs H200, with 40% memory savings. The stack is tightly coupled to TensorRT-LLM visualgen and Blackwell hardware.

NVIDIA Other 2025-11-08

NVIDIA Launches Interactive AI Agent for GPU-Accelerated Data Science with Nemotron Nano-9B

NVIDIA unveils an interactive AI agent powered by Nemotron Nano-9B-v2 and CUDA-X libraries, enabling natural language orchestration of ML workflows. It achieves 3x-43x GPU acceleration over CPU for data processing, model training, and hyperparameter optimization.

NVIDIA Other Medium Signal 2025-10-22

NVIDIA Publishes Tutorial for Converting Lightweight LLM into Terminal AI Agent

NVIDIA released a developer tutorial guiding users to build an AI agent that understands natural language and executes Bash commands, using its open-source Nemotron Nano v2 model within roughly 200 lines of Python code. The tutorial emphasizes building from scratch and simplifying with LangGraph, focusing on safe tool calling and human-in-the-loop control.

NVIDIA Other 2025-06-06

NVIDIA and SK hynix Co-Architect Next-Gen Memory for AI Factories, Locking HBM4 to Vera Rubin

NVIDIA and SK hynix announce a multi-year tech partnership to co-develop next-gen memory for Vera Rubin, RTX Spark, and Jetson Thor. Separately, SK Telecom deploys a gigawatt-scale AI cloud using the full DGX stack, targeting 2027. This elevates SK hynix from supplier to co-architect, strengthening NVIDIA's lock-in on HBM and the AI ecosystem.

Intel Other 2025-06-02

Intel's 18A Xeon 6+ and Rack Scale AI: A CPU-Centric Challenge to NVIDIA's Inference Empire

At Computex 2026, Intel launched the 18A-node Xeon 6+ processor, the Rack Scale AI platform with SambaNova's SN-50 RDU, and a fully disaggregated inference service (Vector Core Compute). This CPU-centric hybrid architecture targets agentic AI inference workloads, directly challenging NVIDIA's Vera Rubin NVL72 and GPU-dominated ecosystem.

NVIDIA Other 2025-06-01

NVIDIA RTX Spark and Nemotron-3 Ultra: AI Control Shifts from Cloud to Personal Edge

NVIDIA launched RTX Spark personal AI supercomputer (co-developed with MediaTek) and Nemotron-3 Ultra open-source model at GTC Taipei 2026. The N1X chip delivers 1 PFLOPS local AI compute, bringing LLM inference to PCs. This marks NVIDIA's pivot from cloud GPU vendor to edge AI infrastructure monopolist, redefining the PC as an AI-native device.

Google Other High Signal 2020-10-11

Google Cloud Integrates MCP with Apigee and Advances Agentic Platform to Evolve Enterprise APIs for AI Agents

Google Cloud announced the general availability of Model Context Protocol (MCP) in Apigee and the advancement of its Agentic Platform, aiming to transform traditional enterprise APIs into secure, governed tools for AI agents at scale. This move integrates API governance, security layers, and AI inference infrastructure, providing core platform capabilities for enterprises shifting from API-driven to agent-driven architectures.

Reports

Filter