NVIDIA - AI Infrastructure Intelligence Search

NVIDIA Other 2025-06-06

NVIDIA and SK hynix Co-Architect Next-Gen Memory for AI Factories, Locking HBM4 to Vera Rubin

NVIDIA and SK hynix announce a multi-year tech partnership to co-develop next-gen memory for Vera Rubin, RTX Spark, and Jetson Thor. Separately, SK Telecom deploys a gigawatt-scale AI cloud using the full DGX stack, targeting 2027. This elevates SK hynix from supplier to co-architect, strengthening NVIDIA's lock-in on HBM and the AI ecosystem.

Intel Other 2025-06-02

Intel's 18A Xeon 6+ and Rack Scale AI: A CPU-Centric Challenge to NVIDIA's Inference Empire

At Computex 2026, Intel launched the 18A-node Xeon 6+ processor, the Rack Scale AI platform with SambaNova's SN-50 RDU, and a fully disaggregated inference service (Vector Core Compute). This CPU-centric hybrid architecture targets agentic AI inference workloads, directly challenging NVIDIA's Vera Rubin NVL72 and GPU-dominated ecosystem.

NVIDIA Other 2025-06-01

NVIDIA RTX Spark and Nemotron-3 Ultra: AI Control Shifts from Cloud to Personal Edge

NVIDIA launched RTX Spark personal AI supercomputer (co-developed with MediaTek) and Nemotron-3 Ultra open-source model at GTC Taipei 2026. The N1X chip delivers 1 PFLOPS local AI compute, bringing LLM inference to PCs. This marks NVIDIA's pivot from cloud GPU vendor to edge AI infrastructure monopolist, redefining the PC as an AI-native device.

Microsoft Other Medium Signal 2025-02-27

Microsoft Launches Phi-4 SLM Series to Enhance Edge AI and Multimodal Reasoning

Microsoft introduced the Phi-4 family of small language models (SLMs), featuring the 5.6B-parameter Phi-4-multimodal capable of processing speech, vision and text. The models are now available in Azure AI Foundry, HuggingFace and NVIDIA's API Catalog with optimized edge computing capabilities.

Google Other High Signal 2020-10-11

Google Cloud Integrates MCP with Apigee and Advances Agentic Platform to Evolve Enterprise APIs for AI Agents

Google Cloud announced the general availability of Model Context Protocol (MCP) in Apigee and the advancement of its Agentic Platform, aiming to transform traditional enterprise APIs into secure, governed tools for AI agents at scale. This move integrates API governance, security layers, and AI inference infrastructure, providing core platform capabilities for enterprises shifting from API-driven to agent-driven architectures.

Reports

Filter

NVIDIA and SK hynix Co-Architect Next-Gen Memory for AI Factories, Locking HBM4 to Vera Rubin

Intel's 18A Xeon 6+ and Rack Scale AI: A CPU-Centric Challenge to NVIDIA's Inference Empire

NVIDIA RTX Spark and Nemotron-3 Ultra: AI Control Shifts from Cloud to Personal Edge

Microsoft Launches Phi-4 SLM Series to Enhance Edge AI and Multimodal Reasoning

Google Cloud Integrates MCP with Apigee and Advances Agentic Platform to Evolve Enterprise APIs for AI Agents