RAN - AI Infrastructure Intelligence Search

OpenAI Other 2026-06-26

OpenAI and Broadcom Tape Out First Inference ASIC Jalapeño in 9 Months, Targeting NVIDIA Dominance

OpenAI and Broadcom unveil Jalapeño, their first custom inference ASIC, fabricated on TSMC 3nm and optimized for Transformer models. Targeting a 50% inference cost reduction, it taped out in 9 months and is slated for deployment in gigawatt-scale data centers by late 2026, marking OpenAI's strategic pivot to full-stack AI infrastructure and a direct challenge to NVIDIA's inference hegemony.

OpenAI Other 2026-06-26

Making private MCP servers reachable without making them public | OpenAI Developers

...

Huawei Other 2026-06-25

Huawei Pushes Token-Based Billing at MWC Shanghai 2026: Shifting Carrier Monetization from Bytes to AI Inference Value

At MWC Shanghai 2026, Huawei urged carriers to shift from byte-based to token-based billing for AI workloads, showcasing a 372% token throughput improvement in long-sequence inference via its AI Inference Acceleration Solution. It also highlighted the Upper-6 GHz band as critical for AI wearables requiring 20 Mbps uplink, aiming to reposition 5G-A networks as AI compute delivery infrastructure.

OpenAI Other 2026-06-25

Oracle Defense Ecosystem Cohort 3: Offline AI on Roving Edge Devices Goes Operational

Oracle announced the third cohort of its Defense Ecosystem at the Brussels summit, adding 10 companies. Concurrently, Whitespace's Saga AI system deployed on Oracle Roving Edge Devices during Royal Navy's Operation HIGHMAST, running classified AI workloads completely offline, proving sovereign edge AI is operational.

Huawei Other 2026-06-25

Huawei Unveils AI-Centric Network with Token Monetization, UCM Caching Breaks Long-Context Barriers

At MWC Shanghai 2026, Huawei unveiled an AI-native network architecture integrating service, network, and compute, shifting from traffic-centric to intelligence-centric operations. The Unified Cache Manager (UCM) extends KV cache to petabyte-scale external storage, achieving 372% token throughput gains on GLM-5.1 at 128K sequence lengths. Token monetization frameworks and agentic operations enable carriers to charge for AI inference capacity and personalize services.

Google Cloud Other 2026-06-25

Google Cloud Multi-Agent Architecture Shifts Control from Human to Autonomous Verification

Google Cloud introduces agent-scale data management with multi-agent verification to reduce human oversight. Deploys six Gemini agents with Nokia for autonomous network operations. Amazon plans to commercialize Trainium chips, intensifying AI hardware competition against Google TPU and Nvidia GPU.

Anthropic Other 2026-06-25

Anthropic Accuses Alibaba of Massive Distillation Attack on Claude AI Model

Anthropic accused Alibaba-linked operators of conducting 29 million exchanges via thousands of fraudulent accounts to distill Claude's capabilities, including long-context reasoning and decision-making. This highlights the vulnerability of AI model IP under API access, prompting a redefinition of model security boundaries.

Cisco Other 2026-06-25

Cisco Launches AI Troubleshooting Agent for Industrial Networks, Shifting Control Plane

Cisco launches AI Troubleshooting for Industrial Networks, an ambient agent on Cisco Cloud Control. It monitors switch syslogs, uses deterministic logic to diagnose physical and network faults, and provides OT technicians with actionable fix steps, aiming to reduce MTTD and MTTR by minimizing escalations to network experts.

AMD Other 2026-06-24

TSMC Hikes Advanced Node Prices 5-10%, Squeezing AI Chip Margins

TSMC informs clients of 5-10% price hikes across all advanced nodes (7nm+), affecting 74% of wafer revenue. Apple, Nvidia, AMD, and others face higher costs, potentially raising AI infrastructure prices.

Cisco Other 2026-06-24

Cisco Live US & InfoComm 2026 : la collaboration entre dans l’ère agentique

...

Google Other 2026-06-24

Mandiant Reveals Cisco SD-WAN Manager Zero-Day: Control Plane Becomes Prime Target

Mandiant identified a zero-day (CVE-2026-20245) in Cisco Catalyst SD-WAN Manager exploited via malicious CSV upload to escalate to root. The intrusion involved rogue peering, credential manipulation, and anti-forensic cleanup. This highlights SD-WAN centralized control planes as a new attack surface for advanced threats.

ARM Other 2026-06-24

China's LineShine Tops TOP500: CPU-Only 2.2 ExaFLOPS with ARMv9 and HBM Memory

LineShine supercomputer achieves 2.198 ExaFLOPS FP64 sustained using 13.79 million ARMv9 cores across 20,480 nodes, making it the first system to exceed 2 ExaFLOPS without GPUs. Each node has dual LX2 CPUs (304 cores) with 32GB HBM, demonstrating a CPU+HBM architecture breakthrough for HPC.

Nokia Other 2026-06-24

Nokia, Amazon Web Services expand collaboration to deliver autonomous networks built for the AI era

...

NVIDIA Other 2026-06-23

NVIDIA Unveils 45°C Liquid Cooling for Rubin Chips, Slashes Water Use 100%

NVIDIA announces a liquid cooling system for its Rubin GPUs running 45°C coolant (hotter than a hot tub), using dry coolers in a closed loop to cut electricity and eliminate water evaporation (100% reduction). However, chillers may still be needed in hot climates, and chip longevity impacts remain unaddressed.

NVIDIA Other 2026-06-23

NVIDIA Launches Agent Toolkit: Nemotron Models, OpenShell Runtime for Specialized AI Agents

NVIDIA unveils Agent Toolkit, an open modular foundation with Nemotron models, NemoClaw blueprints, and OpenShell runtime, enabling enterprises to build secure, specialized AI agents. It targets life sciences, cybersecurity, and industrial workflows, aiming to turn frontier models into domain-specific digital coworkers.

Google Cloud Other 2026-06-23

Google Cloud and Nokia Embed Gemini AI Agents to Seize Network Operations Control Plane

Google Cloud and Nokia partner to embed Gemini AI agents (including Router Agent, Event Triage Agent) into Nokia Assurance Center, launching as SaaS on Google Cloud Marketplace in September 2026. Aiming to reduce troubleshooting time by 50-80%, this marks a fundamental shift from rule-based to AI-driven telco operations.

Nokia Other 2026-06-23

Nokia and Google Cloud Inject Gemini AI into Network Assurance

Nokia integrates Google's Gemini AI into its Assurance Center, creating six AI agents for event triage, anomaly detection, and remediation. Claims 50-80% reduction in troubleshooting time. The SaaS solution will run on Google Cloud, launching September 2026.

Anthropic Other 2026-06-23

Micron-Anthropic Deal Locks AI Memory Demand, But Stock Price Already Priced In

Micron signed a long-term supply contract with Anthropic covering HBM, DRAM, and SSDs, with joint analysis of memory subsystems for AI workloads. Micron also participated in Anthropic's Series H. This aims to transform memory from a commodity to an AI infrastructure asset, but the stock has already run up, requiring proof of sustained scarcity premium.

NVIDIA Other 2026-06-23

NVIDIA Dominates TOP500 with Full-Stack Lock-in: Grace CPU, InfiniBand, and GPU Integration

NVIDIA powers 81% of TOP500 supercomputers, with Grace CPU adoption rising to 26 systems and Quantum InfiniBand connecting 376. The full-stack strategy (GPU+CPU+networking) shifts procurement from open components to single-vendor lock-in; top 8 Green500 systems use NVIDIA GPUs.

AMD Other 2026-06-23

AMD MI430X GPU Delivers >200 TFLOPS Native FP64, Reshaping HPC-AI Convergence Baseline

AMD powers 4 of top 10 TOP500 supercomputers and previews MI430X GPU with >200 TFLOPS native FP64. This targets AI-for-science workloads, making double-precision compute a key metric for converged HPC-AI infrastructure, directly challenging NVIDIA and Intel.

Reports

Filter

OpenAI and Broadcom Tape Out First Inference ASIC Jalapeño in 9 Months, Targeting NVIDIA Dominance

Making private MCP servers reachable without making them public | OpenAI Developers

Huawei Pushes Token-Based Billing at MWC Shanghai 2026: Shifting Carrier Monetization from Bytes to AI Inference Value

Oracle Defense Ecosystem Cohort 3: Offline AI on Roving Edge Devices Goes Operational

Huawei Unveils AI-Centric Network with Token Monetization, UCM Caching Breaks Long-Context Barriers

Google Cloud Multi-Agent Architecture Shifts Control from Human to Autonomous Verification

Anthropic Accuses Alibaba of Massive Distillation Attack on Claude AI Model

Cisco Launches AI Troubleshooting Agent for Industrial Networks, Shifting Control Plane

TSMC Hikes Advanced Node Prices 5-10%, Squeezing AI Chip Margins

Cisco Live US & InfoComm 2026 : la collaboration entre dans l’ère agentique

Mandiant Reveals Cisco SD-WAN Manager Zero-Day: Control Plane Becomes Prime Target

China's LineShine Tops TOP500: CPU-Only 2.2 ExaFLOPS with ARMv9 and HBM Memory

Nokia, Amazon Web Services expand collaboration to deliver autonomous networks built for the AI era

NVIDIA Unveils 45°C Liquid Cooling for Rubin Chips, Slashes Water Use 100%

NVIDIA Launches Agent Toolkit: Nemotron Models, OpenShell Runtime for Specialized AI Agents

Google Cloud and Nokia Embed Gemini AI Agents to Seize Network Operations Control Plane

Nokia and Google Cloud Inject Gemini AI into Network Assurance

Micron-Anthropic Deal Locks AI Memory Demand, But Stock Price Already Priced In

NVIDIA Dominates TOP500 with Full-Stack Lock-in: Grace CPU, InfiniBand, and GPU Integration

AMD MI430X GPU Delivers >200 TFLOPS Native FP64, Reshaping HPC-AI Convergence Baseline