Reports
AI-generated structured vendor updates
爱立信深耕AI与6G双向发力,助力运营商决胜下一个十年
...
华邦电子加入台积电WoW先进封装内存供应链,打破三大DRAM厂垄断
...
Huawei Pushes Token-Based Billing at MWC Shanghai 2026: Shifting Carrier Monetization from Bytes to AI Inference Value
At MWC Shanghai 2026, Huawei urged carriers to shift from byte-based to token-based billing for AI workloads, showcasing a 372% token throughput improvement in long-sequence inference via its AI Inference Acceleration Solution. It also highlighted the Upper-6 GHz band as critical for AI wearables requiring 20 Mbps uplink, aiming to reposition 5G-A networks as AI compute delivery infrastructure.
NVIDIA and AWS Default GPU Vector Search with cuVS, G7 Instances Deliver 4.6x Inference
NVIDIA and AWS collaborate to embed cuVS as default GPU-accelerated vector search in OpenSearch Serverless, delivering 10x faster indexing at 1/4 cost. New EC2 G7 instances with RTX PRO 4500 Blackwell GPUs achieve up to 4.6x inference performance. AWS achieves GB300 Exemplar Cloud status for training.
NVIDIA Dominates TOP500 with Full-Stack Lock-in: Grace CPU, InfiniBand, and GPU Integration
NVIDIA powers 81% of TOP500 supercomputers, with Grace CPU adoption rising to 26 systems and Quantum InfiniBand connecting 376. The full-stack strategy (GPU+CPU+networking) shifts procurement from open components to single-vendor lock-in; top 8 Green500 systems use NVIDIA GPUs.
NVIDIA's AI Agents and Digital Twins Reshape Telecom Network Control Plane
At DTW Ignite 2026, NVIDIA showcases its AI agent platform integrating NeMo synthetic data, NemoClaw secure runtime, OpenShell sandbox, and RTX PRO 6000-accelerated digital twins, aiming for autonomous telecom operations. Partners include SoftBank, Amdocs, NTT DATA, etc., moving from task automation to full autonomy.
NVIDIA JUPITER Validates Grace Hopper: Exascale Science Goes Production
Europe's first exascale supercomputer JUPITER, powered by NVIDIA Grace Hopper Superchips and Quantum-X800 InfiniBand, achieves breakthroughs in brain mapping at cellular scale, 1km-resolution climate simulation, 6G AI, and 50-qubit quantum simulation, proving exascale is production-ready.
AMD MLPerf 6.0: MI350 GPUs Achieve 3.5x Leap with MXFP4, Debut Multi-Node Training
AMD submitted its most comprehensive MLPerf Training 6.0 results, including first multi-node training (FLUX.1 on 512 GPUs) and MXFP4 training recipe. MI355X delivers 3.5x generational leap over MI300X on Llama 2-70B, within 5% of NVIDIA B200. 10 ecosystem partners validated reproducibility.
HBM Bottleneck Reshapes AI Infrastructure: Asian Memory Makers Gain Leverage Over Nvidia
SK Hynix, Samsung, and Micron have crossed $1 trillion market cap as HBM becomes the hard limit in AI infrastructure. Asian suppliers now account for 90% of Nvidia's production costs, shifting the bottleneck from GPU compute to stacked memory and advanced packaging.
Google Lightning Engine: 4.9x Spark Performance with Ecosystem Lock-in Risks
Google Cloud launches Lightning Engine GA for Apache Spark, delivering up to 4.9x faster performance via vectorized native execution on Gluten/Velox. Optimized Cloud Storage and BigQuery connectors boost throughput, but the premium tier and deep integration create vendor lock-in risks.
NVIDIA's UK Sovereign AI Play: From Chip Vendor to National Infrastructure Controller
NVIDIA partners with the UK government to deploy sovereign AI infrastructure via Isambard-AI (5,400 GH200 superchips) and the Sovereign AI Fund, backing local startups. This move establishes a national AI control plane, locking compute into NVIDIA's ecosystem and bypassing traditional hyperscalers like AWS and Azure.
Cisco Cloud Control Unifies Management: Control Plane Shifts to Single Pane for AgenticOps
Cisco Live 2026 unveils Cisco Cloud Control, a unified dashboard for networking, security, compute, and observability, enabling human-AI agent collaboration. Also expands Live Protect kernel-level patching to N9000 switches, outlines quantum-safe roadmap, and launches C9550/C8600 hardware.
HBM Profitability Falls Below DDR5, TrendForce Warns of Multi-Fold Price Surge in 2027
TrendForce reports that HBM per-wafer revenue fell below DDR5 64GB RDIMM in Q1 2026, making HBM less profitable. Suppliers will reallocate capacity, leading to multi-fold HBM4 contract price increases in 2027. Demand from NVIDIA Rubin Ultra and AI ASICs will further tighten supply.
NVIDIA RTX Spark: SoC Seizes PC Control, AI Compute Revolution with Ecosystem Lock-in
NVIDIA launches RTX Spark SoC, integrating Blackwell GPU with 20-core Grace CPU (MediaTek co-designed), NVLink-C2C at 600GB/s, up to 128GB unified memory, 1 petaflop FP4 AI, and local 120B-parameter LLM support. This marks a shift from GPU vendor to platform provider, directly challenging Apple M, Qualcomm, and x86 incumbents.
Google Showcases AI-Native App Architecture Paradigm via Agent Platform
A Google Cloud customer case study demonstrates a "stream-of-consciousness to tasks" app built on Gemini Enterprise Agent Platform. The architecture leverages APIs for native audio streaming, proactive tool calling, and session resumption to enable seamless, low-latency conversion from speech to structured tasks, featuring a provider-agnostic abstraction layer for future voice features.
US AI Infrastructure Expansion Stalls: 30%-50% of 16GW Capacity Delayed
The US planned ~16GW data center capacity this year, with 30%-50% expected to face delays or cancellations, only ~5GW actually breaking ground. Power, supply chain, and workforce bottlenecks suppress AI infrastructure deployment.
Intel, Nokia, and Dell Introduce Dedicated UPF Appliance for Far Edge
At MWC 2026, Intel, Nokia, and Dell previewed a far-edge UPF appliance powered by Intel Xeon 6 SoC. The solution aims to deliver high-performance, low-power 5G core user plane processing for telcos in space- and power-constrained far-edge environments, with integrated AI capabilities.
Nokia Opens R&D and Manufacturing Campus in Oulu Focused on AI-Driven Networks
Nokia has opened a new R&D and manufacturing campus in Oulu, Finland, dedicated to designing, testing, and delivering next-generation networks built for AI. The campus integrates R&D, smart manufacturing, and a partner ecosystem, aiming to advance 5G/6G and private networks to power the AI supercycle with essential connectivity.
Cisco Demonstrates Unified S/NOC with Agentic AI for Autonomous Security Operations at MWC 2026
At MWC 2026, Cisco operated a unified Security and Network Operations Center (S/NOC), demonstrating seamless integration across its Security Cloud, XDR, and Splunk platforms. The core innovation was the use of a beta Agentic AI to generate "Instant Attack Storyboards" for triage and investigation, with automated workflows bridging incidents to Splunk Enterprise Security for deeper threat hunting.
Nokia Partners with NVIDIA on AI-RAN Platform to Accelerate 6G Evolution
Nokia and NVIDIA have formed a strategic partnership, with NVIDIA investing $1 billion and jointly launching AI-RAN products based on NVIDIA's computing platform. The collaboration aims to embed AI data center capabilities into the RAN, driving the transition from 5G to AI-native 6G networks, with T-Mobile as the first deployment customer.