Reports
AI-generated structured vendor updates
ASUS Launches NVIDIA GB300 Deskside AI Supercomputer, Shifting Control from Cloud to On-Prem
ASUS launches the ExpertCenter Pro ET900N G3, powered by NVIDIA's GB300 Grace Blackwell Ultra Desktop Superchip, delivering 20 PFLOPS and 748GB of coherent memory for near-trillion parameter models. Concurrently, Coherent expands InP fab in Texas for optical interconnects, and NVIDIA plans a $20-25B debt offering, signaling a systemic shift of AI control from cloud to localized enterprise hardware.
Google Cloud Embeds Legal Verifiability into AI Agents via SPIFFE and Kakunin
Google Cloud introduces SPIFFE-based Agent Identity for Gemini Enterprise and Vertex AI, then overlays Kakunin's compliance layer to map internal SPIFFE identifiers to X.509 certificates generated in AWS KMS, with all state changes committed to WORM audit logs. This converts secure cloud workloads into legally auditable market participants to meet EU AI Act and MiCA accountability mandates.
NVIDIA & Coherent Expand 6-Inch InP Fab, Locking AI Optical Interconnect Supply Chain
Coherent breaks ground on the world's first 6-inch indium phosphide fab in Texas, backed by $2B from NVIDIA and multi-billion purchase commitments. The facility produces lasers, transceivers, and pluggable optics for silicon photonics interconnects, enabling NVIDIA's Vera Rubin Ultra NVL576 576-GPU clusters and signaling a mass shift from copper to optical backbones in AI data centers.
Huawei's LogicFolding: 3D Stacking Rewrites AI Chip Rules
Huawei's Tau Scaling Law and LogicFolding architecture boost transistor density by 55% and power efficiency by 41% via vertical logic stacking, targeting 1.4nm-class by 2031. Ascend 920/910C chips are now used for DeepSeek V4-Pro post-training, signaling real-world AI workload deployment and challenging Nvidia's dominance in China.
TSMC Reveals Glass Substrate Plan for CoWoS, Marking Packaging Inflection
TSMC publicly disclosed its glass substrate development plan for CoWoS, partnering with Ibiden and Innolux to validate feasibility. Glass substrates offer lower signal loss and higher thermal stability than organic substrates, addressing warpage and signal integrity in large AI chip packaging. Mass production is targeted for 2027-2028, directly competing with Intel's glass substrate roadmap.
Applied Materials Launches Deposition and Etch Systems for 3D Chip Scaling
Applied Materials unveils Centris Spectral SiN ALD for uniform dielectric deposition in GAA contacts and Producer Selectra Mo Etch for molybdenum-based 3D NAND word line separation, addressing high-aspect-ratio uniformity issues critical for AI chip manufacturing.
Intel Foundry Lands Google TPU Packaging Deal: EMIB-T Shakes TSMC's AI Chip Monopoly
Intel secures a multi-billion-dollar deal to package over 3 million Google TPUs using its advanced EMIB-T 2.5D packaging, while the chips themselves remain fabricated at TSMC. This marks Intel's strategic shift from CPU vendor to second-source AI packaging partner, targeting 2028 production. Intel's 18A node yields exceed expectations, but analysts caution the scope is limited to packaging.
Cisco AI Defense Adds Agent Harness Red Teaming for Agentic AI Security
Cisco introduces Agent Validation in AI Defense: Explorer Edition, a dedicated red-teaming capability for agentic AI systems. It autonomously probes agent harness attack surfaces, including tool routes, indirect content channels, and persistent state, providing verified findings beyond chat-based security assessments.
AWS S3 Annotations: 1GB Mutable Metadata Per Object, Killing External Metadata DBs
AWS launches S3 annotations, enabling up to 1,000 mutable annotations per object (each 1MB, total 1GB) in JSON/XML/YAML. Annotations auto-index into Apache Iceberg tables, queryable via Athena without retrieval charges. This embeds metadata into the storage layer, eliminating external metadata databases and reshaping AI agent data discovery.
NVIDIA and Coherent Scale 6-Inch InP Fab, Optical Interconnect Becomes AI Infrastructure's New Bottleneck Breaker
NVIDIA invests $2B and commits multi-billion purchases to Coherent's expanded 6-inch indium phosphide fab in Texas, scaling production of lasers and optical modules for AI interconnects. This addresses copper's distance and power limitations in large GPU clusters (e.g., Vera Rubin Ultra NVL576), pushing co-packaged optics into volume manufacturing.
Qualcomm's RISC-V Gamble: Tenstorrent Acquisition and Edge AI Pivot
Qualcomm pivots from ARM to open-source RISC-V, acquiring Ventana Micro and targeting Tenstorrent for $8-10B. Launches 'Dragonfly' brand for custom AI accelerators, aiming for $35B data-center revenue by 2031, betting on edge AI and AI agents.
NVIDIA ACE Goes Local: Control Shifts from Cloud to RTX GPU for Game AI
NVIDIA launches ACE Game Agent SDK (open-source C/C++ framework) and UE5 plugins (ASR/SLM/TTS), moving AI NPC inference fully on-device via GeForce RTX. DLSS 4.5 plugin adds multi-frame generation. This shifts control from cloud providers to NVIDIA GPU ecosystem, but masks hardware lock-in and local model limitations.
AMD MLPerf 6.0: MI350 GPUs Achieve 3.5x Leap with MXFP4, Debut Multi-Node Training
AMD submitted its most comprehensive MLPerf Training 6.0 results, including first multi-node training (FLUX.1 on 512 GPUs) and MXFP4 training recipe. MI355X delivers 3.5x generational leap over MI300X on Llama 2-70B, within 5% of NVIDIA B200. 10 ecosystem partners validated reproducibility.
NVIDIA and HPE Expand AI Factory with Vera CPU for Agentic AI, Full-Stack Integration
NVIDIA and HPE expand the HPE AI Factory with the Vera CPU, the first CPU built for agentic AI, plus the NVIDIA Agent Toolkit, Confidential Computing, and full-stack NVIDIA integration (Spectrum-X, BlueField, ConnectX). This turnkey solution targets enterprise agentic AI production, locking customers into NVIDIA's hardware-software stack.
OpenAI buys Ona: Control point shifts to persistent AI agent runtime
OpenAI acquires cloud infrastructure startup Ona to integrate its persistent execution environment into Codex, enabling AI agents to run independently for hours or days in enterprise-owned clouds. This addresses security, governance, and audit requirements, signaling OpenAI's shift from model provider to full-stack AI platform.
Cloudflare One Stack: AI Agent Skills to Automate SASE Migration, Targeting Zscaler Lock-in
Cloudflare launches the Cloudflare One Stack, a set of skill files for AI agents to automate Zero Trust deployment and migration, with built-in logic for migrating from Zscaler and Palo Alto Networks. It integrates with the MCP server for live API access, aiming to slash switching costs and accelerate defection from rival SASE platforms.
NVIDIA Blackwell Sweeps MLPerf: NVLink and NVFP4 Redefine AI Training Economics
NVIDIA Blackwell dominates MLPerf Training 6.0, submitting across all seven benchmarks including MoE workloads. GB300 NVL72 delivers up to 1.6x faster training than GB200, with fifth-gen NVLink unifying 72 GPUs as one giant GPU. NVFP4 low-precision training and massive scale (8,192 GPUs) set new industry standards.
Microsoft Agent 365: Control Plane Lock Replaces Model Lock, Building an Entra Empire for AI
Microsoft launches Agent 365 as a unified control plane for AI agents, integrating Entra, Defender, Purview, Intune, and cost management, alongside the Microsoft IQ semantic platform. While claiming model diversity and openness, this effectively locks enterprise AI assets into Microsoft's management toolchain, shifting control from model layer to infrastructure layer.
SiMa.ai Palette Neat: Natural-Language Agentic Environment Dismantles NVIDIA's GPU Moat
SiMa.ai launches open-source Palette Neat, an agentic development environment for Physical AI, paired with its sub-10W Modalix SoM. It uses natural language to abstract compute complexity, slashing dev cycles from months to days. Pin-compatible with NVIDIA SoM, it targets breaking the GPU ecosystem lock-in.
HPE Nonstop Embeds Agentic AI for Fraud: Control Shifts to Proprietary Inference Engine
HPE integrates Lusis TANGO AIF into Nonstop Compute, embedding Random Forest and deep learning models for real-time, adaptive anti-fraud operations. The solution offers self-healing infrastructure and linear scalability, shifting fraud detection from rule-based engines to AI-driven inference within the proprietary Nonstop environment.