Reports
AI-generated structured vendor updates
惠普与OpenAI达成战略合作,全面部署Frontier平台
...
Anthropic Alleges Largest AI Distillation Attack by Alibaba-Linked Operators, Exposing API Security Gaps
Anthropic alerted U.S. senators that Alibaba-linked operators conducted the largest known distillation attack, generating 28.8 million model exchanges via 25,000 fraudulent accounts to harvest Claude's frontier capabilities. The incident exposes a critical vulnerability in AI API security, forcing a rethinking of inference endpoint protection and usage monitoring.
Oracle Defense Ecosystem Cohort 3: Offline AI on Roving Edge Devices Goes Operational
Oracle announced the third cohort of its Defense Ecosystem at the Brussels summit, adding 10 companies. Concurrently, Whitespace's Saga AI system deployed on Oracle Roving Edge Devices during Royal Navy's Operation HIGHMAST, running classified AI workloads completely offline, proving sovereign edge AI is operational.
China's LineShine Tops TOP500: CPU-Only 2.2 ExaFLOPS with ARMv9 and HBM Memory
LineShine supercomputer achieves 2.198 ExaFLOPS FP64 sustained using 13.79 million ARMv9 cores across 20,480 nodes, making it the first system to exceed 2 ExaFLOPS without GPUs. Each node has dual LX2 CPUs (304 cores) with 32GB HBM, demonstrating a CPU+HBM architecture breakthrough for HPC.
NVIDIA Launches Agent Toolkit: Nemotron Models, OpenShell Runtime for Specialized AI Agents
NVIDIA unveils Agent Toolkit, an open modular foundation with Nemotron models, NemoClaw blueprints, and OpenShell runtime, enabling enterprises to build secure, specialized AI agents. It targets life sciences, cybersecurity, and industrial workflows, aiming to turn frontier models into domain-specific digital coworkers.
AMD MI430X GPU Delivers >200 TFLOPS Native FP64, Reshaping HPC-AI Convergence Baseline
AMD powers 4 of top 10 TOP500 supercomputers and previews MI430X GPU with >200 TFLOPS native FP64. This targets AI-for-science workloads, making double-precision compute a key metric for converged HPC-AI infrastructure, directly challenging NVIDIA and Intel.
Nvidia Vera Rubin CPU: 10-Wide Core Redefines CPU for Agentic Computing
At GTC Taipei 2026, Nvidia unveiled the Vera Rubin CPU with a custom 10-wide fetch/decode/execute pipeline, claiming world-leading IPC and bandwidth. Designed for agentic computing, it complements Nvidia GPUs. Nvidia also announced a partnership with Microsoft to reinvent the PC as a Personal AI and committed to returning 50% of free cash flow to shareholders.
Micron-Anthropic Deal: Memory Co-Architecture Locks in AI Supply Chain
Micron and Anthropic sign a strategic agreement covering joint memory/storage architecture design, multi-year supply, Claude adoption, and investment. This ties frontier AI model demands directly to infrastructure design, aiming to optimize token economics and power efficiency, but essentially locks in supply and restructures the ecosystem.
NVIDIA JUPITER Validates Grace Hopper: Exascale Science Goes Production
Europe's first exascale supercomputer JUPITER, powered by NVIDIA Grace Hopper Superchips and Quantum-X800 InfiniBand, achieves breakthroughs in brain mapping at cellular scale, 1km-resolution climate simulation, 6G AI, and 50-qubit quantum simulation, proving exascale is production-ready.
NVIDIA's French AI Push: Open Models as a Trojan Horse for Hardware Lock-in
NVIDIA partners with French entities to deploy GB200, Blackwell B300, and Vera Rubin NVL72 systems, while promoting the Nemotron open model coalition. This builds an NVIDIA-centric AI infrastructure ecosystem in Europe, masking hardware lock-in with open model rhetoric.
Nvidia ENPIRE: AI Agents Autonomously Train Robots to Install GPUs at 99% Success
Nvidia's ENPIRE framework enables AI coding agents (Codex, Claude Code) to autonomously write, test, and refine robot training code, achieving 99% pass@8 on GPU insertion and other contact-rich tasks. The system uses Git for collaboration, but token consumption scales faster than fleet size, and simulation-to-reality transfer remains imperfect.
AWS Trainium Hits 80% MFU on World Models, Reshaping AI Training Economics
AWS claims its Trainium chip achieves 80% Model FLOP Utilization (MFU) on world model training, nearly double the industry average. With a general-purpose instruction set and sustained thermal performance, Trainium is attracting startups like Odyssey and DeCart AI, challenging Nvidia's dominance in AI training infrastructure.
ASUS Launches NVIDIA GB300 Deskside AI Supercomputer, Shifting Control from Cloud to On-Prem
ASUS launches the ExpertCenter Pro ET900N G3, powered by NVIDIA's GB300 Grace Blackwell Ultra Desktop Superchip, delivering 20 PFLOPS and 748GB of coherent memory for near-trillion parameter models. Concurrently, Coherent expands InP fab in Texas for optical interconnects, and NVIDIA plans a $20-25B debt offering, signaling a systemic shift of AI control from cloud to localized enterprise hardware.
Huawei's LogicFolding: 3D Stacking Rewrites AI Chip Rules
Huawei's Tau Scaling Law and LogicFolding architecture boost transistor density by 55% and power efficiency by 41% via vertical logic stacking, targeting 1.4nm-class by 2031. Ascend 920/910C chips are now used for DeepSeek V4-Pro post-training, signaling real-world AI workload deployment and challenging Nvidia's dominance in China.
AMD MLPerf 6.0: MI350 GPUs Achieve 3.5x Leap with MXFP4, Debut Multi-Node Training
AMD submitted its most comprehensive MLPerf Training 6.0 results, including first multi-node training (FLUX.1 on 512 GPUs) and MXFP4 training recipe. MI355X delivers 3.5x generational leap over MI300X on Llama 2-70B, within 5% of NVIDIA B200. 10 ecosystem partners validated reproducibility.
NVIDIA and HPE Expand AI Factory with Vera CPU for Agentic AI, Full-Stack Integration
NVIDIA and HPE expand the HPE AI Factory with the Vera CPU, the first CPU built for agentic AI, plus the NVIDIA Agent Toolkit, Confidential Computing, and full-stack NVIDIA integration (Spectrum-X, BlueField, ConnectX). This turnkey solution targets enterprise agentic AI production, locking customers into NVIDIA's hardware-software stack.
Cloudflare One Stack: AI Agent Skills to Automate SASE Migration, Targeting Zscaler Lock-in
Cloudflare launches the Cloudflare One Stack, a set of skill files for AI agents to automate Zero Trust deployment and migration, with built-in logic for migrating from Zscaler and Palo Alto Networks. It integrates with the MCP server for live API access, aiming to slash switching costs and accelerate defection from rival SASE platforms.
NVIDIA Blackwell Sweeps MLPerf: NVLink and NVFP4 Redefine AI Training Economics
NVIDIA Blackwell dominates MLPerf Training 6.0, submitting across all seven benchmarks including MoE workloads. GB300 NVL72 delivers up to 1.6x faster training than GB200, with fifth-gen NVLink unifying 72 GPUs as one giant GPU. NVFP4 low-precision training and massive scale (8,192 GPUs) set new industry standards.
Microsoft Agent 365: Control Plane Lock Replaces Model Lock, Building an Entra Empire for AI
Microsoft launches Agent 365 as a unified control plane for AI agents, integrating Entra, Defender, Purview, Intune, and cost management, alongside the Microsoft IQ semantic platform. While claiming model diversity and openness, this effectively locks enterprise AI assets into Microsoft's management toolchain, shifting control from model layer to infrastructure layer.
CrowdStrike Continuous Identity for AI Agents Shifts Control Plane
At Identiverse 2026, CrowdStrike launched Continuous Identity for AI Agents, a Falcon Next-Gen Identity Security capability. Using SPIFFE for verifiable agent identity, it dynamically grants/revokes access based on real-time risk, eliminates standing privileges, and integrates with Falcon AIDR to detect privilege misuse, shifting the identity control plane from static policies to continuous risk assessment.