Reports
AI-generated structured vendor updates
AMD EPYC Challenges Rack-Scale Density for Agentic AI Control
AMD claims its EPYC processors lead in rack-scale performance for agentic AI's CPU-intensive services (orchestration, caching, databases). Under a 100kW rack model, EPYC 9965 'Turin' delivers 2.37x throughput over NVIDIA Vera, with next-gen 'Venice' projected at 3.30x. Emphasizes deployability on current x86 platforms, avoiding future architecture dependency.
NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration
NVIDIA launches Nemotron 3 Ultra, a 550B-parameter MoE model (55B active) purpose-built for AI agent orchestration. Featuring Multi-Teacher On-Policy Distillation (MOPD) and a Hybrid Mamba-Transformer architecture, it achieves 5x throughput and 30% cost savings on tasks like SWE-bench, signaling a shift of reasoning control to a layered agent system.
Cisco Silicon One Expands to Campus: Chip-Embedded Control Locks Agentic AI Networks
Cisco extends Silicon One to campus with C9550/C9350 switches and Cloud Control, embedding distributed visibility, sustained high throughput, and adaptive programmability directly into the silicon. Deep on-chip buffering, identity-aware forwarding, and sub-second policy updates shift control from perimeter devices to chip and cloud-native orchestration, targeting agentic AI workloads.
Microsoft Build 2026: Unifying Agent Stack from Chip to Cloud
At Build 2026, Microsoft unveiled a comprehensive agent-era platform: Project Solara (chip-to-cloud), Microsoft IQ (unified grounding), Rayfin (backend generation), Azure HorizonDB, and GPU-accelerated analytics. The goal is to lock developers into Microsoft's ecosystem.
Cisco Shifts AI Network Control from K8s Black Box to Unified Fabric via Isovalent and VXLAN ESG
Cisco integrates Isovalent's eBPF into Nexus One for pod-to-fabric visibility and introduces VXLAN ESG-based AI job segmentation, embedding security and multi-tenancy into the network fabric. This targets the Kubernetes 'black box' bottleneck in AI inference, unifying control and troubleshooting.
NVIDIA Transaction Foundation Models Shift Financial AI Control to Unified GPU Stack
NVIDIA launches a developer example for transaction foundation models, partnering with Revolut, Mastercard, and others to replace siloed ML models with unified transformer-based systems. Leveraging Hopper GPUs, cuDF, and Nemotron, it shifts financial data processing from feature engineering to unified embeddings, effectively moving control to NVIDIA's hardware ecosystem.
Google AlloyDB Remote MCP Server GA: Standardizing AI Agent Data Access with Open Protocol
Google Cloud announces GA of AlloyDB Remote MCP Server, enabling AI agents to securely access operational data via HTTP endpoints. Built on open MCP protocol, it offers IAM fine-grained authorization, Model Armor protection, and audit logging, integrated with AlloyDB’s ScaNN vector index (10B+ vectors, 6x speed) and AI functions, positioning AlloyDB as the single source of truth for enterprise agentic workloads.
Cisco Full-Stack PQC Switches Lock Down Quantum Security with Hardware Trust Anchor
Cisco unveils C9000 Smart Switches, the first enterprise switches with full-stack post-quantum cryptography (PQC). A **Trust Anchor module (TAm)** embedded in FPGA enables quantum-resistant secure boot, while **IOS XE** integrates **ML-KEM** for key exchange in **SSH, MACsec, IPsec, TLS**. Aimed at harvest-now-decrypt-later threats, but no performance data disclosed.
AMD Ryzen AI Halo & Max PRO 400: Local 300B Parameter Inference, but Hidden Lock-in and Thermal Limits
AMD launches Ryzen AI Halo developer platform (128GB unified memory, 200B parameter models) and Ryzen AI Max PRO 400 series (first x86 client to run 300B parameter models locally). Unified memory, ROCm optimization, and OEM partnerships aim to shift agentic AI from cloud to local, but shared memory bandwidth and thermal constraints limit real-world throughput.
Cloudflare Tests Anthropic Mythos: AI-Driven Exploit Chain Construction and Proof Generation
Cloudflare's Project Glasswing tested Anthropic's Mythos Preview, revealing its ability to automatically chain multiple low-severity bugs into exploitable PoCs with runnable code. They built a multi-stage harness to manage noise and context limits, achieving a significant leap in vulnerability discovery quality.
AWS AgentCore Payments: Autonomous AI Agent Spending Unlocks New Lock-in and Threat Surface
AWS previews managed payment capabilities in Bedrock AgentCore, enabling AI agents to autonomously pay for APIs, MCP servers, and web content, integrated with Coinbase and Stripe. Also launches Agent Toolkit for AWS and MCP Server GA. This pushes AI agents toward autonomous execution but introduces new security and lock-in risks.
Cisco-AMD Benchmark Shifts AI Fabric Control from GPU to SmartNIC and Switch
Cisco and AMD jointly release benchmarks for AI scale-out fabrics using N9000 800G switches, Pensando Pollara 400 smartNICs, and MI300X GPUs. IBPerf and MLPerf tests show P01/P99 bandwidth near 400Gbps line rate under incast congestion, proving deterministic performance that eliminates GPU stalls.
AMD Backs SPEC CPU 2026 Benchmark, Emphasizing Open, Trusted Performance Measurement
AMD published a blog endorsing the upcoming SPEC CPU 2026 industry benchmark, emphasizing the critical role of open, reproducible CPU performance standards for customer infrastructure decisions in the AI era. The new benchmark updates its application suite and strengthens support for bare-metal cloud environments and parallel computing.
AMD and OpenAI Contribute MRC Protocol to OCP for Scalable AI Networking
AMD, in collaboration with OpenAI, Microsoft, and others, contributed the MRC (Multipath Reliable Connection) protocol, designed for large-scale AI training, to the Open Compute Project (OCP). AMD co-authored the specification and has already deployed MRC on its programmable Pensando DPU/NIC products, positioning its networking technology as a key enabler for resilient and adaptive AI infrastructure.
AMD and OpenAI Introduce MRC, a Next-Gen Transport Protocol for AI Training
AMD, in collaboration with OpenAI, Microsoft, and other industry leaders, has released the specification for the Multipath Reliable Connection (MRC) protocol. MRC addresses performance bottlenecks of RoCEv2 in hyperscale AI training clusters through intelligent packet spraying, selective retransmission, and network-signaled congestion control, aiming to improve bandwidth utilization and job resilience.
NVIDIA Extreme Co-Design: Vera Rubin Platform Targets Agentic Inference TCO Inflection
NVIDIA unveils an extreme co-design stack for agentic systems, featuring Vera Rubin NVL72, NVLink 6, ConnectX-9, BlueField-4, and Spectrum-X. By disaggregating inference, optimizing KV cache management, and deploying low-latency fabrics, it aims to break the throughput-interactivity tradeoff, making high-context token processing economically viable.
Cisco Launches Nexus Dashboard 4.2, Enhancing Network Monitoring and Security for AI Workloads
Cisco has released Nexus Dashboard 4.2, a data center management platform update. Key enhancements include Slurm integration for AI/HPC job monitoring, LLDP-based integration with NVIDIA NICs for adaptive routing, and Live Protect for zero-downtime vulnerability mitigation using eBPF. The release aims to provide a unified, intelligent, and secure operations plane for hybrid cloud and AI infrastructure.
Cisco Provides AI Defense Proactive Testing Platform via DevNet Lab
Cisco released a hands-on DevNet lab for its AI Defense Explorer Edition, enabling developers to conduct agent-driven red teaming on AI models and applications in a self-service manner. The tool uses natural language to define attack objectives and simulates multi-turn adaptive attacks, aiming to shift security testing left in the development lifecycle.
White House Considers Pre-Release Security Review for AI Models, a 180-Degree Regulatory Pivot
The Trump administration is considering an executive order requiring new AI models to pass federal security review before public release. Anthropic Mythos was singled out for demonstrating powerful cyberattack capabilities, with NSA and intelligence agencies leading the review rather than the Commerce Department.
AMD Showcases Heterogeneous Computing Strategy for Enterprise AI with Dell
At Dell Technologies World, AMD highlighted its heterogeneous computing portfolio, aiming to match the right compute engine to specific enterprise AI workloads, while emphasizing hardware-based security and manageability. This signals a shift in AI infrastructure from generic solutions to fine-tuned, scenario-specific deployments.