NSA - AI Infrastructure Intelligence Search

AMD Other 2026-06-10

AMD EPYC Challenges Rack-Scale Density for Agentic AI Control

AMD claims its EPYC processors lead in rack-scale performance for agentic AI's CPU-intensive services (orchestration, caching, databases). Under a 100kW rack model, EPYC 9965 'Turin' delivers 2.37x throughput over NVIDIA Vera, with next-gen 'Venice' projected at 3.30x. Emphasizes deployability on current x86 platforms, avoiding future architecture dependency.

NVIDIA Other 2026-06-04

NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration

NVIDIA launches Nemotron 3 Ultra, a 550B-parameter MoE model (55B active) purpose-built for AI agent orchestration. Featuring Multi-Teacher On-Policy Distillation (MOPD) and a Hybrid Mamba-Transformer architecture, it achieves 5x throughput and 30% cost savings on tasks like SWE-bench, signaling a shift of reasoning control to a layered agent system.

Cisco Other 2026-06-03

Cisco Silicon One Expands to Campus: Chip-Embedded Control Locks Agentic AI Networks

Cisco extends Silicon One to campus with C9550/C9350 switches and Cloud Control, embedding distributed visibility, sustained high throughput, and adaptive programmability directly into the silicon. Deep on-chip buffering, identity-aware forwarding, and sub-second policy updates shift control from perimeter devices to chip and cloud-native orchestration, targeting agentic AI workloads.

Microsoft Other 2026-06-02

Microsoft Build 2026: Unifying Agent Stack from Chip to Cloud

At Build 2026, Microsoft unveiled a comprehensive agent-era platform: Project Solara (chip-to-cloud), Microsoft IQ (unified grounding), Rayfin (backend generation), Azure HorizonDB, and GPU-accelerated analytics. The goal is to lock developers into Microsoft's ecosystem.

Cisco Other 2026-06-02

Cisco Shifts AI Network Control from K8s Black Box to Unified Fabric via Isovalent and VXLAN ESG

Cisco integrates Isovalent's eBPF into Nexus One for pod-to-fabric visibility and introduces VXLAN ESG-based AI job segmentation, embedding security and multi-tenancy into the network fabric. This targets the Kubernetes 'black box' bottleneck in AI inference, unifying control and troubleshooting.

NVIDIA Other 2026-06-02

NVIDIA Transaction Foundation Models Shift Financial AI Control to Unified GPU Stack

NVIDIA launches a developer example for transaction foundation models, partnering with Revolut, Mastercard, and others to replace siloed ML models with unified transformer-based systems. Leveraging Hopper GPUs, cuDF, and Nemotron, it shifts financial data processing from feature engineering to unified embeddings, effectively moving control to NVIDIA's hardware ecosystem.

Google Other 2026-06-01

Google AlloyDB Remote MCP Server GA: Standardizing AI Agent Data Access with Open Protocol

Google Cloud announces GA of AlloyDB Remote MCP Server, enabling AI agents to securely access operational data via HTTP endpoints. Built on open MCP protocol, it offers IAM fine-grained authorization, Model Armor protection, and audit logging, integrated with AlloyDB’s ScaNN vector index (10B+ vectors, 6x speed) and AI functions, positioning AlloyDB as the single source of truth for enterprise agentic workloads.

Cisco Other 2026-05-26

Cisco Full-Stack PQC Switches Lock Down Quantum Security with Hardware Trust Anchor

Cisco unveils C9000 Smart Switches, the first enterprise switches with full-stack post-quantum cryptography (PQC). A **Trust Anchor module (TAm)** embedded in FPGA enables quantum-resistant secure boot, while **IOS XE** integrates **ML-KEM** for key exchange in **SSH, MACsec, IPsec, TLS**. Aimed at harvest-now-decrypt-later threats, but no performance data disclosed.

AMD Other 2026-05-20

AMD Ryzen AI Halo & Max PRO 400: Local 300B Parameter Inference, but Hidden Lock-in and Thermal Limits

AMD launches Ryzen AI Halo developer platform (128GB unified memory, 200B parameter models) and Ryzen AI Max PRO 400 series (first x86 client to run 300B parameter models locally). Unified memory, ROCm optimization, and OEM partnerships aim to shift agentic AI from cloud to local, but shared memory bandwidth and thermal constraints limit real-world throughput.

Cloudflare Other 2026-05-18

Cloudflare Tests Anthropic Mythos: AI-Driven Exploit Chain Construction and Proof Generation

Cloudflare's Project Glasswing tested Anthropic's Mythos Preview, revealing its ability to automatically chain multiple low-severity bugs into exploitable PoCs with runnable code. They built a multi-stage harness to manage noise and context limits, achieving a significant leap in vulnerability discovery quality.

Amazon Other 2026-05-12

AWS AgentCore Payments: Autonomous AI Agent Spending Unlocks New Lock-in and Threat Surface

AWS previews managed payment capabilities in Bedrock AgentCore, enabling AI agents to autonomously pay for APIs, MCP servers, and web content, integrated with Coinbase and Stripe. Also launches Agent Toolkit for AWS and MCP Server GA. This pushes AI agents toward autonomous execution but introduces new security and lock-in risks.

Cisco Other 2026-05-07

Cisco-AMD Benchmark Shifts AI Fabric Control from GPU to SmartNIC and Switch

Cisco and AMD jointly release benchmarks for AI scale-out fabrics using N9000 800G switches, Pensando Pollara 400 smartNICs, and MI300X GPUs. IBPerf and MLPerf tests show P01/P99 bandwidth near 400Gbps line rate under incast congestion, proving deterministic performance that eliminates GPU stalls.

AMD Other Medium Signal 2026-05-07

AMD Backs SPEC CPU 2026 Benchmark, Emphasizing Open, Trusted Performance Measurement

AMD published a blog endorsing the upcoming SPEC CPU 2026 industry benchmark, emphasizing the critical role of open, reproducible CPU performance standards for customer infrastructure decisions in the AI era. The new benchmark updates its application suite and strengthens support for bare-metal cloud environments and parallel computing.

AMD Other High Signal 2026-05-06

AMD and OpenAI Contribute MRC Protocol to OCP for Scalable AI Networking

AMD, in collaboration with OpenAI, Microsoft, and others, contributed the MRC (Multipath Reliable Connection) protocol, designed for large-scale AI training, to the Open Compute Project (OCP). AMD co-authored the specification and has already deployed MRC on its programmable Pensando DPU/NIC products, positioning its networking technology as a key enabler for resilient and adaptive AI infrastructure.

AMD Other High Signal 2026-05-06

AMD and OpenAI Introduce MRC, a Next-Gen Transport Protocol for AI Training

AMD, in collaboration with OpenAI, Microsoft, and other industry leaders, has released the specification for the Multipath Reliable Connection (MRC) protocol. MRC addresses performance bottlenecks of RoCEv2 in hyperscale AI training clusters through intelligent packet spraying, selective retransmission, and network-signaled congestion control, aiming to improve bandwidth utilization and job resilience.

NVIDIA Other 2026-05-05

NVIDIA Extreme Co-Design: Vera Rubin Platform Targets Agentic Inference TCO Inflection

NVIDIA unveils an extreme co-design stack for agentic systems, featuring Vera Rubin NVL72, NVLink 6, ConnectX-9, BlueField-4, and Spectrum-X. By disaggregating inference, optimizing KV cache management, and deploying low-latency fabrics, it aims to break the throughput-interactivity tradeoff, making high-context token processing economically viable.

Cisco Other High Signal 2026-05-05

Cisco Launches Nexus Dashboard 4.2, Enhancing Network Monitoring and Security for AI Workloads

Cisco has released Nexus Dashboard 4.2, a data center management platform update. Key enhancements include Slurm integration for AI/HPC job monitoring, LLDP-based integration with NVIDIA NICs for adaptive routing, and Live Protect for zero-downtime vulnerability mitigation using eBPF. The release aims to provide a unified, intelligent, and secure operations plane for hybrid cloud and AI infrastructure.

Cisco Other Medium Signal 2026-05-05

Cisco Provides AI Defense Proactive Testing Platform via DevNet Lab

Cisco released a hands-on DevNet lab for its AI Defense Explorer Edition, enabling developers to conduct agent-driven red teaming on AI models and applications in a self-service manner. The tool uses natural language to define attack objectives and simulates multi-turn adaptive attacks, aiming to shift security testing left in the development lifecycle.

Anthropic Other High Signal 2026-05-05

White House Considers Pre-Release Security Review for AI Models, a 180-Degree Regulatory Pivot

The Trump administration is considering an executive order requiring new AI models to pass federal security review before public release. Anthropic Mythos was singled out for demonstrating powerful cyberattack capabilities, with NSA and intelligence agencies leading the review rather than the Commerce Department.

AMD Other Medium Signal 2026-05-04

AMD Showcases Heterogeneous Computing Strategy for Enterprise AI with Dell

At Dell Technologies World, AMD highlighted its heterogeneous computing portfolio, aiming to match the right compute engine to specific enterprise AI workloads, while emphasizing hardware-based security and manageability. This signals a shift in AI infrastructure from generic solutions to fine-tuned, scenario-specific deployments.

Reports

Filter