Reports
AI-generated structured vendor updates
AMD Highlights AI PC as Critical Infrastructure for Enterprise Agentic AI in IDC White Paper
AMD released an IDC white paper indicating that over 80% of enterprises are planning, piloting, or deploying AI PCs to support scaled Agentic AI. The report highlights high-performance NPUs and on-device AI processing as critical for enabling real-time, secure workflows, signaling a shift in enterprise AI infrastructure from cloud to endpoint.
Cisco Optimizes Developer Portals via Product Sprints, Focusing on AI Agent Workflow Data
Cisco's DevNet team detailed its practice of optimizing developer portals and content through product sprints, focusing on establishing measurable product-market fit indicators. Notably, the newly added analytics events specifically track how developer content is consumed by AI coding assistants or agents, such as copying Markdown and downloading OpenAPI/SDK/MCP documents.
NVIDIA and Google Cloud Deepen Collaboration to Build Cloud Infrastructure for AI Factories and Physical AI
NVIDIA and Google Cloud have announced an expanded collaboration, introducing new Vera Rubin and Blackwell GPU-powered instances to build "AI factories" scaling to nearly a million GPUs. The integration of Gemini, Nemotron, and other platforms aims to accelerate production deployment of agentic and physical AI, such as robotics and digital twins.
Cisco Launches AI Agent Security Scanner, Shifting Security Control Point to IDEs
Cisco has launched an AI Agent Security Scanner IDE extension designed to identify and mitigate new attack surfaces in the AI development toolchain. The tool provides local, multi-layered protection by statically scanning MCP server configurations and agent skill definitions, embedding secure coding rules during code generation, and continuously monitoring file integrity at runtime.
NVIDIA Partners with Adobe and WPP to Build Enterprise-Grade AI Agent Security Architecture Centered on OpenShell
NVIDIA deepens its strategic collaboration with Adobe and WPP to place intelligent AI agents at the center of enterprise marketing operations. The key move is the introduction and emphasis on the NVIDIA OpenShell secure runtime, which provides a policy-based, auditable, and isolated execution environment for AI agents handling multi-step workflows. This signals a shift from purely functional AI towards controlled and trustworthy enterprise-grade agentic architectures.
NVIDIA Shifts AI Infrastructure Metric from FLOPS to Cost Per Token
NVIDIA advocates for "cost per token" as the primary economic metric for AI infrastructure, replacing "FLOPS per dollar." This shift moves the focus from computational inputs to business outputs, requiring full-stack optimization across hardware, software, and networking to lower enterprise AI inference TCO.
AMD Announces Breakthrough MLPerf Inference 6.0 Results, Showcasing Multinode Scaling and Multimodal Capabilities
AMD's MLPerf Inference 6.0 submission, powered by Instinct MI355X GPUs, surpassed 1 million tokens per second for the first time on models like Llama 2 70B and GPT-OSS-120B. The results highlight efficient multinode scaling, rapid enablement of new workloads (e.g., text-to-video model Wan-2.2-t2v), and reproducible performance across a broad partner ecosystem.
Cisco Advances Cloud-Native Service Architecture with Isovalent
Telefónica's acens adopts Cisco's Isovalent Enterprise for Cilium to build a high-performance, observable, and secure Kubernetes platform, meeting enterprise needs in multi-cloud environments. The solution leverages eBPF technology to provide granular network policies and transparent encryption, enhancing security in multi-tenant environments.
Cisco Open Sources DefenseClaw for AI Agent Security Governance
Cisco launched open-source DefenseClaw, providing three-layer security architecture for AI agents like OpenClaw: supply chain scanning, runtime inspection, and system boundary control. The solution integrates NVIDIA's OpenShell sandbox for end-to-end automated governance.
NVIDIA Introduces Physical AI Data Factory Blueprint, Transforming Compute into Synthetic Data
At GTC, NVIDIA introduced the Physical AI Data Factory Blueprint, an open reference architecture designed to transform compute into large-scale, high-quality synthetic training data. Built on Cosmos world models and the OSMO operator, it addresses the bottleneck of scaling real-world data, aiming to serve as the data engine for next-gen autonomous systems and robots.
NVIDIA Forms Nemotron Coalition to Advance Open Frontier Models
NVIDIA announced the Nemotron Coalition at GTC, a collaboration with model builders and AI labs like Mistral AI to advance open, frontier-level foundation models. The initiative aims to foster the open model ecosystem by sharing expertise, data, and compute, emphasizing a future where AI is powered by a system of both open and proprietary models.
NVIDIA Donates GPU Dynamic Resource Allocation Driver to Kubernetes Community
NVIDIA donated its GPU Dynamic Resource Allocation (DRA) driver to the CNCF, making it an upstream Kubernetes project. This move aims to shift the core control point of GPU orchestration from proprietary vendor layers to the open-source community, and drive standardization in collaboration with major cloud providers.
NVIDIA Launches OpenShell, Establishing Runtime Sandbox for Secure Autonomous AI Agents
NVIDIA introduces OpenShell, an open-source project designed as a secure-by-design runtime for autonomous AI agents. It employs a "browser tab" model, isolating agent operations from policy enforcement at the system level to prevent policy overrides and data leaks. NVIDIA is collaborating with key security vendors to establish a unified policy layer for enterprise AI agents.
Cisco Extends Zero Trust Security to AI Agent Ecosystem
At RSA 2026, Cisco introduced security innovations for AI agents, extending Zero Trust Access with agent discovery in Identity Intelligence, agentic IAM in Duo, and MCP enforcement in Secure Access SSE. It launched AI Defense: Explorer Edition for self-serve testing and DefenseClaw open source framework to automate security deployment.
AMD and NAVER Cloud Collaborate on Sovereign AI Infrastructure in Korea
AMD and NAVER Cloud announced a strategic collaboration to accelerate sovereign AI infrastructure in Korea. NAVER Cloud will expand deployment of AMD EPYC "Venice" CPUs and gain early access to next-gen Instinct MI455X GPUs, with joint optimization of AI services and software stacks on AMD platforms.
AMD and Samsung Deepen Collaboration, Locking HBM4 Supply and Exploring Foundry Partnership
AMD and Samsung signed an MOU, designating Samsung as the primary HBM4 supplier for the next-gen Instinct MI455X GPU and collaborating on DDR5 memory optimized for 6th Gen EPYC CPUs. The companies will also explore opportunities for Samsung to provide foundry services for future AMD products.
NVIDIA Extends CUDA Tile Programming Model to Julia Language
NVIDIA introduces its CUDA Tile high-level GPU programming model to the Julia ecosystem via the cuTile.jl package. This move aims to lower the barrier to high-performance GPU kernel development by abstracting low-level thread and memory management with a tile-based data model, while maintaining high syntax and performance parity with the Python version.
Cisco Defines Security Architecture for Agentic AI Era with Expanded AI Defense and SASE Capabilities
Cisco announced major updates to its AI Defense solution, adding AI supply chain governance and runtime protections to mitigate risks of agentic AI compromise. Concurrently, Cisco SASE introduced AI traffic detection and optimization capabilities to ensure secure and reliable agentic workflows. These developments reflect Cisco's strategic focus on converging AI security with networking architectures.
NVFP4 + TeaCache Drive 10x FLUX.2 Inference Speedup, Locking Blackwell Ecosystem
NVIDIA and BFL optimize FLUX.2 on DGX B200/B300 using NVFP4 4-bit quantization, TeaCache step skipping, CUDA Graphs, and torch.compile, achieving 6.3x (single GPU) to 10.2x (dual GPU) latency reduction vs H200, with 40% memory savings. The stack is tightly coupled to TensorRT-LLM visualgen and Blackwell hardware.
NVIDIA Technologies and GPU Architectures | NVIDIA
NVIDIA Home NVIDIA Home ...