Reports
AI-generated structured vendor updates
NVIDIA Advances Physical AI Integration in Robotics
NVIDIA showcases physical AI breakthroughs for robotics, accelerating deployment via Isaac Sim simulation and Jetson Orin edge modules. Case study: Aigen leverages synthetic data training and open-world foundation models to enable solar-powered robots for precision weeding, reducing herbicide use by 90%.
NVIDIA and Google Optimize Gemma 4 for Enhanced Local AI Agent Infrastructure
NVIDIA announces collaboration with Google to deeply optimize the Gemma 4 series of open models for its RTX, DGX Spark, and Jetson platforms. This move aims to extend high-performance, multimodal AI inference from the cloud to edge devices and personal workstations, providing full-stack model support (2B to 31B) for local AI agents.
NVIDIA Optimizes Gemma 4 Models for Local Agentic AI Acceleration
NVIDIA collaborates with Google to optimize the Gemma 4 family of models for efficient performance across a range of NVIDIA hardware, from edge devices to high-performance GPUs. These models support various tasks including reasoning, coding, and agent capabilities, making them suitable for local agentic AI applications.
Google Launches Gemma 4 Open Models, Targeting Edge Inference and AI Agent Architecture
Google introduces the Gemma 4 open model family, with four sizes from 2B to 31B parameters, emphasizing breakthrough intelligence-per-parameter and native support for agentic workflows, multimodality, and long context. The small models are engineered for edge devices, aiming to bring frontier reasoning to mobile and IoT scenarios.
Google Launches Gemma 4 Open Model Family
Google introduces Gemma 4 open model family with four size variants, optimized for edge and mobile devices. The series supports multimodal processing, long context windows and 140+ languages under Apache 2.0 license.
Cisco Launches Validated AI Infrastructure Solution
Cisco introduced validated AI infrastructure designs in collaboration with NVIDIA and Red Hat, offering pre-integrated AI POD solutions to address compatibility and security challenges in enterprise DIY AI infrastructure. The solution encompasses complete compute, networking, storage and AI software stacks with modular scalability.
AMD Announces Breakthrough MLPerf Inference 6.0 Results, Showcasing Multinode Scaling and Multimodal Capabilities
AMD's MLPerf Inference 6.0 submission, powered by Instinct MI355X GPUs, surpassed 1 million tokens per second for the first time on models like Llama 2 70B and GPT-OSS-120B. The results highlight efficient multinode scaling, rapid enablement of new workloads (e.g., text-to-video model Wan-2.2-t2v), and reproducible performance across a broad partner ecosystem.
Intel Demonstrates AI Performance with Xeon 6 and Arc Pro GPUs in MLPerf Inference
Intel showcased the performance of its Xeon 6 CPUs and Arc Pro B-Series GPUs in the MLPerf Inference v6.0 benchmarks, particularly in handling large language models (LLMs). The results indicate that a system with four Arc Pro B70 GPUs can process 120B parameter models, delivering up to 1.8x higher inference performance in multi-GPU setups.
Cisco Proposes Unified AI Fabric Architecture for Training/Inference Traffic
Cisco introduces unified AI fabric architecture using N9000 switches to intelligently route both training and inference traffic, addressing resource inefficiencies in dual-fabric setups. The solution features silicon-level low latency, real-time telemetry and automated policy tuning, targeting neocloud providers' platform transformation.
NVIDIA Collaborates with Energy Leaders to Position AI Factories as Smart Grid Assets
NVIDIA, in collaboration with Emerald AI, proposes treating large-scale AI data centers (AI factories) as flexible, intelligent grid assets rather than static power loads. This architecture integrates accelerated computing, power networking, and control to enhance grid reliability and optimize energy efficiency. Several major energy companies plan to collaborate on this architecture to support AI workloads and accelerate power connection.
NVIDIA Collaborates with Energy Leaders on AI Factory-Grid Integration Architecture
NVIDIA and Emerald AI introduced a new architecture treating AI factories as intelligent grid assets, combining accelerated computing, real-time energy orchestration and reference designs. The Vera Rubin DSX-based approach enables dynamic grid response and has gained support from multiple energy providers.
NVIDIA Expands AI Ecosystem via NVLink Fusion
NVIDIA announces Marvell joining its AI ecosystem through NVLink Fusion technology, enabling more efficient AI computing interconnects. This collaboration enhances data transfer efficiency in large-scale AI training and inference scenarios.
Cisco Open Sources DefenseClaw for AI Agent Security Governance
Cisco launched open-source DefenseClaw, providing three-layer security architecture for AI agents like OpenClaw: supply chain scanning, runtime inspection, and system boundary control. The solution integrates NVIDIA's OpenShell sandbox for end-to-end automated governance.
AWS and TGS Strategic Partnership for Energy AI and HPC Transformation
TGS selected AWS as preferred cloud provider, leveraging AWS HPC and generative AI for energy exploration solutions. Collaboration includes modernizing TGS Imaging AnyWare platform and deploying multimodal Subsurface Foundation Model with AWS Nitro security.
Fortinet and NVIDIA Collaborate on Isolated Infrastructure Acceleration for AI Factories
Fortinet and NVIDIA have partnered to launch an isolated infrastructure acceleration solution for AI factories, integrating Fortinet's security technologies with NVIDIA's accelerated computing capabilities to enhance the security and performance of AI workloads. The solution focuses on achieving network isolation and security acceleration in AI training and inference environments.
Cisco Launches Nexus Hyperfabric AI with 800G Switch and HGX B300 GPU Integration
Cisco introduces Nexus Hyperfabric AI infrastructure, integrating 800G Ethernet switches and NVIDIA HGX B300 GPUs, offering both fully integrated and flexible 'bring-your-own' deployment models. The solution aligns with NVIDIA's Cloud Partner program to streamline AI infrastructure deployment and operations.
Arm Expands into Silicon Products with First Self-Designed AGI CPU
Arm is expanding its compute platform into production silicon for the first time, launching the self-designed Arm AGI CPU for AI data centers and agentic workloads. It targets over 2x performance per rack versus x86 platforms and is backed by lead partner Meta, customers like OpenAI, and a broad OEM/ODM ecosystem.
NVIDIA Introduces Physical AI Data Factory Blueprint, Transforming Compute into Synthetic Data
At GTC, NVIDIA introduced the Physical AI Data Factory Blueprint, an open reference architecture designed to transform compute into large-scale, high-quality synthetic training data. Built on Cosmos world models and the OSMO operator, it addresses the bottleneck of scaling real-world data, aiming to serve as the data engine for next-gen autonomous systems and robots.
NVIDIA Unveils Physical AI Data Factory Blueprint and Frontier Models
NVIDIA launched three physical AI frontier models and an open Physical AI Data Factory reference architecture at GTC 2026, converting computation into synthetic training data via Cosmos world model and OSMO operators. The Omniverse DSX digital twin blueprint enables validation and real-time AI inference integration with Jetson modules.
NVIDIA Forms Nemotron Coalition to Advance Open Frontier Models
NVIDIA announced the Nemotron Coalition at GTC, a collaboration with model builders and AI labs like Mistral AI to advance open, frontier-level foundation models. The initiative aims to foster the open model ecosystem by sharing expertise, data, and compute, emphasizing a future where AI is powered by a system of both open and proprietary models.