CUDA - AI Infrastructure Intelligence Search

Nokia Other High Signal 2026-04-09

Nokia Partners with NVIDIA on AI-RAN Platform to Accelerate 6G Evolution

Nokia and NVIDIA have formed a strategic partnership, with NVIDIA investing $1 billion and jointly launching AI-RAN products based on NVIDIA's computing platform. The collaboration aims to embed AI data center capabilities into the RAN, driving the transition from 5G to AI-native 6G networks, with T-Mobile as the first deployment customer.

NVIDIA Other High Signal 2026-04-03

NVIDIA and Google Optimize Gemma 4 for Enhanced Local AI Agent Infrastructure

NVIDIA announces collaboration with Google to deeply optimize the Gemma 4 series of open models for its RTX, DGX Spark, and Jetson platforms. This move aims to extend high-performance, multimodal AI inference from the cloud to edge devices and personal workstations, providing full-stack model support (2B to 31B) for local AI agents.

NVIDIA Other Medium Signal 2026-04-03

NVIDIA Optimizes Gemma 4 Models for Local Agentic AI Acceleration

NVIDIA collaborates with Google to optimize the Gemma 4 family of models for efficient performance across a range of NVIDIA hardware, from edge devices to high-performance GPUs. These models support various tasks including reasoning, coding, and agent capabilities, making them suitable for local agentic AI applications.

Cisco Other Medium Signal 2026-03-27

Cisco Launches Nexus Hyperfabric AI with 800G Switch and HGX B300 GPU Integration

Cisco introduces Nexus Hyperfabric AI infrastructure, integrating 800G Ethernet switches and NVIDIA HGX B300 GPUs, offering both fully integrated and flexible 'bring-your-own' deployment models. The solution aligns with NVIDIA's Cloud Partner program to streamline AI infrastructure deployment and operations.

NVIDIA Other High Signal 2026-03-17

NVIDIA cuDF Accelerates Spark Data Processing for Enterprise A/B Testing

NVIDIA accelerates Apache Spark workflows on Google Kubernetes Engine using cuDF GPU DataFrame and CUDA-X libraries, delivering 4x performance gain and 76% cost reduction for Snap. The solution enables code-free migration of Spark applications and processes over 10PB data.

NVIDIA Other 2026-03-17

Project Rheo: NVIDIA Shifts Robot Training Control from Real Hospitals to Simulation

NVIDIA unveils Project Rheo, a blueprint combining Isaac Sim, GR00T VLA models, and synthetic data generation for hospital robotics. Developers train Physical AI policies in digital twins—loco-manipulation (surgical tray pick-and-place) and precision bimanual tasks (trocar assembly)—with Cosmos Transfer 2.5 for cross-scene generalization.

NVIDIA Other 2026-03-13

NVIDIA Warp: Differentiable Physics Simulation for AI Training on GPU

NVIDIA Warp is a framework for GPU-accelerated, differentiable physics simulation. It enables writing high-performance kernels in Python, with automatic differentiation, and integrates with PyTorch/JAX. The 2D Navier-Stokes example demonstrates end-to-end optimization, reducing the cost of generating training data for physics AI.

NVIDIA Other Medium Signal 2026-03-12

NVIDIA and Dassault Systèmes Integrate Virtual Twin and AI Physics Models

NVIDIA partners with Dassault Systèmes to integrate virtual twin platforms with NVIDIA accelerated computing, AI physics models, and CUDA-X/Omniverse libraries. The integration enables AI-based physical behavior simulation through SIMULIA software for real-time prediction across industries.

NVIDIA Other Medium Signal 2026-03-04

NVIDIA Extends CUDA Tile Programming Model to Julia Language

NVIDIA introduces its CUDA Tile high-level GPU programming model to the Julia ecosystem via the cuTile.jl package. This move aims to lower the barrier to high-performance GPU kernel development by abstracting low-level thread and memory management with a tile-based data model, while maintaining high syntax and performance parity with the Python version.

Trend Micro Other High Signal 2026-03-03

Trend Micro Report Highlights AI Supply Chain Risks and Model Attack Surfaces

Trend Micro's 'Fault Lines in the AI Ecosystem' report systematically analyzes security risks in the AI supply chain, including training data poisoning, third-party plugin vulnerabilities, and model theft attacks. It indicates that enterprise AI security boundaries have expanded from traditional IT infrastructure to the model layer and data pipelines.

AMD Other Medium Signal 2026-03-02

AMD Launches Enterprise AI Suite for Hardware-Software Integration

AMD released an Enterprise AI Suite integrating hardware and software ecosystems, offering an end-to-end toolchain from model optimization to deployment. The suite is optimized for Instinct accelerators and Ryzen AI processors to enhance AI workload performance and reduce development complexity.

AMD Other High Signal 2026-03-02

AMD Launches AI Developer Program to Strengthen Software Ecosystem

AMD launched a centralized AI developer portal offering ROCm software stack, optimized frameworks, and tools to lower development barriers and enhance hardware performance. The program systematically strengthens AMD's AI software ecosystem through pre-optimized models and community support, directly challenging NVIDIA's CUDA dominance.

NVIDIA Other 2026-01-23

NVFP4 + TeaCache Drive 10x FLUX.2 Inference Speedup, Locking Blackwell Ecosystem

NVIDIA and BFL optimize FLUX.2 on DGX B200/B300 using NVFP4 4-bit quantization, TeaCache step skipping, CUDA Graphs, and torch.compile, achieving 6.3x (single GPU) to 10.2x (dual GPU) latency reduction vs H200, with 40% memory savings. The stack is tightly coupled to TensorRT-LLM visualgen and Blackwell hardware.

NVIDIA Other 2025-11-08

NVIDIA Launches Interactive AI Agent for GPU-Accelerated Data Science with Nemotron Nano-9B

NVIDIA unveils an interactive AI agent powered by Nemotron Nano-9B-v2 and CUDA-X libraries, enabling natural language orchestration of ML workflows. It achieves 3x-43x GPU acceleration over CPU for data processing, model training, and hyperparameter optimization.

NVIDIA Other 2025-06-06

NVIDIA and SK hynix Co-Architect Next-Gen Memory for AI Factories, Locking HBM4 to Vera Rubin

NVIDIA and SK hynix announce a multi-year tech partnership to co-develop next-gen memory for Vera Rubin, RTX Spark, and Jetson Thor. Separately, SK Telecom deploys a gigawatt-scale AI cloud using the full DGX stack, targeting 2027. This elevates SK hynix from supplier to co-architect, strengthening NVIDIA's lock-in on HBM and the AI ecosystem.

NVIDIA Other 2025-06-01

NVIDIA RTX Spark and Nemotron-3 Ultra: AI Control Shifts from Cloud to Personal Edge

NVIDIA launched RTX Spark personal AI supercomputer (co-developed with MediaTek) and Nemotron-3 Ultra open-source model at GTC Taipei 2026. The N1X chip delivers 1 PFLOPS local AI compute, bringing LLM inference to PCs. This marks NVIDIA's pivot from cloud GPU vendor to edge AI infrastructure monopolist, redefining the PC as an AI-native device.

Reports

Filter