Privacy - AI Infrastructure Intelligence Search

NVIDIA Other 2026-07-16

NVIDIA CUDA 13.3 Introduces clmad for Hardware-Accelerated Carryless Multiplication on GPUs

NVIDIA CUDA 13.3 adds the clmad hardware instruction for carryless multiply-accumulate on Ampere+ GPUs. GHASH throughput reaches 6.3 TB/s on B200, up to 18.8x faster than bitsliced. Sum-check protocol accelerates 3-13x. The instruction also benefits CRC, Reed-Solomon, and post-quantum cryptography.

Cisco Other 2026-07-15

Cisco, Aliro, zerothird Demo Operational QKD Network with MACsec

Cisco, Aliro, and zerothird demonstrated an operational entanglement-based QKD network at Cisco Photonics Center. Aliro Orchestrator manages quantum key distribution, feeding keys via Cisco SKIP interface into Cisco 8000 routers for MACsec encryption, marking a transition from lab to production.

Huawei Other 2026-06-25

Huawei Pushes Token-Based Billing at MWC Shanghai 2026: Shifting Carrier Monetization from Bytes to AI Inference Value

At MWC Shanghai 2026, Huawei urged carriers to shift from byte-based to token-based billing for AI workloads, showcasing a 372% token throughput improvement in long-sequence inference via its AI Inference Acceleration Solution. It also highlighted the Upper-6 GHz band as critical for AI wearables requiring 20 Mbps uplink, aiming to reposition 5G-A networks as AI compute delivery infrastructure.

AMD Other 2026-06-24

TSMC Hikes Advanced Node Prices 5-10%, Squeezing AI Chip Margins

TSMC informs clients of 5-10% price hikes across all advanced nodes (7nm+), affecting 74% of wafer revenue. Apple, Nvidia, AMD, and others face higher costs, potentially raising AI infrastructure prices.

NVIDIA Other 2026-06-23

NVIDIA Unveils 45°C Liquid Cooling for Rubin Chips, Slashes Water Use 100%

NVIDIA announces a liquid cooling system for its Rubin GPUs running 45°C coolant (hotter than a hot tub), using dry coolers in a closed loop to cut electricity and eliminate water evaporation (100% reduction). However, chillers may still be needed in hot climates, and chip longevity impacts remain unaddressed.

AMD Other 2026-06-23

AMD MI430X GPU Delivers >200 TFLOPS Native FP64, Reshaping HPC-AI Convergence Baseline

AMD powers 4 of top 10 TOP500 supercomputers and previews MI430X GPU with >200 TFLOPS native FP64. This targets AI-for-science workloads, making double-precision compute a key metric for converged HPC-AI infrastructure, directly challenging NVIDIA and Intel.

NVIDIA Other 2026-06-23

NVIDIA's AI Agents and Digital Twins Reshape Telecom Network Control Plane

At DTW Ignite 2026, NVIDIA showcases its AI agent platform integrating NeMo synthetic data, NemoClaw secure runtime, OpenShell sandbox, and RTX PRO 6000-accelerated digital twins, aiming for autonomous telecom operations. Partners include SoftBank, Amdocs, NTT DATA, etc., moving from task automation to full autonomy.

Amazon Other 2026-06-21

AWS Seizes Agent Control Plane with MCP Gateway and AgentCore

AWS launches managed web search for Bedrock AgentCore, autonomous agents in Amazon Quick, subagent MicroVM orchestration with LangChain, and MCP Gateway, shifting enterprise AI agents from prototypes to governed infrastructure with cloud-native control planes and execution isolation.

NVIDIA Other 2026-06-18

NVIDIA's French AI Push: Open Models as a Trojan Horse for Hardware Lock-in

NVIDIA partners with French entities to deploy GB200, Blackwell B300, and Vera Rubin NVL72 systems, while promoting the Nemotron open model coalition. This builds an NVIDIA-centric AI infrastructure ecosystem in Europe, masking hardware lock-in with open model rhetoric.

AMD Other 2026-06-17

AMD Mustang Peak Threadripper: 144 cores, PCIe 6.0, TR6 socket – Power and memory challenges loom

AMD's Zen 6 Threadripper 'Mustang Peak' is confirmed with 2nm TSMC process, DDR5, PCIe 6.0, and a new TR6 socket. Using Powderhorn CCDs, it scales to 144 cores (288 threads) with clocks above 6 GHz. However, massive power draw and memory bandwidth demands (possibly requiring MRDIMM) raise platform cost concerns.

NVIDIA Other 2026-06-17

NVIDIA RTX Remix 1.5: RTX IO Shrinks Game Sizes, AI Agents Reshape Modding

NVIDIA releases RTX Remix 1.5, featuring RTX IO compression that slashes Half-Life 2 RTX from 80GB to 50GB and reduces CPU overhead. The update also introduces AI agent integration via 'RTX Remix Skills,' allowing AI coding agents to automate complex modding tasks, lowering the barrier for non-programmers.

AMD Other 2026-06-17

AMD MLPerf 6.0: MI350 GPUs Achieve 3.5x Leap with MXFP4, Debut Multi-Node Training

AMD submitted its most comprehensive MLPerf Training 6.0 results, including first multi-node training (FLUX.1 on 512 GPUs) and MXFP4 training recipe. MI355X delivers 3.5x generational leap over MI300X on Llama 2-70B, within 5% of NVIDIA B200. 10 ecosystem partners validated reproducibility.

AMD Other 2026-06-16

AMD and Rackspace Deploy 30MW Governed AI Stack: Ecosystem Restructuring from Silicon to Outcomes

AMD and Rackspace sign a definitive agreement to deploy 30MW of AMD AI compute (Instinct GPUs including MI355X, EPYC CPUs) across Rackspace's data centers, creating a governed enterprise AI stack with single accountability from silicon to outcomes, targeting regulated industries.

AMD Other 2026-06-15

AMD Acquires MEXT: AI-Predicted Flash Nears DRAM Performance to Cut AI Memory TCO

AMD acquires MEXT, an AI-driven memory optimization startup. MEXT's predictive technology makes NAND Flash behave like DRAM, expanding effective memory capacity for AI workloads and lowering TCO. The tech will be integrated across AMD's data center portfolio (EPYC, Instinct) to address memory bottlenecks in large models.

AMD Other 2026-06-15

AMD Open-Sources AI Software Stack on Vultr, Taking on NVIDIA CUDA Ecosystem

AMD launches a suite of open-source, modular enterprise AI software components on Vultr Marketplace, including AMD Inference Microservices (AIMs), AI Workbench, Resource Manager, and Solution Blueprints. This aims to provide production-grade AI infrastructure without vendor lock-in, directly challenging NVIDIA's CUDA ecosystem.

NVIDIA Other 2026-06-15

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

ASUS launches ExpertCenter Pro ET900N G3, built on NVIDIA DGX Station GB300 architecture with GB300 Grace Blackwell Ultra chip, 748GB coherent memory, and 20 PFLOPS AI performance. This deskside AI supercomputer enables local LLM fine-tuning, inference, and agentic AI workflows via NVLink-C2C and the full NVIDIA AI software stack including NemoClaw.

NVIDIA Other 2026-06-11

NVIDIA Optimizes Google's DiffusionGemma for 1,000 tok/s Parallel Text Generation

NVIDIA optimizes Google DeepMind's DiffusionGemma, a diffusion-based text model generating 256 tokens per step in parallel. On a single H100, it achieves 1,000 tok/s, with deployment via NIM and NeMo. This breaks the sequential token bottleneck, slashing serving costs and latency for real-time AI.

AMD Other 2026-06-11

AMD, Dell, Cambridge Launch UK Sovereign AI Lab to Challenge NVIDIA's CUDA Dominance with Open ROCm

AMD, Dell, and the University of Cambridge launch the Sovereign AI Innovation Lab (SAIL) in the UK, deploying Zenith supercomputer with 5th Gen EPYC and Instinct MI355X GPUs, plus the Sunrise fusion AI system. The lab promotes open, interoperable AI infrastructure based on AMD ROCm, challenging NVIDIA's CUDA lock-in and offering long-term technology choice for national AI initiatives.

AMD Other 2026-06-10

AMD EPYC Challenges Rack-Scale Density for Agentic AI Control

AMD claims its EPYC processors lead in rack-scale performance for agentic AI's CPU-intensive services (orchestration, caching, databases). Under a 100kW rack model, EPYC 9965 'Turin' delivers 2.37x throughput over NVIDIA Vera, with next-gen 'Venice' projected at 3.30x. Emphasizes deployability on current x86 platforms, avoiding future architecture dependency.

Cisco Other 2026-06-04

Cisco AI Defense + AppOmni Extends Runtime Guardrails to SaaS AI Agents

Cisco integrates AI Defense with AppOmni, using AgentGuard as a real-time intercept layer inside SaaS environments. Custom guardrails now apply to Microsoft 365 Copilot, ServiceNow Now Assist, and other SaaS agents, monitoring MCP, chat, and agent-to-agent channels to block prompt injection, tool exploitation, and data exfiltration with a unified policy engine.

Reports

Filter