iOS - AI Infrastructure Intelligence Search

Fortinet Other 2026-07-28

Fortinet推出FortiGate 3500G/400G AI防火墙保护混合企业网络

...

AMD Other 2026-07-28

AMD Unveils 6th Gen EPYC Venice, MI400 GPUs, Helios Rack for AI Inference

AMD launches 6th Gen EPYC Venice (2nm) and MI400 series GPUs, with MI455X claiming 34x token throughput improvement. The Helios rack solution integrates 72 MI455X GPUs with 18 EPYC CPUs via Pensando networking and ROCm software, offering 30% more inference tokens per dollar than competitors. Adopted by OpenAI, Meta, and others.

NVIDIA Other 2026-07-28

NVIDIA Invests $5B in SSI, Opens Vera Rubin Platform to Lock In AI Safety Research

NVIDIA makes a major equity investment in Safe Superintelligence (SSI) and provides access to its next-generation Vera Rubin GPU platform. The partnership goes beyond hardware sales, giving NVIDIA rare access to SSI's confidential research, with insights feeding back into NVIDIA's platform roadmap, marking a strategic shift from hardware vendor to deep research partner.

AMD Other 2026-07-27

AMD发布第六代EPYC Venice处理器与Helios机架级AI解决方案

...

Microsoft Azure Other 2026-07-26

Microsoft Azure Integrates AMD Helios Rack-Scale AI Platform, Breaks NVIDIA GPU Monopoly

Microsoft Azure announces deployment of AMD Helios rack-scale AI platform in H2 2026. Rack integrates 72 MI455X GPUs, 18 Venice CPUs, liquid cooling, delivering 2.9 Exaflops FP4 inference. This signals a major industry shift away from NVIDIA GPU monopoly towards multi-vendor heterogeneous AI infrastructure.

AMD Other 2026-07-26

AMD Launches Helios Rack-Scale AI Platform with MI400 GPUs, Targeting Inference TCO

At Advancing AI 2026, AMD unveiled the Helios rackscale platform integrating 72 MI455X GPUs and 18 EPYC Venice CPUs per rack, delivering 2.9 exaflops FP4 inference and 31TB HBM4 memory. The MI430X offers 288 TFLOPS FP64 for HPC. AMD claims up to 30% more inference tokens per dollar vs. competitors.

Microsoft Other 2026-07-26

Microsoft Launches MAI Models, Slashes GPU Costs 89%, Reducing OpenAI Dependency

Microsoft unveiled MAI-Image-2.5-Pro and MAI-Voice-2-Flash on Azure Foundry, achieving 96.8% text rendering accuracy at 8K and reducing GPU costs by 84-89% vs GPT. Integrated across Bing, PowerPoint, and Dynamics 365, it marks a strategic shift from OpenAI dependency. Also, NVIDIA Jetson heads to the moon for edge AI.

AMD Other 2026-07-26

AMD Helios Enters Production: 12-Stack HBM4 Outmuscles NVIDIA, UALoE Opens AI Network

AMD's second-generation Helios rack-scale AI server enters full production, featuring 72 MI455X GPUs with Samsung's exclusive 12-stack HBM4 (31TB per rack). Compared to NVIDIA's 8-stack design, it offers 50% more memory and 30% lower token cost. Microsoft Azure commits to large-scale deployment, solidifying hyperscaler dual-vendor strategy.

AMD Other 2026-07-25

AMD and Cerebras Unveil Disaggregated AI Inference with Wafer-Scale Engine

AMD and Cerebras launch a disaggregated AI inference solution combining the Helios Rackscale system (6th-gen EPYC Venice CPUs + up to 72 Instinct MI455X GPUs) with the Cerebras WSE-3 (4 trillion transistors) via Infinity Fabric, targeting ultra-low latency and high throughput for AI inference, challenging traditional GPU clusters.

NVIDIA Other 2026-07-25

NVIDIA Leads 25 Companies in Open Letter Against Restricting Open-Weight AI and Distillation

NVIDIA CEO Jensen Huang posted his first tweet on X, attaching an open letter signed by 25 companies including Microsoft, Meta, and IBM, urging Congress not to restrict open-weight AI models and distillation. The letter marks the formal split of the AI industry into open-weight and closed-source camps, with OpenAI, Anthropic, and Google notably absent, reshaping industry alliances and influencing global AI governance.

Cisco Other 2026-07-24

Cisco Proposes Logically Air-Gapped Model with eBPF, Shifting Security to Kernel

Cisco introduces a logically air-gapped governance model using eBPF and Cilium to create a software-defined cryptographic perimeter at the kernel level. Integrating Cisco Secure Workload with Isovalent, it aims to provide data residency and regulatory compliance for containerized, virtualized, and bare-metal environments without sacrificing cloud agility.

AMD Other 2026-07-24

AMD发布Helios机架级AI平台与MI455X GPU

...

Google Other 2026-07-24

Google Begins Gemini 4 Pre-training with 4M+ Context and Monthly Releases

Alphabet confirms start of largest pre-training run for Gemini 4, featuring 4M+ context and native multimodality with near-monthly releases. 2026 capex raised to $195-205B, Google Cloud Q2 up 82%, signaling full-stack AI acceleration.

AMD Other 2026-07-24

AMD Helios Rack Challenges NVIDIA NVLink with Open UALoE Interconnect

At Advancing AI 2026, AMD launched the Helios rack with 72 MI455X GPUs, 18 Venice EPYC CPUs, and Pensando networking, claiming 30% higher inference token/$ vs NVIDIA NVL72. It introduced UALoE open interconnect to break NVLink lock-in, partnering with Cerebras, Cisco, and major AI firms.

Microsoft Other 2026-07-24

AMD Helios Full-Stack AI Server Deployed on Azure, UALoE Open Standard Challenges NVLink

AMD and Microsoft Azure announce large-scale deployment of Helios full-stack AI servers, featuring 72 MI455X GPUs, 18 EPYC Venice CPUs, and Pensando DPUs, with 31TB HBM4 and 1.7PB/s memory bandwidth. Two new Azure VM series target Agentic AI and semiconductor design, marking Microsoft's multi-vendor strategy and challenging NVIDIA's dominance.

Microsoft Other 2026-07-24

Microsoft launches MAI-Image-2.5-Pro and MAI-Voice-2-Flash, deepening in-house AI ecosystem

Microsoft introduces MAI-Image-2.5-Pro and MAI-Voice-2-Flash, proprietary models now powering Bing, PowerPoint, OneDrive, and Dynamics 365, replacing third-party models. Claims up to 84% GPU cost reduction and 2x faster voice, signaling a strategic shift to in-house AI.

AMD Other 2026-07-23

Microsoft Azure Deploys AMD Helios Rack with MI455X GPUs, Launches Three New VM Families

Microsoft Azure announces the deployment of AMD Helios rack-scale AI platform, featuring 72 Instinct MI455X GPUs, 31TB HBM4 memory, and 1.4PB/s bandwidth per rack. Three new VM families target AI inference, data engineering, and HPC, powered by 6th-gen EPYC Venice CPUs and Pensando DPUs.

AMD Other 2026-07-23

AMD与Anthropic达成战略合作：部署2GW MI450 GPU并投资50亿美元

...

AMD Other 2026-07-23

AMD Invests $5B in Anthropic, Secures 2GW MI450 Deployment, Reshaping AI Compute Ecosystem

AMD and Anthropic announce a strategic partnership: Anthropic will deploy up to 2GW of AMD Instinct MI450 GPUs, with AMD investing up to $5B in Anthropic. They will collaborate on ROCm optimization and Claude workload tuning, marking AMD's transition from chip vendor to AI ecosystem investor and accelerating multi-sourcing in AI compute.

NVIDIA Other 2026-07-23

NVIDIA-OpenAI $100B Partnership: 10GW Vera Rubin AI Factories Reshape Ecosystem

NVIDIA and OpenAI announce a strategic partnership to deploy at least 10GW of NVIDIA systems using the Vera Rubin platform (Rubin GPU, Vera CPU, HBM4, NVLink 6). NVIDIA will invest up to $100B. First facilities go online in H2 2026, powering OpenAI's next-gen models, marking the era of multi-GW AI factories.

Reports

Filter

Fortinet推出FortiGate 3500G/400G AI防火墙保护混合企业网络

AMD Unveils 6th Gen EPYC Venice, MI400 GPUs, Helios Rack for AI Inference

NVIDIA Invests $5B in SSI, Opens Vera Rubin Platform to Lock In AI Safety Research

AMD发布第六代EPYC Venice处理器与Helios机架级AI解决方案

Microsoft Azure Integrates AMD Helios Rack-Scale AI Platform, Breaks NVIDIA GPU Monopoly

AMD Launches Helios Rack-Scale AI Platform with MI400 GPUs, Targeting Inference TCO

Microsoft Launches MAI Models, Slashes GPU Costs 89%, Reducing OpenAI Dependency

AMD Helios Enters Production: 12-Stack HBM4 Outmuscles NVIDIA, UALoE Opens AI Network

AMD and Cerebras Unveil Disaggregated AI Inference with Wafer-Scale Engine

NVIDIA Leads 25 Companies in Open Letter Against Restricting Open-Weight AI and Distillation

Cisco Proposes Logically Air-Gapped Model with eBPF, Shifting Security to Kernel

AMD发布Helios机架级AI平台与MI455X GPU

Google Begins Gemini 4 Pre-training with 4M+ Context and Monthly Releases

AMD Helios Rack Challenges NVIDIA NVLink with Open UALoE Interconnect

AMD Helios Full-Stack AI Server Deployed on Azure, UALoE Open Standard Challenges NVLink

Microsoft launches MAI-Image-2.5-Pro and MAI-Voice-2-Flash, deepening in-house AI ecosystem

Microsoft Azure Deploys AMD Helios Rack with MI455X GPUs, Launches Three New VM Families

AMD与Anthropic达成战略合作：部署2GW MI450 GPU并投资50亿美元

AMD Invests $5B in Anthropic, Secures 2GW MI450 Deployment, Reshaping AI Compute Ecosystem

NVIDIA-OpenAI $100B Partnership: 10GW Vera Rubin AI Factories Reshape Ecosystem