Filter

×
Active Filters Clear All
Keyword: AI Infrastructure ×
95 Total Reports
4/5 Page
Cisco Other High Signal 2026-04-09

Cisco Demonstrates Unified S/NOC with Agentic AI for Autonomous Security Operations at MWC 2026

At MWC 2026, Cisco operated a unified Security and Network Operations Center (S/NOC), demonstrating seamless integration across its Security Cloud, XDR, and Splunk platforms. The core innovation was the use of a beta Agentic AI to generate "Instant Attack Storyboards" for triage and investigation, with automated workflows bridging incidents to Splunk Enterprise Security for deeper threat hunting.

Intel Other High Signal 2026-04-08

Intel and SambaNova Announce Heterogeneous Inference Architecture for Agentic AI

Intel and SambaNova have announced a collaborative blueprint for Agentic AI production workloads. The heterogeneous design combines GPUs, SambaNova RDUs, and Intel Xeon 6 processors to address performance, efficiency, and software compatibility issues, with availability expected in H2 2026.

Cisco Other Medium Signal 2026-04-08

Cisco Deepens Nutanix Partnership, Extending HCI to AI and Edge

Cisco announced multiple advancements in its partnership with Nutanix, focusing on integrating the Nutanix Cloud Platform into Cisco AI PODs, Cisco Unified Edge, and FlashStack. The goal is to provide a unified, validated blueprint and operational model for both AI and traditional workloads from core to edge.

ARM Other 2026-04-07

Arm Partners with Monash University Malaysia to Advance Semiconductor Talent for AI Era

Arm announced a collaboration with Monash University Malaysia's School of Engineering, donating IC design development boards and appointing an executive as a guest lecturer. The initiative aims to cultivate semiconductor talent with hands-on Arm architecture and modern system design experience for the AI era.

Microsoft Other High Signal 2026-04-06

Microsoft Partners with Domestic Operators to Build Sovereign AI Infrastructure in Japan

Microsoft announced a $10B investment in Japan over four years, with a key pillar being a collaboration with Sakura Internet and SoftBank. This partnership will offer GPU-based AI compute services through Azure, managed by domestic providers to ensure data residency within Japan. This addresses the demand for sovereign AI infrastructure for sensitive workloads.

Anthropic Other Medium Signal 2026-04-06

Anthropic Establishes Fourth APAC Office in Sydney, Explores Local Compute Capacity

Anthropic announced it will open its fourth Asia-Pacific office in Sydney, Australia, to serve the ANZ market. The company plans to deepen engagement with local institutions and explore expanding compute capacity in Australia via third-party partners to address enterprise data residency requirements.

Google Other High Signal 2026-04-03

Google Launches Gemma 4 Open Models, Targeting Edge Inference and AI Agent Architecture

Google introduces the Gemma 4 open model family, with four sizes from 2B to 31B parameters, emphasizing breakthrough intelligence-per-parameter and native support for agentic workflows, multimodality, and long context. The small models are engineered for edge devices, aiming to bring frontier reasoning to mobile and IoT scenarios.

Google Other Medium Signal 2026-04-03

Google Launches Gemma 4 Open Model Family

Google introduces Gemma 4 open model family with four size variants, optimized for edge and mobile devices. The series supports multimodal processing, long context windows and 140+ languages under Apache 2.0 license.

Cisco Other Medium Signal 2026-04-02

Cisco Launches Validated AI Infrastructure Solution

Cisco introduced validated AI infrastructure designs in collaboration with NVIDIA and Red Hat, offering pre-integrated AI POD solutions to address compatibility and security challenges in enterprise DIY AI infrastructure. The solution encompasses complete compute, networking, storage and AI software stacks with modular scalability.

AMD Other High Signal 2026-04-02

AMD Announces Breakthrough MLPerf Inference 6.0 Results, Showcasing Multinode Scaling and Multimodal Capabilities

AMD's MLPerf Inference 6.0 submission, powered by Instinct MI355X GPUs, surpassed 1 million tokens per second for the first time on models like Llama 2 70B and GPT-OSS-120B. The results highlight efficient multinode scaling, rapid enablement of new workloads (e.g., text-to-video model Wan-2.2-t2v), and reproducible performance across a broad partner ecosystem.

ARM Other High Signal 2026-04-01

ARM Launches AGI CPU Silicon, Extends AI Infrastructure Reach

ARM debuts its first self-designed AGI CPU silicon, moving beyond IP licensing to offer full-stack solutions from custom silicon to integrated platforms. This shift redefines control points in AI infrastructure supply chains, enabling enterprises to optimize AI workload deployment at hardware layer.

Intel Other Medium Signal 2026-04-01

Intel Demonstrates AI Performance with Xeon 6 and Arc Pro GPUs in MLPerf Inference

Intel showcased the performance of its Xeon 6 CPUs and Arc Pro B-Series GPUs in the MLPerf Inference v6.0 benchmarks, particularly in handling large language models (LLMs). The results indicate that a system with four Arc Pro B70 GPUs can process 120B parameter models, delivering up to 1.8x higher inference performance in multi-GPU setups.

NVIDIA Other High Signal 2026-03-31

NVIDIA Collaborates with Energy Leaders to Position AI Factories as Smart Grid Assets

NVIDIA, in collaboration with Emerald AI, proposes treating large-scale AI data centers (AI factories) as flexible, intelligent grid assets rather than static power loads. This architecture integrates accelerated computing, power networking, and control to enhance grid reliability and optimize energy efficiency. Several major energy companies plan to collaborate on this architecture to support AI workloads and accelerate power connection.

NVIDIA Other High Signal 2026-03-31

NVIDIA Collaborates with Energy Leaders on AI Factory-Grid Integration Architecture

NVIDIA and Emerald AI introduced a new architecture treating AI factories as intelligent grid assets, combining accelerated computing, real-time energy orchestration and reference designs. The Vera Rubin DSX-based approach enables dynamic grid response and has gained support from multiple energy providers.

ARM Other High Signal 2026-03-27

Arm Expands into Silicon Products with First Self-Designed AGI CPU

Arm is expanding its compute platform into production silicon for the first time, launching the self-designed Arm AGI CPU for AI data centers and agentic workloads. It targets over 2x performance per rack versus x86 platforms and is backed by lead partner Meta, customers like OpenAI, and a broad OEM/ODM ecosystem.

ARM Other High Signal 2026-03-25

ARM Launches AGI CPU for Agentic AI Infrastructure Era

ARM introduces the Arm AGI CPU, its first silicon product, designed for agentic AI infrastructure on Neoverse. Optimized for massively parallel workloads, it supports 272 cores per blade in a 1OU design, delivering 8160 cores per rack and over 2x performance vs. x86 systems.

ARM Other High Signal 2026-03-25

ARM Launches AGI CPU Silicon for AI Infrastructure Market

ARM introduced its first production AGI CPU silicon in March 2026, marking a strategic shift from IP licensing to full silicon solutions provider. Designed for next-gen AI infrastructure, this move may reshape the data center processor ecosystem.

ARM Other High Signal 2026-03-25

Arm Neoverse Reshapes Control Layer in AI Infrastructure

ARM introduces Neoverse infrastructure CPU cores optimized for cloud, AI, and HPC workloads, adopted by NVIDIA, AWS, Microsoft, and Google for their AI platforms, delivering performance gains and energy efficiency. This architecture enables high-density AI workload deployment in cloud and edge environments with enhanced multi-tenant security.

NVIDIA Other High Signal 2026-03-24

NVIDIA Donates GPU Dynamic Resource Allocation Driver to Kubernetes Community

NVIDIA donated its GPU Dynamic Resource Allocation (DRA) driver to the CNCF, making it an upstream Kubernetes project. This move aims to shift the core control point of GPU orchestration from proprietary vendor layers to the open-source community, and drive standardization in collaboration with major cloud providers.

Check Point Other 2026-03-23

Check Point AI Factory Blueprint: Security Control Shifts to NVIDIA DPU and LLM Layer

Check Point unveils AI Factory Security Blueprint, tightly integrating its firewall with NVIDIA BlueField DPU via DOCA. The architecture enforces security at four layers: LLM, AI infrastructure, perimeter, and workload. The new AI Factory Firewall delivers hardware-accelerated threat prevention without consuming CPU/GPU cycles, aiming to embed security into the AI fabric.