推理 - AI Infrastructure Intelligence Search

Ericsson Other Medium Signal 2026-03-25

Ericsson Collaborates on 6G AI Network Sensing and Optimization

Ericsson partners with Forschungszentrum Jülich to develop 6G AI technologies, focusing on neuromorphic and quantum computing for network sensing and optimization. The collaboration addresses 6G network complexity, energy efficiency, and real-time data processing challenges through non-von Neumann computing paradigms.

ARM Other High Signal 2026-03-25

Arm Launches Self-Developed AGI CPU for AI Data Center Market

Arm introduces its first self-developed AGI CPU for AI data centers, featuring Neoverse V3 architecture with claimed 2x performance per rack over x86 platforms. This marks Arm's strategic shift from IP licensing to silicon provider, with support from key customers including Meta and OpenAI.

Cisco Other Medium Signal 2026-03-23

Cisco and Digital Realty Launch Unified AI Infrastructure Solution

Cisco partners with Digital Realty to deliver a pre-validated AI infrastructure reference architecture integrating 8000 series routers, SRv6 networking and AI security solutions, supporting 20-50kW high-density POD deployment. The solution leverages Digital Realty's global data center platform for distributed AI inference, simplifying enterprise AI scaling.

NVIDIA Other High Signal 2026-03-21

NVIDIA Blackwell Architecture Achieves 25x Energy Efficiency Gain

NVIDIA's Blackwell GPU architecture delivers 25x energy efficiency improvement over Hopper through Transformer Engine and NVLink innovations. This architectural breakthrough significantly reduces AI training/inference operational costs, directly impacting data center TCO and sustainability metrics.

NVIDIA Other Medium Signal 2026-03-20

NVIDIA Extends RTX AI Capabilities to Local Agentic AI, Accelerating Gemma 4 Inference

At GTC 2026, NVIDIA announced it is extending its RTX platform capabilities to the domain of local Agentic AI, aiming to accelerate the inference performance of open models like Gemma 4 on end-user devices. This move seeks to leverage local, real-time context to enhance the value of AI agents, driving innovation beyond the cloud.

AMD Other Medium Signal 2026-03-19

AMD and Celestica Launch Rack-Scale AI Platform Helios

AMD partners with Celestica to launch Helios rack-scale AI platform, integrating Instinct accelerators and EPYC processors for chip-to-rack optimization. The platform targets AI training and inference workloads with performance and efficiency enhancements for data center and cloud providers.

AMD Other Medium Signal 2026-03-19

AMD Highlights CPU's Critical Role in Agentic AI Orchestration and Inference

AMD states Agentic AI workloads require serial decision-making and context management, better suited for CPUs. The company emphasizes high-core-count, high-memory-bandwidth server CPUs will lead in agent orchestration and lightweight inference, complementing GPUs in training. This signals a strategic repositioning of CPUs in AI data center architecture.

AMD Other Medium Signal 2026-03-19

AMD and Upstage Collaborate on Sovereign AI Infrastructure with MI325X

AMD expands partnership with Upstage to deliver sovereign AI infrastructure using Instinct MI325X accelerators. The solution integrates Solar LLM with optimized ROCm software stack to enhance AI training and inference efficiency, addressing Korea's data sovereignty requirements.

Amazon Other High Signal 2026-03-19

AWS and Cerebras Introduce Decoupled Inference Architecture for AI Performance

AWS collaborates with Cerebras on a heterogeneous inference solution using Trainium and CS-3, featuring a decoupled architecture for compute and memory stages connected via EFA. It targets interactive AI applications with claimed 10x performance gain, deployed on Nitro-secured infrastructure.

Amazon Other Medium Signal 2026-03-19

Parrot Analytics Deploys Amazon Bedrock AgentCore for High-Throughput Agent Orchestration

Parrot Analytics integrates Amazon Bedrock AgentCore and Amazon Nova models to achieve 25 TPS sustained agent throughput, building an intelligent operating system for the media industry. It combines proprietary data with AWS AI infrastructure to orchestrate batch AI workloads at industrial scale, shifting the industry from retrospective to predictive capital allocation.

Cisco Other Medium Signal 2026-03-18

Cisco UCS Integrates NVIDIA Blackwell GPU with Dynamic Resource Pooling

Cisco integrates NVIDIA RTX PRO 4500 Blackwell GPU into UCS platform, supporting deployment from data center to edge. Intersight management enables dynamic GPU resource pooling with real-time PCIe allocation. Validated design blueprints accelerate scalable AI inference and vision AI workloads.

NVIDIA Other High Signal 2026-03-18

NVIDIA and Telecom Operators Build AI Grids to Redistribute AI Inference

NVIDIA is partnering with global telecom operators like AT&T and Comcast to transform existing distributed network sites into 'AI Grids' for edge AI inference. This initiative aims to deploy AI compute closer to users and data, reducing latency and cost per token. It represents a strategic shift for telcos from being data carriers to distributed AI computing platforms.

NVIDIA Other High Signal 2026-03-18

NVIDIA Partners with Telecom Operators to Build Distributed AI Inference Grid

NVIDIA collaborates with telecom operators to transform 100,000 global network sites and 100GW backup power into a distributed AI computing platform for low-latency inference. The AI grid has been validated in IoT and cloud gaming scenarios, achieving sub-500ms latency and 50% cost reduction.

Samsung Electronics Other Medium Signal 2026-03-18

Samsung and AMD Deepen AI Hardware Collaboration with HBM4 Supply and Foundry Services

Samsung will be the primary HBM4 supplier for AMD's next-gen MI455X GPU, delivering 13Gbps bandwidth memory. The partners will also develop DDR5 solutions for 6th-gen EPYC CPUs and explore Samsung's foundry services for future AMD products.

Google Other Medium Signal 2026-03-18

Google DeepMind Releases AGI Cognitive Assessment Framework and Launches Hackathon

Google DeepMind proposes a cognitive science-based AGI assessment framework defining 10 key cognitive abilities and a three-stage evaluation protocol. It launches a Kaggle hackathon to crowdsource evaluation solutions for five core abilities, aiming to establish standardized AGI assessment systems.

Google Other Medium Signal 2026-03-18

Google Gemini API Streamlines Agent Orchestration Architecture

Gemini API update enables inline custom and built-in tools in single requests, adds context loop between tools, and reduces agent development complexity. Expands Google Maps Basics for Gemini 3 models and introduces unique IDs for better debuggability.

Hewlett Packard Enterprise Other High Signal 2026-03-17

HPE Launches AI Grid with NVIDIA to Unify Distributed Inference Clusters

HPE announced the AI Grid at NVIDIA GTC, an end-to-end solution built on NVIDIA's reference architecture to securely connect distributed AI factories and inference clusters into a single intelligent system. It enables service providers to deploy and operate thousands of edge inference sites, meeting the predictable, low-latency infrastructure requirements of AI-native applications.

OpenAI Other Medium Signal 2026-03-17

OpenAI Releases Compact Models GPT-5.4 mini/nano for Enterprise AI Inference

OpenAI launches GPT-5.4 mini and nano models optimized for coding, multimodal tasks, and high-throughput API workloads. The compact models improve inference speed and reduce deployment costs, reflecting OpenAI's strategy to enhance enterprise AI service competitiveness.

NVIDIA Other High Signal 2026-03-17

NVIDIA AI Grids: AT&T, T-Mobile Building Distributed AI Platform

NVIDIA at GTC 2026 announced AI Grids strategy, as telecom operators transform network infrastructure into geographically distributed AI inference platforms. Major operators including AT&T, T-Mobile, Comcast, and Akamai participating in building distributed edge AI infrastructure.

NVIDIA Other High Signal 2026-03-17

NVIDIA Mass Produces Dynamo 1.0 Inference OS, Strengthening AI Factory Platform Strategy

NVIDIA begins mass production of Dynamo 1.0 inference OS, providing a unified software layer to coordinate AI inference workloads across data centers, cloud and edge. The system simplifies large-scale AI model deployment through standardized runtime and scheduler, abstracting infrastructure management.

Reports

Filter

Ericsson Collaborates on 6G AI Network Sensing and Optimization

Arm Launches Self-Developed AGI CPU for AI Data Center Market

Cisco and Digital Realty Launch Unified AI Infrastructure Solution

NVIDIA Blackwell Architecture Achieves 25x Energy Efficiency Gain

NVIDIA Extends RTX AI Capabilities to Local Agentic AI, Accelerating Gemma 4 Inference

AMD and Celestica Launch Rack-Scale AI Platform Helios

AMD Highlights CPU's Critical Role in Agentic AI Orchestration and Inference

AMD and Upstage Collaborate on Sovereign AI Infrastructure with MI325X

AWS and Cerebras Introduce Decoupled Inference Architecture for AI Performance

Parrot Analytics Deploys Amazon Bedrock AgentCore for High-Throughput Agent Orchestration

Cisco UCS Integrates NVIDIA Blackwell GPU with Dynamic Resource Pooling

NVIDIA and Telecom Operators Build AI Grids to Redistribute AI Inference

NVIDIA Partners with Telecom Operators to Build Distributed AI Inference Grid

Samsung and AMD Deepen AI Hardware Collaboration with HBM4 Supply and Foundry Services

Google DeepMind Releases AGI Cognitive Assessment Framework and Launches Hackathon

Google Gemini API Streamlines Agent Orchestration Architecture

HPE Launches AI Grid with NVIDIA to Unify Distributed Inference Clusters

OpenAI Releases Compact Models GPT-5.4 mini/nano for Enterprise AI Inference

NVIDIA AI Grids: AT&T, T-Mobile Building Distributed AI Platform

NVIDIA Mass Produces Dynamo 1.0 Inference OS, Strengthening AI Factory Platform Strategy