AI model - AI Infrastructure Intelligence Search

Amazon Other 2026-07-29

Amazon Shuts AGI Lab, Pivots to Frontier Model Research, Phases Out Nova Premier and Omni

Amazon confirmed the closure of its AGI Lab in San Francisco after 18 months, forming the Frontier Model Research (FMR) group. It is phasing out Nova Premier, Omni, Reel, and Canvas models, signaling a strategic retreat from broad AI to focused frontier research.

Microsoft Azure Other 2026-07-26

Microsoft Azure Integrates AMD Helios Rack-Scale AI Platform, Breaks NVIDIA GPU Monopoly

Microsoft Azure announces deployment of AMD Helios rack-scale AI platform in H2 2026. Rack integrates 72 MI455X GPUs, 18 Venice CPUs, liquid cooling, delivering 2.9 Exaflops FP4 inference. This signals a major industry shift away from NVIDIA GPU monopoly towards multi-vendor heterogeneous AI infrastructure.

Anthropic Other 2026-07-26

Anthropic Launches Claude Opus 5 at Half Price, Deep AWS Integration Shifts Control

Anthropic releases Claude Opus 5 with pricing unchanged from Opus 4.8 but performance approaching Fable 5, effectively halving cost. AWS announces Claude Platform GA, deeply integrating Anthropic API into AWS IAM/billing/management, shifting control from standalone API to cloud platform.

Microsoft Other 2026-07-24

Microsoft launches MAI-Image-2.5-Pro and MAI-Voice-2-Flash, deepening in-house AI ecosystem

Microsoft introduces MAI-Image-2.5-Pro and MAI-Voice-2-Flash, proprietary models now powering Bing, PowerPoint, OneDrive, and Dynamics 365, replacing third-party models. Claims up to 84% GPU cost reduction and 2x faster voice, signaling a strategic shift to in-house AI.

Hewlett Packard Enterprise Other 2026-07-19

AMD and HPE Launch Helios Open AI Infrastructure to Rival NVIDIA Ecosystem

AMD and HPE expand partnership to launch Helios, an open-stack AI infrastructure platform integrating EPYC CPUs, Instinct MI455X GPUs, Pensando networking, and ROCm software. Each rack delivers up to 2.9 exaFLOPS FP4, built on OCP principles with Juniper switches, targeting simplified deployment and energy efficiency.

NVIDIA Other 2026-07-07

NVIDIA Vera CPU: Max Single-Threaded Performance at Scale for Agentic AI

NVIDIA launches Vera CPU, a max single-threaded CPU at scale for agentic AI. With Olympus cores delivering 1.8x sustained per-core performance over x86, 1.2TB/s LPDDR5X bandwidth, and 3.4TB/s core-to-core bandwidth, Vera integrates into NVIDIA's unified AI factory architecture, aiming to lock users into its ecosystem.

NVIDIA Other 2026-07-07

AI Innovators Adopt NVIDIA Vera — Why Max Single-Threaded CPU at Scale Matters

...

Check Point Other 2026-07-01

When AI Invents the Attack: Browser-Native Ransomware

...

OpenAI Other 2026-06-25

Oracle Defense Ecosystem Cohort 3: Offline AI on Roving Edge Devices Goes Operational

Oracle announced the third cohort of its Defense Ecosystem at the Brussels summit, adding 10 companies. Concurrently, Whitespace's Saga AI system deployed on Oracle Roving Edge Devices during Royal Navy's Operation HIGHMAST, running classified AI workloads completely offline, proving sovereign edge AI is operational.

Huawei Other 2026-06-25

Huawei Unveils AI-Centric Network with Token Monetization, UCM Caching Breaks Long-Context Barriers

At MWC Shanghai 2026, Huawei unveiled an AI-native network architecture integrating service, network, and compute, shifting from traffic-centric to intelligence-centric operations. The Unified Cache Manager (UCM) extends KV cache to petabyte-scale external storage, achieving 372% token throughput gains on GLM-5.1 at 128K sequence lengths. Token monetization frameworks and agentic operations enable carriers to charge for AI inference capacity and personalize services.

Anthropic Other 2026-06-25

Anthropic Accuses Alibaba of Massive Distillation Attack on Claude AI Model

Anthropic accused Alibaba-linked operators of conducting 29 million exchanges via thousands of fraudulent accounts to distill Claude's capabilities, including long-context reasoning and decision-making. This highlights the vulnerability of AI model IP under API access, prompting a redefinition of model security boundaries.

Nokia Other 2026-06-24

Nokia, Amazon Web Services expand collaboration to deliver autonomous networks built for the AI era

...

NVIDIA Other 2026-06-23

NVIDIA's AI Agents and Digital Twins Reshape Telecom Network Control Plane

At DTW Ignite 2026, NVIDIA showcases its AI agent platform integrating NeMo synthetic data, NemoClaw secure runtime, OpenShell sandbox, and RTX PRO 6000-accelerated digital twins, aiming for autonomous telecom operations. Partners include SoftBank, Amdocs, NTT DATA, etc., moving from task automation to full autonomy.

ARM Other 2026-06-23

Arm servers capture >45% data center revenue, x86 ecosystem under AI-driven assault

IDC reports Q1 2026 global server revenue hit a record $122.6B, with Arm-based servers capturing >45% share (x86 at 52%). Accelerated servers (GPU/ASIC/FPGA) generated >70% revenue. Nvidia's Grace CPU (NVL72) and hyperscaler custom Arm chips drive the shift; x86 still leads in unit volume but faces supply constraints.

ASML Other 2026-06-23

ASML CEO Validates Musk's Terafab, Reshaping AI Chip Supply Chain

ASML's CEO publicly acknowledges tracking Elon Musk's planned terawatt-scale AI supercomputer Terafab, comparing it to Korean DRAM megaprojects. This signals that the sole EUV lithography supplier is allocating capacity, potentially transforming AI chip supply chain and vertical integration.

NVIDIA Other 2026-06-23

Nvidia Vera Rubin CPU: 10-Wide Core Redefines CPU for Agentic Computing

At GTC Taipei 2026, Nvidia unveiled the Vera Rubin CPU with a custom 10-wide fetch/decode/execute pipeline, claiming world-leading IPC and bandwidth. Designed for agentic computing, it complements Nvidia GPUs. Nvidia also announced a partnership with Microsoft to reinvent the PC as a Personal AI and committed to returning 50% of free cash flow to shareholders.

Anthropic Other 2026-06-22

Micron-Anthropic Deal: Memory Co-Architecture Locks in AI Supply Chain

Micron and Anthropic sign a strategic agreement covering joint memory/storage architecture design, multi-year supply, Claude adoption, and investment. This ties frontier AI model demands directly to infrastructure design, aiming to optimize token economics and power efficiency, but essentially locks in supply and restructures the ecosystem.

NVIDIA Other 2026-06-22

Dell PowerEdge XE8812: Liquid-Cooled Density Trap with NVIDIA Vera Rubin NVL4

Dell launches PowerEdge XE8812 with NVIDIA Vera Rubin NVL4, delivering 144 GPUs per rack, 300kW+ power, and 100% direct liquid cooling. It offers a generational leap in memory and compute density for HPC and AI, but deeply locks users into Dell's PowerRack, iDRAC, and ORv3 ecosystem from chip to rack.

NVIDIA Other 2026-06-22

NVIDIA JUPITER Validates Grace Hopper: Exascale Science Goes Production

Europe's first exascale supercomputer JUPITER, powered by NVIDIA Grace Hopper Superchips and Quantum-X800 InfiniBand, achieves breakthroughs in brain mapping at cellular scale, 1km-resolution climate simulation, 6G AI, and 50-qubit quantum simulation, proving exascale is production-ready.

Amazon Other 2026-06-17

AWS Trainium Hits 80% MFU on World Models, Reshaping AI Training Economics

AWS claims its Trainium chip achieves 80% Model FLOP Utilization (MFU) on world model training, nearly double the industry average. With a general-purpose instruction set and sustained thermal performance, Trainium is attracting startups like Odyssey and DeCart AI, challenging Nvidia's dominance in AI training infrastructure.

Reports

Filter

Amazon Shuts AGI Lab, Pivots to Frontier Model Research, Phases Out Nova Premier and Omni

Microsoft Azure Integrates AMD Helios Rack-Scale AI Platform, Breaks NVIDIA GPU Monopoly

Anthropic Launches Claude Opus 5 at Half Price, Deep AWS Integration Shifts Control

Microsoft launches MAI-Image-2.5-Pro and MAI-Voice-2-Flash, deepening in-house AI ecosystem

AMD and HPE Launch Helios Open AI Infrastructure to Rival NVIDIA Ecosystem

NVIDIA Vera CPU: Max Single-Threaded Performance at Scale for Agentic AI

AI Innovators Adopt NVIDIA Vera — Why Max Single-Threaded CPU at Scale Matters

When AI Invents the Attack: Browser-Native Ransomware

Oracle Defense Ecosystem Cohort 3: Offline AI on Roving Edge Devices Goes Operational

Huawei Unveils AI-Centric Network with Token Monetization, UCM Caching Breaks Long-Context Barriers

Anthropic Accuses Alibaba of Massive Distillation Attack on Claude AI Model

Nokia, Amazon Web Services expand collaboration to deliver autonomous networks built for the AI era

NVIDIA's AI Agents and Digital Twins Reshape Telecom Network Control Plane

Arm servers capture >45% data center revenue, x86 ecosystem under AI-driven assault

ASML CEO Validates Musk's Terafab, Reshaping AI Chip Supply Chain

Nvidia Vera Rubin CPU: 10-Wide Core Redefines CPU for Agentic Computing

Micron-Anthropic Deal: Memory Co-Architecture Locks in AI Supply Chain

Dell PowerEdge XE8812: Liquid-Cooled Density Trap with NVIDIA Vera Rubin NVL4

NVIDIA JUPITER Validates Grace Hopper: Exascale Science Goes Production

AWS Trainium Hits 80% MFU on World Models, Reshaping AI Training Economics