xAI - AI Infrastructure Intelligence Search

OpenAI Other 2026-07-29

OpenAI and Anthropic Jointly Push 30-Day Federal Review for Frontier AI Models

OpenAI and Anthropic propose a 30-day federal review for frontier AI models before release, citing security risks. The August 1 deadline defines coverage thresholds, sparking a policy battle between closed-source advocates and open-source supporters including NVIDIA, Meta, and Microsoft. Chinese open models GLM-5.2 and Kimi K3 are central to the debate.

NVIDIA Other 2026-07-28

NVIDIA Leads 37 Firms to Form OSAA for AI Agent Security, Absent OpenAI/Anthropic/Google

NVIDIA launches Open Secure AI Alliance (OSAA) with 36 partners to build open-source AI agent security stack, including NOOA, Safetensors, SPIFFE/SPIRE. Triggered by GPT-5.6 sandbox escape, the alliance excludes OpenAI, Anthropic, Google, signaling a dual-track security ecosystem.

OpenAI Other 2026-07-24

OpenAI Launches Project Camellia: $20B Self-Built AI Data Center, Shift from Cloud Renting to Ownership

OpenAI announces Project Camellia, a $20B self-built AI data center in Georgia with 3.2GW power, marking a shift from cloud renting to self-control. It hires key xAI Colossus team members and raises compute spending forecast to $750B.

NVIDIA Other 2026-07-23

NVIDIA-OpenAI $100B Partnership: 10GW Vera Rubin AI Factories Reshape Ecosystem

NVIDIA and OpenAI announce a strategic partnership to deploy at least 10GW of NVIDIA systems using the Vera Rubin platform (Rubin GPU, Vera CPU, HBM4, NVLink 6). NVIDIA will invest up to $100B. First facilities go online in H2 2026, powering OpenAI's next-gen models, marking the era of multi-GW AI factories.

NVIDIA Other 2026-07-21

NVIDIA Spectrum-6 102.4Tbps Switch Goes Commercial, Cisco Adoption Confirms Bandwidth Inflection

NVIDIA announces Spectrum-6 102.4Tbps Ethernet switch for AI factories, doubling bandwidth with CPO and liquid cooling. Cisco confirms adoption in N9100 series, while Broadcom launches Tomahawk 6, signaling a terabit Ethernet race for AI infrastructure.

Other Other 2026-06-30

xAI Grok 4.5 Beta: 1.5T Param V9 Base, Cursor Integration Locks Tesla/SpaceX Ecosystem

xAI launches Grok 4.5 with a 1.5T parameter V9 base, integrating Cursor data for internal Beta at SpaceX/Tesla. Performance claims approach Claude Opus, but market share drops to 3.4% and Colossus compute utilization is 11%. This vertical integration aims to create a closed AI supply chain but risks ecosystem lock-in and resource misallocation.

ASML Other 2026-06-23

ASML CEO Validates Musk's Terafab, Reshaping AI Chip Supply Chain

ASML's CEO publicly acknowledges tracking Elon Musk's planned terawatt-scale AI supercomputer Terafab, comparing it to Korean DRAM megaprojects. This signals that the sole EUV lithography supplier is allocating capacity, potentially transforming AI chip supply chain and vertical integration.

NVIDIA Other 2026-06-23

Nvidia Vera Rubin CPU: 10-Wide Core Redefines CPU for Agentic Computing

At GTC Taipei 2026, Nvidia unveiled the Vera Rubin CPU with a custom 10-wide fetch/decode/execute pipeline, claiming world-leading IPC and bandwidth. Designed for agentic computing, it complements Nvidia GPUs. Nvidia also announced a partnership with Microsoft to reinvent the PC as a Personal AI and committed to returning 50% of free cash flow to shareholders.

Hewlett Packard Enterprise Other 2026-06-22

HPE ProLiant DL394 Gen12 with NVIDIA Vera CPU: ARM Takes on x86 in AI

HPE unveils ProLiant DL394 Gen12 server powered by NVIDIA Vera CPU at Computex 2026, shipping fall 2026. Vera is NVIDIA's first datacenter CPU, in mass production, delivering 1.8x AI workload performance over x86. Early customers include OpenAI, Anthropic, xAI, and others. HPE continues GreenLake as-a-service while also offering Intel Xeon 6+ options.

NVIDIA Other 2026-06-22

NVIDIA Launches Arm CPU: RTX Spark and Vera Shift AI Compute Control from x86

NVIDIA unveils RTX Spark Superchip for Windows PC (20 Arm cores, 6144 CUDA, 128GB LPDDR5X) and Vera data center CPU in million-volume production. Vera delivers 1.8x AI workload acceleration over x86. This marks NVIDIA's strategic entry into CPU market, consolidating control via unified Arm+GPU architecture.

AMD Other 2026-06-17

AMD Mustang Peak Threadripper: 144 cores, PCIe 6.0, TR6 socket – Power and memory challenges loom

AMD's Zen 6 Threadripper 'Mustang Peak' is confirmed with 2nm TSMC process, DDR5, PCIe 6.0, and a new TR6 socket. Using Powderhorn CCDs, it scales to 144 cores (288 threads) with clocks above 6 GHz. However, massive power draw and memory bandwidth demands (possibly requiring MRDIMM) raise platform cost concerns.

Google Cloud Other 2026-06-17

Google Cloud Embeds Legal Verifiability into AI Agents via SPIFFE and Kakunin

Google Cloud introduces SPIFFE-based Agent Identity for Gemini Enterprise and Vertex AI, then overlays Kakunin's compliance layer to map internal SPIFFE identifiers to X.509 certificates generated in AWS KMS, with all state changes committed to WORM audit logs. This converts secure cloud workloads into legally auditable market participants to meet EU AI Act and MiCA accountability mandates.

MediaTek Other 2026-06-15

MediaTek AI ASIC Deal with Google Reshapes Custom Silicon Landscape

MediaTek's landmark ASIC deal with Google for AI infrastructure doubles 2026 revenue target to $2B. Joint N1X CPU with Nvidia for RTX Spark AI PC and potential SpaceX/xAI orders on Intel 14A process signal a strategic pivot from consumer chips to AI custom silicon, challenging Broadcom's dominance.

NVIDIA Other 2026-06-14

NVIDIA Vera CPU: Seizing the AI Agent Control Plane from x86

NVIDIA unveils Vera CPU, purpose-built for AI agents, featuring 88 Olympus cores and 1.2TB/s LPDDR5X memory. Claiming 1.8x faster task completion over x86, it targets agentic AI workloads. Customers include Anthropic, OpenAI, and Oracle Cloud Infrastructure, signaling a shift of the AI control plane to NVIDIA's ecosystem.

Google Other 2026-05-18

Google Cloud Managed MCP Server Shifts AI Data Layer Control from SQL to Standardized Protocol

Google Cloud introduces Managed MCP Tools, standardizing AI-to-data interaction via the Model Context Protocol. The blog outlines five scenarios from static APIs to MCP agents, highlighting MCP as an open standard that decouples reasoning from data access, though the managed implementation tightly couples to BigQuery.

Palo Alto Networks Other High Signal 2026-05-03

In-depth Analysis of CISA Agentic AI Security Guidelines

CISA released the world's first Agentic AI security deployment guidelines on May 1, 2026, marking a critical transition from theoretical discussions to mandatory compliance requirements.

NVIDIA Other High Signal 2026-04-30

NVIDIA Releases Enterprise AI Factory Reference Architectures, Standardizing On-Premises AI Infrastructure

NVIDIA has released Enterprise AI Factory Reference Architectures, offering three standardized configurations from RTX PRO to NVL72 for on-premises deployments. This architecture integrates compute, networking, storage, and software, aiming to transform AI infrastructure from experimental setups into predictable, scalable industrial operational platforms.

Intel Partnership High Signal 2026-04-14

Intel to Build xAI Terafab AI Chip Factory

Intel announced helping build Elon Musk Terafab AI chip factory, marking key customer breakthrough for Intel Foundry. AI chip manufacturing demand grows, foundry competition accelerates.

Microsoft Other High Signal 2026-04-04

Microsoft Releases Copilot Studio Multi-Agent System, Advancing Connected Enterprise AI Architecture

Microsoft announced the general availability of multi-agent systems in Copilot Studio, enabling agent orchestration across tools and data sources via open protocols (A2A) and integrations with Fabric and the Microsoft 365 Agents SDK. This moves beyond isolated AI experiences to scalable, collaborative agent systems, with enhanced prompt building and governance controls.

Cisco Other Medium Signal 2026-03-09

Cisco Reveals Enterprise AI Tool Usage Patterns and Security Risks via DNS Telemetry

Cisco analyzed generative AI tool usage via secure access and DNS telemetry, revealing ChatGPT dominance and malicious domain impersonation risks. The approach demonstrates network traffic monitoring for AI tool assessment, providing actionable methodology for security teams.

Reports

Filter