enterprise AI - AI Infrastructure Intelligence Search

NVIDIA Other 2026-06-08

NVIDIA and LG Build AI Factory: DSX Platform Locks Physical AI Stack

NVIDIA and LG Group jointly build an AI factory leveraging NVIDIA's DSX platform, integrating Isaac Sim/Lab, Cosmos, GR00T frameworks for robotics, autonomous driving, data centers, and sovereign AI. LG subsidiaries align cooling, robotics, and sensor components exclusively with NVIDIA, creating a fortified ecosystem.

NVIDIA Other 2026-06-04

NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration

NVIDIA launches Nemotron 3 Ultra, a 550B-parameter MoE model (55B active) purpose-built for AI agent orchestration. Featuring Multi-Teacher On-Policy Distillation (MOPD) and a Hybrid Mamba-Transformer architecture, it achieves 5x throughput and 30% cost savings on tasks like SWE-bench, signaling a shift of reasoning control to a layered agent system.

Cisco Other 2026-06-04

Cisco AI Defense + AppOmni Extends Runtime Guardrails to SaaS AI Agents

Cisco integrates AI Defense with AppOmni, using AgentGuard as a real-time intercept layer inside SaaS environments. Custom guardrails now apply to Microsoft 365 Copilot, ServiceNow Now Assist, and other SaaS agents, monitoring MCP, chat, and agent-to-agent channels to block prompt injection, tool exploitation, and data exfiltration with a unified policy engine.

AMD Other 2026-05-20

AMD Ryzen AI Halo & Max PRO 400: Local 300B Parameter Inference, but Hidden Lock-in and Thermal Limits

AMD launches Ryzen AI Halo developer platform (128GB unified memory, 200B parameter models) and Ryzen AI Max PRO 400 series (first x86 client to run 300B parameter models locally). Unified memory, ROCm optimization, and OEM partnerships aim to shift agentic AI from cloud to local, but shared memory bandwidth and thermal constraints limit real-world throughput.

Cisco Other 2026-05-13

Cisco N9300 Smart Switches Embed Security into AI Data Center Fabric

At ONUG 2026, Cisco unveiled Nexus One architecture and N9300 Smart Switches, embedding L4 segmentation, Hypershield, eBPF-based Live Protect, and DPU-integrated firewall directly into the network fabric. This aims to deliver bottleneck-free security for AI workloads while enabling AI-driven operations via AgenticOps and AI Canvas.

Microsoft Other 2026-05-08

Microsoft Integrates GPT-5.5 Instant into M365 Copilot: Model Choice Becomes the New AI Control Plane

Microsoft integrates GPT-5.5 Instant into M365 Copilot, Copilot Studio, and Foundry, offering model choice between OpenAI and Anthropic Claude. This marks a shift from single-model lock-in to platform-level model orchestration and governance, moving the control point from model capability to routing and policy layers.

AMD Other Medium Signal 2026-05-07

AMD Backs SPEC CPU 2026 Benchmark, Emphasizing Open, Trusted Performance Measurement

AMD published a blog endorsing the upcoming SPEC CPU 2026 industry benchmark, emphasizing the critical role of open, reproducible CPU performance standards for customer infrastructure decisions in the AI era. The new benchmark updates its application suite and strengthens support for bare-metal cloud environments and parallel computing.

AMD Other High Signal 2026-05-06

AMD and OpenAI Contribute MRC Protocol to OCP for Scalable AI Networking

AMD, in collaboration with OpenAI, Microsoft, and others, contributed the MRC (Multipath Reliable Connection) protocol, designed for large-scale AI training, to the Open Compute Project (OCP). AMD co-authored the specification and has already deployed MRC on its programmable Pensando DPU/NIC products, positioning its networking technology as a key enabler for resilient and adaptive AI infrastructure.

AMD Other High Signal 2026-05-06

AMD and OpenAI Introduce MRC, a Next-Gen Transport Protocol for AI Training

AMD, in collaboration with OpenAI, Microsoft, and other industry leaders, has released the specification for the Multipath Reliable Connection (MRC) protocol. MRC addresses performance bottlenecks of RoCEv2 in hyperscale AI training clusters through intelligent packet spraying, selective retransmission, and network-signaled congestion control, aiming to improve bandwidth utilization and job resilience.

Anthropic Other High Signal 2026-05-06

Anthropic Secures Compute Deal with SpaceX, Significantly Boosting Claude Capacity

Anthropic announced a partnership with SpaceX to utilize all compute capacity at the Colossus 1 data center, gaining over 300MW of new capacity. This move aims to directly improve service for Claude Pro and Max subscribers, with immediate increases to Claude Code and API rate limits.

Anthropic Other High Signal 2026-05-04

Anthropic Releases AI Agent Templates for Financial Services, Accelerating Enterprise AI Workflow Deployment

Anthropic has released ten ready-to-run AI agent templates for financial services, covering key scenarios like research, compliance, and finance. Delivered as plugins and managed agents with deep Microsoft 365 integration, they aim to reduce AI deployment cycles from months to days. This signals a shift from general-purpose AI to deep integration into vertical industry workflows.

AMD Other Medium Signal 2026-05-04

AMD Showcases Heterogeneous Computing Strategy for Enterprise AI with Dell

At Dell Technologies World, AMD highlighted its heterogeneous computing portfolio, aiming to match the right compute engine to specific enterprise AI workloads, while emphasizing hardware-based security and manageability. This signals a shift in AI infrastructure from generic solutions to fine-tuned, scenario-specific deployments.

Anthropic Other High Signal 2026-05-04

Anthropic Partners with Top-Tier Capital to Form New AI Services Company for Mid-Market

Anthropic, alongside Blackstone, Hellman & Friedman, Goldman Sachs, and other capital partners, is forming a new AI services company. It aims to provide deep customization and long-term operational support for deploying Claude in mid-market companies, complementing its existing system integrator network for large enterprises.

AMD Other High Signal 2026-04-30

AMD Proposes New AI Infrastructure Networking Paradigm: From Lossless Fabrics to Intelligent Endpoints

AMD published a blog outlining seven key questions for building large-scale AI infrastructure, arguing that traditional lossless Ethernet or InfiniBand architectures face cost and complexity bottlenecks. It advocates shifting network intelligence and reliability functions from expensive, specialized switches to intelligent NICs, enabling reliable transport over standard (potentially lossy) Ethernet to reduce TCO and simplify operations.

NVIDIA Other High Signal 2026-04-30

NVIDIA Releases Enterprise AI Factory Reference Architectures, Standardizing On-Premises AI Infrastructure

NVIDIA has released Enterprise AI Factory Reference Architectures, offering three standardized configurations from RTX PRO to NVL72 for on-premises deployments. This architecture integrates compute, networking, storage, and software, aiming to transform AI infrastructure from experimental setups into predictable, scalable industrial operational platforms.

AMD Other High Signal 2026-04-29

AMD and Liquid AI Discuss Efficient AI Architecture from Silicon to Systems

AMD's CTO and Liquid AI's CEO discuss the evolution of AI architecture, emphasizing efficiency as key to extending AI from the cloud to edge and endpoint devices. They argue that co-design from silicon to systems enables low-power, responsive AI inference, supporting always-on agents and multi-model orchestration.

Google Other 2026-04-29

Google Opens TPU Hardware to On-Prem, 8th-Gen Chips Target Nvidia

Google announces 8th-gen TPUs (8t for training with 3x performance over Ironwood, 8i for inference with 80% better perf/dollar) and plans to deliver TPU hardware directly to customer data centers. Also closed Wiz acquisition to bolster AI security. This marks a strategic pivot from cloud-only to hardware supplier.

Microsoft Other High Signal 2026-04-28

Microsoft Unveils Foundry Platform, Defining New Paradigm for Durable, Stateful AI Agents

Microsoft CEO Satya Nadella demonstrated durable, stateful AI agents built on the Foundry platform. The platform enables agents to run across time boundaries, orchestrate tools and models, and close the loop with evaluation and improvement over long-running workflows, marking a key evolution from conversational assistants to autonomous execution systems.

Microsoft Other High Signal 2026-04-28

Microsoft Announces Largest-Ever Enterprise M365 Copilot Deployment

Microsoft announced that Accenture is deploying Microsoft 365 Copilot to over 740,000 employees, marking the largest public deployment of the product to date. This move signals a shift of generative AI assistants from pilot phases to large-scale enterprise operations, with its success or failure serving as a critical reference for enterprise AI adoption.

AMD Other High Signal 2026-04-27

AMD Extends Edge AI Architecture to Space, Defining Orbital Computing Paradigm

AMD's CTO proposes applying the core principles of 'performance-per-watt' and 'mission-critical reliability' from terrestrial edge AI to space computing. The company is providing a repeatable platform foundation for in-orbit satellite intelligence and future orbital data centers through heterogeneous computing, open software stacks, and modular system design.

Reports

Filter

NVIDIA and LG Build AI Factory: DSX Platform Locks Physical AI Stack

NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration

Cisco AI Defense + AppOmni Extends Runtime Guardrails to SaaS AI Agents

AMD Ryzen AI Halo & Max PRO 400: Local 300B Parameter Inference, but Hidden Lock-in and Thermal Limits

Cisco N9300 Smart Switches Embed Security into AI Data Center Fabric

Microsoft Integrates GPT-5.5 Instant into M365 Copilot: Model Choice Becomes the New AI Control Plane

AMD Backs SPEC CPU 2026 Benchmark, Emphasizing Open, Trusted Performance Measurement

AMD and OpenAI Contribute MRC Protocol to OCP for Scalable AI Networking

AMD and OpenAI Introduce MRC, a Next-Gen Transport Protocol for AI Training

Anthropic Secures Compute Deal with SpaceX, Significantly Boosting Claude Capacity

Anthropic Releases AI Agent Templates for Financial Services, Accelerating Enterprise AI Workflow Deployment

AMD Showcases Heterogeneous Computing Strategy for Enterprise AI with Dell

Anthropic Partners with Top-Tier Capital to Form New AI Services Company for Mid-Market

AMD Proposes New AI Infrastructure Networking Paradigm: From Lossless Fabrics to Intelligent Endpoints

NVIDIA Releases Enterprise AI Factory Reference Architectures, Standardizing On-Premises AI Infrastructure

AMD and Liquid AI Discuss Efficient AI Architecture from Silicon to Systems

Google Opens TPU Hardware to On-Prem, 8th-Gen Chips Target Nvidia

Microsoft Unveils Foundry Platform, Defining New Paradigm for Durable, Stateful AI Agents

Microsoft Announces Largest-Ever Enterprise M365 Copilot Deployment

AMD Extends Edge AI Architecture to Space, Defining Orbital Computing Paradigm