GitHub - AI Infrastructure Intelligence Search

Cisco Other 2026-06-03

Cisco Agent Gateway: Zero Trust Evolves from Access to Action Control for AI Agents

Cisco launches Agent Gateway for Secure Access, extending Zero Trust from access control to action-level control for AI agents. Using Duo for agent identity, it enforces policies across LLMs, MCP servers, and SaaS APIs, with server-side credential injection and unified audit—addressing the unique security challenges of autonomous agent workflows.

Microsoft Azure Product Launch 2026-06-03

Microsoft Maia 200 Mass-Produced, Cobalt 200 Previewed: AI Inference Control Shifts to Azure

At Build 2026, Microsoft announced mass production of Maia 200 AI inference chips, preview of Cobalt 200 ARM processors, and the MAI-Thinking-1 reasoning model (35B params). This signals a full-stack vertical integration to reduce NVIDIA dependency and lock Azure AI workloads.

Microsoft Other 2026-06-02

Microsoft Build 2026: Unifying Agent Stack from Chip to Cloud

At Build 2026, Microsoft unveiled a comprehensive agent-era platform: Project Solara (chip-to-cloud), Microsoft IQ (unified grounding), Rayfin (backend generation), Azure HorizonDB, and GPU-accelerated analytics. The goal is to lock developers into Microsoft's ecosystem.

Google Other 2026-06-02

Google's gcs-analytics-core Library Boosts Iceberg and Spark Performance on GCS

Google Cloud announces gcs-analytics-core, an open-source Java library integrated into Iceberg 1.11.0+ GCSFileIO. It uses vectored I/O and smart Parquet prefetching to reduce scan latency. TPC-DS benchmarks show 18%-71% scan time improvement, but execution time gains are modest for large datasets (1.58% at 10TB).

NVIDIA Other 2026-06-01

NVIDIA Alpamayo: Closed-Loop RL Post-Training Bridges AV Sim-to-Real Gap

NVIDIA's Alpamayo platform introduces AlpaGym, an open-source, high-throughput closed-loop RL post-training framework. It integrates AlpaSim simulator, Cosmos-RL distributed training, and Physical AI datasets, enabling AV models to learn from the consequences of their own actions in simulation, significantly reducing the gap between training and deployment.

NVIDIA Other 2026-06-01

NVIDIA Cosmos 3: Open-Source Physical AI Model with MoT for Ecosystem Lock-in

NVIDIA releases Cosmos 3, a unified physical AI foundation model with Mixture-of-Transformers architecture combining reasoning, world generation, and action generation. Open-sourced with training scripts and six synthetic datasets, but deployment optimized for NVIDIA NIM and GPUs, signaling an ecosystem lock-in strategy.

NVIDIA Other 2026-06-01

NVIDIA DSX OS: Open Source Software to Seize AI Factory Control Plane

NVIDIA launches DSX OS, an open-source modular software suite for operating AI factories. Components include DSX Exchange, MaxLPS, NICo, NVSentinel, etc., unifying IT/OT, power optimization, and lifecycle management. Claims 40% more GPUs under fixed power, but core relies on NVIDIA proprietary hardware, aiming to lock users into its ecosystem.

Google Other 2026-05-29

Google Launches A2UI: Open Protocol for Agent-Driven UI in Gemini Enterprise

Google introduces A2UI, an open protocol enabling AI agents to return JSON payloads describing interactive UI components (date pickers, maps) for native rendering in Gemini Enterprise. It integrates with A2A and Flutter, solving the text-only limitation while preventing HTML injection.

Other Other 2026-05-22

BadHost CVE-2026-48710: Starlette Auth Bypass Exposes AI Agent Infrastructure to HTTP Smuggling

BadHost (CVE-2026-48710) exploits Starlette's inconsistent URL reconstruction via Host header injection, bypassing path-based auth. Affecting 400K+ repos including FastAPI, vLLM, and MCP Server, it exposes AI Agent infrastructure to data theft and potential RCE, forcing a security paradigm shift in HTTP parsing.

Google Other 2026-05-19

Google TPU 8t/8i Enables Cross-Datacenter Training, Gemini 3.5 Flash 4x Faster

Google unveils TPU 8t (training) and TPU 8i (inference) with 3x raw compute and 2x perf-per-watt. JAX/Pathways enable distributed training across 1M+ TPUs across sites. Gemini 3.5 Flash delivers 4x output tokens per second vs frontier models. SynthID adopted by OpenAI, Nvidia, Kakao, Eleven Labs.

Amazon Other 2026-05-12

AWS AgentCore Payments: Autonomous AI Agent Spending Unlocks New Lock-in and Threat Surface

AWS previews managed payment capabilities in Bedrock AgentCore, enabling AI agents to autonomously pay for APIs, MCP servers, and web content, integrated with Coinbase and Stripe. Also launches Agent Toolkit for AWS and MCP Server GA. This pushes AI agents toward autonomous execution but introduces new security and lock-in risks.

Amazon Other High Signal 2026-05-06

AWS Upgrades Virtual Desktops to AI Agent Infrastructure Layer

AWS announced Amazon WorkSpaces now enables AI agents to securely operate desktop applications using their own identity and permissions, without requiring API integrations or application modernization. This extends virtual desktops from a human productivity tool to a universal runtime platform for enterprise AI agents, integrating with major agent frameworks via the standard Model Context Protocol (MCP).

Microsoft Other High Signal 2026-05-01

Microsoft Publishes Cybersecurity Responsibility Framework for AI Era, Emphasizing Public-Private Collaboration and Modernized Vulnerability Management

Microsoft published a framework on securing the global digital ecosystem with next-generation AI, arguing that as AI accelerates vulnerability discovery, response and remediation must keep pace. The document outlines five recommendations, emphasizing public-private collaboration, responsible release of AI capabilities, and modernizing vulnerability management processes.

NVIDIA Other High Signal 2026-05-01

NVIDIA Collaborates with OpenClaw via NemoClaw to Drive Secure Enterprise Autonomous AI Agent Deployment

NVIDIA introduces NemoClaw, a reference implementation that bundles OpenClaw with the OpenShell secure runtime and Nemotron open models, providing a blueprint for secure enterprise deployment of long-running autonomous AI agents. This move addresses the 1000x inference demand surge and security governance challenges, shifting the AI infrastructure control point towards local, secure, and auditable architectures.

Cloudflare Other 2026-05-01

Cloudflare Dynamic Workflows: Control Plane Shift to Per-Tenant Durable Execution

Cloudflare launches Dynamic Workflows, a library enabling per-tenant dynamic dispatch of durable execution code at runtime. Built on Dynamic Workers, it allows Worker Loader to route and isolate tenant workflows with zero idle cost. Targets multi-tenant SaaS, AI agents, and CI/CD, but creates ecosystem lock-in around Cloudflare runtime.

Cisco Other High Signal 2026-04-30

Cisco Publishes Model Provenance Constitution, Defining Weight-Level Derivation Standards

Cisco published the 'Model Provenance Constitution' to provide a normative definition for AI model supply chain safety. The standard strictly hinges on the verifiable derivation history of model weights, clearly delineating five types of provenance links (e.g., direct descent, distillation) and eight exclusions (e.g., independent reproduction), aiming to resolve industry inconsistencies in model provenance definitions.

Cisco Other High Signal 2026-04-30

Cisco Open Sources Model Provenance Kit, Targeting AI Supply Chain Security Governance

Cisco released the open-source Model Provenance Kit, which uses a tiered strategy to analyze model metadata, tokenizer structure, and weight-level signals to generate unique fingerprints and verify the lineage and integrity of AI models. This aims to address risks of tampering, forgery, and compliance in the AI model supply chain.

Microsoft Other High Signal 2026-04-30

Microsoft Defines ‘Agentic Computing Era’, Positions AI Infrastructure and Agent Platform as Core Strategy

Microsoft's CEO, post-earnings, explicitly identifies the shift from end-user-driven workloads to those driven by both end-users and agents as a platform shift that will change the entire tech stack. The company's strategy is focused on building leading AI infrastructure and an agent platform, having already grown its AI business to a $37 billion annual run rate.

Amazon Other High Signal 2026-04-29

AWS Platformizes AI Agents and Deepens Cloud Integration with OpenAI

At its annual event, AWS announced the productization of AI agent capabilities, launching the personal AI assistant for work, Amazon Quick, and expanding Amazon Connect into four vertical-specific Agentic AI solutions. Concurrently, AWS and OpenAI expanded their partnership, deeply integrating the latest models, Codex, and managed agent services into the Amazon Bedrock platform.

NVIDIA Other High Signal 2026-04-28

NVIDIA Drives Manufacturing into 'Simulation-First' Era with OpenUSD and Omniverse

NVIDIA introduces a comprehensive physical AI stack centered on the SimReady standard, Omniverse simulation libraries, and the Metropolis VSS Blueprint. This aims to transform manufacturing's traditional 'design-build-test' cycle into a 'simulation-first' paradigm, enabling AI model training and system validation in high-fidelity virtual environments to drastically reduce product cycles and costs.

Reports

Filter

Cisco Agent Gateway: Zero Trust Evolves from Access to Action Control for AI Agents

Microsoft Maia 200 Mass-Produced, Cobalt 200 Previewed: AI Inference Control Shifts to Azure

Microsoft Build 2026: Unifying Agent Stack from Chip to Cloud

Google's gcs-analytics-core Library Boosts Iceberg and Spark Performance on GCS

NVIDIA Alpamayo: Closed-Loop RL Post-Training Bridges AV Sim-to-Real Gap

NVIDIA Cosmos 3: Open-Source Physical AI Model with MoT for Ecosystem Lock-in

NVIDIA DSX OS: Open Source Software to Seize AI Factory Control Plane

Google Launches A2UI: Open Protocol for Agent-Driven UI in Gemini Enterprise

BadHost CVE-2026-48710: Starlette Auth Bypass Exposes AI Agent Infrastructure to HTTP Smuggling

Google TPU 8t/8i Enables Cross-Datacenter Training, Gemini 3.5 Flash 4x Faster

AWS AgentCore Payments: Autonomous AI Agent Spending Unlocks New Lock-in and Threat Surface

AWS Upgrades Virtual Desktops to AI Agent Infrastructure Layer

Microsoft Publishes Cybersecurity Responsibility Framework for AI Era, Emphasizing Public-Private Collaboration and Modernized Vulnerability Management

NVIDIA Collaborates with OpenClaw via NemoClaw to Drive Secure Enterprise Autonomous AI Agent Deployment

Cloudflare Dynamic Workflows: Control Plane Shift to Per-Tenant Durable Execution

Cisco Publishes Model Provenance Constitution, Defining Weight-Level Derivation Standards

Cisco Open Sources Model Provenance Kit, Targeting AI Supply Chain Security Governance

Microsoft Defines ‘Agentic Computing Era’, Positions AI Infrastructure and Agent Platform as Core Strategy

AWS Platformizes AI Agents and Deepens Cloud Integration with OpenAI

NVIDIA Drives Manufacturing into 'Simulation-First' Era with OpenUSD and Omniverse