GitHub - AI Infrastructure Intelligence Search

Cloudflare Other 2026-07-31

Cloudflare Migrates cdnjs to Workers Platform, Handling 9 Billion Requests Daily

Cloudflare has fully migrated cdnjs to its developer platform, leveraging R2, KV, Workers, and Workflows. The new architecture handles 9 billion requests daily with a 98.6% cache hit rate, demonstrating the scalability of edge computing for critical CDN infrastructure.

Amazon Other 2026-07-28

亚马逊CloudWatch发布Coding Agent Insights监控AI编码工具

...

NVIDIA Other 2026-07-28

NVIDIA Leads 37 Firms to Form OSAA for AI Agent Security, Absent OpenAI/Anthropic/Google

NVIDIA launches Open Secure AI Alliance (OSAA) with 36 partners to build open-source AI agent security stack, including NOOA, Safetensors, SPIFFE/SPIRE. Triggered by GPT-5.6 sandbox escape, the alliance excludes OpenAI, Anthropic, Google, signaling a dual-track security ecosystem.

Microsoft Other 2026-07-27

微软1900亿美元资本支出仍无法满足算力需求 Azure面临供应瓶颈

...

Microsoft Other 2026-07-16

Microsoft Replaces OpenAI/Anthropic with In-House MAI Models to Cut Costs and Reduce Dependency

Microsoft has started replacing OpenAI and Anthropic AI calls in Excel and Outlook with its in-house MAI models, handling tens of thousands of prompts weekly. The move aims to cut costs and reduce dependency on Anthropic, signaling a strategic shift toward internal AI models and impacting the AI vendor ecosystem.

Other Other 2026-07-14

MemGhost Attack: Persistent False Memory Injection in AI Agents via Email

Researchers unveil MemGhost, a stealth memory injection attack that plants persistent false memories into AI agents via a single email without user notification. It exploits the persistent memory feature, highlighting critical security gaps and driving demand for memory auditing.

Other Other 2026-07-14

SANS Identifies Distributed Scanning of MCP Servers and AI Assistant Configs

SANS Internet Storm Center reports systematic scanning of MCP servers, AI assistant configs, and local LLM endpoints. 49 IPs targeted MCP handshakes, exploiting CVEs in MCP SDKs, signaling AI infrastructure as a new attack vector.

AMD Other 2026-07-10

Towards Feature Complete Triton Support in JAX-Triton â ROCm Blogs

...

Cloudflare Other 2026-07-01

Announcing the Monetization Gateway: charge for any resource behind Cloudflare via x402

...

Research Other 2026-06-30

libssh2 CVE-2026-55200: Pre-auth RCE via Malicious Server, Attack Surface Shifts to Clients

A critical heap out-of-bounds write vulnerability (CVE-2026-55200, CVSS 9.2) in libssh2 allows a malicious SSH server to achieve pre-auth RCE on connecting clients. The flaw affects curl, Git, PHP, and many other projects statically linking the library, expanding the attack surface from servers to virtually any client application, including CI/CD, backup, and embedded systems.

OpenAI Other 2026-06-26

Making private MCP servers reachable without making them public | OpenAI Developers

...

Amazon Other 2026-06-17

Introducing Amazon Bedrock Managed Knowledge Base for faster, more accurate enterprise AI applications

...

Google Cloud Other 2026-06-17

Google Cloud's OKF v0.1: A Markdown-Based Control Plane for AI Agent Knowledge

Google Cloud introduces Open Knowledge Format (OKF) v0.1, a vendor-neutral Markdown spec for structuring context for AI agents. It represents knowledge as directories of markdown files with YAML front matter, requiring no proprietary services or SDKs, and can be hosted on any file system, targeting enterprise knowledge fragmentation and interoperability.

Microsoft Other 2026-06-16

Microsoft Agent 365: Control Plane Lock Replaces Model Lock, Building an Entra Empire for AI

Microsoft launches Agent 365 as a unified control plane for AI agents, integrating Entra, Defender, Purview, Intune, and cost management, alongside the Microsoft IQ semantic platform. While claiming model diversity and openness, this effectively locks enterprise AI assets into Microsoft's management toolchain, shifting control from model layer to infrastructure layer.

NVIDIA Other 2026-06-16

SiMa.ai Palette Neat: Natural-Language Agentic Environment Dismantles NVIDIA's GPU Moat

SiMa.ai launches open-source Palette Neat, an agentic development environment for Physical AI, paired with its sub-10W Modalix SoM. It uses natural language to abstract compute complexity, slashing dev cycles from months to days. Pin-compatible with NVIDIA SoM, it targets breaking the GPU ecosystem lock-in.

AMD Other 2026-06-15

AMD Open-Sources AI Software Stack on Vultr, Taking on NVIDIA CUDA Ecosystem

AMD launches a suite of open-source, modular enterprise AI software components on Vultr Marketplace, including AMD Inference Microservices (AIMs), AI Workbench, Resource Manager, and Solution Blueprints. This aims to provide production-grade AI infrastructure without vendor lock-in, directly challenging NVIDIA's CUDA ecosystem.

NVIDIA Other 2026-06-15

NVIDIA Bets on World-Action Models: Control Shifts from VLM to Video Backbones

NVIDIA's blog introduces World-Action Models (WAMs) as a paradigm shift from VLM-based VLAs. WAMs leverage pretrained video/world-model backbones to jointly predict future states and robot actions, aiming to bridge the language-to-action grounding gap. This could redefine robot foundation model training but raises concerns about inference cost and latency.

NVIDIA Other 2026-06-09

NVIDIA NVFP4: Native 4-Bit Training Boosts Throughput 1.73x, Locks Blackwell Ecosystem

NVIDIA introduces NVFP4, a native 4-bit format on Blackwell, enabling lossless mixed-precision pretraining in JAX/MaxText. Achieves 1.73x throughput gain over FP8 on Llama 3.1 405B (GB300). Techniques like micro-block scaling and Random Hadamard Transform boost performance but lock users into NVIDIA hardware.

OpenAI Other 2026-06-08

OpenAI Pivots to Codex: From Chatbot to Agentic Control Plane for Enterprise Automation

OpenAI plans its biggest ChatGPT overhaul, integrating Codex, AI agents, and third-party apps into a super-app. This marks a strategic pivot from a Q&A chatbot to an agentic execution platform, with Codex as the new control plane, aiming to boost enterprise monetization and counter Anthropic's competitive threat.

NVIDIA Other 2026-06-04

NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration

NVIDIA launches Nemotron 3 Ultra, a 550B-parameter MoE model (55B active) purpose-built for AI agent orchestration. Featuring Multi-Teacher On-Policy Distillation (MOPD) and a Hybrid Mamba-Transformer architecture, it achieves 5x throughput and 30% cost savings on tasks like SWE-bench, signaling a shift of reasoning control to a layered agent system.

Reports

Filter

Cloudflare Migrates cdnjs to Workers Platform, Handling 9 Billion Requests Daily

亚马逊CloudWatch发布Coding Agent Insights监控AI编码工具

NVIDIA Leads 37 Firms to Form OSAA for AI Agent Security, Absent OpenAI/Anthropic/Google

微软1900亿美元资本支出仍无法满足算力需求 Azure面临供应瓶颈

Microsoft Replaces OpenAI/Anthropic with In-House MAI Models to Cut Costs and Reduce Dependency

MemGhost Attack: Persistent False Memory Injection in AI Agents via Email

SANS Identifies Distributed Scanning of MCP Servers and AI Assistant Configs

Towards Feature Complete Triton Support in JAX-Triton â ROCm Blogs

Announcing the Monetization Gateway: charge for any resource behind Cloudflare via x402

libssh2 CVE-2026-55200: Pre-auth RCE via Malicious Server, Attack Surface Shifts to Clients

Making private MCP servers reachable without making them public | OpenAI Developers

Introducing Amazon Bedrock Managed Knowledge Base for faster, more accurate enterprise AI applications

Google Cloud's OKF v0.1: A Markdown-Based Control Plane for AI Agent Knowledge

Microsoft Agent 365: Control Plane Lock Replaces Model Lock, Building an Entra Empire for AI

SiMa.ai Palette Neat: Natural-Language Agentic Environment Dismantles NVIDIA's GPU Moat

AMD Open-Sources AI Software Stack on Vultr, Taking on NVIDIA CUDA Ecosystem

NVIDIA Bets on World-Action Models: Control Shifts from VLM to Video Backbones

NVIDIA NVFP4: Native 4-Bit Training Boosts Throughput 1.73x, Locks Blackwell Ecosystem

OpenAI Pivots to Codex: From Chatbot to Agentic Control Plane for Enterprise Automation

NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration

Reports

Filter

Cloudflare Migrates cdnjs to Workers Platform, Handling 9 Billion Requests Daily

亚马逊CloudWatch发布Coding Agent Insights监控AI编码工具

NVIDIA Leads 37 Firms to Form OSAA for AI Agent Security, Absent OpenAI/Anthropic/Google

微软1900亿美元资本支出仍无法满足算力需求 Azure面临供应瓶颈

Microsoft Replaces OpenAI/Anthropic with In-House MAI Models to Cut Costs and Reduce Dependency

MemGhost Attack: Persistent False Memory Injection in AI Agents via Email

SANS Identifies Distributed Scanning of MCP Servers and AI Assistant Configs

Towards Feature Complete Triton Support in JAX-Triton â ROCm Blogs

Announcing the Monetization Gateway: charge for any resource behind Cloudflare via x402

libssh2 CVE-2026-55200: Pre-auth RCE via Malicious Server, Attack Surface Shifts to Clients

Making private MCP servers reachable without making them public | OpenAI Developers

Introducing Amazon Bedrock Managed Knowledge Base for faster, more accurate enterprise AI applications

Google Cloud's OKF v0.1: A Markdown-Based Control Plane for AI Agent Knowledge

Microsoft Agent 365: Control Plane Lock Replaces Model Lock, Building an Entra Empire for AI

SiMa.ai Palette Neat: Natural-Language Agentic Environment Dismantles NVIDIA's GPU Moat

AMD Open-Sources AI Software Stack on Vultr, Taking on NVIDIA CUDA Ecosystem

NVIDIA Bets on World-Action Models: Control Shifts from VLM to Video Backbones

NVIDIA NVFP4: Native 4-Bit Training Boosts Throughput 1.73x, Locks Blackwell Ecosystem

OpenAI Pivots to Codex: From Chatbot to Agentic Control Plane for Enterprise Automation

NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration

Towards Feature Complete Triton Support in JAX-Triton â ROCm Blogs