Filter

×
Active Filters Clear All
Keyword: GitHub ×
61 Total Reports
1/4 Page
Google Cloud Other 2026-06-17

Google Cloud's OKF v0.1: A Markdown-Based Control Plane for AI Agent Knowledge

Google Cloud introduces Open Knowledge Format (OKF) v0.1, a vendor-neutral Markdown spec for structuring context for AI agents. It represents knowledge as directories of markdown files with YAML front matter, requiring no proprietary services or SDKs, and can be hosted on any file system, targeting enterprise knowledge fragmentation and interoperability.

Anthropic Other 2026-06-17

Anthropic Agent SDK计费独立,AI编程进入生产级工程化

...

Anthropic Other 2026-06-17

Microsoft Work IQ APIs GA: Semantic Layer Locks AI Agents to M365 Data

Microsoft released Work IQ APIs GA on June 16, 2026, providing a semantic layer for M365 AI agents. It collapses traditional API surfaces into 10 generic MCP tools, claiming 2x faster runtime and 80% fewer tokens, billed via Copilot Credits. This effectively controls the data access gateway for enterprise AI agents.

Microsoft Other 2026-06-16

Microsoft Agent 365: Control Plane Lock Replaces Model Lock, Building an Entra Empire for AI

Microsoft launches Agent 365 as a unified control plane for AI agents, integrating Entra, Defender, Purview, Intune, and cost management, alongside the Microsoft IQ semantic platform. While claiming model diversity and openness, this effectively locks enterprise AI assets into Microsoft's management toolchain, shifting control from model layer to infrastructure layer.

NVIDIA Other 2026-06-16

SiMa.ai Palette Neat: Natural-Language Agentic Environment Dismantles NVIDIA's GPU Moat

SiMa.ai launches open-source Palette Neat, an agentic development environment for Physical AI, paired with its sub-10W Modalix SoM. It uses natural language to abstract compute complexity, slashing dev cycles from months to days. Pin-compatible with NVIDIA SoM, it targets breaking the GPU ecosystem lock-in.

AMD Other 2026-06-15

AMD Open-Sources AI Software Stack on Vultr, Taking on NVIDIA CUDA Ecosystem

AMD launches a suite of open-source, modular enterprise AI software components on Vultr Marketplace, including AMD Inference Microservices (AIMs), AI Workbench, Resource Manager, and Solution Blueprints. This aims to provide production-grade AI infrastructure without vendor lock-in, directly challenging NVIDIA's CUDA ecosystem.

NVIDIA Other 2026-06-15

NVIDIA Bets on World-Action Models: Control Shifts from VLM to Video Backbones

NVIDIA's blog introduces World-Action Models (WAMs) as a paradigm shift from VLM-based VLAs. WAMs leverage pretrained video/world-model backbones to jointly predict future states and robot actions, aiming to bridge the language-to-action grounding gap. This could redefine robot foundation model training but raises concerns about inference cost and latency.

Research Other 2026-06-15

Z.ai GLM-5.2 Ships Usable 1M-Token Context, No Benchmarks, Two Thinking Levels

Z.ai releases GLM-5.2 with a claim of usable 1M-token context and two thinking-effort levels. No standard benchmarks are provided, raising concerns about real-world performance. The model targets replacing chunking-based RAG with native long-context reasoning.

NVIDIA Other 2026-06-09

NVIDIA NVFP4: Native 4-Bit Training Boosts Throughput 1.73x, Locks Blackwell Ecosystem

NVIDIA introduces NVFP4, a native 4-bit format on Blackwell, enabling lossless mixed-precision pretraining in JAX/MaxText. Achieves 1.73x throughput gain over FP8 on Llama 3.1 405B (GB300). Techniques like micro-block scaling and Random Hadamard Transform boost performance but lock users into NVIDIA hardware.

NVIDIA Other 2026-06-04

NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration

NVIDIA launches Nemotron 3 Ultra, a 550B-parameter MoE model (55B active) purpose-built for AI agent orchestration. Featuring Multi-Teacher On-Policy Distillation (MOPD) and a Hybrid Mamba-Transformer architecture, it achieves 5x throughput and 30% cost savings on tasks like SWE-bench, signaling a shift of reasoning control to a layered agent system.

Cisco Other 2026-06-03

Cisco Agent Gateway: Zero Trust Evolves from Access to Action Control for AI Agents

Cisco launches Agent Gateway for Secure Access, extending Zero Trust from access control to action-level control for AI agents. Using Duo for agent identity, it enforces policies across LLMs, MCP servers, and SaaS APIs, with server-side credential injection and unified audit—addressing the unique security challenges of autonomous agent workflows.

Microsoft Azure Product Launch 2026-06-03

Microsoft Maia 200 Mass-Produced, Cobalt 200 Previewed: AI Inference Control Shifts to Azure

At Build 2026, Microsoft announced mass production of Maia 200 AI inference chips, preview of Cobalt 200 ARM processors, and the MAI-Thinking-1 reasoning model (35B params). This signals a full-stack vertical integration to reduce NVIDIA dependency and lock Azure AI workloads.

Microsoft Other 2026-06-02

Microsoft Build 2026: Unifying Agent Stack from Chip to Cloud

At Build 2026, Microsoft unveiled a comprehensive agent-era platform: Project Solara (chip-to-cloud), Microsoft IQ (unified grounding), Rayfin (backend generation), Azure HorizonDB, and GPU-accelerated analytics. The goal is to lock developers into Microsoft's ecosystem.

Google Other 2026-06-02

Google's gcs-analytics-core Library Boosts Iceberg and Spark Performance on GCS

Google Cloud announces gcs-analytics-core, an open-source Java library integrated into Iceberg 1.11.0+ GCSFileIO. It uses vectored I/O and smart Parquet prefetching to reduce scan latency. TPC-DS benchmarks show 18%-71% scan time improvement, but execution time gains are modest for large datasets (1.58% at 10TB).

Meta Other High Signal 2026-06-02

Build 2026: Project Polaris Replaces GPT-4 Turbo, GitHub Copilot Decouples from OpenAI

Microsoft unveiled Project Polaris in-house coding model at Build 2026, planning to replace OpenAI GPT-4 Turbo as GitHub Copilot's default inference engine starting August 2026, with a 3-month transition period. This marks Microsoft's first formal decoupling from OpenAI at the model layer. Anthropic Claude has been integrated into Copilot, supporting multi-model draft+review collaborative workflows. Microsoft publicly named Claude as a primary target for the first time. Strategic signal: model self-reliance, distribution and runtime are durable moats.

NVIDIA Other 2026-06-01

NVIDIA Alpamayo: Closed-Loop RL Post-Training Bridges AV Sim-to-Real Gap

NVIDIA's Alpamayo platform introduces AlpaGym, an open-source, high-throughput closed-loop RL post-training framework. It integrates AlpaSim simulator, Cosmos-RL distributed training, and Physical AI datasets, enabling AV models to learn from the consequences of their own actions in simulation, significantly reducing the gap between training and deployment.

NVIDIA Other 2026-06-01

NVIDIA Cosmos 3: Open-Source Physical AI Model with MoT for Ecosystem Lock-in

NVIDIA releases Cosmos 3, a unified physical AI foundation model with Mixture-of-Transformers architecture combining reasoning, world generation, and action generation. Open-sourced with training scripts and six synthetic datasets, but deployment optimized for NVIDIA NIM and GPUs, signaling an ecosystem lock-in strategy.

NVIDIA Other 2026-06-01

NVIDIA DSX OS: Open Source Software to Seize AI Factory Control Plane

NVIDIA launches DSX OS, an open-source modular software suite for operating AI factories. Components include DSX Exchange, MaxLPS, NICo, NVSentinel, etc., unifying IT/OT, power optimization, and lifecycle management. Claims 40% more GPUs under fixed power, but core relies on NVIDIA proprietary hardware, aiming to lock users into its ecosystem.

Google Other 2026-05-29

Google Launches A2UI: Open Protocol for Agent-Driven UI in Gemini Enterprise

Google introduces A2UI, an open protocol enabling AI agents to return JSON payloads describing interactive UI components (date pickers, maps) for native rendering in Gemini Enterprise. It integrates with A2A and Flutter, solving the text-only limitation while preventing HTML injection.

Other Other 2026-05-22

BadHost CVE-2026-48710: Starlette Auth Bypass Exposes AI Agent Infrastructure to HTTP Smuggling

BadHost (CVE-2026-48710) exploits Starlette's inconsistent URL reconstruction via Host header injection, bypassing path-based auth. Affecting 400K+ repos including FastAPI, vLLM, and MCP Server, it exposes AI Agent infrastructure to data theft and potential RCE, forcing a security paradigm shift in HTTP parsing.