Filter

×
Active Filters Clear All
Keyword: GitHub ×
61 Total Reports
2/4 Page
Google Other 2026-05-19

Google TPU 8t/8i Enables Cross-Datacenter Training, Gemini 3.5 Flash 4x Faster

Google unveils TPU 8t (training) and TPU 8i (inference) with 3x raw compute and 2x perf-per-watt. JAX/Pathways enable distributed training across 1M+ TPUs across sites. Gemini 3.5 Flash delivers 4x output tokens per second vs frontier models. SynthID adopted by OpenAI, Nvidia, Kakao, Eleven Labs.

Amazon Other 2026-05-12

AWS AgentCore Payments: Autonomous AI Agent Spending Unlocks New Lock-in and Threat Surface

AWS previews managed payment capabilities in Bedrock AgentCore, enabling AI agents to autonomously pay for APIs, MCP servers, and web content, integrated with Coinbase and Stripe. Also launches Agent Toolkit for AWS and MCP Server GA. This pushes AI agents toward autonomous execution but introduces new security and lock-in risks.

Amazon Other High Signal 2026-05-06

AWS Upgrades Virtual Desktops to AI Agent Infrastructure Layer

AWS announced Amazon WorkSpaces now enables AI agents to securely operate desktop applications using their own identity and permissions, without requiring API integrations or application modernization. This extends virtual desktops from a human productivity tool to a universal runtime platform for enterprise AI agents, integrating with major agent frameworks via the standard Model Context Protocol (MCP).

Microsoft Other High Signal 2026-05-01

Microsoft Publishes Cybersecurity Responsibility Framework for AI Era, Emphasizing Public-Private Collaboration and Modernized Vulnerability Management

Microsoft published a framework on securing the global digital ecosystem with next-generation AI, arguing that as AI accelerates vulnerability discovery, response and remediation must keep pace. The document outlines five recommendations, emphasizing public-private collaboration, responsible release of AI capabilities, and modernizing vulnerability management processes.

NVIDIA Other High Signal 2026-05-01

NVIDIA Collaborates with OpenClaw via NemoClaw to Drive Secure Enterprise Autonomous AI Agent Deployment

NVIDIA introduces NemoClaw, a reference implementation that bundles OpenClaw with the OpenShell secure runtime and Nemotron open models, providing a blueprint for secure enterprise deployment of long-running autonomous AI agents. This move addresses the 1000x inference demand surge and security governance challenges, shifting the AI infrastructure control point towards local, secure, and auditable architectures.

Cloudflare Other 2026-05-01

Cloudflare Dynamic Workflows: Control Plane Shift to Per-Tenant Durable Execution

Cloudflare launches Dynamic Workflows, a library enabling per-tenant dynamic dispatch of durable execution code at runtime. Built on Dynamic Workers, it allows Worker Loader to route and isolate tenant workflows with zero idle cost. Targets multi-tenant SaaS, AI agents, and CI/CD, but creates ecosystem lock-in around Cloudflare runtime.

Cisco Other High Signal 2026-04-30

Cisco Publishes Model Provenance Constitution, Defining Weight-Level Derivation Standards

Cisco published the 'Model Provenance Constitution' to provide a normative definition for AI model supply chain safety. The standard strictly hinges on the verifiable derivation history of model weights, clearly delineating five types of provenance links (e.g., direct descent, distillation) and eight exclusions (e.g., independent reproduction), aiming to resolve industry inconsistencies in model provenance definitions.

Cisco Other High Signal 2026-04-30

Cisco Open Sources Model Provenance Kit, Targeting AI Supply Chain Security Governance

Cisco released the open-source Model Provenance Kit, which uses a tiered strategy to analyze model metadata, tokenizer structure, and weight-level signals to generate unique fingerprints and verify the lineage and integrity of AI models. This aims to address risks of tampering, forgery, and compliance in the AI model supply chain.

Microsoft Other High Signal 2026-04-30

Microsoft Defines ‘Agentic Computing Era’, Positions AI Infrastructure and Agent Platform as Core Strategy

Microsoft's CEO, post-earnings, explicitly identifies the shift from end-user-driven workloads to those driven by both end-users and agents as a platform shift that will change the entire tech stack. The company's strategy is focused on building leading AI infrastructure and an agent platform, having already grown its AI business to a $37 billion annual run rate.

Amazon Other High Signal 2026-04-29

AWS Platformizes AI Agents and Deepens Cloud Integration with OpenAI

At its annual event, AWS announced the productization of AI agent capabilities, launching the personal AI assistant for work, Amazon Quick, and expanding Amazon Connect into four vertical-specific Agentic AI solutions. Concurrently, AWS and OpenAI expanded their partnership, deeply integrating the latest models, Codex, and managed agent services into the Amazon Bedrock platform.

NVIDIA Other High Signal 2026-04-28

NVIDIA Drives Manufacturing into 'Simulation-First' Era with OpenUSD and Omniverse

NVIDIA introduces a comprehensive physical AI stack centered on the SimReady standard, Omniverse simulation libraries, and the Metropolis VSS Blueprint. This aims to transform manufacturing's traditional 'design-build-test' cycle into a 'simulation-first' paradigm, enabling AI model training and system validation in high-fidelity virtual environments to drastically reduce product cycles and costs.

ARM Other High Signal 2026-04-28

Arm Launches Performix Performance Toolkit, Targeting AI Agent Era Optimization

Arm launched Performix, a free performance analysis toolkit designed to provide unified performance insights and optimization across the Arm platform for AI agent development. Integrated into mainstream AI dev environments via the Arm MCP Server, it turns runtime hardware data into actionable optimization guidance, with support from ecosystem partners like Microsoft and MongoDB.

Microsoft Other High Signal 2026-04-25

Microsoft Integrates GPT-5.5 into Enterprise Copilots, Advancing Multi-Model Workflow Orchestration

Microsoft announced the deployment of the GPT-5.5 model across GitHub Copilot, Microsoft 365 Copilot, Copilot Studio, and Foundry. The update emphasizes multi-model orchestration, enabling users to select different models for tasks (e.g., fast scaffolding, deep reasoning, execution, review) and introduces a 'Rubber Duck' agent for multi-model reflection loops.

Microsoft Other High Signal 2026-04-23

Microsoft Makes Copilot Agent Mode Default in Office, Pushing AI-Native Workflows

Microsoft announced the general availability and default setting of "Agent Mode" for Copilot in Word, Excel, and PowerPoint. This mode enables AI to reason and perform multi-step operations directly on the document canvas, signaling a shift from assistive tool to embedded AI collaborator.

Cisco Other High Signal 2026-04-11

Cisco Shares Enterprise AI Assistant Patterns, Emphasizing Deterministic Security and Guided Interaction

Based on 18 months of production experience with its Customer Experience AI Assistant, Cisco identifies non-obvious patterns critical for enterprise AI success. Key insights include enforcing RBAC via deterministic code (not LLM prompts), proactively disambiguating enterprise acronyms, minimizing clarification loops, and providing guided follow-up questions grounded in actual system capabilities.

Cisco Other Medium Signal 2026-04-08

Cisco Integrates AI into MSP Operations via ThousandEyes MCP Server

Cisco announced the ThousandEyes Model Context Protocol (MCP) server. It integrates ThousandEyes' network and digital experience intelligence directly into AI assistants (e.g., Claude, ChatGPT), enabling MSP analysts to perform advanced diagnostics via natural language. This aims to boost operational efficiency and transform the MSP service model.

Microsoft Other High Signal 2026-04-06

Microsoft Partners with Domestic Operators to Build Sovereign AI Infrastructure in Japan

Microsoft announced a $10B investment in Japan over four years, with a key pillar being a collaboration with Sakura Internet and SoftBank. This partnership will offer GPU-based AI compute services through Azure, managed by domestic providers to ensure data residency within Japan. This addresses the demand for sovereign AI infrastructure for sensitive workloads.

Google Other High Signal 2026-04-03

Google Introduces Flex and Priority Inference Tiers for Gemini API

Google adds Flex and Priority service tiers to its Gemini API. Flex is a cost-optimized tier offering a 50% price reduction for latency-tolerant workloads via a synchronous interface. Priority is a high-reliability tier ensuring critical requests are not preempted during peak loads. This provides developers a unified way to balance cost and reliability based on AI task types, such as background agentic workflows versus interactive applications.

Google Other High Signal 2026-04-03

Google Launches Gemma 4 Open Models, Targeting Edge Inference and AI Agent Architecture

Google introduces the Gemma 4 open model family, with four sizes from 2B to 31B parameters, emphasizing breakthrough intelligence-per-parameter and native support for agentic workflows, multimodality, and long context. The small models are engineered for edge devices, aiming to bring frontier reasoning to mobile and IoT scenarios.

Google Other Medium Signal 2026-04-03

Google Introduces Flex and Priority Tiers for Gemini API

Google adds Flex and Priority service tiers to Gemini API, enabling developers to optimize cost and reliability through a single interface. Flex offers 50% cost savings for latency-tolerant workloads, while Priority ensures highest reliability for critical apps. This change simplifies management of synchronous/asynchronous tasks in AI agent architectures.