Filter

×
Active Filters Clear All
Keyword: OpenAI ×
169 Total Reports
1/9 Page
NVIDIA Other 2026-06-14

NVIDIA Vera CPU: Seizing the AI Agent Control Plane from x86

NVIDIA unveils Vera CPU, purpose-built for AI agents, featuring 88 Olympus cores and 1.2TB/s LPDDR5X memory. Claiming 1.8x faster task completion over x86, it targets agentic AI workloads. Customers include Anthropic, OpenAI, and Oracle Cloud Infrastructure, signaling a shift of the AI control plane to NVIDIA's ecosystem.

Cisco Other 2026-06-11

Cisco AI Defense Policy Studio: Meta-Prompting Unwritten Policy into Auditable Guardrails

Cisco introduces AI Defense Policy Studio, an AI assistant that guides policy owners through authoring custom guardrails via a chat-and-review UI. It uses meta-prompting to translate informal guidance into human- and model-readable policy documents, directly deployable to Cisco AI Defense for runtime enforcement across models and applications.

OpenAI Other 2026-06-06

OpenAI Releases Dreaming V3: Background Auto-Memory for ChatGPT, 5x Compute Efficiency Gain

OpenAI released Dreaming V3 on June 4 for Plus/Pro—ChatGPT's background auto-memory system. Unlike legacy "explicitly ask to remember" model, it runs post-conversation process synthesizing preferences, constraints, active projects and time context. Key gains: ~5x efficiency enabling free user access; temporal awareness; 2x storage for Plus/Pro. Controversy: Feb arXiv study found 96% of memories system-created without prompting—potential EU AI Act scrutiny effective August.

Amazon Other 2026-06-06

AWS Bedrock New Console Embraces OpenAI/Anthropic APIs, Shifting Control to Inference Layer

AWS launches a new Bedrock console powered by the bedrock-mantle endpoint, natively supporting OpenAI and Anthropic API protocols. Users can seamlessly switch between GPT, Claude, and open-weight models. This move standardizes model access, aiming to lock users into AWS's unified inference plane while weakening individual model provider API lock-in.

Cloudflare Other 2026-06-05

Cloudflare AI Gateway Adds Identity-Driven Budgets, Seizing AI Traffic Control

Cloudflare launches spend limits and identity-driven budgets (closed beta) in AI Gateway, integrating with Cloudflare Access. It enables per-user, per-team dollar budgets with fallback routing, shifting AI cost governance from model providers to the gateway control plane.

Cisco Product Launch 2026-06-03

Cisco Cloud Control & AI Canvas: The Control Point Shifts from Hardware to the AI Decision Plane

At Cisco Live 2026, Cisco launched Cloud Control, an AI-ops platform with agentic workflows, and AI Canvas for human-agent collaboration. The platform leverages Splunk's data fabric and proprietary models trained on 40 years of Cisco data. The Silicon One architecture now unifies campus and cloud switches. This marks a strategic pivot from hardware vendor to AI platform, shifting the control point to the AI decision plane.

Microsoft Azure Product Launch 2026-06-03

Microsoft Maia 200 Mass-Produced, Cobalt 200 Previewed: AI Inference Control Shifts to Azure

At Build 2026, Microsoft announced mass production of Maia 200 AI inference chips, preview of Cobalt 200 ARM processors, and the MAI-Thinking-1 reasoning model (35B params). This signals a full-stack vertical integration to reduce NVIDIA dependency and lock Azure AI workloads.

Microsoft Other 2026-06-02

Microsoft Build 2026: Unifying Agent Stack from Chip to Cloud

At Build 2026, Microsoft unveiled a comprehensive agent-era platform: Project Solara (chip-to-cloud), Microsoft IQ (unified grounding), Rayfin (backend generation), Azure HorizonDB, and GPU-accelerated analytics. The goal is to lock developers into Microsoft's ecosystem.

NVIDIA Other High Signal 2026-06-02

GTC Taipei 2026: Vera 88-Core CPU Designed for Agents, 1.8x x86 Performance

NVIDIA launched first standalone data center microprocessor Vera at GTC Taipei 2026, directly competing with Intel Xeon and AMD EPYC for the first time. 88 custom Olympus Arm cores, monolithic mesh (not chiplet), 50% faster inter-core communication. LPDDR5X 1.2TB/s bandwidth, PCIe Gen6. Agent sandbox 1.8x x86. First customers: OpenAI, Anthropic, SpaceX. Q3 2026 production, FY CPU revenue target $20B. Marks NVIDIA's strategic leap from GPU accelerator vendor to full-stack data center platform vendor.

Meta Other High Signal 2026-06-02

Build 2026: Project Polaris Replaces GPT-4 Turbo, GitHub Copilot Decouples from OpenAI

Microsoft unveiled Project Polaris in-house coding model at Build 2026, planning to replace OpenAI GPT-4 Turbo as GitHub Copilot's default inference engine starting August 2026, with a 3-month transition period. This marks Microsoft's first formal decoupling from OpenAI at the model layer. Anthropic Claude has been integrated into Copilot, supporting multi-model draft+review collaborative workflows. Microsoft publicly named Claude as a primary target for the first time. Strategic signal: model self-reliance, distribution and runtime are durable moats.

Amazon Other 2026-06-02

AWS Hosts OpenAI GPT-5.5 & Codex: Control Shifts from Model to Cloud

AWS launches OpenAI GPT-5.5, GPT-5.4, and Codex on Bedrock via the Responses API. This integrates frontier models into AWS infrastructure for data residency and capacity management, but locks users into Bedrock's ecosystem.

Cisco Other 2026-06-01

Cisco Talos Threat Hunting Expands Across Endpoint, Network, and Identity Domains

Cisco Talos expands threat hunting to network (Cisco Firewall) and identity (Cisco Duo) domains, using an AI-driven engine for hypothesis-based searches. Findings are delivered via Cisco Security Cloud Control, targeting stealthy threats that evade alert-based detection.

NVIDIA Other 2026-06-01

NVIDIA Vera 88-Core Arm CPU: Control Plane Shifts from x86 to NVIDIA for AI Agent Workloads

NVIDIA unveils Vera, its first standalone datacenter CPU with 88 custom Arm Olympus cores, monolithic mesh, 1.2TB/s LPDDR5X bandwidth, achieving 1.8x x86 performance in agent workloads. Tightly coupled with GPUs via NVLink-C2C, Vera shifts the control plane from Intel/AMD to NVIDIA. First customers: OpenAI, Anthropic. Production Q3 2026.

NVIDIA Product Launch 2026-05-29

NVIDIA's Triple Play: Vera CPU, N1X Laptop Chip, and $6.5B Silicon Photonics Reshape AI Infra Control

NVIDIA delivers first agent-specific Vera CPU (88 Arm v9.2 cores, 1.2TB/s memory bandwidth), teases consumer N1X laptop chip, and invests $6.5B in silicon photonics. This shifts AI orchestration control from x86 to NVIDIA's Arm ecosystem, while CPO addresses memory wall, but volume production remains challenging until post-2028.

Check Point Other 2026-05-28

Check Point Agentic Exposure Validation: AI Agents Counter Autonomous Exploitation

Check Point launches Agentic Exposure Validation (AEV), using AI agents that reason like attackers. It correlates exposure data, asset context, and live threat intelligence to safely prove what is exploitable. Part of CTEM, it enables evidence-based reduction before AI-driven adversaries act.

Microsoft Other 2026-05-23

Microsoft Fara1.5 Browser Agent Open-Weight, 72% Success Rate Beats Closed-Source Rivals

Microsoft releases Fara1.5 (4B/9B/27B) browser Computer-Use Agent fine-tuned on Qwen3.5, achieving 72% success rate on Online-Mind2Web, surpassing OpenAI Operator (58.3%) and Gemini 2.5 CU (57.3%). Open-weight with MagenticLite sandbox, but suffers from visual prompt injection and credential exposure risks.

Zscaler Other 2026-05-20

Zscaler's AI-Guardian Shifts Zero Trust Control Plane to Non-Human AI Identities

Zscaler launches Project AI-Guardian with six GSIs to extend Zero Trust to AI agents, introducing AI Protect suite. The core shift treats non-human identities as first-class security principals, enabling granular access control and continuous red-teaming for AI agent ecosystems.

Google Other 2026-05-19

Google TPU 8t/8i Enables Cross-Datacenter Training, Gemini 3.5 Flash 4x Faster

Google unveils TPU 8t (training) and TPU 8i (inference) with 3x raw compute and 2x perf-per-watt. JAX/Pathways enable distributed training across 1M+ TPUs across sites. Gemini 3.5 Flash delivers 4x output tokens per second vs frontier models. SynthID adopted by OpenAI, Nvidia, Kakao, Eleven Labs.

Intel Other 2026-05-16

AI Agent Workloads Trigger Structural CPU Shortage, Arm and AMD Reshape Server Value Chain

AI inference and agent orchestration surge CPU demand, shifting CPU-GPU ratio from 1:8 to 1:1. AMD EPYC lead time 8-12 weeks, Intel Xeon up to 6 months; Arm's 3nm 136-core AGI processor co-developed with Meta/Cerebras/Cloudflare/OpenAI sees demand exceeding 200 billion USD. CPU replaces GPU as the new AI infrastructure bottleneck, with Arm and AMD reshaping the value chain.

Cisco Other 2026-05-14

Cisco Uses MRC to Push SRv6: A Stealth Power Grab in AI Networking

Cisco claims MRC protocol is built on its SRv6 architecture, highlighting application-driven networking, static routing reliability, and deterministic visibility. This is a strategic move to lock AI networking into Cisco's SRv6 ecosystem, countering NVIDIA's Spectrum-X and Arista's alternatives.