Filter

×
Active Filters Clear All
Keyword: Compute ×
208 Total Reports
4/11 Page
NVIDIA Other 2026-05-25

NVIDIA Vera CPU Threatens x86: 1.5x Performance, 4x Density, Full-Stack AI Lock-In

Rumors indicate NVIDIA will unveil its first general-purpose CPU Vera at Computex 2026, claiming 1.5x x86 performance, 2x throughput, and 4x rack density. Shipment targets: 1.2M units in FY2027, 4.2M in FY2028. Vera targets the AI inference shift from 1:8 to 1:1 CPU/GPU ratio, complementing Grace to create a full GPU+CPU stack.

Microsoft Other 2026-05-23

Microsoft Fara1.5 Browser Agent Open-Weight, 72% Success Rate Beats Closed-Source Rivals

Microsoft releases Fara1.5 (4B/9B/27B) browser Computer-Use Agent fine-tuned on Qwen3.5, achieving 72% success rate on Online-Mind2Web, surpassing OpenAI Operator (58.3%) and Gemini 2.5 CU (57.3%). Open-weight with MagenticLite sandbox, but suffers from visual prompt injection and credential exposure risks.

Google Other 2026-05-21

Google Antigravity Control Plane Redefines AI Development, Locks Agent Orchestration

At I/O 2026, Google launched Antigravity 2.0 desktop app and CLI/SDK as a unified agent control plane, alongside Gemini 3.5 Flash/Omni models, Managed Agents API, and native Android support in AI Studio. This aims to streamline AI development from prototype to production, but effectively locks developers into Google's ecosystem and cloud services.

Cisco Other 2026-05-20

Cisco G300 Intelligent Packet Flow: Hardware-Accelerated AI Networking Breakthrough

Cisco launches Intelligent Packet Flow on Silicon One G300, transforming the fabric into an intelligent system with hardware-accelerated adaptive routing, collective congestion awareness, and telemetry. In 8K-16K GPU clusters, it reduces CCT by 87% vs ECMP, improves JCT by 82%, and unlocks 28% more GPU efficiency.

Intel Other 2026-05-20

Intel Core Ultra 3 SoC Replaces Discrete GPUs in Edge Robotics, Slashing TCO

Intel Core Ultra Series 3 SoC integrates CPU, GPU, and NPU to power edge robotics, replacing discrete GPUs. Partners like Sensory AI run multi-agent AI (vision, language, motion) locally, cutting TCO and eliminating cloud latency. This shifts the cost-performance curve for service robots.

AMD Other 2026-05-20

AMD Ryzen AI Halo & Max PRO 400: Local 300B Parameter Inference, but Hidden Lock-in and Thermal Limits

AMD launches Ryzen AI Halo developer platform (128GB unified memory, 200B parameter models) and Ryzen AI Max PRO 400 series (first x86 client to run 300B parameter models locally). Unified memory, ROCm optimization, and OEM partnerships aim to shift agentic AI from cloud to local, but shared memory bandwidth and thermal constraints limit real-world throughput.

Google Other 2026-05-19

Google TPU 8t/8i Enables Cross-Datacenter Training, Gemini 3.5 Flash 4x Faster

Google unveils TPU 8t (training) and TPU 8i (inference) with 3x raw compute and 2x perf-per-watt. JAX/Pathways enable distributed training across 1M+ TPUs across sites. Gemini 3.5 Flash delivers 4x output tokens per second vs frontier models. SynthID adopted by OpenAI, Nvidia, Kakao, Eleven Labs.

Cisco Other 2026-05-14

Cisco Unified Edge: Service Providers' New Ecosystem Bundle for Edge AI Services

Cisco launches Unified Edge platform integrating compute, networking, storage, and security, managed via Intersight, targeting service providers to deploy AI inference at thousands of edge sites. Verizon as early adopter plans to bundle edge capabilities into enterprise connectivity offerings.

Cisco Other 2026-05-14

Cisco Uses MRC to Push SRv6: A Stealth Power Grab in AI Networking

Cisco claims MRC protocol is built on its SRv6 architecture, highlighting application-driven networking, static routing reliability, and deterministic visibility. This is a strategic move to lock AI networking into Cisco's SRv6 ecosystem, countering NVIDIA's Spectrum-X and Arista's alternatives.

Microsoft Other 2026-05-14

Microsoft's DQI at WinHEC 2026: Shifting Driver Control from IHVs to Microsoft

At WinHEC 2026, Microsoft announced the Driver Quality Initiative (DQI), centered on transitioning third-party kernel-mode drivers to user-mode or Microsoft-authored class drivers, alongside enhanced trust verification, lifecycle management, and quality metrics. This aims to systematically improve Windows driver quality but effectively consolidates Microsoft's control over the driver ecosystem.

Cisco Other 2026-05-13

Cisco N9300 Smart Switches Embed Security into AI Data Center Fabric

At ONUG 2026, Cisco unveiled Nexus One architecture and N9300 Smart Switches, embedding L4 segmentation, Hypershield, eBPF-based Live Protect, and DPU-integrated firewall directly into the network fabric. This aims to deliver bottleneck-free security for AI workloads while enabling AI-driven operations via AgenticOps and AI Canvas.

Amazon Other 2026-05-12

AWS AgentCore Payments: Autonomous AI Agent Spending Unlocks New Lock-in and Threat Surface

AWS previews managed payment capabilities in Bedrock AgentCore, enabling AI agents to autonomously pay for APIs, MCP servers, and web content, integrated with Coinbase and Stripe. Also launches Agent Toolkit for AWS and MCP Server GA. This pushes AI agents toward autonomous execution but introduces new security and lock-in risks.

Cisco Other 2026-05-07

Cisco-AMD Benchmark Shifts AI Fabric Control from GPU to SmartNIC and Switch

Cisco and AMD jointly release benchmarks for AI scale-out fabrics using N9000 800G switches, Pensando Pollara 400 smartNICs, and MI300X GPUs. IBPerf and MLPerf tests show P01/P99 bandwidth near 400Gbps line rate under incast congestion, proving deterministic performance that eliminates GPU stalls.

ARM Other High Signal 2026-05-07

Arm Reports Record Results, AGI CPU Emerges as New AI Infrastructure Focal Point

Arm reported record FY2026 results with $4.92B revenue and over 20% growth for three consecutive years. The core highlight is the Arm AGI CPU designed for agentic AI, securing over $2B in customer demand and backing from Meta, AWS, Google, and others.

AMD Other Medium Signal 2026-05-07

AMD Backs SPEC CPU 2026 Benchmark, Emphasizing Open, Trusted Performance Measurement

AMD published a blog endorsing the upcoming SPEC CPU 2026 industry benchmark, emphasizing the critical role of open, reproducible CPU performance standards for customer infrastructure decisions in the AI era. The new benchmark updates its application suite and strengthens support for bare-metal cloud environments and parallel computing.

Amazon Other High Signal 2026-05-06

AWS Releases Managed MCP Server for Secure AI Agent Access to AWS APIs

AWS announced the general availability of its managed Model Context Protocol (MCP) server, providing authenticated and secure access to AWS services for AI coding agents like Claude Code and Kiro. The server offers a fixed set of tools to call AWS APIs, retrieve real-time documentation, and introduces sandboxed script execution and curated 'Skills' to address production challenges such as outdated knowledge and overly broad IAM policies generated by agents.

AMD Other High Signal 2026-05-06

AMD and OpenAI Contribute MRC Protocol to OCP for Scalable AI Networking

AMD, in collaboration with OpenAI, Microsoft, and others, contributed the MRC (Multipath Reliable Connection) protocol, designed for large-scale AI training, to the Open Compute Project (OCP). AMD co-authored the specification and has already deployed MRC on its programmable Pensando DPU/NIC products, positioning its networking technology as a key enabler for resilient and adaptive AI infrastructure.

NVIDIA Other High Signal 2026-05-06

NVIDIA Opens MRC Protocol via OCP, Pushing Standardization of AI Ethernet Fabrics

NVIDIA announced the opening of its MRC (Multipath Reliable Connection) RDMA transport protocol via the Open Compute Project (OCP). The protocol, proven on Spectrum-X Ethernet hardware, aims to enhance throughput, resilience, and GPU utilization for large-scale AI training clusters through multi-path load balancing and hardware-level failure bypass.

AMD Other High Signal 2026-05-06

AMD and OpenAI Introduce MRC, a Next-Gen Transport Protocol for AI Training

AMD, in collaboration with OpenAI, Microsoft, and other industry leaders, has released the specification for the Multipath Reliable Connection (MRC) protocol. MRC addresses performance bottlenecks of RoCEv2 in hyperscale AI training clusters through intelligent packet spraying, selective retransmission, and network-signaled congestion control, aiming to improve bandwidth utilization and job resilience.

Anthropic Other High Signal 2026-05-06

Anthropic Secures Compute Deal with SpaceX, Significantly Boosting Claude Capacity

Anthropic announced a partnership with SpaceX to utilize all compute capacity at the Colossus 1 data center, gaining over 300MW of new capacity. This move aims to directly improve service for Claude Pro and Max subscribers, with immediate increases to Claude Code and API rate limits.