Filter

×
Active Filters Clear All
Keyword: Pricing ×
32 Total Reports
1/2 Page
MediaTek Other 2026-06-16

HBM Bottleneck Reshapes AI Infrastructure: Asian Memory Makers Gain Leverage Over Nvidia

SK Hynix, Samsung, and Micron have crossed $1 trillion market cap as HBM becomes the hard limit in AI infrastructure. Asian suppliers now account for 90% of Nvidia's production costs, shifting the bottleneck from GPU compute to stacked memory and advanced packaging.

AMD Other 2026-06-16

AMD and Rackspace Deploy 30MW Governed AI Stack: Ecosystem Restructuring from Silicon to Outcomes

AMD and Rackspace sign a definitive agreement to deploy 30MW of AMD AI compute (Instinct GPUs including MI355X, EPYC CPUs) across Rackspace's data centers, creating a governed enterprise AI stack with single accountability from silicon to outcomes, targeting regulated industries.

NVIDIA Other 2026-06-16

AMD Ryzen 10000 Series to Swap iGPU for NPU: AI Boost at Cost of Basic Display

Leaks suggest AMD's next-gen Zen 6 desktop CPU 'Olympic Ridge' will replace the integrated GPU with an NPU, targeting >40 TOPS for Copilot+ AI PC certification. It also upgrades the client I/O die to support CUDIMM/CAMM and EXPO 1.2 for faster DDR5. The trade-off boosts local AI but forces nearly all users to rely on a discrete GPU for basic display.

Research Other 2026-06-15

Z.ai GLM-5.2 Ships Usable 1M-Token Context, No Benchmarks, Two Thinking Levels

Z.ai releases GLM-5.2 with a claim of usable 1M-token context and two thinking-effort levels. No standard benchmarks are provided, raising concerns about real-world performance. The model targets replacing chunking-based RAG with native long-context reasoning.

MediaTek Other 2026-06-15

Compute Futures Market: Financializing GPU Capacity Could Reshape AI Infrastructure Procurement

Carmen Li is building a GPU pricing index and spot marketplace via Silicon Data and Compute Exchange, aiming to launch compute futures. Backed by DRW, this initiative targets GPU price volatility by standardizing compute trading, potentially creating a trillion-dollar asset class and transforming AI compute procurement.

Amazon Other 2026-06-10

Graviton5 + Nitro Formal Verification: AWS Locks AI CPU Control with ARM and Math

AWS launches Graviton5-based M9g/M9gd instances with 25% compute gain, PCIe Gen6, DDR5-8800, and the first formally verified cloud hypervisor (Nitro Isolation Engine). Meta deploys tens of millions of cores for agentic AI, marking a decisive ARM victory in cloud CPU.

Amazon Other 2026-06-10

Anthropic Claude Fable 5 on AWS: Data Retention Policy Breaches Cloud Security Boundary, Erodes Enterprise Data Sovereignty

AWS and Anthropic launch Claude Fable 5 with long-running async execution, advanced vision, and proactive self-verification. Access requires 30-day data retention and sharing with Anthropic, moving inference data outside AWS security boundary. Harmful prompts fall back to Opus 4.8, introducing complex pricing and governance risks.

Amazon Other 2026-06-06

AWS Bedrock New Console Embraces OpenAI/Anthropic APIs, Shifting Control to Inference Layer

AWS launches a new Bedrock console powered by the bedrock-mantle endpoint, natively supporting OpenAI and Anthropic API protocols. Users can seamlessly switch between GPT, Claude, and open-weight models. This move standardizes model access, aiming to lock users into AWS's unified inference plane while weakening individual model provider API lock-in.

Cloudflare Other 2026-06-05

Cloudflare AI Gateway Adds Identity-Driven Budgets, Seizing AI Traffic Control

Cloudflare launches spend limits and identity-driven budgets (closed beta) in AI Gateway, integrating with Cloudflare Access. It enables per-user, per-team dollar budgets with fallback routing, shifting AI cost governance from model providers to the gateway control plane.

Samsung Electronics Other 2026-06-02

HBM Profitability Falls Below DDR5, TrendForce Warns of Multi-Fold Price Surge in 2027

TrendForce reports that HBM per-wafer revenue fell below DDR5 64GB RDIMM in Q1 2026, making HBM less profitable. Suppliers will reallocate capacity, leading to multi-fold HBM4 contract price increases in 2027. Demand from NVIDIA Rubin Ultra and AI ASICs will further tighten supply.

Amazon Other 2026-06-02

AWS Hosts OpenAI GPT-5.5 & Codex: Control Shifts from Model to Cloud

AWS launches OpenAI GPT-5.5, GPT-5.4, and Codex on Bedrock via the Responses API. This integrates frontier models into AWS infrastructure for data residency and capacity management, but locks users into Bedrock's ecosystem.

Intel Other 2026-06-01

Intel Reclaims AI Control Plane: Xeon 6+ and E835 Target Agentic Orchestration

Intel launches Xeon 6+ (288 E-cores on 18A), E835 200GbE controllers, and Crescent Island GPU. The strategy repositions the CPU as the control plane for agentic AI orchestration and data movement, while using E835 Ethernet to standardize AI data center networking.

Samsung Electronics Other 2026-05-23

Micron Partners TSMC for Custom HBM4E Logic Dies, Targets 2027 Ramp with 1-gamma DRAM

Micron plans to ramp HBM4E in 2027, transitioning to 1-gamma DRAM and using TSMC for both standard and custom logic dies. This marks a shift from standardized HBM to customized solutions, positioning memory as a strategic asset for AI inference workloads.

Amazon Other High Signal 2026-05-06

AWS Releases Managed MCP Server for Secure AI Agent Access to AWS APIs

AWS announced the general availability of its managed Model Context Protocol (MCP) server, providing authenticated and secure access to AWS services for AI coding agents like Claude Code and Kiro. The server offers a fixed set of tools to call AWS APIs, retrieve real-time documentation, and introduces sandboxed script execution and curated 'Skills' to address production challenges such as outdated knowledge and overly broad IAM policies generated by agents.

AMD Other High Signal 2026-04-30

AMD Proposes New AI Infrastructure Networking Paradigm: From Lossless Fabrics to Intelligent Endpoints

AMD published a blog outlining seven key questions for building large-scale AI infrastructure, arguing that traditional lossless Ethernet or InfiniBand architectures face cost and complexity bottlenecks. It advocates shifting network intelligence and reliability functions from expensive, specialized switches to intelligent NICs, enabling reliable transport over standard (potentially lossy) Ethernet to reduce TCO and simplify operations.

Intel Other High Signal 2026-04-30

Intel Collaborates with ChatPPT to Launch Hybrid AI PC Edition, Driving AI Workload Localization

Intel partnered with AI app ChatPPT to launch a hybrid AI PC edition using Intel's AI Super Builder technology. This version offloads certain AI workloads (e.g., formatting) from the cloud to the local PC, reducing cloud token costs by over 50%, boosting usage duration by 32%, and enhancing data privacy.

Cisco Other High Signal 2026-04-29

Cisco Reshapes MSSP Operations with Unified Console and Agentic AI

Cisco released a strategic guide for MSSPs, focusing on driving partner adoption of its unified Security Cloud Control console and AI agent-integrated AIOps. The goal is to enable cross-vendor device management, achieve up to 70% operational efficiency gains, and guide MSSPs towards value-based service tiering and business model transformation.

Amazon Other High Signal 2026-04-29

AWS Platformizes AI Agents and Deepens Cloud Integration with OpenAI

At its annual event, AWS announced the productization of AI agent capabilities, launching the personal AI assistant for work, Amazon Quick, and expanding Amazon Connect into four vertical-specific Agentic AI solutions. Concurrently, AWS and OpenAI expanded their partnership, deeply integrating the latest models, Codex, and managed agent services into the Amazon Bedrock platform.

Cisco Other High Signal 2026-04-28

Cisco Leverages Industrial Network Refresh Cycles to Drive Native OT Security Integration

Cisco outlines its OT security strategy, advocating for embedding security features (e.g., asset discovery, network segmentation) into industrial network switches during refresh cycles, rather than deploying parallel monitoring stacks. This aims to transform security from an add-on cost into an inherent property of infrastructure, preparing for data and connectivity demands from industrial AI and automation.

Microsoft Other High Signal 2026-04-28

Microsoft Unveils Foundry Platform, Defining New Paradigm for Durable, Stateful AI Agents

Microsoft CEO Satya Nadella demonstrated durable, stateful AI agents built on the Foundry platform. The platform enables agents to run across time boundaries, orchestrate tools and models, and close the loop with evaluation and improvement over long-running workflows, marking a key evolution from conversational assistants to autonomous execution systems.