Filter

×
Active Filters Clear All
Keyword: Arm ×
103 Total Reports
1/6 Page
ARM Other 2026-06-16

ARM AGI CPU Enters Mass Production with $2B Pre-Orders, Shifting AI Inference to ARM

ARM's self-developed AGI CPU has entered mass production with TSMC, securing $2B in pre-orders. Partnering with Red Hat, ARM aims to bring enterprise software stacks to its CPU, signaling a strategic shift from IP licensing to chip manufacturing and challenging x86 in AI inference.

NVIDIA Other 2026-06-16

NVIDIA RTX Spark SoC Invades Windows PC: Arm CPU + GPU with 128GB Unified Memory Reshapes AI PC

At HPE Discover 2026, NVIDIA unveiled the RTX Spark SoC for Windows PCs, built on TSMC 3nm with a MediaTek-designed Arm CPU, 70B transistors, and up to 128GB unified memory. This marks NVIDIA's official entry into the PC SoC market, directly challenging Intel, AMD, and Qualcomm in the AI PC segment.

MediaTek Other 2026-06-16

MediaTek Doubles AI ASIC Target to $2B, Challenges Broadcom in Data Center Custom Silicon

MediaTek doubles its 2026 AI ASIC revenue target to $2B, leveraging Google hyperscaler deals and the NVIDIA RTX Spark chip (featuring MediaTek's N1X Arm CPU). It aims for 10-15% of the $70-80B custom AI chip market by 2027, directly challenging Broadcom's dominance.

NVIDIA Other 2026-06-16

AMD Ryzen 10000 Series to Swap iGPU for NPU: AI Boost at Cost of Basic Display

Leaks suggest AMD's next-gen Zen 6 desktop CPU 'Olympic Ridge' will replace the integrated GPU with an NPU, targeting >40 TOPS for Copilot+ AI PC certification. It also upgrades the client I/O die to support CUDIMM/CAMM and EXPO 1.2 for faster DDR5. The trade-off boosts local AI but forces nearly all users to rely on a discrete GPU for basic display.

ARM Other 2026-06-15

ARM's Pivot to Direct AI Chip Sales: From IP Licensor to Silicon Competitor

ARM accelerates its $15B chip revenue goal by shifting from pure IP licensing to direct AI chip sales, disrupting relationships with Qualcomm and Apple, and challenging Nvidia/Intel, signaling a fundamental ecosystem restructuring.

Research Other 2026-06-15

Z.ai GLM-5.2 Ships Usable 1M-Token Context, No Benchmarks, Two Thinking Levels

Z.ai releases GLM-5.2 with a claim of usable 1M-token context and two thinking-effort levels. No standard benchmarks are provided, raising concerns about real-world performance. The model targets replacing chunking-based RAG with native long-context reasoning.

MediaTek Other 2026-06-15

Compute Futures Market: Financializing GPU Capacity Could Reshape AI Infrastructure Procurement

Carmen Li is building a GPU pricing index and spot marketplace via Silicon Data and Compute Exchange, aiming to launch compute futures. Backed by DRW, this initiative targets GPU price volatility by standardizing compute trading, potentially creating a trillion-dollar asset class and transforming AI compute procurement.

AMD Other 2026-06-12

AMD Zen 6 Venice 256-Core EPYC Claims 3.3x Rack Performance Over NVIDIA Vera, But Estimates Raise Questions

AMD unveils first estimated performance of Zen 6 Venice EPYC (2nm, 256 cores), claiming 3.3x rack-level integer throughput over NVIDIA Vera at 100kW total power. A direct counter to NVIDIA's Arm push, but based on projected estimates, not silicon.

Microsoft Other 2026-06-11

Microsoft & NVIDIA RTX Spark Brings 1 Petaflop AI to Windows, Reshaping Local Inference

At Computex 2026, Microsoft unveiled RTX Spark, an Arm-based AI superchip co-developed with NVIDIA and MediaTek, delivering up to 1 petaflop AI performance and 128GB unified memory for local 120B parameter models. Intel Arc G3 and Qualcomm Snapdragon X2 series also launched, accelerating the Windows AI PC ecosystem.

Amazon Other 2026-06-10

Graviton5 + Nitro Formal Verification: AWS Locks AI CPU Control with ARM and Math

AWS launches Graviton5-based M9g/M9gd instances with 25% compute gain, PCIe Gen6, DDR5-8800, and the first formally verified cloud hypervisor (Nitro Isolation Engine). Meta deploys tens of millions of cores for agentic AI, marking a decisive ARM victory in cloud CPU.

NVIDIA Other 2026-06-10

NVIDIA Integrates BESS into AI Factory Power Architecture: Control Plane Shifts to Smart Storage

NVIDIA integrates Battery Energy Storage Systems (BESS) as a system-level component within its DSX platform for AI factories, shifting power infrastructure from passive backup to active control. BESS combines inverters, real-time telemetry, and dynamic control for load smoothing, ride-through, and faster grid interconnection, with self-qualification guidelines setting new validation standards.

ARM Other 2026-06-10

Arm's Neural Dawn: Dedicated Neural Accelerators Redefine Mobile GPU Roadmap

Arm and Sumo Digital unveil Neural Dawn, the first mobile game to use Unreal Engine MegaLights. By integrating dedicated neural accelerators into next-gen Mali GPUs, it delivers desktop-class ray-traced lighting within mobile power limits, signaling a shift from traditional to AI-native graphics pipelines.

Amazon Other 2026-06-10

Anthropic Claude Fable 5 on AWS: Data Retention Policy Breaches Cloud Security Boundary, Erodes Enterprise Data Sovereignty

AWS and Anthropic launch Claude Fable 5 with long-running async execution, advanced vision, and proactive self-verification. Access requires 30-day data retention and sharing with Anthropic, moving inference data outside AWS security boundary. Harmful prompts fall back to Opus 4.8, introducing complex pricing and governance risks.

Google Other 2026-06-09

GKE Inference Gateway Prefix Caching: 92% Faster AI Inference with Hidden Lock-in

Google Cloud launches GKE Inference Gateway with prefix caching and model-aware routing, achieving 92.8% lower TTFT and 15.7% higher throughput on Llama 3.1 8B. Snap reports 75-80% cache hit rates. However, deep integration with GKE Gateway API risks lock-in, limiting multi-cloud portability.

NVIDIA Other 2026-06-08

NVIDIA's UK Sovereign AI Play: From Chip Vendor to National Infrastructure Controller

NVIDIA partners with the UK government to deploy sovereign AI infrastructure via Isambard-AI (5,400 GH200 superchips) and the Sovereign AI Fund, backing local startups. This move establishes a national AI control plane, locking compute into NVIDIA's ecosystem and bypassing traditional hyperscalers like AWS and Azure.

NVIDIA Other 2026-06-08

NVIDIA and Doosan: Full-Stack Physical AI Platform Restructures Industrial Automation

NVIDIA expands collaboration with Doosan Group to integrate its physical AI stack (Isaac Sim, Cosmos, Jetson Thor) into Doosan Robotics' Agentic Robot OS, explore AI factory power (SMR, hydrogen fuel cells), and MGX ecosystem PCB materials. This move transforms NVIDIA from a GPU vendor into the central platform for physical AI and AI factory infrastructure, deeply locking industrial automation partners.

NVIDIA Other 2026-06-04

NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration

NVIDIA launches Nemotron 3 Ultra, a 550B-parameter MoE model (55B active) purpose-built for AI agent orchestration. Featuring Multi-Teacher On-Policy Distillation (MOPD) and a Hybrid Mamba-Transformer architecture, it achieves 5x throughput and 30% cost savings on tasks like SWE-bench, signaling a shift of reasoning control to a layered agent system.

Cisco Other 2026-06-04

Cisco AI Defense + AppOmni Extends Runtime Guardrails to SaaS AI Agents

Cisco integrates AI Defense with AppOmni, using AgentGuard as a real-time intercept layer inside SaaS environments. Custom guardrails now apply to Microsoft 365 Copilot, ServiceNow Now Assist, and other SaaS agents, monitoring MCP, chat, and agent-to-agent channels to block prompt injection, tool exploitation, and data exfiltration with a unified policy engine.

Microsoft Azure Product Launch 2026-06-03

Microsoft Maia 200 Mass-Produced, Cobalt 200 Previewed: AI Inference Control Shifts to Azure

At Build 2026, Microsoft announced mass production of Maia 200 AI inference chips, preview of Cobalt 200 ARM processors, and the MAI-Thinking-1 reasoning model (35B params). This signals a full-stack vertical integration to reduce NVIDIA dependency and lock Azure AI workloads.

Microsoft Other 2026-06-02

Microsoft Build 2026: Unifying Agent Stack from Chip to Cloud

At Build 2026, Microsoft unveiled a comprehensive agent-era platform: Project Solara (chip-to-cloud), Microsoft IQ (unified grounding), Rayfin (backend generation), Azure HorizonDB, and GPU-accelerated analytics. The goal is to lock developers into Microsoft's ecosystem.