Reports
AI-generated structured vendor updates
Lexar Offloads AI Models to SSD: DRAM Cut 40%, Latency Remains Hurdle
Lexar unveils AI Storage Core SSD with a custom SPU DRAM-less controller and software stack, offloading LLMs to NAND Flash. It runs Qwen 3.5 122B on 32GB DRAM at 15.6 tokens/s (3x improvement), but TTFM latency of 2-8 seconds hinders real-time use.
NVIDIA Blackwell Sweeps MLPerf: NVLink and NVFP4 Redefine AI Training Economics
NVIDIA Blackwell dominates MLPerf Training 6.0, submitting across all seven benchmarks including MoE workloads. GB300 NVL72 delivers up to 1.6x faster training than GB200, with fifth-gen NVLink unifying 72 GPUs as one giant GPU. NVFP4 low-precision training and massive scale (8,192 GPUs) set new industry standards.
SiMa.ai Palette Neat: Natural-Language Agentic Environment Dismantles NVIDIA's GPU Moat
SiMa.ai launches open-source Palette Neat, an agentic development environment for Physical AI, paired with its sub-10W Modalix SoM. It uses natural language to abstract compute complexity, slashing dev cycles from months to days. Pin-compatible with NVIDIA SoM, it targets breaking the GPU ecosystem lock-in.
ARM AGI CPU Enters Mass Production with $2B Pre-Orders, Shifting AI Inference to ARM
ARM's self-developed AGI CPU has entered mass production with TSMC, securing $2B in pre-orders. Partnering with Red Hat, ARM aims to bring enterprise software stacks to its CPU, signaling a strategic shift from IP licensing to chip manufacturing and challenging x86 in AI inference.
NVIDIA RTX Spark SoC Invades Windows PC: Arm CPU + GPU with 128GB Unified Memory Reshapes AI PC
At HPE Discover 2026, NVIDIA unveiled the RTX Spark SoC for Windows PCs, built on TSMC 3nm with a MediaTek-designed Arm CPU, 70B transistors, and up to 128GB unified memory. This marks NVIDIA's official entry into the PC SoC market, directly challenging Intel, AMD, and Qualcomm in the AI PC segment.
AMD Critical RCE Vulnerability Disclosed After 124 Days, Sparks AI Infrastructure Security Crisis
Security researcher mr.bruh publicly disclosed a critical remote code execution (RCE) vulnerability in AMD processors after 124 days without a fix, with AMD refusing a $10,000 bounty. The flaw affects AI servers running AMD EPYC and Instinct, likened to a Log4j moment for AI infrastructure, forcing enterprises to reassess chip-level security response and supply chain risk.
Microsoft Work IQ Agent-First Platform Shifts Enterprise Integration Control from Developers to AI Runtime
Microsoft launched Work IQ, an agent-first enterprise platform replacing traditional app connections. AI agents dynamically discover data structures at runtime without manual coding. Alongside Copilot super app, Scout personal assistant, and Project Solara, Microsoft pivots to agent-centric architecture.
MediaTek Doubles AI ASIC Target to $2B, Challenges Broadcom in Data Center Custom Silicon
MediaTek doubles its 2026 AI ASIC revenue target to $2B, leveraging Google hyperscaler deals and the NVIDIA RTX Spark chip (featuring MediaTek's N1X Arm CPU). It aims for 10-15% of the $70-80B custom AI chip market by 2027, directly challenging Broadcom's dominance.
CrowdStrike Reimagines AI Agent Security with SPIFFE-Based Continuous Authorization
CrowdStrike launches Continuous Identity for AI Agents, using SPIFFE to issue verifiable identities to each agent. It enforces real-time authorization based on owner, caller, and device risk, eliminates standing privileges, and maintains context across delegation. Falcon AI monitors prompts for intent abuse.
TSMC Discloses Glass Substrate Pilot, Packaging Paradigm Shifts
TSMC, with Ibiden and Innolux, publicly discloses glass substrate integration into CoWoS for advanced packaging. Glass offers superior electrical and thermal properties over organic substrates, enabling larger dies and higher density. Mass production is distant; CoPoS remains near-term priority.
MediaTek Pivots from Chip Design to System-Level Integration, Targeting Google TPU and Musk AI Racks
MediaTek elevates its AI strategy from chip design to system-level integration, targeting Google TPU v10 PCBA and Musk-affiliated AI rack assembly. Using an asset-light model and Taiwan's supply chain, it aims for 40-50% gross margin in system integration.
Qualcomm's $8B Tenstorrent Bet: A RISC-V Chiplet Lock-in Play
Qualcomm is in talks to acquire AI chip startup Tenstorrent for $8-10 billion, targeting its RISC-V-based AI accelerators and chiplet technology. This move aims to reduce Arm dependency and bolster data center AI inference capabilities, marking a strategic pivot from mobile to infrastructure.
OpenAI Faces Multi-State AG Probe: Pre-IPO Regulatory Wave Redefines AI Compliance
OpenAI faces multi-state AG investigations ahead of its IPO, targeting consumer protection, data management, minors' safety, and sensitive info handling. This forces the AI industry to overhaul compliance standards, pushing enterprises to reassess data sovereignty and legal exposure.
HBM Bottleneck Reshapes AI Infrastructure: Asian Memory Makers Gain Leverage Over Nvidia
SK Hynix, Samsung, and Micron have crossed $1 trillion market cap as HBM becomes the hard limit in AI infrastructure. Asian suppliers now account for 90% of Nvidia's production costs, shifting the bottleneck from GPU compute to stacked memory and advanced packaging.
AMD and Rackspace Deploy 30MW Governed AI Stack: Ecosystem Restructuring from Silicon to Outcomes
AMD and Rackspace sign a definitive agreement to deploy 30MW of AMD AI compute (Instinct GPUs including MI355X, EPYC CPUs) across Rackspace's data centers, creating a governed enterprise AI stack with single accountability from silicon to outcomes, targeting regulated industries.
Apple Rebuilds Siri with Google Gemini, Cuts Legacy Hardware Support
Apple rebuilds Siri using Google Gemini-derived capabilities, introducing five new AFM 3 foundation models (including a 20B-parameter multimodal on-device model). The move is paired with the sharpest hardware support cut in watchOS 27, limiting to S9/S10 chips, signaling a strategic shift from vertical integration to hybrid AI partnerships and accelerated hardware refresh cycles.
AMD Ryzen 10000 Series to Swap iGPU for NPU: AI Boost at Cost of Basic Display
Leaks suggest AMD's next-gen Zen 6 desktop CPU 'Olympic Ridge' will replace the integrated GPU with an NPU, targeting >40 TOPS for Copilot+ AI PC certification. It also upgrades the client I/O die to support CUDIMM/CAMM and EXPO 1.2 for faster DDR5. The trade-off boosts local AI but forces nearly all users to rely on a discrete GPU for basic display.
D-Wave's Dual-Platform Quantum Push: Annealing and Gate-Model Convergence Challenges IBM
D-Wave reported $33.4M Q1 bookings (up 2000% YoY), with 73% commercial revenue. Its dual-platform strategy (annealing + gate-model) targets 100 logical qubits by 2032. CEO challenges industry hype, urging focus on real customers and published results.
SailPoint Acquires Entro to Dominate Non-Human Identity and AI Agent Security
SailPoint announces acquisition of Entro to integrate non-human identity discovery, credential security, and NHIDR technology into its Agentic Fabric framework. The move aims to provide unified visibility and governance for AI agents, machine identities, and other non-human entities.
CrowdStrike Continuous Identity for AI Agents Shifts Control Plane
At Identiverse 2026, CrowdStrike launched Continuous Identity for AI Agents, a Falcon Next-Gen Identity Security capability. Using SPIFFE for verifiable agent identity, it dynamically grants/revokes access based on real-time risk, eliminates standing privileges, and integrates with Falcon AIDR to detect privilege misuse, shifting the identity control plane from static policies to continuous risk assessment.