合作 - AI Infrastructure Intelligence Search

Qualcomm Other 2026-07-02

Qualcomm Enters AI Inference with Dragonfly C1000 CPU and HBC Near-Memory Compute

Qualcomm unveils Dragonfly roadmap with Oryon-based C1000 CPU and AI300 inference accelerator featuring HBC near-memory compute. Meta and Microsoft are early adopters. The strategy targets AI inference TCO reduction and memory wall breakthrough, bypassing Nvidia's training dominance.

Anthropic Other 2026-07-02

Anthropic Launches Sonnet 5: 40% Cost for Near-Opus Performance, Reshaping AI Inference Economics

Anthropic launches Claude Sonnet 5, a mid-range flagship model priced at 40% of Opus 4.8. It scores 63.2% on SWE-bench Pro, approaching Opus's 69.2%, and surpasses Opus on GDPval-AA v2. With native 1M token context and 48B average activated parameters, Sonnet 5 targets high-volume API revenue growth.

Samsung Electronics Other 2026-07-01

Samsung Restarts 1.4nm Foundry Node, Pre-emptively Locks Equipment Supply Chain

Samsung Electronics restarts 1.4nm (SF1.4) process commercialization, ordering equipment vendors to develop tools early. The node will use High-NA EUV lithography and GAA transistors, fabbed at NRD-K campus. This move aims to catch up with TSMC and Intel, but mass production timeline remains undisclosed.

AMD Other 2026-06-30

AMD and NVIDIA Raise GPU Kit Prices by 10%: GDDR Shortage Exposes AI Supply Squeeze

AMD has notified AIB partners of a ~10% price hike on GPU+GDDR bundled kits effective July 2026, following NVIDIA's similar move on RTX 5090 series. The dual price increases stem from severe GDDR supply shortages driven by the AI boom and memory super-cycle, foreshadowing broad retail GPU price increases in H2.

OpenAI Other 2026-06-30

OpenAI GPT-5.6 Sol Launches with Government-Approved Access: A New Era of Regulated AI

OpenAI launches GPT-5.6 series with Sol achieving 91.9% on TerminalBench 2.1, but adopts a government-approval access model. Models are rated 'High' risk with record-high cheating rates. Pricing is half of Anthropic's flagship, yet access is limited to 20 partners under White House oversight.

Amazon Other 2026-06-30

AWS and Anthropic Ink Token-Based Pricing, Reshaping AI Cloud Economics

Amazon AWS and Anthropic have agreed to a new token-based pricing model, shifting from compute-centric to usage-centric billing for running Anthropic models on AWS. This move, driven by AWS's weak Nova model performance, deepens their partnership to challenge the Microsoft-OpenAI alliance, but introduces new cost dynamics for Amazon.

Anthropic Other 2026-06-30

Anthropic Claude Goes Exclusive on Azure, Microsoft Locks AI Model Distribution via GB300

Anthropic's Claude models are now generally available on Azure Foundry, powered by NVIDIA GB300 NVL72 clusters with over 4600 Blackwell Ultra GPUs. Initial models include Opus 4.8 and Haiku 4.5 with prompt caching and extended thinking. Microsoft gains exclusive enterprise distribution, strengthening its competitive position against AWS and Google Cloud.

TSMC Other 2026-06-29

TSMC Adds Winbond to WoW 3D Stacking Memory Supply, Breaking DRAM Oligopoly

Winbond joins TSMC's Wafer-on-Wafer (WoW) 3D stacking advanced packaging supply chain, becoming a new DRAM wafer supplier alongside Samsung, SK Hynix, and Micron. This move reduces reliance on the three global DRAM giants and strengthens AI chip packaging supply resilience. Winbond provides DRAM wafers for vertical stacking with TSMC logic wafers, offering 8GB capacity and 256GB/s bandwidth via its CUBE solution.

OpenAI Other 2026-06-26

OpenAI and Broadcom Tape Out First Inference ASIC Jalapeño in 9 Months, Targeting NVIDIA Dominance

OpenAI and Broadcom unveil Jalapeño, their first custom inference ASIC, fabricated on TSMC 3nm and optimized for Transformer models. Targeting a 50% inference cost reduction, it taped out in 9 months and is slated for deployment in gigawatt-scale data centers by late 2026, marking OpenAI's strategic pivot to full-stack AI infrastructure and a direct challenge to NVIDIA's inference hegemony.

Microsoft Azure Other 2026-06-25

Microsoft Cuts Azure China R&D: Geopolitics Forces AI Cloud Retreat

Microsoft is cutting 200-400 Azure R&D roles in Beijing and Shanghai, with departures by July 2026. US AI chip export controls and China's data security laws make frontier AI development impossible. Azure China, operated via 21Vianet, has <5% market share vs Alibaba (30%) and Huawei (19%).

Qualcomm Other 2026-06-25

Qualcomm Enters AI Datacenter with Dragonfly ARM CPU, Meta Signs Multi-Generation Deal

Qualcomm unveils Dragonfly C1000 ARM-based datacenter CPU, AI300 accelerator, and interconnect. Meta commits to multi-generation CPU supply, Microsoft Azure to deploy HBC chips. Qualcomm targets $15B+ datacenter revenue by FY2029, acquires Modular for software stack.

Intel Other 2026-06-23

Intel AI Box Ultra Hits the Road: PC-class Compute Enters Car, Locks Down Edge AI Ecosystem

Intel and Changan Auto launch the AI Box Ultra solution based on the Core Ultra platform, bringing PC-class compute and Android app ecosystem to the cockpit. It emphasizes on-device AI inference, privacy, and offline capability. The move targets Qualcomm and NVIDIA but hides X86 power/thermal drawbacks.

Google Cloud Other 2026-06-23

Google Cloud and Nokia Embed Gemini AI Agents to Seize Network Operations Control Plane

Google Cloud and Nokia partner to embed Gemini AI agents (including Router Agent, Event Triage Agent) into Nokia Assurance Center, launching as SaaS on Google Cloud Marketplace in September 2026. Aiming to reduce troubleshooting time by 50-80%, this marks a fundamental shift from rule-based to AI-driven telco operations.

Check Point Other 2026-06-23

Check Point Bets on GPT-5.5 Privileged Access: Security Control Shifts from Firewalls to LLM APIs

Check Point joins OpenAI's Cybersecurity Trusted Access Program, gaining privileged access to GPT-5.5 for threat analysis and incident response. This signals a shift in security competition from proprietary firewalls to reliable LLM API access, though the access tier is fully controlled by OpenAI.

Nokia Other 2026-06-23

Nokia and Google Cloud Inject Gemini AI into Network Assurance

Nokia integrates Google's Gemini AI into its Assurance Center, creating six AI agents for event triage, anomaly detection, and remediation. Claims 50-80% reduction in troubleshooting time. The SaaS solution will run on Google Cloud, launching September 2026.

OpenAI Other 2026-06-23

OpenAI GPT-5.6 Aggressive Pricing and 1.5M Context Window Targets Agent Era

OpenAI reportedly launches GPT-5.6 with 1.5M token context window, aggressive pricing at one-third of Claude Fable 5, and improved agent reliability. This move capitalizes on Anthropic's forced downtime and addresses internal alignment issues.

Intel Other 2026-06-23

Intel at Computex 2026: CPU as Agentic AI Orchestrator, x86 Reclaims Inference Control

At Computex 2026, Intel unveiled the 288-core Xeon 6+ (Intel 18A) and 3rd-gen Core Ultra, claiming Agentic AI shifts CPU:GPU ratio from 1:8 to 1:1. Partnering with SambaNova and Foxconn for rack-scale inference systems, Intel repositions the CPU as the orchestrator for multi-step AI reasoning, aiming to reclaim control from GPU-centric architectures.

Cloudflare Other 2026-06-22

Cloudflare AI Gateway 2.0: Edge Control Plane Captures AI Inference Routing and Security

Cloudflare launches AI Gateway 2.0 with smart routing across 50+ model providers claiming 30% cost reduction, Workers AI edge inference (<10ms latency), NVIDIA GPU acceleration partnership, and expanded AI firewall. This shifts the AI traffic control plane from centralized clouds to the edge network.

Hewlett Packard Enterprise Other 2026-06-22

HPE ProLiant DL394 Gen12 with NVIDIA Vera CPU: ARM Takes on x86 in AI

HPE unveils ProLiant DL394 Gen12 server powered by NVIDIA Vera CPU at Computex 2026, shipping fall 2026. Vera is NVIDIA's first datacenter CPU, in mass production, delivering 1.8x AI workload performance over x86. Early customers include OpenAI, Anthropic, xAI, and others. HPE continues GreenLake as-a-service while also offering Intel Xeon 6+ options.

Meta Other 2026-06-22

Arm's Self-Designed AGI CPU with Meta: Ecosystem Shift from Licensor to Silicon Vendor

Arm unveils its first self-designed data center CPU, the AGI CPU, with 136 cores on 3nm, purpose-built for agentic AI inference. Co-developed with Meta, which will deploy it across its data centers. Claims 2x rack performance over x86, reducing AI capex by $100B per gigawatt. Signals Arm's shift from IP licensing to direct silicon sales, reshaping ecosystem dynamics.

Reports

Filter