Filter

×
Active Filters Clear All
Keyword: 算力 ×
71 Total Reports
1/4 Page
AMD Other 2026-06-12

AMD Backs All-Instinct GPU Cloud: TensorWave's $350M Series B Signals NVIDIA Ecosystem Breakout

TensorWave closes $350M Series B led by Magnetar and AMD Ventures at $1.55B valuation. The cloud is exclusively built on AMD Instinct GPUs (MI300X to MI455X), targeting memory-intensive AI workloads to offer a viable alternative to NVIDIA CUDA lock-in and validate ROCm software stack maturity in production.

Huawei Product Launch 2026-06-05

Huawei Cloud Launches AICS: Control Plane Shift in the Token Industrialization Era

Huawei Cloud unveils four Agentic Infra products, led by the AICS cluster (100K cards/200 EFLOPS). It integrates NPU-direct CMS memory, CCE VolcanoNext unified scheduling, and AgentSphere security sandbox to create a unified control plane for LLM training and Agent inference, aiming to lock in the full-stack AI infrastructure.

NVIDIA Other High Signal 2026-06-02

GTC Taipei 2026: DSX Open Source Data Center Platform, 40% More Chips Under Same Power

NVIDIA launched open-source data center software platform DSX at GTC Taipei 2026, providing planning, deployment, and monitoring tool suite. Key advantage: deploy up to 40% more accelerator chips under same power budget. Huang claims zero-cost factory digital twins. Also launched DGX Station for Windows, 748GB unified memory, 20 petaflops FP4, Q4 2026 availability.

NVIDIA Other 2026-06-01

NVIDIA DSX: Open-Source Power Orchestration Steals AI DC Control Plane

NVIDIA unveils DSX, an open-source DC platform that enables 40% more accelerators under the same power budget via software-defined power orchestration and digital twin validation. It shifts DC control from hardware to NVIDIA's software stack.

NVIDIA Other 2026-06-01

NVIDIA RTX Spark: SoC Seizes PC Control, AI Compute Revolution with Ecosystem Lock-in

NVIDIA launches RTX Spark SoC, integrating Blackwell GPU with 20-core Grace CPU (MediaTek co-designed), NVLink-C2C at 600GB/s, up to 128GB unified memory, 1 petaflop FP4 AI, and local 120B-parameter LLM support. This marks a shift from GPU vendor to platform provider, directly challenging Apple M, Qualcomm, and x86 incumbents.

NVIDIA Product Launch 2026-05-29

NVIDIA Blackwell Ultra GB300 NVL72: 1.44 EFLOPS FP4, 50x AI Factory Boost

NVIDIA launches Blackwell Ultra GB300 NVL72 rack system with 72 Blackwell Ultra GPUs and 36 Grace CPUs, delivering 1,440 PFLOPS FP4 sparse, 20TB HBM3e, 130TB/s NVLink. Claims 50x AI factory output over Hopper. Available now.

Apple Other 2026-05-25

Apple Registers genai.apple.com, Siri Standalone App and Extensions System Open Third-Party AI Gateway

Apple registers genai.apple.com before WWDC 2026, signaling generative AI as a platform pillar. Siri becomes a standalone app with personal context, on-screen understanding, and deep app actions. Powered by Google Gemini on Private Cloud Compute. Extensions system lets third-party AI (Claude, Gemini) plug in, with Apple taking a cut.

Intel Other 2026-05-25

Intel CEO: AI Inference Flips CPU/GPU Ratio, Multi-Agent Pushes CPU Back to Core

Intel CEO Lip-Bu Tan forecasts AI inference driving CPU/GPU ratio from 1:8 to 1:1 or even 4:1, with Multi-Agent demands (OS scheduling, KV Cache offload, high-concurrency tool calls) elevating CPU from supporting role to lead. NVIDIA Vera, AMD Venice, and Intel 18A CPU mass production confirm a CPU demand super-cycle.

Anthropic Other 2026-04-29

Behind Anthropics 900B Valuation: How Cross-Cloud Compute Reshapes Vendor Lock-in Risks in Enterprise AI Procurement

Anthropics 900B valuation funding is underpinned by a tri-cloud compute strategy. Enterprises using Claude simultaneously bind to AWS Google and NVIDIA escalating vendor lock-in from single-cloud to cross-cloud architectural lock-in

Google Other 2026-04-22

Google Cloud Next 26 Opens: Agentic Cloud Strategy Announced

Google Cloud Next 26 opens with enterprise Agentic AI full-stack.

Google Other 2026-04-22

Google Global Compute Pooling: Resource Utilization Jumps from 35% to 85%

Google launches global compute pooling technology, boosting resource utilization from 35% to 85%+, reducing costs by 40%+.

Google Product Launch 2026-04-22

Google TPU v8 Launches: Single Cluster Breaks 40 ExaFLOPS

Google launches TPU v8 chip with 40+ ExaFLOPS single cluster capacity, supporting millions of concurrent agents, 3x compute density and 2x energy efficiency improvement.

Meta Financial News Medium Signal 2026-04-19

Meta's 2026 Strategy: Labor-to-Compute Reallocation at Extreme Scale

Meta's strategic choice represents 'endgame thinking' in AI infrastructure arms race—not how to profit but how to survive. When capex reaches 50%+ of revenue, this is no longer a business decision but survival bet. The 'relative value' of labor costs has undergone fundamental revaluation in the AI era.

NVIDIA Product Launch High Signal 2026-04-15

NVIDIA Rubin Era: 1.8kW GPU TDP and Mandatory Liquid Cooling Reshape Data Centers

NVIDIA's mandatory liquid cooling is a landmark event in AI infrastructure 'qualitative change' of physical form. When chip power exceeds 1.8kW, air cooling physical limits are breached, the entire data center industry chain—from power architecture, cooling systems to building structure—must be redesigned. This isn't technology upgrade but paradigm shift.

Anthropic Partnership High Signal 2026-04-14

Anthropic GW-Scale TPU Deal: Compute Enters Nuclear Era

Anthropic secured multi-gigawatt next-gen TPU compute from Google and Broadcom, expected online in 2027 for frontier Claude model training. ARR exceeded $30B (3x in 3 months), AI infrastructure investment threshold enters nuclear power plant level.

Microsoft Other High Signal 2026-04-06

Microsoft Partners with Domestic Operators to Build Sovereign AI Infrastructure in Japan

Microsoft announced a $10B investment in Japan over four years, with a key pillar being a collaboration with Sakura Internet and SoftBank. This partnership will offer GPU-based AI compute services through Azure, managed by domestic providers to ensure data residency within Japan. This addresses the demand for sovereign AI infrastructure for sensitive workloads.

Anthropic Other High Signal 2026-04-06

Anthropic Locks in Multi-Gigawatt Next-Gen TPU Capacity with Google and Broadcom

Anthropic has signed a new agreement with Google and Broadcom to secure multiple gigawatts of next-generation TPU capacity, expected online starting 2027. This expansion aims to power frontier Claude models and meet surging global customer demand. The partnership significantly expands Anthropic's $50 billion U.S. compute infrastructure commitment.

Amazon Other Medium Signal 2026-03-30

AWS and TGS Strategic Partnership for Energy AI and HPC Transformation

TGS selected AWS as preferred cloud provider, leveraging AWS HPC and generative AI for energy exploration solutions. Collaboration includes modernizing TGS Imaging AnyWare platform and deploying multimodal Subsurface Foundation Model with AWS Nitro security.

NVIDIA Other High Signal 2026-03-26

NVIDIA Forms Nemotron Coalition to Advance Open Frontier Models

NVIDIA announced the Nemotron Coalition at GTC, a collaboration with model builders and AI labs like Mistral AI to advance open, frontier-level foundation models. The initiative aims to foster the open model ecosystem by sharing expertise, data, and compute, emphasizing a future where AI is powered by a system of both open and proprietary models.

NVIDIA Other High Signal 2026-03-26

NVIDIA Forms Open Model Alliance to Advance Nemotron Ecosystem

NVIDIA launched the first open frontier model alliance with Mistral AI to co-develop foundation models. Members share data, compute and expertise for post-training support, with Nemotron models exceeding 45M downloads. This move aims to advance open model innovation against closed ecosystems.