Filter

×
Active Filters Clear All
Keyword: Privacy ×
69 Total Reports
1/4 Page
AMD Other 2026-06-11

AMD, Dell, Cambridge Launch UK Sovereign AI Lab to Challenge NVIDIA's CUDA Dominance with Open ROCm

AMD, Dell, and the University of Cambridge launch the Sovereign AI Innovation Lab (SAIL) in the UK, deploying Zenith supercomputer with 5th Gen EPYC and Instinct MI355X GPUs, plus the Sunrise fusion AI system. The lab promotes open, interoperable AI infrastructure based on AMD ROCm, challenging NVIDIA's CUDA lock-in and offering long-term technology choice for national AI initiatives.

AMD Other 2026-06-10

AMD EPYC Challenges Rack-Scale Density for Agentic AI Control

AMD claims its EPYC processors lead in rack-scale performance for agentic AI's CPU-intensive services (orchestration, caching, databases). Under a 100kW rack model, EPYC 9965 'Turin' delivers 2.37x throughput over NVIDIA Vera, with next-gen 'Venice' projected at 3.30x. Emphasizes deployability on current x86 platforms, avoiding future architecture dependency.

Cisco Other 2026-06-04

Cisco AI Defense + AppOmni Extends Runtime Guardrails to SaaS AI Agents

Cisco integrates AI Defense with AppOmni, using AgentGuard as a real-time intercept layer inside SaaS environments. Custom guardrails now apply to Microsoft 365 Copilot, ServiceNow Now Assist, and other SaaS agents, monitoring MCP, chat, and agent-to-agent channels to block prompt injection, tool exploitation, and data exfiltration with a unified policy engine.

Microsoft Other 2026-06-02

Microsoft Build 2026: Unifying Agent Stack from Chip to Cloud

At Build 2026, Microsoft unveiled a comprehensive agent-era platform: Project Solara (chip-to-cloud), Microsoft IQ (unified grounding), Rayfin (backend generation), Azure HorizonDB, and GPU-accelerated analytics. The goal is to lock developers into Microsoft's ecosystem.

Intel Other 2026-06-02

Intel at Computex 2026: 18A, Rackscale, and the Shift to CPU-Centric AI Orchestration

Intel unveils Core Ultra Series 3 on 18A, Xeon 6+ with 288 e-cores, a hybrid local inference orchestrator with Perplexity, rackscale AI infrastructure with Foxconn, and disaggregated inference cloud with SambaNova. The keynote positions the CPU as the central orchestrator for agentic AI, signaling a control plane shift from GPU to x86.

ARM Other 2026-06-02

Arm-NVIDIA RTX Spark: Tightly Coupled CPU-GPU for Agentic AI PCs

The Arm-based NVIDIA RTX Spark integrates Arm Grace CPU with NVIDIA Blackwell RTX GPU via unified memory, enabling ultra-low latency on-device AI inference for the agentic era. This platform marks a major milestone for Windows on Arm, targeting developers, creators, and gamers.

ARM Other 2026-06-02

Arm and NVIDIA RTX Spark: Unified Memory PC Architecture Targets Agentic AI, Encircles x86

Arm and NVIDIA unveil RTX Spark, an Arm-based Grace CPU + Blackwell RTX GPU platform with unified memory, targeting Windows on Arm for agentic AI inference. It delivers 1 Petaflop, reduces token cost, and signals a PC paradigm shift from app-driven to agent-driven, backed by Microsoft.

NVIDIA Other 2026-06-02

NVIDIA DGX Spark Update: One-Click Local AI Agents, Multi-Node Cluster for 400B Models

At Computex 2026, NVIDIA updates DGX Spark with NemoClaw for one-click local AI agent setup, 2.6x throughput boost for Qwen3.6-35B via vLLM optimizations, and Sync cluster assistant to connect 2-4 nodes over ConnectX-7 200Gbps RoCE, enabling local deployment of large models and multi-agent pipelines.

NVIDIA Other 2026-06-01

NVIDIA FOX Blueprint Shifts Factory Control from PLCs to AI Agents on DGX

NVIDIA unveiled the Factory Operations Blueprint (FOX), a reference design for autonomous factory manager agents using NemoClaw, AI-Q Blueprint, and DGX Station (GB300 with 20 PFLOPS FP4, 748GB coherent memory). It unifies live machine signals, quality systems, and robot fleets under an AI decision layer. Foxconn, Pegatron, Advantech, and Wistron are early adopters, projecting 80% faster root cause analysis and 15% labor productivity gains.

NVIDIA Other 2026-06-01

NVIDIA Locks Taiwan Supply Chain with AI Factory Stack, Vera Rubin Production Tied to Proprietary Software

NVIDIA partners with TSMC, Foxconn, and others to embed its proprietary AI software (cuLitho, Omniverse, Isaac) into semiconductor manufacturing and server assembly, while ramping Vera Rubin NVL72 production. The move uses efficiency gains (e.g., 20-50% cycle time reduction) as bait to lock the supply chain into a full-stack ecosystem, increasing switching costs for partners.

NVIDIA Other 2026-06-01

NVIDIA BlueField DPU In-Silicon Security Shifts AI Factory Control from Software to Hardware

NVIDIA unveils DOCA security stack (Argus, Vault, Flow) on BlueField-4 DPU, enabling hardware-isolated runtime threat detection via zero-copy memory analysis, zero-trust file access, and 800 Gb/s network enforcement. This shifts security control from host OS to DPU silicon, delivering distributed full-stack protection without compromising AI throughput, but deeply ties to Vera Rubin platform, creating ecosystem lock-in.

Google Other 2026-05-21

Google Antigravity Control Plane Redefines AI Development, Locks Agent Orchestration

At I/O 2026, Google launched Antigravity 2.0 desktop app and CLI/SDK as a unified agent control plane, alongside Gemini 3.5 Flash/Omni models, Managed Agents API, and native Android support in AI Studio. This aims to streamline AI development from prototype to production, but effectively locks developers into Google's ecosystem and cloud services.

Intel Other 2026-05-20

Intel Core Ultra 3 SoC Replaces Discrete GPUs in Edge Robotics, Slashing TCO

Intel Core Ultra Series 3 SoC integrates CPU, GPU, and NPU to power edge robotics, replacing discrete GPUs. Partners like Sensory AI run multi-agent AI (vision, language, motion) locally, cutting TCO and eliminating cloud latency. This shifts the cost-performance curve for service robots.

AMD Other 2026-05-20

AMD Ryzen AI Halo & Max PRO 400: Local 300B Parameter Inference, but Hidden Lock-in and Thermal Limits

AMD launches Ryzen AI Halo developer platform (128GB unified memory, 200B parameter models) and Ryzen AI Max PRO 400 series (first x86 client to run 300B parameter models locally). Unified memory, ROCm optimization, and OEM partnerships aim to shift agentic AI from cloud to local, but shared memory bandwidth and thermal constraints limit real-world throughput.

Google Other 2026-05-19

Google Cloud I/O '26: A2A Protocol and Managed Agents API Shift Agent Control Plane

At Google I/O '26, Google Cloud unveiled a unified agent development toolkit featuring Antigravity 2.0, Managed Agents API, ADK 2.0, and the A2A protocol. The platform evolves Vertex AI into Gemini Enterprise Agent Platform, offering a four-rung ladder from low-code to code-first. It aims to bridge local prototyping and secure cloud deployment via a shared protocol layer, but effectively centralizes agent lifecycle control onto Google Cloud's managed plane.

Google Other 2026-05-19

Google TPU 8t/8i Enables Cross-Datacenter Training, Gemini 3.5 Flash 4x Faster

Google unveils TPU 8t (training) and TPU 8i (inference) with 3x raw compute and 2x perf-per-watt. JAX/Pathways enable distributed training across 1M+ TPUs across sites. Gemini 3.5 Flash delivers 4x output tokens per second vs frontier models. SynthID adopted by OpenAI, Nvidia, Kakao, Eleven Labs.

Google Other 2026-05-19

Google Antigravity 2.0 Shifts Control from Model API to Agent Orchestration

Google launches Antigravity 2.0 desktop app, Managed Agents API, and AI Studio mobile, creating an agent-first development platform. Powered by Gemini 3.5 Flash (4x faster), it deeply integrates with Android, Firebase, and Workspace, aiming to lock developers into Google's orchestration layer.

Cisco Other 2026-05-12

Cisco Replaces Human Annotators with LLM Constitutional Definitions for AI Safety Consistency

Cisco introduces Single-Source Safety Definitions, replacing human annotators with LLMs that re-read 300+ line constitutional documents per classification. This AI-first approach achieves 57x reduction in inter-model disagreement, adds intent/content dual-axis scoring, and becomes the default safety taxonomy for Cisco AI Defense, shifting control from humans to machine-readable specifications.

AMD Other Medium Signal 2026-05-07

AMD Backs SPEC CPU 2026 Benchmark, Emphasizing Open, Trusted Performance Measurement

AMD published a blog endorsing the upcoming SPEC CPU 2026 industry benchmark, emphasizing the critical role of open, reproducible CPU performance standards for customer infrastructure decisions in the AI era. The new benchmark updates its application suite and strengthens support for bare-metal cloud environments and parallel computing.

Google Other High Signal 2026-05-06

Google Launches Gemma 4 Open Models, Accelerating Local AI Agent Deployment

Google released the Gemma 4 open model family under Apache 2.0 license, introducing MoE architecture for the first time. It aims to deliver high-performance AI agent capabilities directly to mobile and edge hardware, reducing reliance on cloud clusters and enabling new local, private AI applications.