Filter

×
Active Filters Clear All
Keyword: Search ×
210 Total Reports
2/11 Page
NVIDIA Other 2026-06-11

NVIDIA Optimizes Google's DiffusionGemma for 1,000 tok/s Parallel Text Generation

NVIDIA optimizes Google DeepMind's DiffusionGemma, a diffusion-based text model generating 256 tokens per step in parallel. On a single H100, it achieves 1,000 tok/s, with deployment via NIM and NeMo. This breaks the sequential token bottleneck, slashing serving costs and latency for real-time AI.

NVIDIA Other 2026-06-11

NVIDIA Locks Local AI Inference Control with DiffusionGemma Parallel Generation

NVIDIA optimizes Google DeepMind's DiffusionGemma open model, which generates 256 tokens in parallel for 4x speedup over autoregressive models. Achieves 1000 tokens/sec on H100, 150 tokens/sec on DGX Spark, running fully locally with no cloud cost. This reinforces NVIDIA GPU's centrality in compute-bound local AI inference.

AMD Other 2026-06-11

AMD, Dell, Cambridge Launch UK Sovereign AI Lab to Challenge NVIDIA's CUDA Dominance with Open ROCm

AMD, Dell, and the University of Cambridge launch the Sovereign AI Innovation Lab (SAIL) in the UK, deploying Zenith supercomputer with 5th Gen EPYC and Instinct MI355X GPUs, plus the Sunrise fusion AI system. The lab promotes open, interoperable AI infrastructure based on AMD ROCm, challenging NVIDIA's CUDA lock-in and offering long-term technology choice for national AI initiatives.

ARM Other 2026-06-10

Arm's Neural Dawn: Dedicated Neural Accelerators Redefine Mobile GPU Roadmap

Arm and Sumo Digital unveil Neural Dawn, the first mobile game to use Unreal Engine MegaLights. By integrating dedicated neural accelerators into next-gen Mali GPUs, it delivers desktop-class ray-traced lighting within mobile power limits, signaling a shift from traditional to AI-native graphics pipelines.

Amazon Other 2026-06-10

Anthropic Claude Fable 5 on AWS: Data Retention Policy Breaches Cloud Security Boundary, Erodes Enterprise Data Sovereignty

AWS and Anthropic launch Claude Fable 5 with long-running async execution, advanced vision, and proactive self-verification. Access requires 30-day data retention and sharing with Anthropic, moving inference data outside AWS security boundary. Harmful prompts fall back to Opus 4.8, introducing complex pricing and governance risks.

Cloudflare Other 2026-06-09

Cloudflare as Customer Zero: Layered Defense Architecture Against Frontier AI Threats

Cloudflare reveals its production defense architecture against frontier AI models, using itself as customer zero. Combines WAF Attack Score, API Shield, Bot Management, Zero Trust, and MCP Server Portal. Core insight: architecture around the vulnerability matters more than patch speed, using ML scoring and positive security models to block attack variants before they hit, and contain lateral movement after a breach.

NVIDIA Other 2026-06-08

NVIDIA's UK Sovereign AI Play: From Chip Vendor to National Infrastructure Controller

NVIDIA partners with the UK government to deploy sovereign AI infrastructure via Isambard-AI (5,400 GH200 superchips) and the Sovereign AI Fund, backing local startups. This move establishes a national AI control plane, locking compute into NVIDIA's ecosystem and bypassing traditional hyperscalers like AWS and Azure.

NVIDIA Other 2026-06-08

NVIDIA and LG Build AI Factory: DSX Platform Locks Physical AI Stack

NVIDIA and LG Group jointly build an AI factory leveraging NVIDIA's DSX platform, integrating Isaac Sim/Lab, Cosmos, GR00T frameworks for robotics, autonomous driving, data centers, and sovereign AI. LG subsidiaries align cooling, robotics, and sensor components exclusively with NVIDIA, creating a fortified ecosystem.

NVIDIA Other 2026-06-08

NVIDIA and Doosan: Full-Stack Physical AI Platform Restructures Industrial Automation

NVIDIA expands collaboration with Doosan Group to integrate its physical AI stack (Isaac Sim, Cosmos, Jetson Thor) into Doosan Robotics' Agentic Robot OS, explore AI factory power (SMR, hydrogen fuel cells), and MGX ecosystem PCB materials. This move transforms NVIDIA from a GPU vendor into the central platform for physical AI and AI factory infrastructure, deeply locking industrial automation partners.

Cloudflare Other 2026-06-08

Cloudflare Embeds Live Threat Intel into WAF, Shifting Control from Manual Rules to Automated Engine

Cloudflare announces integration of real-time threat intelligence (from Cloudforce One) into its WAF engine, enabling proactive rules based on IP, attacker names, target industries, etc. Uses always-on detection with O(1) constant-time lookup for negligible latency. Currently IP-based, with plans for JA3 and domain matching.

AMD Other 2026-06-07

Обозреватели проверили Dell XPS 14 2026: автономность впечатлила, клавиатура — опять нет

Обозреватели проверили Dell XPS 14 2026: автономность впечатлила, клавиатура — опять нет2026-06-07T17:37:54+03:00Обозреватели проверили Dell XPS 14 2026: автономность впечатлила, клавиатура — опять не...

NVIDIA Other 2026-06-07

NVIDIA RTX Spark Superchip: Local AI Agents and AAA Gaming Converge in Ultra-Thin Laptops

NVIDIA unveils RTX Spark, a superchip integrating GPU, CPU, and AI acceleration for Windows PCs, delivering 1440p >100fps ray-traced gaming and local AI agent inference. Partnering with KRAFTON, NC, Riot Games, and T1, it debuts in Korean PC Bangs. This marks NVIDIA's strategic pivot from discrete GPUs to personal computing SoCs, targeting the era of personal AI.

NVIDIA Other 2026-06-04

NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration

NVIDIA launches Nemotron 3 Ultra, a 550B-parameter MoE model (55B active) purpose-built for AI agent orchestration. Featuring Multi-Teacher On-Policy Distillation (MOPD) and a Hybrid Mamba-Transformer architecture, it achieves 5x throughput and 30% cost savings on tasks like SWE-bench, signaling a shift of reasoning control to a layered agent system.

Cisco Other 2026-06-04

Cisco AI Defense + AppOmni Extends Runtime Guardrails to SaaS AI Agents

Cisco integrates AI Defense with AppOmni, using AgentGuard as a real-time intercept layer inside SaaS environments. Custom guardrails now apply to Microsoft 365 Copilot, ServiceNow Now Assist, and other SaaS agents, monitoring MCP, chat, and agent-to-agent channels to block prompt injection, tool exploitation, and data exfiltration with a unified policy engine.

Cisco Other 2026-06-03

Cisco Embeds OT Security Control into Switch ASIC: From Visibility to Enforced Segmentation

At Cisco Live 2026, Cisco launches Cyber Vision updates that embed auto-policy recommendation, simulation, and line-rate enforcement directly into IE3500/IE9300 Industrial Ethernet switches using its own ASICs. Secure remote access is also integrated. This shifts OT security control from appliances to the network fabric, creating a closed loop from visibility to prevention, but locks users into Cisco's full stack.

Microsoft Other 2026-06-02

Microsoft Build 2026: Unifying Agent Stack from Chip to Cloud

At Build 2026, Microsoft unveiled a comprehensive agent-era platform: Project Solara (chip-to-cloud), Microsoft IQ (unified grounding), Rayfin (backend generation), Azure HorizonDB, and GPU-accelerated analytics. The goal is to lock developers into Microsoft's ecosystem.

Intel Other 2026-06-02

Intel at Computex 2026: 18A, Rackscale, and the Shift to CPU-Centric AI Orchestration

Intel unveils Core Ultra Series 3 on 18A, Xeon 6+ with 288 e-cores, a hybrid local inference orchestrator with Perplexity, rackscale AI infrastructure with Foxconn, and disaggregated inference cloud with SambaNova. The keynote positions the CPU as the central orchestrator for agentic AI, signaling a control plane shift from GPU to x86.

Samsung Electronics Other 2026-06-02

HBM Profitability Falls Below DDR5, TrendForce Warns of Multi-Fold Price Surge in 2027

TrendForce reports that HBM per-wafer revenue fell below DDR5 64GB RDIMM in Q1 2026, making HBM less profitable. Suppliers will reallocate capacity, leading to multi-fold HBM4 contract price increases in 2027. Demand from NVIDIA Rubin Ultra and AI ASICs will further tighten supply.

Google Other 2026-06-01

Google AlloyDB Remote MCP Server GA: Standardizing AI Agent Data Access with Open Protocol

Google Cloud announces GA of AlloyDB Remote MCP Server, enabling AI agents to securely access operational data via HTTP endpoints. Built on open MCP protocol, it offers IAM fine-grained authorization, Model Armor protection, and audit logging, integrated with AlloyDB’s ScaNN vector index (10B+ vectors, 6x speed) and AI functions, positioning AlloyDB as the single source of truth for enterprise agentic workloads.

NVIDIA Other 2026-06-01

NVIDIA FOX Blueprint Shifts Factory Control from PLCs to AI Agents on DGX

NVIDIA unveiled the Factory Operations Blueprint (FOX), a reference design for autonomous factory manager agents using NemoClaw, AI-Q Blueprint, and DGX Station (GB300 with 20 PFLOPS FP4, 748GB coherent memory). It unifies live machine signals, quality systems, and robot fleets under an AI decision layer. Foxconn, Pegatron, Advantech, and Wistron are early adopters, projecting 80% faster root cause analysis and 15% labor productivity gains.