Reports
AI-generated structured vendor updates
AMD Announces Breakthrough MLPerf Inference 6.0 Results, Showcasing Multinode Scaling and Multimodal Capabilities
AMD's MLPerf Inference 6.0 submission, powered by Instinct MI355X GPUs, surpassed 1 million tokens per second for the first time on models like Llama 2 70B and GPT-OSS-120B. The results highlight efficient multinode scaling, rapid enablement of new workloads (e.g., text-to-video model Wan-2.2-t2v), and reproducible performance across a broad partner ecosystem.
Check Point Launches AI Defense Plane to Shift Security Control from Models to Runtime
Check Point launched the 'AI Defense Plane', aiming to provide unified security control for AI-driven enterprises. Its core is an AI-native security engine that extends protection from model safety guardrails to runtime behavior control of AI in live environments, covering employee usage, AI applications, and autonomous agentic systems.
Cisco Discloses Memory Poisoning Attack Method in AI Coding Assistants
Cisco's security team discovered and validated a persistent memory poisoning attack method targeting AI coding assistants like Claude Code, demonstrating how tampering with MEMORY.md system files can persistently manipulate AI behavior. This vulnerability prompted Anthropic to remove user memory files' system prompt privileges in v2.1.50.
Fortinet to Announce First Quarter 2026 Financial Results
Fortinet will host a conference call on May 6, 2026, at 1:30 p.m. Eastern Time to discuss its first quarter 2026 financial results. A live webcast and replay will be available on the company's investor relations website.
ARM Launches AGI CPU Silicon, Extends AI Infrastructure Reach
ARM debuts its first self-designed AGI CPU silicon, moving beyond IP licensing to offer full-stack solutions from custom silicon to integrated platforms. This shift redefines control points in AI infrastructure supply chains, enabling enterprises to optimize AI workload deployment at hardware layer.
Intel Demonstrates AI Performance with Xeon 6 and Arc Pro GPUs in MLPerf Inference
Intel showcased the performance of its Xeon 6 CPUs and Arc Pro B-Series GPUs in the MLPerf Inference v6.0 benchmarks, particularly in handling large language models (LLMs). The results indicate that a system with four Arc Pro B70 GPUs can process 120B parameter models, delivering up to 1.8x higher inference performance in multi-GPU setups.
Cisco Implements Preventive IT Operations Through Unified Observability Platform
Cisco IT has built a unified observability platform by integrating Splunk, ThousandEyes and AppDynamics, shifting focus from MTTR to incident prevention. The AI-powered platform enables data correlation analysis, reducing major incidents by 25% and improving resolution speed by 45% over 18 months.
Google Launches Gemini API Docs MCP & Agent Skills for AI Coding Agents
Google introduces Gemini API Docs MCP protocol and Agent Skills toolkit, enabling real-time access to updated API documentation and injecting best-practice patterns to resolve outdated code generation. Combined usage achieves 96.3% pass rate with 63% fewer tokens per correct answer.
AWS Collaborates with Flagship to Accelerate Life Sciences AI Innovation
AWS announced a strategic collaboration with Flagship Pioneering, becoming the preferred cloud provider for Flagship's portfolio companies, offering cloud resources, technical support, and AI capabilities to accelerate drug discovery and scientific platform development. Flagship's early-stage companies will receive AWS cloud credits, technical support, and go-to-market resources, while internal teams gain specialized support to enhance company creation and scaling.
Qualcomm Launches NPU-Integrated Wearable Platform to Advance On-Device AI and Personal AI Ecosystem
Qualcomm unveiled the Snapdragon Wear Elite platform, its first wearable platform with an integrated NPU designed for on-device AI, capable of supporting up to two-billion-parameter models. It marks a strategic shift from smartphone-centric to agent-centric computing, leveraging wearables for continuous context and enabling intelligence to flow across a user's device ecosystem.
Cisco Proposes Unified AI Fabric Architecture for Training/Inference Traffic
Cisco introduces unified AI fabric architecture using N9000 switches to intelligently route both training and inference traffic, addressing resource inefficiencies in dual-fabric setups. The solution features silicon-level low latency, real-time telemetry and automated policy tuning, targeting neocloud providers' platform transformation.
NVIDIA Collaborates with Energy Leaders to Position AI Factories as Smart Grid Assets
NVIDIA, in collaboration with Emerald AI, proposes treating large-scale AI data centers (AI factories) as flexible, intelligent grid assets rather than static power loads. This architecture integrates accelerated computing, power networking, and control to enhance grid reliability and optimize energy efficiency. Several major energy companies plan to collaborate on this architecture to support AI workloads and accelerate power connection.
NVIDIA Collaborates with Energy Leaders on AI Factory-Grid Integration Architecture
NVIDIA and Emerald AI introduced a new architecture treating AI factories as intelligent grid assets, combining accelerated computing, real-time energy orchestration and reference designs. The Vera Rubin DSX-based approach enables dynamic grid response and has gained support from multiple energy providers.
Cisco Open Sources DefenseClaw for AI Agent Security Governance
Cisco launched open-source DefenseClaw, providing three-layer security architecture for AI agents like OpenClaw: supply chain scanning, runtime inspection, and system boundary control. The solution integrates NVIDIA's OpenShell sandbox for end-to-end automated governance.
Google Advocates for Privacy by Innovation, Shaping Data Protection for the AI Assistant Era
Google's President of Global Affairs outlined a 'privacy by innovation' vision at the IAPP summit, arguing that data protection frameworks must evolve alongside AI assistant technologies. He emphasized moving beyond traditional consent models towards context-aware controls, granular agent access management, and built-in safeguards. This represents a systemic shift in thinking about privacy and security governance in the AI era.
Google Proposes Privacy Innovation Framework for AI Assistants
Google's President of Global Affairs Kent Walker outlined a new privacy framework for the AI era at IAPP Global Summit 2026, emphasizing 'privacy as quality' through technological innovation, while demonstrating how its personalized AI assistant integrates multi-app data for proactive services.
AWS and TGS Strategic Partnership for Energy AI and HPC Transformation
TGS selected AWS as preferred cloud provider, leveraging AWS HPC and generative AI for energy exploration solutions. Collaboration includes modernizing TGS Imaging AnyWare platform and deploying multimodal Subsurface Foundation Model with AWS Nitro security.
Samsung Highlights Smart Connectivity in Consumer Microwave Ovens, but Focus Remains Outside Core Enterprise AI Infrastructure or Networking Evolution.
Samsung Electronics announced its 11th consecutive year as the top-selling microwave brand in Europe, highlighting smart connectivity features such as remote monitoring via the SmartThings platform and voice control through Bixby to enhance kitchen convenience.
Arm Expands into Silicon Products with First Self-Designed AGI CPU
Arm is expanding its compute platform into production silicon for the first time, launching the self-designed Arm AGI CPU for AI data centers and agentic workloads. It targets over 2x performance per rack versus x86 platforms and is backed by lead partner Meta, customers like OpenAI, and a broad OEM/ODM ecosystem.
Nokia and Stelia Collaborate to Integrate Open Networking with AI Platform for Distributed AI
Nokia has partnered with AI platform company Stelia to deeply integrate open-standards-based networking technology with an enterprise AI platform. This move aims to address performance, governance, and security challenges in deploying production-grade AI across distributed environments, ensuring high-throughput, low-latency data flow.