Filter

×
Active Filters Clear All
Keyword: LLM ×
105 Total Reports
4/6 Page
Microsoft Other High Signal 2026-04-06

Microsoft Partners with Domestic Operators to Build Sovereign AI Infrastructure in Japan

Microsoft announced a $10B investment in Japan over four years, with a key pillar being a collaboration with Sakura Internet and SoftBank. This partnership will offer GPU-based AI compute services through Azure, managed by domestic providers to ensure data residency within Japan. This addresses the demand for sovereign AI infrastructure for sensitive workloads.

Anthropic Other High Signal 2026-04-06

Anthropic Partners with Mozilla, AI Models Independently Discover High-Severity Firefox Vulnerabilities

Anthropic's Claude Opus 4.6 model discovered 22 vulnerabilities in Mozilla Firefox over two weeks, with 14 classified as high-severity. This demonstrates AI's ability to independently identify unknown vulnerabilities in complex software and its nascent capability to generate exploits, signaling a new phase in AI-powered cybersecurity offense and defense.

Google Other High Signal 2026-04-03

Google Launches Gemma 4 Open Models, Targeting Edge Inference and AI Agent Architecture

Google introduces the Gemma 4 open model family, with four sizes from 2B to 31B parameters, emphasizing breakthrough intelligence-per-parameter and native support for agentic workflows, multimodality, and long context. The small models are engineered for edge devices, aiming to bring frontier reasoning to mobile and IoT scenarios.

Google Other Medium Signal 2026-04-03

Google Launches Gemma 4 Open Model Family

Google introduces Gemma 4 open model family with four size variants, optimized for edge and mobile devices. The series supports multimodal processing, long context windows and 140+ languages under Apache 2.0 license.

AMD Other High Signal 2026-04-02

AMD Announces Breakthrough MLPerf Inference 6.0 Results, Showcasing Multinode Scaling and Multimodal Capabilities

AMD's MLPerf Inference 6.0 submission, powered by Instinct MI355X GPUs, surpassed 1 million tokens per second for the first time on models like Llama 2 70B and GPT-OSS-120B. The results highlight efficient multinode scaling, rapid enablement of new workloads (e.g., text-to-video model Wan-2.2-t2v), and reproducible performance across a broad partner ecosystem.

Intel Other Medium Signal 2026-04-01

Intel Demonstrates AI Performance with Xeon 6 and Arc Pro GPUs in MLPerf Inference

Intel showcased the performance of its Xeon 6 CPUs and Arc Pro B-Series GPUs in the MLPerf Inference v6.0 benchmarks, particularly in handling large language models (LLMs). The results indicate that a system with four Arc Pro B70 GPUs can process 120B parameter models, delivering up to 1.8x higher inference performance in multi-GPU setups.

Cisco Other Medium Signal 2026-03-31

Cisco Open Sources DefenseClaw for AI Agent Security Governance

Cisco launched open-source DefenseClaw, providing three-layer security architecture for AI agents like OpenClaw: supply chain scanning, runtime inspection, and system boundary control. The solution integrates NVIDIA's OpenShell sandbox for end-to-end automated governance.

Cisco Other Medium Signal 2026-03-28

Cisco DevNet Integrates Managed LLM Access to Lower AI Security Practice Barriers

Cisco introduces managed LLM access on its DevNet Learning Labs platform, offering a single OpenAI-compatible API endpoint supporting backends like Azure OpenAI and AWS Bedrock. This keyless, pre-configured environment enables direct LLM invocation for practicing AI security workflows including A2A protocol security and AI defense.

Cisco Other Medium Signal 2026-03-25

Cisco Validates Rapid Fine-tuning on Private AI Infrastructure with NVIDIA

Cisco IT partnered with NVIDIA to achieve 2-5 hour end-to-end embedding model fine-tuning using Nemotron RAG recipe on a single H200 GPU. The solution uses 120B parameter local LLM for synthetic data generation without manual labeling, improving NDCG@1 by 7.3 absolute points. Validates rapid domain-specific retrieval optimization on private AI infrastructure.

Cisco Other High Signal 2026-03-25

Cisco Unifies AI Agent Security Policy Enforcement via LangChain Middleware

Cisco integrates AI Defense Runtime Protection with LangChain as middleware, providing monitoring and enforcement modes for unified AI agent security policy execution. The solution generates runtime contracts with decisions, classifications, and request IDs, supporting multiple integration paths. Cisco plans to contribute this integration to LangChain upstream and expand to other AI environments.

NVIDIA Other High Signal 2026-03-24

NVIDIA Donates GPU Dynamic Resource Allocation Driver to Kubernetes Community

NVIDIA donated its GPU Dynamic Resource Allocation (DRA) driver to the CNCF, making it an upstream Kubernetes project. This move aims to shift the core control point of GPU orchestration from proprietary vendor layers to the open-source community, and drive standardization in collaboration with major cloud providers.

Amazon Other 2026-03-24

Amazon Deploys 2,500 Robots in 108k㎡ Nagareyama FC, Expanding AI-Driven Automated Fulfillment Network

Amazon announced a large-scale fulfillment center in Nagareyama, Japan, to open in March 2026, featuring 10.8万㎡ floor space and deploying ~2,500 ‘Amazon Robotics’ drive units with 26,000 specialized pods. The robotics automation system increases storage capacity by ~40% versus static shelving, handling over 500k items daily. This represents continued scaling of AI-integrated logistics infrastructure.

Cisco Other Medium Signal 2026-03-23

Cisco Launches LLM Security Leaderboard, Standardizing Model Security Evaluation

Cisco introduces an LLM security leaderboard providing objective rankings based on single and multi-round attack testing. The tool uses a standardized evaluation framework mapping attack data to Cisco's AI security taxonomy, with public rankings and methodology. It aims to provide security risk assessment for enterprise AI deployment, filling a gap in model security benchmarking.

Cisco Other High Signal 2026-03-23

Cisco Extends Zero Trust Security to AI Agent Ecosystem

At RSA 2026, Cisco introduced security innovations for AI agents, extending Zero Trust Access with agent discovery in Identity Intelligence, agentic IAM in Duo, and MCP enforcement in Secure Access SSE. It launched AI Defense: Explorer Edition for self-serve testing and DefenseClaw open source framework to automate security deployment.

Check Point Other 2026-03-23

Check Point AI Factory Blueprint: Security Control Shifts to NVIDIA DPU and LLM Layer

Check Point unveils AI Factory Security Blueprint, tightly integrating its firewall with NVIDIA BlueField DPU via DOCA. The architecture enforces security at four layers: LLM, AI infrastructure, perimeter, and workload. The new AI Factory Firewall delivers hardware-accelerated threat prevention without consuming CPU/GPU cycles, aiming to embed security into the AI fabric.

Check Point Other High Signal 2026-03-23

Check Point Releases AI Factory Security Blueprint Covering GPU to LLM Protection

Check Point introduces an AI Factory security architecture blueprint, establishing full-stack protection from GPU hardware layer to LLM prompt layer through a zero-trust framework.

AMD Other Medium Signal 2026-03-19

AMD and Upstage Collaborate on Sovereign AI Infrastructure with MI325X

AMD expands partnership with Upstage to deliver sovereign AI infrastructure using Instinct MI325X accelerators. The solution integrates Solar LLM with optimized ROCm software stack to enhance AI training and inference efficiency, addressing Korea's data sovereignty requirements.

Cisco Other High Signal 2026-03-18

Cisco Advances WLAN Autonomy with Proprietary LLM and AgenticOps

Cisco ranked as leader in ABI Research's WLAN competitiveness assessment, leveraging its proprietary LLM trained on CCIE expert data and AgenticOps capabilities like AI-RRM, config recommendations, and packet analysis to shift from analytics to autonomous operations.

AMD Other High Signal 2026-03-18

AMD and NAVER Cloud Collaborate on Sovereign AI Infrastructure in Korea

AMD and NAVER Cloud announced a strategic collaboration to accelerate sovereign AI infrastructure in Korea. NAVER Cloud will expand deployment of AMD EPYC "Venice" CPUs and gain early access to next-gen Instinct MI455X GPUs, with joint optimization of AI services and software stacks on AMD platforms.

NVIDIA Other High Signal 2026-03-14

NVIDIA Releases Cosmos World Model Suite, Enhancing Synthetic Data and Reasoning for Physical AI

NVIDIA has released significant updates to its Cosmos World Foundation Models (WFM) suite, including Transfer 2.5, Predict 2.5, and Reason 2. These models are designed to accelerate the generation of high-fidelity, physics-aware synthetic data and support downstream fine-tuning and reasoning for physical AI systems like robotics and autonomous vehicles, addressing the bottleneck of real-world data scarcity.