Filter

×
Active Filters Clear All
Keyword: AI model ×
91 Total Reports
1/5 Page
Google Cloud Other 2026-06-16

Apple Rebuilds Siri with Google Gemini, Cuts Legacy Hardware Support

Apple rebuilds Siri using Google Gemini-derived capabilities, introducing five new AFM 3 foundation models (including a 20B-parameter multimodal on-device model). The move is paired with the sharpest hardware support cut in watchOS 27, limiting to S9/S10 chips, signaling a strategic shift from vertical integration to hybrid AI partnerships and accelerated hardware refresh cycles.

AMD Other 2026-06-15

AMD Acquires MEXT: AI-Predicted Flash Nears DRAM Performance to Cut AI Memory TCO

AMD acquires MEXT, an AI-driven memory optimization startup. MEXT's predictive technology makes NAND Flash behave like DRAM, expanding effective memory capacity for AI workloads and lowering TCO. The tech will be integrated across AMD's data center portfolio (EPYC, Instinct) to address memory bottlenecks in large models.

NVIDIA Other 2026-06-15

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

ASUS launches ExpertCenter Pro ET900N G3, built on NVIDIA DGX Station GB300 architecture with GB300 Grace Blackwell Ultra chip, 748GB coherent memory, and 20 PFLOPS AI performance. This deskside AI supercomputer enables local LLM fine-tuning, inference, and agentic AI workflows via NVLink-C2C and the full NVIDIA AI software stack including NemoClaw.

Research Other 2026-06-15

Z.ai GLM-5.2 Ships Usable 1M-Token Context, No Benchmarks, Two Thinking Levels

Z.ai releases GLM-5.2 with a claim of usable 1M-token context and two thinking-effort levels. No standard benchmarks are provided, raising concerns about real-world performance. The model targets replacing chunking-based RAG with native long-context reasoning.

Fortinet Other 2026-06-15

DXC and Anthropic Forge Multi-Year Alliance: Claude-Certified Engineers for Mission-Critical AI

DXC Technology and Anthropic announce a multi-year global partnership, making DXC a Global Premier partner in the Claude Partner Network. They will train tens of thousands of Claude-certified engineers to deploy Claude models in mission-critical environments via the DXC OASIS platform, using a 'Customer Zero' internal validation approach.

Cloudflare Other 2026-06-15

Cloudflare Absorbs Ensemble AI: Architectural Model Compression Reshapes Edge Inference Economics

Cloudflare integrates key Ensemble AI talent, bringing NdLinear and NdLinear-LoRA—architectural model compression techniques that preserve multidimensional activations to reduce parameters and compute. This aims to slash inference costs on Workers AI, boost GPU utilization, and accelerate global edge AI deployment.

NVIDIA Other 2026-06-11

NVIDIA Halos OS: A Certified Safety OS That Seizes Control of Autonomous Driving

NVIDIA introduces Halos OS, a full-stack safety system comprising ASIL D certified Halos Core, standardized Halos SDK, AI guardrails in Halos Applications, and cloud-based Safety Evaluation Framework. Built on DRIVE Hyperion, it aims to embed safety into L4 robotaxis from the ground up.

Microsoft Other 2026-06-11

Microsoft & NVIDIA RTX Spark Brings 1 Petaflop AI to Windows, Reshaping Local Inference

At Computex 2026, Microsoft unveiled RTX Spark, an Arm-based AI superchip co-developed with NVIDIA and MediaTek, delivering up to 1 petaflop AI performance and 128GB unified memory for local 120B parameter models. Intel Arc G3 and Qualcomm Snapdragon X2 series also launched, accelerating the Windows AI PC ecosystem.

AMD Other 2026-06-11

AMD, Dell, Cambridge Launch UK Sovereign AI Lab to Challenge NVIDIA's CUDA Dominance with Open ROCm

AMD, Dell, and the University of Cambridge launch the Sovereign AI Innovation Lab (SAIL) in the UK, deploying Zenith supercomputer with 5th Gen EPYC and Instinct MI355X GPUs, plus the Sunrise fusion AI system. The lab promotes open, interoperable AI infrastructure based on AMD ROCm, challenging NVIDIA's CUDA lock-in and offering long-term technology choice for national AI initiatives.

Amazon Other 2026-06-10

Anthropic Claude Fable 5 on AWS: Data Retention Policy Breaches Cloud Security Boundary, Erodes Enterprise Data Sovereignty

AWS and Anthropic launch Claude Fable 5 with long-running async execution, advanced vision, and proactive self-verification. Access requires 30-day data retention and sharing with Anthropic, moving inference data outside AWS security boundary. Harmful prompts fall back to Opus 4.8, introducing complex pricing and governance risks.

Cloudflare Other 2026-06-09

Cloudflare as Customer Zero: Layered Defense Architecture Against Frontier AI Threats

Cloudflare reveals its production defense architecture against frontier AI models, using itself as customer zero. Combines WAF Attack Score, API Shield, Bot Management, Zero Trust, and MCP Server Portal. Core insight: architecture around the vulnerability matters more than patch speed, using ML scoring and positive security models to block attack variants before they hit, and contain lateral movement after a breach.

NVIDIA Other 2026-06-08

NVIDIA and LG Build AI Factory: DSX Platform Locks Physical AI Stack

NVIDIA and LG Group jointly build an AI factory leveraging NVIDIA's DSX platform, integrating Isaac Sim/Lab, Cosmos, GR00T frameworks for robotics, autonomous driving, data centers, and sovereign AI. LG subsidiaries align cooling, robotics, and sensor components exclusively with NVIDIA, creating a fortified ecosystem.

Amazon Other 2026-06-06

AWS Bedrock New Console Embraces OpenAI/Anthropic APIs, Shifting Control to Inference Layer

AWS launches a new Bedrock console powered by the bedrock-mantle endpoint, natively supporting OpenAI and Anthropic API protocols. Users can seamlessly switch between GPT, Claude, and open-weight models. This move standardizes model access, aiming to lock users into AWS's unified inference plane while weakening individual model provider API lock-in.

NVIDIA Other 2026-06-04

NVIDIA Nemotron 3 Ultra: A MoE-Based Control Plane for Cost-Efficient AI Agent Orchestration

NVIDIA launches Nemotron 3 Ultra, a 550B-parameter MoE model (55B active) purpose-built for AI agent orchestration. Featuring Multi-Teacher On-Policy Distillation (MOPD) and a Hybrid Mamba-Transformer architecture, it achieves 5x throughput and 30% cost savings on tasks like SWE-bench, signaling a shift of reasoning control to a layered agent system.

Cisco Other 2026-06-03

Cisco Embeds OT Security Control into Switch ASIC: From Visibility to Enforced Segmentation

At Cisco Live 2026, Cisco launches Cyber Vision updates that embed auto-policy recommendation, simulation, and line-rate enforcement directly into IE3500/IE9300 Industrial Ethernet switches using its own ASICs. Secure remote access is also integrated. This shifts OT security control from appliances to the network fabric, creating a closed loop from visibility to prevention, but locks users into Cisco's full stack.

Microsoft Other 2026-06-02

Microsoft Build 2026: Unifying Agent Stack from Chip to Cloud

At Build 2026, Microsoft unveiled a comprehensive agent-era platform: Project Solara (chip-to-cloud), Microsoft IQ (unified grounding), Rayfin (backend generation), Azure HorizonDB, and GPU-accelerated analytics. The goal is to lock developers into Microsoft's ecosystem.

Cisco Other 2026-06-02

Cisco Shifts AI Network Control from K8s Black Box to Unified Fabric via Isovalent and VXLAN ESG

Cisco integrates Isovalent's eBPF into Nexus One for pod-to-fabric visibility and introduces VXLAN ESG-based AI job segmentation, embedding security and multi-tenancy into the network fabric. This targets the Kubernetes 'black box' bottleneck in AI inference, unifying control and troubleshooting.

Intel Other 2026-06-02

Intel and SambaNova Rackscale AI: CPU Regains Inference Control Plane

At Computex 2026, Intel unveiled rack-scale AI infrastructure combining Xeon 6+ with SambaNova SN-50 RDUs, plus a fully disaggregated inference cloud (prefill on NVIDIA Blackwell, decode on RDUs) by Vector Core Compute. This aims to reposition the CPU as the central orchestrator for inference, challenging GPU dominance.

NVIDIA Other 2026-06-02

NVIDIA Transaction Foundation Models Shift Financial AI Control to Unified GPU Stack

NVIDIA launches a developer example for transaction foundation models, partnering with Revolut, Mastercard, and others to replace siloed ML models with unified transformer-based systems. Leveraging Hopper GPUs, cuDF, and Nemotron, it shifts financial data processing from feature engineering to unified embeddings, effectively moving control to NVIDIA's hardware ecosystem.

Amazon Other 2026-06-02

AWS Hosts OpenAI GPT-5.5 & Codex: Control Shifts from Model to Cloud

AWS launches OpenAI GPT-5.5, GPT-5.4, and Codex on Bedrock via the Responses API. This integrates frontier models into AWS infrastructure for data residency and capacity management, but locks users into Bedrock's ecosystem.