Qualcomm introduces AI200 and AI250 racks for AI inference

The company aims to make the AI200 commercially available in 2026, with the AI250 expected to follow in 2027.

October 28, 2025

The new platforms are designed to support large language model and multimodal inference, as well as a wide range of other AI workloads. Credit: Qualcomm Technologies, Inc.

Qualcomm Technologies has unveiled its latest AI inference-focused solutions for data centres in the form of the AI200 and AI250 accelerator cards and corresponding rack systems.

The company targets commercial availability of the AI200 in 2026, followed by the AI250 in 2027.

Access deeper industry intelligence

Experience unmatched clarity with a single platform that combines unique data, AI, and human expertise.

Find out more

Both systems are intended for large language model (LLM) and multimodal inference as well as broader AI workloads.

The AI200 system features a rack-level design and utilises individual cards supporting 768GB of low-power double data rate synchronous dynamic random-access (LPDDR) memory, designed to enable larger models and increased workload flexibility.

Qualcomm reported that the AI250 will incorporate near-memory computing, offering more than ten times the effective memory bandwidth compared to prior generations while lowering power draw.

Both platforms employ direct liquid cooling, PCIe-based scale-up, and Ethernet-based scale-out connectivity.

The racks are specified to operate at up to 160kW. Confidential computing is also included to meet secure workload requirements.

Qualcomm’s software stack is engineered to support integration with standard machine learning (ML) frameworks and inference engines, as well as generative AI (gen AI) and large language model serving methods such as disaggregated inference.

Developers can onboard models from sources such as Hugging Face and deploy them using Qualcomm’s tools, including its Efficient Transformers Library and Inference Suite.

Qualcomm senior vice president and technology planning, edge solutions and data centre general manager Durga Malladi said: “With Qualcomm AI200 and AI250, we’re redefining what’s possible for rack-scale AI inference.

“These innovative new AI infrastructure solutions empower customers to deploy generative AI at unprecedented TCO, while maintaining the flexibility and security modern data centers demand.”

In parallel to these product launches, Qualcomm Technologies has announced a partnership with HUMAIN to deploy its data centre hardware in Saudi Arabia.

The project aims for an initial deployment of 200 megawatts of Qualcomm AI200 and AI250 rack capacity from 2026 onwards.

According to both firms, this effort is intended to establish edge-to-cloud hybrid AI inference infrastructure ahead of the ninth Future Investment Initiative conference and follows a May 2025 announcement made at the US–Saudi Investment Forum.

The collaboration combines HUMAIN’s regional infrastructure capabilities with Qualcomm’s semiconductor technology stack to deliver inference services for enterprises and government agencies within Saudi Arabia and globally.

HUMAIN CEO Tareq Amin said: “With Qualcomm’s world-class AI infrastructure solutions, we’re shaping the foundation of the Kingdom’s AI future.

“This collaboration unites HUMAIN’s deep regional insight and unique full AI stack capabilities with Qualcomm’s unmatched semiconductors technology and product leadership.”

Qualcomm introduces AI200 and AI250 racks for AI inference

Go deeper with GlobalData

Artificial Intelligence (AI) - Thematic Intelligence

Artificial Intelligence (AI) Market Size, Share, Trends, Analysis by Product/Service (Specialized...

Data Insights

Access deeper industry intelligence

Artificial Intelligence (AI) - Thematic Intelligence

Artificial Intelligence (AI) Market Size, Share, Trends, Analysis by Product/Service (Specialized...

Go deeper with GlobalData

Rafay Systems releases Token Factory for AI model access

Google unveils Gemma 4 open AI models for diverse hardware use

The Future of Tech – Key Themes for 2026 and Beyond

Amazon’s Bahrain cloud centre hit in Iran strike, reports say

Sign up for our daily news round-up!

Sign up to the newsletter: In Brief

Go deeper with GlobalData

Data Insights

Access deeper industry intelligence

Sign up for our daily news round-up!

Give your business an edge with our leading industry insights.

Go deeper with GlobalData

Go deeper with GlobalData

Access deeper industry intelligence

Sign up for our daily news round-up!

Sign up to the newsletter: In Brief

I would also like to subscribe to:

Thank you for subscribing