NPU Accelerator

Atlas 300i Duo 96GB — Quad Bundle

4× Atlas 300i Duo cards: 384GB combined LPDDR4X ECC · 1,120 TOPS INT8. Full-scale private inference cluster in a single server chassis.

€0

We are unable to ship to your location.

Technical Specifications

Total memory	384GB LPDDR4X ECC (8× 48GB)
Architecture	8× Ascend 310B NPU
Compute	1,120 TOPS INT8 · 560 TFLOPS FP16
Interface	4× PCIe Gen4 x16
Power	4× 150W TDP
Condition	Brand new and unused

Software ecosystem

The Atlas 300i Duo runs on Huawei's CANN / Ascend stack. Two actively maintained inference paths work today:

llama.cpp (CANN backend) — widely tested for GGUF-format LLM inference
PyTorch via torch-npu — full Python ML stack; supports training and fine-tuning

Not CUDA-compatible. Buyer is responsible for verifying software compatibility before purchase. The CANN / Ascend ecosystem requires separate setup from NVIDIA pipelines.

What runs on it

DeepSeek V4 was co-developed with Huawei specifically for the Ascend platform and runs natively. Other validated models include:

DeepSeek V4-Flash, V3, and R1
Qwen 2.5 72B
Llama 3 70B / Llama 3.1 70B
Mistral 7B / Mixtral 8×7B
ChatGLM3-6B and other Chinese-language models

Huawei Official Documentation

Atlas 300i Duo 96GB — Quad Bundle — product demonstration

Compatible Platforms

✓ Taishan 200 2280 V2 (ARM) — recommended
✓ Atlas 800 3000 — supported
✓ Huawei RH2288H V5 (x86) — requires driver recompile