Skip to content

Enabled high-performance Automatic Tensor Parallelism (auto TP) for the Qwen2-MoE and DeepSeek-V2 models on multiple GPUs/HPUs #11513

Enabled high-performance Automatic Tensor Parallelism (auto TP) for the Qwen2-MoE and DeepSeek-V2 models on multiple GPUs/HPUs

Enabled high-performance Automatic Tensor Parallelism (auto TP) for the Qwen2-MoE and DeepSeek-V2 models on multiple GPUs/HPUs #11513

Re-run triggered January 24, 2025 18:01
Status Success
Total duration 2m 27s
Artifacts

python.yml

on: pull_request
Matrix: unit-tests
Fit to window
Zoom out
Zoom in