Qwen #537

xiezipeng-ML · 2024-03-29T08:22:31Z

No description provided.

* update * format * readme * update README * fix * update readme * config -> configs * update sft * update * update * update * black format * format and isort * fix imports * rm usless files * format

ShawnXuan · 2024-09-20T08:40:46Z

推理

cuda PASS

python projects/Qwen/pipeline.py --model_path=/root/models/Qwen1.5-7B-Chat --mode=huggingface

npu PASS

python projects/Qwen/pipeline.py --model_path=/data0/hf_models/qwen2/Qwen1.5-7B-Chat --mode=huggingface --device=npu

xpu PASS

python projects/Qwen/pipeline.py --model_path=/root/models/Qwen1.5-7B-Chat --mode=huggingface --device=xpu

训练

data preparation

python projects/Qwen/utils/data_prepare.py

cuda PASS

export NUM_GPUS=8
python3 -m oneflow.distributed.launch \
    --nproc_per_node ${NUM_GPUS} \
    --nnodes 1 \
    --node_rank 0 \
    --master_addr 127.0.0.1 \
    --master_port 12345 \
        tools/train_net.py --config-file=projects/Qwen/configs/qwen_sft.py \
            graph.enabled=True \
            train.input_placement_device="cuda" \
            train.dist.device_type="cuda" \
            train.dist.pipeline_parallel_size=${NUM_GPUS}

A100-PCIE-40GB x 4 OOM

xpu OOM

export NUM_GPUS=1
python3 -m oneflow.distributed.launch \
    --nproc_per_node ${NUM_GPUS} \
    --nnodes 1 \
    --node_rank 0 \
    --master_addr 127.0.0.1 \
    --master_port 12345 \
        tools/train_net.py --config-file=projects/Qwen/configs/qwen_sft.py \
            graph.enabled=False \
            train.input_placement_device="xpu" \
            train.dist.device_type="xpu" \
            train.dist.pipeline_parallel_size=${NUM_GPUS}

npu 没有测，应该不行

xiezipeng-ML added 5 commits March 21, 2024 12:11

add qwen2

aac20df

update

820cad6

fix

5a51027

refine

400ea76

refine

e42b8d0

ShawnXuan force-pushed the qwen branch from ef8ce79 to e42b8d0 Compare September 18, 2024 08:09

xiezipeng-ML and others added 2 commits September 18, 2024 16:20

Merge branch 'main' into qwen

4c803dc

Qwen devices (#553)

7b280a3

* update * format * readme * update README * fix * update readme * config -> configs * update sft * update * update * update * black format * format and isort * fix imports * rm usless files * format

ShawnXuan requested review from fpzh2011, 0x404, Flowingsun007 and oneflow-ci-bot September 20, 2024 02:56

fpzh2011 approved these changes Sep 20, 2024

View reviewed changes

ShawnXuan merged commit 169be08 into main Sep 20, 2024
10 checks passed

ShawnXuan deleted the qwen branch September 20, 2024 13:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen #537

Qwen #537

xiezipeng-ML commented Mar 29, 2024

ShawnXuan commented Sep 20, 2024

Qwen #537

Qwen #537

Conversation

xiezipeng-ML commented Mar 29, 2024

ShawnXuan commented Sep 20, 2024

推理

训练