Skip to content

Latest commit

 

History

History
19 lines (13 loc) · 1.11 KB

README.md

File metadata and controls

19 lines (13 loc) · 1.11 KB

Instructions to train SmolLM-Instruct

We build the SmolLM-Instruct (v0.2) models (135M, 360M and 1.7B) by doing SFT on a mix of these datasets:

Setup

Follow the installation instructions in https://github.com/huggingface/alignment-handbook/tree/main?tab=readme-ov-file#installation-instructions

Training

We train the models on 8 GPUs using the following command:

ACCELERATE_LOG_LEVEL=info accelerate launch --config_file recipes/accelerate_configs/deepspeed_zero3.yaml scripts/run_sft.py recipes/smollm/sft/config.yaml