Train Anole on your custom data

You can train Anole on your custom data. Note that the current training code has not been fully verified, but we will continuously update it soon!

Steps

Modify modeling_chameleon.py

# Modify line 1628 and line 1629 of modeling_chameleon.py

# Original Code:
image_tokens = self.model.vocabulary_mapping.image_tokens
logits[:, :, image_tokens] = torch.finfo(logits.dtype).min

# Modified Code:
# image_tokens = self.model.vocabulary_mapping.image_tokens
# logits[:, :, image_tokens] = torch.finfo(logits.dtype).min

Prepare your raw finetuning data like this

Note: Current code only supports finetuning on one-text segment and one image, we will support multiple interleaved text segments and images finetuning soon.

# Example samples
{"text": "Give me an image of Orange juice in a mason glass with an orange cut in half and wooden orange squeezer.", "image": "/path/to/image/1.png"}
{"text": "Give me an image of Chibi_Yukata_Disney_Princesses_by_vulpixfairy-picture", "image": "/path/to/image/2.png"}

Set the constants in constants_training.py
Convert raw finetuning data to tokenized data

bash prepare_data.sh

Convert PyTorch model to Hugging Face model

cd ../transformers/src/transformers/models/chameleon/
python convert_chameleon_weights_to_hf.py --model_size 7B --input_dir ANOLE_PATH_TORCH --output_dir ANOLE_PATH_HF

train the model using huggingface trainer

bash train.sh

Convert the huggingface model back to the torch model for inference

# specify `ANOLE_PATH_HF_TRAINED` and `ANOLE_PATH_TORCH` in constants_training.py
python bin_to_pth.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Train Anole on your custom data

Steps

Files

README.md

Latest commit

History

README.md

File metadata and controls

Train Anole on your custom data

Steps