Turning LLaMA into LLaVA #1543

yukiarimo · 2025-01-15T08:40:45Z

Hello community! I’ve trained a custom LLaMA 3.1 16B model with custom tokens from the base model. It works great.

Now, I would like to create a LLaVA from it (a mmproj that I can use in kobold.cpp (based on llama.cpp)). Can you please help me out? How can I do that?

The pseudo-code:

llama_model.load(“my-model”)
llama_model.create_vision(config)

# dataset is like:
# <yuki>What is this?<data>{image_tokens_here}</data></yuki>\n<yuna>It is an Apple.</yuna>\n<yuki>What is this?<data>{image_tokens_here}</data></yuki>\n<yuna>It is a banana.</yuna>
# Note: all <> here are custom tokens!

dataset = “JSONL file”
llama_model.vision.train(dataset)
llama_model.save_projector()

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Turning LLaMA into LLaVA #1543

Turning LLaMA into LLaVA #1543

yukiarimo commented Jan 15, 2025

Turning LLaMA into LLaVA #1543

Turning LLaMA into LLaVA #1543

Comments

yukiarimo commented Jan 15, 2025