Add Example for training ColBERT using Pylate in terms of contrastive way #164

sigridjineth · 2024-12-26T14:02:52Z

Changes

Add Example for training ColBERT using Pylate in terms of contrastive way. The existing script only shows the example for training with teacher distillation strategy.

Refer to https://github.com/lightonai/pylate/blob/main/pylate/losses/contrastive.py

NohTow · 2024-12-26T14:50:19Z

Hello,

I don't have my computer right now, but I also trained models using contrastive loss during the experiments for the paper and IIRC, the boilerplates from PyLate work out of the box if you just do not compile explicitely (as we compile internally as I specified in the other issue).

So I think I rather prefer we do that, as I am not sure about the side effects the parameters of the inductor can have and we just don't need to compile explicitely (actually now the parameter to compile models in ST has been fixed, so even for the other models we should offload this using this parameter).

sigridjineth · 2024-12-28T04:11:13Z

@NohTow Yes, here, I used the Pylate boilerplate and worked pretty well out of the box when not explicitly compiling it. Have you any kinds of concerns not compiling the model when training as in my script?

NohTow · 2024-12-29T11:22:05Z

Have you any kinds of concerns not compiling the model when training as in my script?

Not at all, especially as the ModernBERT is compiled by default (even if you do not call model = torch.compile(model). My comment is rather to remove these two lines:

torch._inductor.config.fallback_random = True
torch._inductor.config.triton.unique_kernel_names = True

As they are not needed if you do not have the model = torch.compile(model)line and I am not sure about some side effects it could silently induce!

examples/train_pylate_contrastive.py

Co-authored-by: Antoine Chaffin <[email protected]>

sigridjineth · 2024-12-30T11:24:50Z

@NohTow okay, request it again

NohTow · 2025-01-02T13:31:15Z

I took the liberty to modify a bit the script to make it closer to the other boilerplate on this repository.
The script works so it can be merged, I just don't know about the batch size and learning rate (as I did not really explore much training ModernColBERT with contrastive learning), but I guess that is ok to not have optimal HP for boilerplates.
Although the LR seems very small to me compared to what we are used to with ModernBERT, did you run some sweeps?

add train_pylate_contrastive.py

cadb7e0

Refer to https://github.com/lightonai/pylate/blob/main/pylate/losses/contrastive.py

sigridjineth changed the title ~~add train_pylate_contrastive.py~~ Add Example for training ColBERT using Pylate in terms of contrastive way Dec 26, 2024

NohTow reviewed Dec 29, 2024

View reviewed changes

sigridjineth and others added 6 commits December 30, 2024 20:24

Update examples/train_pylate_contrastive.py

d92e5e5

Co-authored-by: Antoine Chaffin <[email protected]>

Update examples/train_pylate_contrastive.py

88ea5f3

Co-authored-by: Antoine Chaffin <[email protected]>

Update examples/train_pylate_contrastive.py

9fba67f

Co-authored-by: Antoine Chaffin <[email protected]>

Update examples/train_pylate_contrastive.py

75d19df

Co-authored-by: Antoine Chaffin <[email protected]>

Update examples/train_pylate_contrastive.py

425df2f

Co-authored-by: Antoine Chaffin <[email protected]>

Update examples/train_pylate_contrastive.py

c41cd6b

Co-authored-by: Antoine Chaffin <[email protected]>

sigridjineth requested a review from NohTow December 30, 2024 11:24

Antoine Chaffin and others added 2 commits January 2, 2025 10:37

Renaming and consistency with kd script

bf81243

Use bf16

ea42756

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Example for training ColBERT using Pylate in terms of contrastive way #164

Add Example for training ColBERT using Pylate in terms of contrastive way #164

sigridjineth commented Dec 26, 2024

NohTow commented Dec 26, 2024 •

edited

Loading

sigridjineth commented Dec 28, 2024

NohTow commented Dec 29, 2024

sigridjineth commented Dec 30, 2024

NohTow commented Jan 2, 2025

Add Example for training ColBERT using Pylate in terms of contrastive way #164

Are you sure you want to change the base?

Add Example for training ColBERT using Pylate in terms of contrastive way #164

Conversation

sigridjineth commented Dec 26, 2024

NohTow commented Dec 26, 2024 • edited Loading

sigridjineth commented Dec 28, 2024

NohTow commented Dec 29, 2024

sigridjineth commented Dec 30, 2024

NohTow commented Jan 2, 2025

NohTow commented Dec 26, 2024 •

edited

Loading