SigLIP impl #634

rwightman · 2023-09-15T20:14:32Z

lucasb-eyer · 2023-09-18T18:38:11Z

Can't comment on the distributed part of the code as I don't know that part of PyTorch, but the rest (loss details, bias/temp/inits) LGTM.

rwightman · 2023-09-18T18:43:37Z

@lucasb-eyer thanks for taking a look, yeah the dist part is where a lot of the risk is, but seems to be behaving on local cc12m runs comparing single to 4x GPU.

lucasb-eyer · 2023-09-19T12:55:27Z

FYI: in our code, Basil implemented a small unit-test checking both formulations for "almost equalness" of chunked vs non-chunked, this gave us good reassurance in the implementation (+looking at profiler for memory use).

rwightman · 2023-09-22T18:42:04Z

I've tested

convnext_tiny on cc12m old InfoNCE run vs new SigLIP (36.13 vs 36.46 zero-shot in1k) (4 GPU)
initial convergence w/ siglip + grad accum enabled
initial convergence of custom text and original clip models w/o siglip (original CLIP InfoNCE loss)
initial convergence with bidirection exchange and unidirectional
validating several existing models

Will merge shortly to prevent this getting stale

* Initial SigLIP impl * Add logit_bias to custom text clip * non-dict model output wrong way around wrt logit_bias * Disable diving loss by world size, better without * A bit of cleanup * Add bidirectional exchange option, more cleanup * Add reference in siglip docstring * Remove some comments after further verification * bidir exchange by default * Proper bidir default

Initial SigLIP impl

f8babcf

rwightman mentioned this pull request Sep 15, 2023

Request for Sigmoid Loss Integration: SigLip #618

Closed

rwightman added 4 commits September 15, 2023 13:18

Add logit_bias to custom text clip

e48ee1a

non-dict model output wrong way around wrt logit_bias

b449bf4

Disable diving loss by world size, better without

85725fc

A bit of cleanup

b88364b

rwightman changed the title ~~Initial SigLIP impl~~ SigLIP impl Sep 16, 2023

Add bidirectional exchange option, more cleanup

134e61a

Add reference in siglip docstring

8ba778d

rwightman requested review from mitchellnw and rom1504 September 19, 2023 06:45

Remove some comments after further verification

ffb9c1b

rwightman added 2 commits September 22, 2023 12:07

bidir exchange by default

3181385

Proper bidir default

a5ba05f

rwightman merged commit a6a80c4 into main Sep 22, 2023
5 checks passed

rwightman deleted the siglip branch September 22, 2023 19:17

lucasb-eyer mentioned this pull request Sep 28, 2023

Question: Will SigLIP / SigLiT be added to this codebase? google-research/big_vision#38

Closed

NielsRogge mentioned this pull request Dec 17, 2023

Add SigLIP huggingface/transformers#26522

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SigLIP impl #634

SigLIP impl #634

rwightman commented Sep 15, 2023 •

edited

Loading

lucasb-eyer commented Sep 18, 2023

rwightman commented Sep 18, 2023

lucasb-eyer commented Sep 19, 2023

rwightman commented Sep 22, 2023 •

edited

Loading

SigLIP impl #634

SigLIP impl #634

Conversation

rwightman commented Sep 15, 2023 • edited Loading

lucasb-eyer commented Sep 18, 2023

rwightman commented Sep 18, 2023

lucasb-eyer commented Sep 19, 2023

rwightman commented Sep 22, 2023 • edited Loading

rwightman commented Sep 15, 2023 •

edited

Loading

rwightman commented Sep 22, 2023 •

edited

Loading