`logit_bias` init for non siglip pretrained models #717

rsomani95 · 2023-10-27T17:04:45Z

Fixes #712

There's one awkward bit about this currently:

import open_clip, torch

m = open_clip.create_model("convnext_base_w", "laion_aesthetic_s13b_b82k", init_logit_bias=100)
m.logit_bias == torch.tensor(0.)   # Maybe this is ok, but this feels awkward?

rwightman · 2023-10-28T22:39:17Z

@rsomani95 I think it should be

    if 'logit_bias' not in state_dict and model.logit_bias is not None:
        state_dict["logit_bias"] = torch.tensor(0.)

EDIT: actually maybe state_dict["logit_bias"] = torch.zeros_like(state_dict["logit_scale"]) in case we allow loading state dict onto specific device or in diff precision in the future

init logit_bias for non siglip pretrained models

8d1a852

rsomani95 mentioned this pull request Oct 27, 2023

Error When Loading Non SigLIP Pre-Trained Checkpoint To Train With Sigmoid Loss #712

Closed

rwightman merged commit 5e6114e into mlfoundations:main Oct 31, 2023
1 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`logit_bias` init for non siglip pretrained models #717

`logit_bias` init for non siglip pretrained models #717

rsomani95 commented Oct 27, 2023

rwightman commented Oct 28, 2023 •

edited

Loading

logit_bias init for non siglip pretrained models #717

logit_bias init for non siglip pretrained models #717

Conversation

rsomani95 commented Oct 27, 2023

rwightman commented Oct 28, 2023 • edited Loading

`logit_bias` init for non siglip pretrained models #717

`logit_bias` init for non siglip pretrained models #717

rwightman commented Oct 28, 2023 •

edited

Loading