compiled model state_dict() workaround #639

EIFY · 2023-09-18T20:26:03Z

Pytorch 2.0 adds _orig_mod. prefix to keys of state_dict() of compiled models. For compatibility, we save state_dict() of the original model, which shares the weights without the prefix.

I have verified this workaround by resuming model training from a checkpoint created after a few epochs of compiled model training. The behaviors are within the expected randomness from the original trajectory. See also PyTorch forum topic "How to save/load a model with torch.compile".

Pytorch 2.0 adds '_orig_mod.' prefix to keys of state_dict() of compiled models. For compatibility, we save state_dict() of the original model, which shares the weights without the prefix.

compiled model state_dict() workaround

ad18dd8

Pytorch 2.0 adds '_orig_mod.' prefix to keys of state_dict() of compiled models. For compatibility, we save state_dict() of the original model, which shares the weights without the prefix.

rwightman merged commit 905fc54 into mlfoundations:main Sep 22, 2023
5 checks passed

EIFY deleted the save-compiled branch September 22, 2023 18:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

compiled model state_dict() workaround #639

compiled model state_dict() workaround #639

EIFY commented Sep 18, 2023

compiled model state_dict() workaround #639

compiled model state_dict() workaround #639

Conversation

EIFY commented Sep 18, 2023