Skip to content

Commit

Permalink
hf config
Browse files Browse the repository at this point in the history
  • Loading branch information
ahmeda14960 committed Oct 2, 2024
1 parent 436f985 commit 594c0c2
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/levanter/compat/torch_serialization.py
Original file line number Diff line number Diff line change
Expand Up @@ -168,7 +168,7 @@ def default_eqx_module_from_state_dict(mod: Mod, state_dict: StateDict, prefix:
# Hack to get around the fact we're using llama code for
# olmo model and something weird w layernorm
if prefix is not None:
if 'layernorm' in prefix.lower():
if 'layernorm' in prefix.lower() or 'model.norm.weight' in prefix.lower():
continue
new = jax_tree_from_state_dict(value, state_dict, apply_prefix(prefix, key))
# Do not try to update parameters that are never defined
Expand Down

0 comments on commit 594c0c2

Please sign in to comment.