Update: ONNX conversion doc #2686

b8zhong · 2025-01-17T02:45:41Z

Update onnx + onnxruntime

Don't think quantize_onnx_model.py supports model name flag? Did not work w/ SPLADE++ Ensemble Distil earlier.

https://github.com/castorini/anserini/blob/master/src/main/python/onnx/quantize_onnx_model.py#L19-L24

lintool · 2025-01-17T09:57:49Z

Actually, @b8zhong you know what would be helpful? A snippet of code that compares the encoded output of the hgf model vs. the ONNX model - and take the two vectors and compute l1 norm or something - to confirm that the conversion worked.

would you might adding this?

b8zhong · 2025-01-18T16:42:09Z

@lintool any idea why such a large L1 diff?

L1 difference between PyTorch and ONNX outputs: 0.18276220560073853

Maybe AutoModel should actually be AutoModelForMaskedLM... ?

lintool · 2025-01-18T17:32:01Z

@b8zhong I ran into similar issues also. I believe you need to normalize the vector, see: #2645

The PR is defunct, but look at computeL2Norm.

b8zhong · 2025-01-18T19:35:16Z

Ty for the hint.. I don't think I would have ever thought of that 😛

There we go:
L1 difference between PyTorch and ONNX outputs: 0.009487475268542767

Acceptable I think?

lintool · 2025-01-18T20:04:27Z

Yup, that's good.

The page as written is specific to SPLADE++, can you make it generic, using SPLADE++ as an example?

b8zhong · 2025-01-18T21:39:49Z

Thoughts?

lintool

minor nits, I'll fix and then I'll merge.

docs/onnx-conversion.md

b8zhong added 2 commits January 16, 2025 20:29

docs: update onnx-conversion.md

14f1afd

Merge branch 'castorini:master' into master

369dc99

feat: intial L1 compute

27fcb65

fix: add normalization + bump validation threshold

f95172d

Merge branch 'castorini:master' into master

ff95154

b8zhong added 2 commits January 18, 2025 16:32

docs: generalize onnx conversion doc

f336f34

Merge branch 'master' of https://github.com/b8zhong/anserini

69d9c9c

lintool requested changes Jan 18, 2025

View reviewed changes

docs/onnx-conversion.md Outdated Show resolved Hide resolved

docs/onnx-conversion.md Outdated Show resolved Hide resolved

chore: minor spelling

b280d83

lintool approved these changes Jan 18, 2025

View reviewed changes

lintool merged commit 8a7be85 into castorini:master Jan 18, 2025
1 check passed

lintool mentioned this pull request Jan 19, 2025

Create ONNX version of snowflake-arctic-embed-l #2681

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update: ONNX conversion doc #2686

Update: ONNX conversion doc #2686

b8zhong commented Jan 17, 2025

lintool commented Jan 17, 2025

b8zhong commented Jan 18, 2025

lintool commented Jan 18, 2025

b8zhong commented Jan 18, 2025

lintool commented Jan 18, 2025

b8zhong commented Jan 18, 2025

lintool left a comment

Update: ONNX conversion doc #2686

Update: ONNX conversion doc #2686

Conversation

b8zhong commented Jan 17, 2025

lintool commented Jan 17, 2025

b8zhong commented Jan 18, 2025

lintool commented Jan 18, 2025

b8zhong commented Jan 18, 2025

lintool commented Jan 18, 2025

b8zhong commented Jan 18, 2025

lintool left a comment

Choose a reason for hiding this comment