ONNX converting issues #10

NikitaKononov · 2023-04-10T21:43:13Z

Hello, I faced these errors while converting to onnx

TracerWarning: Converting a tensor to a Python boolean might cause the trace to be incorrect. We can't record the data flow of Python values, so this value will be treated as a constant in the future. This means that the trace might not generalize to other inputs!
assert (discriminant >= 0).all()
Warning: Constant folding - Only steps=1 can be constant folded for opset >= 10 onnx::Slice op. Constant folding not applied.
WARNING: The shape inference of prim::Constant type is missing, so it may result in wrong shape inference for the exported graph. Please consider adding it in symbolic function.

What may be wrong? Thanks

lss233 · 2023-04-11T02:56:22Z

They're just some normal warnings, it's OK to ignore them.

NikitaKononov · 2023-04-11T08:42:47Z

They're just some normal warnings

Thank you for your answer. So it doesn't affect inference quality?

lss233 · 2023-04-11T09:40:18Z

Yes, none of these warnings have effects on quality.

The code reported by TracerWarnings is used to check whether variables meet the requirements, rather than the inference part. Therefore, it can be ignored.

Constant folding is a method used in ONNX to optimize the Slice operation. However, this optimization is not applicable at the location where the warning occurred, and not performing constant folding meets our expectations.

As for the last warning, I think it is some kind of bug in PyTorch that causes ONNX to be unable to recognize the type. However, it won't have any effect if your model infers correctly.

NikitaKononov · 2023-04-17T10:50:49Z

won't have any effect if your model infers correctly

Hello!

Model converted into onnx with your scripts has very poor performance in NVIDIA Triton Inference Server
Inference time is x2-3 times slower, than pytorch inference
I've tried all available options in Triton configuration, but I can't achieve good inference speed

Have you faced such problem? Thanks.

sudoskys · 2023-04-17T11:13:41Z

Very sorry! When writing the Onnx runtime, I specified the CPU inference, please wait

VitsServer/event.py

Line 157 in c114751

model = RunONNX(model=_vits_base, providers=['CPUExecutionProvider'])

sudoskys · 2023-04-17T11:31:03Z

954ceba

NikitaKononov · 2023-04-17T12:14:34Z

I specified the CPU inference

Thanks, I'll give it a try
But RunONNX doesn't affect converted model saving, as I can see in the code?

I use the converted model in NVIDIA Triton Inference server
It utilizes GPU, but have poor performance for some reason

I'll test pure pytorch inference, pure onnx inference, and triton inference with pytorch model and onnx model
and provide test results

sudoskys · 2023-04-17T12:37:19Z

ok

sudoskys · 2023-04-17T12:44:36Z

pls wait a while for svc branch

NikitaKononov · 2023-04-17T13:10:05Z

ok

Have done 50 test inferences for each model with same input text
pytorch avg ~2.5s
onnx avg ~ 2.7s
triton onnx avg ~ 4.1s

for some reason onnxruntime in triton makes execution slower, trying to find bottleneck

sudoskys · 2023-04-19T01:17:47Z

ok

Have done 50 test inferences for each model with same input text
pytorch avg ~2.5s
onnx avg ~ 2.7s
triton onnx avg ~ 4.1s

for some reason onnxruntime in triton makes execution slower, trying to find bottleneck

The server will convert pth to onnx before loading. Instead of using onnx.
It may be that the model structure or other configuration errors caused this problem

sudoskys · 2023-04-19T01:21:06Z

The ONNX model will perform some initialization operations during the first reasoning after the Session is loaded, and this factor should also be considered

sudoskys added the good first issue Good for newcomers label Apr 11, 2023

sudoskys closed this as completed Apr 16, 2023

sudoskys pinned this issue Apr 16, 2023

sudoskys reopened this Apr 17, 2023

sudoskys linked a pull request Apr 17, 2023 that will close this issue

Soft vc #11

Merged

3 tasks

sudoskys closed this as completed in #11 Apr 18, 2023

sudoskys reopened this Apr 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ONNX converting issues #10

ONNX converting issues #10

NikitaKononov commented Apr 10, 2023

lss233 commented Apr 11, 2023

NikitaKononov commented Apr 11, 2023

lss233 commented Apr 11, 2023

NikitaKononov commented Apr 17, 2023

sudoskys commented Apr 17, 2023 •

edited

Loading

sudoskys commented Apr 17, 2023

NikitaKononov commented Apr 17, 2023 •

edited

Loading

sudoskys commented Apr 17, 2023 •

edited

Loading

sudoskys commented Apr 17, 2023

NikitaKononov commented Apr 17, 2023

sudoskys commented Apr 19, 2023

sudoskys commented Apr 19, 2023

ONNX converting issues #10

ONNX converting issues #10

Comments

NikitaKononov commented Apr 10, 2023

lss233 commented Apr 11, 2023

NikitaKononov commented Apr 11, 2023

lss233 commented Apr 11, 2023

NikitaKononov commented Apr 17, 2023

sudoskys commented Apr 17, 2023 • edited Loading

sudoskys commented Apr 17, 2023

NikitaKononov commented Apr 17, 2023 • edited Loading

sudoskys commented Apr 17, 2023 • edited Loading

sudoskys commented Apr 17, 2023

NikitaKononov commented Apr 17, 2023

sudoskys commented Apr 19, 2023

sudoskys commented Apr 19, 2023

sudoskys commented Apr 17, 2023 •

edited

Loading

NikitaKononov commented Apr 17, 2023 •

edited

Loading

sudoskys commented Apr 17, 2023 •

edited

Loading