Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Exclude Conv op from quantization in ort-quantize.py #550

Merged
merged 1 commit into from
Jan 25, 2025

Conversation

robertknight
Copy link
Owner

@robertknight robertknight commented Jan 25, 2025

By default quantize_dynamic will replace Conv with ConvInteger. RTen doesn't support this operator yet and ONNX Runtime doesn't support the operator with the particular combination of data types that quantize_dynamic generates (u8 input, i8 weights) (see issues mentioned in comments). Hence omit this from the quantized op types for now.

By default `quantize_dynamic` will replace `Conv` with `ConvInteger`.  RTen
doesn't support this operator yet and ONNX Runtime doesn't support the operator
with the particular combination of data types that `quantize_dynamic` generates
(u8 input, i8 weights). Hence omit this from the quantized op types for now.
@robertknight robertknight force-pushed the ort-quantize-exclude-conv branch from a5fff2a to 8e888d6 Compare January 25, 2025 11:27
@robertknight robertknight merged commit 8fd710a into main Jan 25, 2025
2 checks passed
@robertknight robertknight deleted the ort-quantize-exclude-conv branch January 25, 2025 11:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant