Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement baseline int8 kernel for WASM #556

Merged
merged 3 commits into from
Jan 26, 2025
Merged

Conversation

robertknight
Copy link
Owner

This initial implementation performs about the same as f32 or slightly slower.

Faster kernels should be possible via one of:

  • Upconverting to i16 during packing and using i32x4_dot_i16x8
  • Using i32x4.dot_i8x16_i7x16_add_s if the relaxed-simd target feature is
    enabled

This is needed for downstream libraries wanting to use operations on these types
that are not available in rten-simd's traits.
This initial implementation performs about the same as f32 or slightly slower.

Faster kernels should be possible via one of:

 - Upconverting to i16 during packing and using `i32x4_dot_i16x8`
 - Using `i32x4.dot_i8x16_i7x16_add_s` if the `relaxed-simd` target feature is
   enabled
@robertknight robertknight marked this pull request as ready for review January 26, 2025 11:36
@robertknight robertknight merged commit e2704a9 into main Jan 26, 2025
2 checks passed
@robertknight robertknight deleted the wasm-int8-kernel branch January 26, 2025 11:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant