Implement baseline int8 kernel for WASM #556

robertknight · 2025-01-26T11:30:38Z

This initial implementation performs about the same as f32 or slightly slower.

Faster kernels should be possible via one of:

Upconverting to i16 during packing and using i32x4_dot_i16x8
Using i32x4.dot_i8x16_i7x16_add_s if the relaxed-simd target feature is
enabled

This is needed for downstream libraries wanting to use operations on these types that are not available in rten-simd's traits.

This initial implementation performs about the same as f32 or slightly slower. Faster kernels should be possible via one of: - Upconverting to i16 during packing and using `i32x4_dot_i16x8` - Using `i32x4.dot_i8x16_i7x16_add_s` if the `relaxed-simd` target feature is enabled

robertknight added 3 commits January 26, 2025 10:17

Suppress warning about unnecessary mut on non-macOS platforms

0bc8593

Expose v128 value inside SIMD wrapper types

832a6a6

This is needed for downstream libraries wanting to use operations on these types that are not available in rten-simd's traits.

robertknight marked this pull request as ready for review January 26, 2025 11:36

robertknight merged commit e2704a9 into main Jan 26, 2025
2 checks passed

robertknight deleted the wasm-int8-kernel branch January 26, 2025 11:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement baseline int8 kernel for WASM #556

Implement baseline int8 kernel for WASM #556

robertknight commented Jan 26, 2025

Implement baseline int8 kernel for WASM #556

Implement baseline int8 kernel for WASM #556

Conversation

robertknight commented Jan 26, 2025