Implement CuBlas/MKL int8, float mixed precision gemm_batch #506

AidanBeltonS · 2024-05-31T15:52:10Z

Summary

The CuBlas backend currently does not support the gemm_batch combination for (int8, int8, float, float).
This should be implemented as there is an equivalent gemm batch operation that can be used within CuBlas.

The MKLCPU/GPU backends also have this combination set to unsupported for the same precision issues.

Problem statement

This was not added in #466 due to precision issues when using adaptaiveCPP. Tests passed locally with DPC++.
See #466 for details on how to implement this.

AidanBeltonS mentioned this issue May 31, 2024

Add new batch_gemm types #466

Merged

4 tasks

AidanBeltonS changed the title ~~Implement CuBlas int8, float mixed precision gemm_batch~~ Implement CuBlas/MKL int8, float mixed precision gemm_batch Jun 5, 2024

Rbiessy added the feature A request to add a new feature label Jul 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement CuBlas/MKL int8, float mixed precision gemm_batch #506

Implement CuBlas/MKL int8, float mixed precision gemm_batch #506

AidanBeltonS commented May 31, 2024 •

edited

Loading

Implement CuBlas/MKL int8, float mixed precision gemm_batch #506

Implement CuBlas/MKL int8, float mixed precision gemm_batch #506

Comments

AidanBeltonS commented May 31, 2024 • edited Loading

Summary

Problem statement

AidanBeltonS commented May 31, 2024 •

edited

Loading