Lower precision for inference #191

kairukuma · 2024-06-10T13:52:04Z

Hi all, thanks for the great work! Is there a way to run chromBPnet in mixed precision formats like BF16 and utilize tensor cores? I'm using an RTX 6000 and its tensor performance on paper is ~8 times faster than its single-precision performance. I'm curious to see if inference speed for slower functions (e.g. contribs_bw) would scale proportionally. Any help is appreciated!

panushri25 · 2024-06-12T01:08:26Z

Ah that is good to know. We dont have this capability currently, contribs_bw is primarily using the deeplift algorithm, if deeplift is faster at lower precision, it can translate o this function being fast as well. Will take a note to explore this in the upcoming releases. Thank you!

panushri25 closed this as completed Jun 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lower precision for inference #191

Lower precision for inference #191

kairukuma commented Jun 10, 2024 •

edited

Loading

panushri25 commented Jun 12, 2024

Lower precision for inference #191

Lower precision for inference #191

Comments

kairukuma commented Jun 10, 2024 • edited Loading

panushri25 commented Jun 12, 2024

kairukuma commented Jun 10, 2024 •

edited

Loading