Allow `fp16` input tensor to apply with `fp16` saved model #36

deepkyu · 2023-09-01T06:06:56Z

Compressor

Current API gets input tensor as input_shape
- Corresponding line
- Generate input tensor with float32 with the given size
Some models can be saved with half() (i.e. fp16), which cannot infer with float32 input tensor .
- This model should give the example input tensor whose dtype is same with the model (float16)
Need additional feature for users to select the data type of example input tensor
Our back-end has already supported different input types of tensors, and it doesn't affect to compress the model

The text was updated successfully, but these errors were encountered:

Only-bottle · 2023-09-22T09:15:04Z

@deepkyu @illian01님, 해당 이슈는 Compressor Backend에서 처리하였습니다.

해결한 방식은 아래와 같습니다.

half의 경우 cpu에서는 사용할 수 없다는 것을 확인했고(관련 issue), float으로 변환해서 압축 후 half로 변환해서 제공하도록 처리하였습니다.

deepkyu added the enhancement New feature or request label Sep 1, 2023

deepkyu assigned deepkyu and Only-bottle and unassigned deepkyu Sep 1, 2023

Only-bottle closed this as completed Oct 4, 2023

Provide feedback