-
Notifications
You must be signed in to change notification settings - Fork 60
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Enable AMX FP16 on next generation CPU #456
Conversation
src/utils/matmul_helper.h
Outdated
TimeLine t("onednn_amx_sgemm_f32bf16f32_compute"); | ||
TimeLine t1("onednn_amx_sgemm_f32bf16f32_compute.create_primitive"); | ||
template <typename Tin, typename Tweight, typename Tout> | ||
void onednn_amx_sgemm_f32x16f32_compute(bool transA, int M, int N, int K, float alpha, const Tin *A, int lda, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
reanme onednn_amx_gemm_compute?
src/utils/matmul_helper.h
Outdated
void onednn_amx_sgemm_f32bf16f32_compute_residential(bool transA, int M, int N, int K, float alpha, const Tin *A, | ||
int lda, const bfloat16_t *packedB, float beta, Tout *C, int ldc, const float *bias, const Tin *res, | ||
template <typename Tin, typename Tweight, typename Tout> | ||
void onednn_amx_sgemm_f32x16f32_compute_residential(bool transA, int M, int N, int K, float alpha, const Tin *A, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
reanme onednn_amx_gemm_compute_residential?
除了compute和compute_resident,其他的kernel也需要支持这个amx fp16的支持 |
ad11ced
to
ee2c349
Compare
revised and updated. |
@wenhuanh We will need to check ISA to select AMX_FP16 or AVX512_FP16. But currently this is OK. |
need export env DNNL_MAX_CPU_ISA=AVX512_CORE_AMX_FP16 when running Signed-off-by: Wenhuan Huang <wenhuan.huang@intel.com>
ee2c349
to
2190d1f
Compare
Perf status was updated in related JIRA. |
need export env DNNL_MAX_CPU_ISA=AVX512_CORE_AMX_FP16 when running