[06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.0430446 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.0446183 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0450194 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.065536 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.052224 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0447634 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.0430446 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.0448366 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.0410331 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.0838217 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0362789 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.0421303 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.0420937 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.0441051 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.0511512 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.0436663 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.0437394 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.04096 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.0411794 [06/27/2024-06:27:01] [V] [TRT] Fastest Tactic: 0x130df49cb195156b Time: 0.0357669 [06/27/2024-06:27:01] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x130df49cb195156b [06/27/2024-06:27:01] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,2560,32) -> Float(204800,1:4,2560,32) *************** [06/27/2024-06:27:01] [V] [TRT] --------------- Timing Runner: Conv_69 || Conv_94 (CublasConvolution) [06/27/2024-06:27:01] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:01] [V] [TRT] --------------- Timing Runner: Conv_69 || Conv_94 (CaskConvolution) [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0360229 [06/27/2024-06:27:01] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0543097 [06/27/2024-06:27:01] [V] [TRT] Fastest Tactic: 0x130df49cb195156b Time: 0.0360229 [06/27/2024-06:27:01] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x130df49cb195156b [06/27/2024-06:27:01] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:01] [V] [TRT] *************** Autotuning format combination: Float(819200,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:01] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_70), Mul_71) (PointWise) [06/27/2024-06:27:01] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:01] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_70), Mul_71) (PointWiseV2) [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0141506 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0158757 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0136046 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0147611 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0117929 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0117816 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0143095 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0118164 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.012045 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0165059 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0141374 [06/27/2024-06:27:01] [V] [TRT] Fastest Tactic: 0x0000000000000005 Time: 0.0117816 [06/27/2024-06:27:01] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000005 [06/27/2024-06:27:01] [V] [TRT] *************** Autotuning format combination: Float(819200,1,10240,128) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:01] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_70), Mul_71) (PointWise) [06/27/2024-06:27:01] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:01] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_70), Mul_71) (PointWiseV2) [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.014336 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0138174 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0111631 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0253074 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0126659 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.012325 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0144549 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0135913 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0131129 [06/27/2024-06:27:01] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0131173 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0160919 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0x0000000000000002 Time: 0.0111631 [06/27/2024-06:27:02] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000002 [06/27/2024-06:27:02] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,2560,32) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_70), Mul_71) (PointWise) [06/27/2024-06:27:02] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_70), Mul_71) (PointWiseV2) [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.012861 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0132601 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0216665 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0132322 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0145161 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0165542 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0186011 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0159614 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0311003 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0248442 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0121051 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0121661 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000c Time: 0.012045 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0128122 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000e Time: 0.0138868 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0123745 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0140035 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0146286 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0205598 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0143767 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0145993 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0137916 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0144672 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0157262 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0140168 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0112415 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0142828 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0x000000000000001d Time: 0.0112415 [06/27/2024-06:27:02] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001d [06/27/2024-06:27:02] [V] [TRT] *************** Autotuning format combination: Float(25600,6400:32,80,1) -> Float(12800,6400:32,80,1) *************** [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_70), Mul_71) (PointWise) [06/27/2024-06:27:02] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_70), Mul_71) (PointWiseV2) [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0110389 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.0166875 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001a Time: 0.0115341 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001b Time: 0.0128987 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001f Time: 0.0110389 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0x0000000000000018 Time: 0.0110389 [06/27/2024-06:27:02] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000018 [06/27/2024-06:27:02] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_70), Mul_71) (PointWiseV2) [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.354597 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.422327 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.506441 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.546962 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.743717 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.735378 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.670281 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.863378 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.994889 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000009 Time: 1.40142 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000a Time: 0.349769 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000b Time: 0.404041 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000c Time: 0.433152 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000d Time: 0.55808 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000e Time: 0.531456 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000f Time: 0.393655 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.804279 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.668233 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.686226 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.537454 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.264046 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.488887 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.453778 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.658144 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001c Time: 0.059648 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0436674 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0594895 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0x000000000000001d Time: 0.0436674 [06/27/2024-06:27:02] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001d [06/27/2024-06:27:02] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:02] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: Conv_72 (CudaDepthwiseConvolution) [06/27/2024-06:27:02] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: Conv_72 (FusedConvActConvolution) [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.0855771 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000009ffff Time: 0.101742 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.0538819 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000000cffff Time: 0.0610011 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.0851383 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.0558811 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.0603185 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.0544183 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000002effff Time: 0.0743863 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.0531017 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.0560762 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000003dffff Time: 0.0591482 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.0547154 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.0531505 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.0524663 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.0743863 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000004affff Time: 0.0540282 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000004effff Time: 0.056515 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.0606598 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000055ffff Time: 0.083456 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.0533455 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000005effff Time: 0.0881371 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.054272 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000061ffff Time: 0.0717531 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.0588069 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000006affff Time: 0.0544152 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.0775802 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.0560762 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.0538331 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.0549547 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.0822126 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.0587093 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.0558324 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000007effff Time: 0.0760686 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.0592457 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000008affff Time: 0.0566613 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.0541257 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.053539 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.0531017 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.0545158 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0x000000000045ffff Time: 0.0524663 [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: Conv_72 (CudnnConvolution) [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0351963 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.049152 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.06656 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000004 Time: 5.65029 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.153015 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.0349623 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0403383 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000003a Time: 0.121856 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000003c Time: 5.73323 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000003d Time: 0.213138 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.0352549 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0348763 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.066656 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000074 Time: 5.70383 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.163401 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0x0000000000000071 Time: 0.0348763 [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: Conv_72 (CublasConvolution) [06/27/2024-06:27:02] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: Conv_72 (CaskConvolution) [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x1fc87d7eb370bb7a Time: 0.0207543 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.0610987 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.0181577 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.0257463 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0346405 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.0257672 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x8e3884f0eaec3ecd [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x8e3884f0eaec3ecd Time: 0.0193097 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.02048 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x9cd5cdc35441c505 Time: 0.0308078 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9de226a0c44627c4 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x9de226a0c44627c4 Time: 0.0795131 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xa419b3b68f2da07b Time: 0.0287314 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.0212323 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0256 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.031744 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0172455 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.0200594 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.0272823 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0184137 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xfff46c7893896eb1 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xfff46c7893896eb1 Time: 0.0299301 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0172455 [06/27/2024-06:27:02] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:02] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: Conv_72 (CublasConvolution) [06/27/2024-06:27:02] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: Conv_72 (CaskConvolution) [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0290231 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0221309 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.025405 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.0190537 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0186514 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0230713 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.0229251 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0192537 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.0257219 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.018708 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.0263314 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.0192006 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.02211 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.0177006 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.0232385 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.02816 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.0224653 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.0187429 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.376759 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.0250659 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.0251611 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0xae0c89d047932ba3 Time: 0.0177006 [06/27/2024-06:27:02] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:02] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: Conv_72 (CublasConvolution) [06/27/2024-06:27:02] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: Conv_72 (CaskConvolution) [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0268008 [06/27/2024-06:27:02] [V] [TRT] Conv_72 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.023343 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0x9dece0dc37e90462 Time: 0.023343 [06/27/2024-06:27:02] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:02] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:02] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_73), Mul_74) (PointWise) [06/27/2024-06:27:02] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_73), Mul_74) (PointWiseV2) [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0222772 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0187051 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.025795 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0141365 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0169214 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0108669 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0139237 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0205775 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0118607 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0129608 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0150976 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0x0000000000000005 Time: 0.0108669 [06/27/2024-06:27:02] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000005 [06/27/2024-06:27:02] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_73), Mul_74) (PointWise) [06/27/2024-06:27:02] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_73), Mul_74) (PointWiseV2) [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0161625 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.024947 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0110055 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0139782 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0111638 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0115573 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0207935 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0137093 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0134595 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0123025 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0133648 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0x0000000000000002 Time: 0.0110055 [06/27/2024-06:27:02] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000002 [06/27/2024-06:27:02] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_73), Mul_74) (PointWise) [06/27/2024-06:27:02] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_73), Mul_74) (PointWiseV2) [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0115791 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0114898 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0126545 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.012411 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0123002 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0124107 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0127029 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0135273 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0120194 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0145115 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0108882 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0116128 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0111852 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0123611 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000e Time: 0.0115566 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0109192 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0121417 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0111515 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0115678 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0122248 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0148782 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0131923 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0157851 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0154039 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0120686 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0168391 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0133918 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0x000000000000000a Time: 0.0108882 [06/27/2024-06:27:02] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000000a [06/27/2024-06:27:02] [V] [TRT] *************** Autotuning format combination: Float(12800,6400:32,80,1) -> Float(12800,6400:32,80,1) *************** [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_73), Mul_74) (PointWise) [06/27/2024-06:27:02] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_73), Mul_74) (PointWiseV2) [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0112865 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.0127756 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001a Time: 0.0110615 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001b Time: 0.0135514 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000001f Time: 0.0112415 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0x000000000000001a Time: 0.0110615 [06/27/2024-06:27:02] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001a [06/27/2024-06:27:02] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:02] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:02] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: Conv_75 (CudaDepthwiseConvolution) [06/27/2024-06:27:02] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: Conv_75 (FusedConvActConvolution) [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000007ffff Time: 0.117394 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000000affff Time: 0.0917211 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000000effff Time: 0.0903314 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000000fffff Time: 0.0954514 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000019ffff Time: 0.0893074 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000001affff Time: 0.183296 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000024ffff Time: 0.120907 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000027ffff Time: 0.0901852 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000002dffff Time: 0.146725 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000036ffff Time: 0.0964023 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000004cffff Time: 0.0928914 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000062ffff Time: 0.101742 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000006effff Time: 0.103058 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000077ffff Time: 0.126245 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000086ffff Time: 0.120466 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000089ffff Time: 0.0895269 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000097ffff Time: 0.219136 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000098ffff Time: 0.111835 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x00000000009fffff Time: 0.0846263 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000a2ffff Time: 0.0888686 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000a4ffff Time: 0.0914286 [06/27/2024-06:27:02] [V] [TRT] Fastest Tactic: 0x00000000009fffff Time: 0.0846263 [06/27/2024-06:27:02] [V] [TRT] --------------- Timing Runner: Conv_75 (CudnnConvolution) [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.138533 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.122441 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.19339 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000004 Time: 5.65409 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.773705 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.172032 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.158939 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.122587 [06/27/2024-06:27:02] [V] [TRT] Tactic: 0x000000000000003a Time: 0.193536 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000003c Time: 6.10333 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000003d Time: 0.759954 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000003e Time: 0.102254 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.13824 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.1792 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.192658 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000074 Time: 5.78589 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.77195 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000076 Time: 0.0685592 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0x0000000000000076 Time: 0.0685592 [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: Conv_75 (CaskConvolution) [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x01cf8ce2da913006 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x01cf8ce2da913006 Time: 0.1536 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x12dbf7d94ee3696d [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x12dbf7d94ee3696d Time: 0.108398 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.105691 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4727434768e46395 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x4727434768e46395 Time: 0.0951589 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4efce38acc876f5c [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x4efce38acc876f5c Time: 0.211237 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.137874 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 0x5403ad713f811a18 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x5403ad713f811a18 Time: 0.144384 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x5aa723e0481da855 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x5aa723e0481da855 Time: 0.153454 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 0x5deb29b7a8e275f7 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x5deb29b7a8e275f7 Time: 0.0817737 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: 0x94119b4c514b211a [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x94119b4c514b211a Time: 0.0724587 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xa31d27de74b895ff [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xa31d27de74b895ff Time: 0.0955223 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.13312 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: 0xbb8c3889c7eacd30 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xbb8c3889c7eacd30 Time: 0.151113 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xd828f024626fa982 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xd828f024626fa982 Time: 0.143214 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.15872 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0869669 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0x94119b4c514b211a Time: 0.0724587 [06/27/2024-06:27:03] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CudnnConvolution Tactic: 0x0000000000000076 [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: Conv_75 (CaskConvolution) [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x19b688348f983aa0 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x19b688348f983aa0 Time: 0.090336 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x1da91d865428f237 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x1da91d865428f237 Time: 0.0719726 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.100059 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0890149 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.0878446 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x3f0c846d6379bc98 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x3f0c846d6379bc98 Time: 0.341723 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.148334 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.138825 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x62835fce994f06dd [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x62835fce994f06dd Time: 0.0817737 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0x634e99502974e4da [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x634e99502974e4da Time: 0.135461 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.0991086 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.133413 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x8014228ec08b4d49 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x8014228ec08b4d49 Time: 0.0740206 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x94a7db94ba744c45 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x94a7db94ba744c45 Time: 0.083968 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.0947931 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.151259 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0xbdfdef6b84f7ccc9 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xbdfdef6b84f7ccc9 Time: 0.0815543 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x128_relu_exp_large_nhwc_tn_v1 Tactic: 0xca7eeb8d9143d738 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xca7eeb8d9143d738 Time: 0.139995 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xd15dd11d64344e83 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xd15dd11d64344e83 Time: 0.210213 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.139118 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xf48db81f02eca9ee [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xf48db81f02eca9ee Time: 0.0708998 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.133632 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0xf48db81f02eca9ee Time: 0.0708998 [06/27/2024-06:27:03] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xf48db81f02eca9ee [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: Conv_75 (CaskConvolution) [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.177737 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.122734 [06/27/2024-06:27:03] [V] [TRT] Conv_75 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.0966949 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0xb443c221fcb1565b Time: 0.0966949 [06/27/2024-06:27:03] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1), Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_76), Mul_77), Add_78) (PointWise) [06/27/2024-06:27:03] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_76), Mul_77), Add_78) (PointWiseV2) [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0171815 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0176366 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0171007 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0185971 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0172627 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.017247 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0188714 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0179017 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0283794 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0179931 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0176691 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0x0000000000000002 Time: 0.0171007 [06/27/2024-06:27:03] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000002 [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64), Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_76), Mul_77), Add_78) (PointWise) [06/27/2024-06:27:03] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_76), Mul_77), Add_78) (PointWiseV2) [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0175543 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0182857 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0173283 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0185051 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0174243 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0169691 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0188526 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0179017 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0312759 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0177818 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000001c Time: 0.017538 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0x0000000000000005 Time: 0.0169691 [06/27/2024-06:27:03] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000005 [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16), Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_76), Mul_77), Add_78) (PointWise) [06/27/2024-06:27:03] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_76), Mul_77), Add_78) (PointWiseV2) [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0171967 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0174258 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0307493 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0180114 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0181949 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0320951 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0175543 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.018396 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0805303 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0350793 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0178663 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0174243 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0176691 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0178846 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000000e Time: 0.0314542 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0185051 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0175055 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0175721 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0178306 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0237192 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0336724 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0182491 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0178306 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0190006 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0177656 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0170844 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0183589 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0x000000000000001d Time: 0.0170844 [06/27/2024-06:27:03] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001d [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(12800,6400:32,80,1), Float(12800,6400:32,80,1) -> Float(12800,6400:32,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_76), Mul_77), Add_78) (PointWise) [06/27/2024-06:27:03] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_76), Mul_77), Add_78) (PointWiseV2) [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0172942 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.017473 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000001a Time: 0.023678 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000001b Time: 0.0183589 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000001f Time: 0.0179749 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0x0000000000000018 Time: 0.0172942 [06/27/2024-06:27:03] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000018 [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1), Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_76), Mul_77), Add_78) (PointWiseV2) [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.555301 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.86923 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.980261 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.963145 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000004 Time: 1.14381 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.667648 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000006 Time: 1.07286 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000007 Time: 1.44808 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.888101 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.902144 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000000a Time: 0.358693 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000000b Time: 0.485376 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000000c Time: 0.454071 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000000d Time: 0.634149 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000000e Time: 0.667794 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000000f Time: 0.632978 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.958464 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.763173 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.945006 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000013 Time: 1.1128 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.284965 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.412233 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.55179 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.88693 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0619276 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0619368 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0618789 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0x000000000000001e Time: 0.0618789 [06/27/2024-06:27:03] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001e [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(12800,6400:32,80,1) -> Float(12800,6400:32,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1), Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64), Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16), Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(12800,6400:32,80,1), Float(12800,6400:32,80,1) -> Float(12800,6400:32,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1), Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(12800,6400:32,80,1) -> Float(12800,6400:32,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1), Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64), Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16), Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(12800,6400:32,80,1), Float(12800,6400:32,80,1) -> Float(12800,6400:32,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1), Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(819200,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: Conv_93 (CudaDepthwiseConvolution) [06/27/2024-06:27:03] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: Conv_93 (FusedConvActConvolution) [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.0842606 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000009ffff Time: 0.0903558 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.0691444 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000000cffff Time: 0.0613912 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.0538819 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.0559787 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.08704 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.0545646 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000002effff Time: 0.072899 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.0660236 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.0561737 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000003dffff Time: 0.0595413 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000040ffff Time: 1.28015 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.0530057 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.0910674 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.0539794 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000004affff Time: 0.053443 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000004effff Time: 0.0562225 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.0601234 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000055ffff Time: 0.0592945 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.0529067 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000005effff Time: 0.0539794 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.0537356 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000061ffff Time: 0.0588556 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.0589531 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000006affff Time: 0.0545158 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.0531992 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.0741669 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.0533943 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.0549059 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.0531992 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.0583192 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.0560259 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000007effff Time: 0.0583162 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.0591954 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000008affff Time: 0.0566613 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.0535893 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.0538331 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.0521265 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.0537356 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0x0000000000a3ffff Time: 0.0521265 [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: Conv_93 (CudnnConvolution) [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0482743 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0401189 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0568076 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000004 Time: 5.75722 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.122734 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.0256 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0335863 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000003a Time: 0.0567101 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000003c Time: 5.6478 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x000000000000003d Time: 0.124562 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.0256 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0268678 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.0571977 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000074 Time: 5.61371 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.138334 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0x0000000000000070 Time: 0.0256 [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: Conv_93 (CublasConvolution) [06/27/2024-06:27:03] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: Conv_93 (CaskConvolution) [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.0608549 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.0179931 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.0256244 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0342016 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.0209189 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.020224 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.0211278 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0255269 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.0313051 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0172455 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.0199497 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.0256503 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0182674 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0172455 [06/27/2024-06:27:03] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(819200,1,10240,128) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: Conv_93 (CublasConvolution) [06/27/2024-06:27:03] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: Conv_93 (CaskConvolution) [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0286427 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0222165 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.0437638 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.0188343 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0184686 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0226331 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.0224653 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.018652 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.0255756 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.0356882 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.0248686 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.017602 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.0221525 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.0256975 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.0246011 [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.0247467 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0xae0c89d047932ba3 Time: 0.017602 [06/27/2024-06:27:03] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(204800,1:4,2560,32) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: Conv_93 (CublasConvolution) [06/27/2024-06:27:03] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: Conv_93 (CaskConvolution) [06/27/2024-06:27:03] [V] [TRT] Conv_93 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0221518 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0x130df49cb195156b Time: 0.0221518 [06/27/2024-06:27:03] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x130df49cb195156b [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(819200,6400,80,1) -> Float(819200,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: BatchNormalization_96 (Scale) [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0235102 [06/27/2024-06:27:03] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0235102 [06/27/2024-06:27:03] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Scale Tactic: 0x0000000000000000 [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(819200,1,10240,128) -> Float(819200,1,10240,128) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: BatchNormalization_96 (Scale) [06/27/2024-06:27:03] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,2560,32) -> Float(204800,1:4,2560,32) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: BatchNormalization_96 (Scale) [06/27/2024-06:27:03] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(819200,6400,80,1) -> Float(819200,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(819200,1,10240,128) -> Float(819200,1,10240,128) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,2560,32) -> Float(204800,1:4,2560,32) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(25600,6400:32,80,1) -> Float(25600,6400:32,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(819200,6400,80,1) -> Float(819200,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(819200,1,10240,128) -> Float(819200,1,10240,128) *************** [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,2560,32) -> Float(204800,1:4,2560,32) *************** [06/27/2024-06:27:03] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:03] [V] [TRT] *************** Autotuning format combination: Float(819200,6400,80,1) -> Float(1638400,6400,80,1) *************** [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_100), Mul_101) (PointWise) [06/27/2024-06:27:03] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:03] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_100), Mul_101) (PointWiseV2) [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0238202 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0232803 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0238028 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0233012 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.023134 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0230504 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0238933 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0376832 [06/27/2024-06:27:03] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0237819 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0231131 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0240152 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x0000000000000005 Time: 0.0230504 [06/27/2024-06:27:04] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000005 [06/27/2024-06:27:04] [V] [TRT] *************** Autotuning format combination: Float(819200,1,10240,128) -> Float(1638400,1,20480,256) *************** [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_100), Mul_101) (PointWise) [06/27/2024-06:27:04] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_100), Mul_101) (PointWiseV2) [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.023869 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0239924 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0236774 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0241143 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0230296 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0228624 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0246004 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0232816 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0240396 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0230296 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0237192 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x0000000000000005 Time: 0.0228624 [06/27/2024-06:27:04] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000005 [06/27/2024-06:27:04] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,2560,32) -> Float(409600,1:4,5120,64) *************** [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_100), Mul_101) (PointWise) [06/27/2024-06:27:04] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_100), Mul_101) (PointWiseV2) [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.022946 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0229473 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0496396 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0233443 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0237192 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0232392 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0232196 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0235729 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0242598 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0261608 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0229486 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0239073 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0230504 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0233241 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000e Time: 0.0238941 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0230504 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0241128 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0233848 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0232176 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0495421 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0246004 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0241859 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0235102 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0242347 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0228833 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0339627 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001e Time: 0.024771 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.0228833 [06/27/2024-06:27:04] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:04] [V] [TRT] *************** Autotuning format combination: Float(25600,6400:32,80,1) -> Float(51200,6400:32,80,1) *************** [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_100), Mul_101) (PointWise) [06/27/2024-06:27:04] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_100), Mul_101) (PointWiseV2) [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0229042 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.0239909 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001a Time: 0.023426 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001b Time: 0.0240327 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001f Time: 0.0398994 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x0000000000000018 Time: 0.0229042 [06/27/2024-06:27:04] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000018 [06/27/2024-06:27:04] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:04] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:04] [V] [TRT] *************** Autotuning format combination: Float(1638400,6400,80,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: Conv_102 (CudaDepthwiseConvolution) [06/27/2024-06:27:04] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: Conv_102 (FusedConvActConvolution) [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000006ffff Time: 0.295205 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x000000000006ffff Time: 0.295205 [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: Conv_102 (CudnnConvolution) [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.288037 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.146725 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.296082 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000005 Time: 4.89457 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.282331 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.146871 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000003a Time: 0.365861 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000003d Time: 4.84293 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.251758 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.16501 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.294912 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000075 Time: 4.80958 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.146725 [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: Conv_102 (CaskConvolution) [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x01cf8ce2da913006 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x01cf8ce2da913006 Time: 0.149065 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x12dbf7d94ee3696d [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x12dbf7d94ee3696d Time: 0.16501 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.158135 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4727434768e46395 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x4727434768e46395 Time: 0.180078 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4efce38acc876f5c [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x4efce38acc876f5c Time: 0.328558 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.137728 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 0x5403ad713f811a18 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x5403ad713f811a18 Time: 0.17547 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x5aa723e0481da855 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x5aa723e0481da855 Time: 0.149211 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 0x5deb29b7a8e275f7 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x5deb29b7a8e275f7 Time: 0.153605 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xa31d27de74b895ff [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xa31d27de74b895ff Time: 0.180224 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.250002 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: 0xbb8c3889c7eacd30 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xbb8c3889c7eacd30 Time: 0.278967 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xd828f024626fa982 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xd828f024626fa982 Time: 0.27531 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.138025 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.164133 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x503619c69ae500ff Time: 0.137728 [06/27/2024-06:27:04] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x503619c69ae500ff [06/27/2024-06:27:04] [V] [TRT] *************** Autotuning format combination: Float(1638400,1,20480,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: Conv_102 (CaskConvolution) [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x19b688348f983aa0 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x19b688348f983aa0 Time: 0.170277 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x1da91d865428f237 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x1da91d865428f237 Time: 0.130487 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.153307 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.164864 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.206263 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x3f0c846d6379bc98 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x3f0c846d6379bc98 Time: 0.150674 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.229522 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.148626 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x62835fce994f06dd [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x62835fce994f06dd Time: 0.152869 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0x634e99502974e4da [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x634e99502974e4da Time: 0.139191 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.0950857 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.152503 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x8014228ec08b4d49 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x8014228ec08b4d49 Time: 0.126464 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x94a7db94ba744c45 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x94a7db94ba744c45 Time: 0.159305 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.107813 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.0920869 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0xbdfdef6b84f7ccc9 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xbdfdef6b84f7ccc9 Time: 0.220453 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x128_relu_exp_large_nhwc_tn_v1 Tactic: 0xca7eeb8d9143d738 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xca7eeb8d9143d738 Time: 0.221047 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xd15dd11d64344e83 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xd15dd11d64344e83 Time: 0.157403 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.146871 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xf48db81f02eca9ee [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xf48db81f02eca9ee Time: 0.129317 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.137435 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0xb443c221fcb1565b Time: 0.0920869 [06/27/2024-06:27:04] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:04] [V] [TRT] *************** Autotuning format combination: Float(409600,1:4,5120,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: Conv_102 (CaskConvolution) [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.0950857 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.0906971 [06/27/2024-06:27:04] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:04] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.0923063 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x999e005e3b016ea6 Time: 0.0906971 [06/27/2024-06:27:04] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:04] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:04] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_103), Mul_104) (PointWise) [06/27/2024-06:27:04] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_103), Mul_104) (PointWiseV2) [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0138365 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0211383 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0110389 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0143529 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.011129 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0107938 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0144823 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0217136 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0115228 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0107833 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0148827 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x0000000000000009 Time: 0.0107833 [06/27/2024-06:27:04] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000009 [06/27/2024-06:27:04] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_103), Mul_104) (PointWise) [06/27/2024-06:27:04] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_103), Mul_104) (PointWiseV2) [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0155671 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0134217 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0110615 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.014576 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0110618 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0108366 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0144623 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0118836 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0118604 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0109714 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0140725 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x0000000000000005 Time: 0.0108366 [06/27/2024-06:27:04] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000005 [06/27/2024-06:27:04] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_103), Mul_104) (PointWise) [06/27/2024-06:27:04] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_103), Mul_104) (PointWiseV2) [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0109401 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0114219 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0137642 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0110396 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0113435 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0117141 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0127272 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0110952 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0125082 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0149536 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0107109 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000b Time: 0.011174 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0110611 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000d Time: 0.012032 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000e Time: 0.0119101 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0109505 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0116691 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.012301 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0114785 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0210651 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.014341 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0141802 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0145089 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0149696 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0109407 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0111508 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0144969 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x000000000000000a Time: 0.0107109 [06/27/2024-06:27:04] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000000a [06/27/2024-06:27:04] [V] [TRT] *************** Autotuning format combination: Float(12800,1600:32,40,1) -> Float(12800,1600:32,40,1) *************** [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_103), Mul_104) (PointWise) [06/27/2024-06:27:04] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_103), Mul_104) (PointWiseV2) [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0109401 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.0112426 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001a Time: 0.011692 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001b Time: 0.012752 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001f Time: 0.0108983 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x000000000000001f Time: 0.0108983 [06/27/2024-06:27:04] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001f [06/27/2024-06:27:04] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_103), Mul_104) (PointWiseV2) [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.355182 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.466505 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.462555 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.491081 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.624201 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.737719 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.626103 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000007 Time: 1.50045 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.99445 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000009 Time: 1.36368 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000a Time: 0.420425 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000b Time: 0.403749 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000c Time: 0.353573 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000d Time: 0.670427 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000e Time: 0.53131 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000000f Time: 0.437979 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.885467 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.665893 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.699392 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.579145 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.264192 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.411502 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.494885 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.71563 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0426057 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0853623 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000001e Time: 0.044984 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.0426057 [06/27/2024-06:27:04] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:04] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:04] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: Conv_105 || Conv_130 (CudaDepthwiseConvolution) [06/27/2024-06:27:04] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: Conv_105 || Conv_130 (FusedConvActConvolution) [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.24181 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000009ffff Time: 0.0981577 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.0863817 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000000cffff Time: 0.0864571 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.129829 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.0931109 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.119881 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.095744 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000002effff Time: 0.098304 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.112567 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.0945006 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000003dffff Time: 0.0838949 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.0846994 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.0858789 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.0846994 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.0787017 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000004affff Time: 0.0902583 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000004effff Time: 0.0970606 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.13963 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000055ffff Time: 0.0838949 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.0806766 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000005effff Time: 0.094208 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.0833097 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000061ffff Time: 0.0950126 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.119881 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000006affff Time: 0.095744 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.122002 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.084992 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.0864549 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.0918674 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.0879177 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.22645 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.0917943 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000007effff Time: 0.104375 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.10123 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000008affff Time: 0.0968411 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.0904777 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.083088 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.0851383 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.0969874 [06/27/2024-06:27:04] [V] [TRT] Fastest Tactic: 0x000000000046ffff Time: 0.0787017 [06/27/2024-06:27:04] [V] [TRT] --------------- Timing Runner: Conv_105 || Conv_130 (CudnnConvolution) [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0580267 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0692876 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.183589 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000004 Time: 15.2858 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.342601 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.0580754 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0649874 [06/27/2024-06:27:04] [V] [TRT] Tactic: 0x000000000000003a Time: 0.25483 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000003c Time: 15.255 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000003d Time: 0.291109 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.0577829 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0812617 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.18315 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000074 Time: 15.2776 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.289938 [06/27/2024-06:27:05] [V] [TRT] Fastest Tactic: 0x0000000000000070 Time: 0.0577829 [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: Conv_105 || Conv_130 (CublasConvolution) [06/27/2024-06:27:05] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: Conv_105 || Conv_130 (CaskConvolution) [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x1fc87d7eb370bb7a Time: 0.0474697 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.087552 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.0425691 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.0382171 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0358034 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.0474331 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x8e3884f0eaec3ecd [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x8e3884f0eaec3ecd Time: 0.0444731 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.051395 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x9cd5cdc35441c505 Time: 0.0413623 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9de226a0c44627c4 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x9de226a0c44627c4 Time: 0.052035 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xa419b3b68f2da07b Time: 0.0319781 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.0539307 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0379246 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.0410331 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0391314 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.0449829 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.0379977 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0435566 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xfff46c7893896eb1 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xfff46c7893896eb1 Time: 0.07168 [06/27/2024-06:27:05] [V] [TRT] Fastest Tactic: 0xa419b3b68f2da07b Time: 0.0319781 [06/27/2024-06:27:05] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:05] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: Conv_105 || Conv_130 (CublasConvolution) [06/27/2024-06:27:05] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: Conv_105 || Conv_130 (CaskConvolution) [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0366446 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0270636 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.0402286 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.0415086 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0367543 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0460069 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.0835291 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0415817 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.0409234 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.0418743 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.0380343 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.0415451 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0268434 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.0731429 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.0397897 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.0364617 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.0445451 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.0365349 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.0403383 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.0376686 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.0382903 [06/27/2024-06:27:05] [V] [TRT] Fastest Tactic: 0x9dece0dc37e90462 Time: 0.0268434 [06/27/2024-06:27:05] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:05] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: Conv_105 || Conv_130 (CublasConvolution) [06/27/2024-06:27:05] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: Conv_105 || Conv_130 (CaskConvolution) [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0270621 [06/27/2024-06:27:05] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0268678 [06/27/2024-06:27:05] [V] [TRT] Fastest Tactic: 0x9dece0dc37e90462 Time: 0.0268678 [06/27/2024-06:27:05] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:05] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:05] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_106), Mul_107) (PointWise) [06/27/2024-06:27:05] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_106), Mul_107) (PointWiseV2) [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00688261 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00618057 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00743131 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00642405 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00906057 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00658286 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00632884 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00686106 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00666265 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00676239 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00664312 [06/27/2024-06:27:05] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.00618057 [06/27/2024-06:27:05] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000001 [06/27/2024-06:27:05] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_106), Mul_107) (PointWise) [06/27/2024-06:27:05] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_106), Mul_107) (PointWiseV2) [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0116937 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00654317 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00625212 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00636661 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00917086 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00685431 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00582802 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00608305 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00641133 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00686171 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00730011 [06/27/2024-06:27:05] [V] [TRT] Fastest Tactic: 0x0000000000000006 Time: 0.00582802 [06/27/2024-06:27:05] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000006 [06/27/2024-06:27:05] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_106), Mul_107) (PointWise) [06/27/2024-06:27:05] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_106), Mul_107) (PointWiseV2) [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0103063 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00638609 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00610152 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00609505 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00634753 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.011129 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00575195 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00694857 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0067759 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.009108 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0135262 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000000b Time: 0.00641789 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000000c Time: 0.00676239 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0061379 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000000e Time: 0.00634753 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000000f Time: 0.00841708 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.00625332 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0056211 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.00637317 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.00611429 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.00719337 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00956709 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.00550277 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.00646837 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00690329 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00605962 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00904371 [06/27/2024-06:27:05] [V] [TRT] Fastest Tactic: 0x0000000000000016 Time: 0.00550277 [06/27/2024-06:27:05] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000016 [06/27/2024-06:27:05] [V] [TRT] *************** Autotuning format combination: Float(12800,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_106), Mul_107) (PointWise) [06/27/2024-06:27:05] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_106), Mul_107) (PointWiseV2) [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.00841897 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.00660945 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000001a Time: 0.00674909 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000001b Time: 0.00623304 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000001f Time: 0.00622688 [06/27/2024-06:27:05] [V] [TRT] Fastest Tactic: 0x000000000000001f Time: 0.00622688 [06/27/2024-06:27:05] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001f [06/27/2024-06:27:05] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_106), Mul_107) (PointWiseV2) [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.1792 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.212699 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.233033 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.247223 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.313929 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.340699 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.314807 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.533797 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.496933 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.701733 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000000a Time: 0.156526 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000000b Time: 0.262144 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000000c Time: 0.17861 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000000d Time: 0.259803 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000000e Time: 0.237861 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000000f Time: 0.198363 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.374491 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.312325 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.326802 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.270629 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.156233 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.178761 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.228791 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.471186 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0228833 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0231131 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0240404 [06/27/2024-06:27:05] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.0228833 [06/27/2024-06:27:05] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:05] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:05] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: Conv_108 (CudaDepthwiseConvolution) [06/27/2024-06:27:05] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: Conv_108 (FusedConvActConvolution) [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.0477867 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000009ffff Time: 0.0555554 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.0343186 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000000cffff Time: 0.0347575 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.0336165 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.0340846 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.0404846 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.0356645 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000002effff Time: 0.0330889 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.0720457 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.0346112 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000003dffff Time: 0.0358766 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.0333824 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.0332361 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.0330606 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.0331776 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000004affff Time: 0.0344357 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000004effff Time: 0.0370469 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.0407771 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000055ffff Time: 0.0358107 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.0336475 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000005effff Time: 0.0356352 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.0335872 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000061ffff Time: 0.0325339 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.0407406 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000006affff Time: 0.0746789 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.0337042 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.0352841 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.0340261 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.0349915 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.0359863 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.0370834 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.0353719 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000007effff Time: 0.038512 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.0378514 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000008affff Time: 0.03712 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.0346112 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.0327387 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.0356352 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.0329435 [06/27/2024-06:27:05] [V] [TRT] Fastest Tactic: 0x000000000061ffff Time: 0.0325339 [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: Conv_108 (CudnnConvolution) [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0409234 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0526994 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0844069 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000004 Time: 4.28866 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.139776 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.0312466 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0405577 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000003a Time: 0.0846994 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000003c Time: 3.99594 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x000000000000003d Time: 0.16267 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.031861 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0344357 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.0843337 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000074 Time: 3.8618 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.160329 [06/27/2024-06:27:05] [V] [TRT] Fastest Tactic: 0x0000000000000038 Time: 0.0312466 [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: Conv_108 (CublasConvolution) [06/27/2024-06:27:05] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: Conv_108 (CaskConvolution) [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x1fc87d7eb370bb7a Time: 0.0165465 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.0304567 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.0192366 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.0224862 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0212114 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.0163515 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x8e3884f0eaec3ecd [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x8e3884f0eaec3ecd Time: 0.0169686 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.0214204 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x9cd5cdc35441c505 Time: 0.0207105 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9de226a0c44627c4 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x9de226a0c44627c4 Time: 0.0290523 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xa419b3b68f2da07b Time: 0.0206263 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.0226325 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0363374 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.0186697 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0183589 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.0174405 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.0224235 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0194206 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xfff46c7893896eb1 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xfff46c7893896eb1 Time: 0.0776046 [06/27/2024-06:27:05] [V] [TRT] Fastest Tactic: 0x865894c4635db7fd Time: 0.0163515 [06/27/2024-06:27:05] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x865894c4635db7fd [06/27/2024-06:27:05] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: Conv_108 (CublasConvolution) [06/27/2024-06:27:05] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:05] [V] [TRT] --------------- Timing Runner: Conv_108 (CaskConvolution) [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0163688 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0156965 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.0218175 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.0156233 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0153746 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0174243 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.017279 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0157563 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.0220069 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.0153746 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.020898 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.0156965 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0155648 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.0159451 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.0236147 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.0155941 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:05] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.017117 [06/27/2024-06:27:05] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.0147163 [06/27/2024-06:27:06] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.0219226 [06/27/2024-06:27:06] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.0209398 [06/27/2024-06:27:06] [V] [TRT] Conv_108 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.020898 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0xd55ee6fd0b56f808 Time: 0.0147163 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: Conv_108 (CublasConvolution) [06/27/2024-06:27:06] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: Conv_108 (CaskConvolution) [06/27/2024-06:27:06] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0158801 [06/27/2024-06:27:06] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0176564 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x130df49cb195156b Time: 0.0158801 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x130df49cb195156b [06/27/2024-06:27:06] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_109), Mul_110) (PointWise) [06/27/2024-06:27:06] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_109), Mul_110) (PointWiseV2) [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00625518 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00828952 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0068837 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00927971 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00795759 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0073216 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00879677 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00850151 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00563218 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00921624 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0071889 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x0000000000000008 Time: 0.00563218 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000008 [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_109), Mul_110) (PointWise) [06/27/2024-06:27:06] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_109), Mul_110) (PointWiseV2) [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00740457 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0143237 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0077926 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00650327 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00862743 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00691026 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00652925 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00661735 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00649662 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00733486 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00667034 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x0000000000000008 Time: 0.00649662 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000008 [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_109), Mul_110) (PointWise) [06/27/2024-06:27:06] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_109), Mul_110) (PointWiseV2) [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0133382 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0062901 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00733806 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00694509 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00966674 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0068615 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00656291 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00606572 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00688936 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00975329 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000a Time: 0.00692218 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0068197 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000c Time: 0.00716023 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000d Time: 0.00704958 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000e Time: 0.00684757 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0086322 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.00802794 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0094464 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.00677527 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.00607695 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.00616267 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00641848 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0067227 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.00898877 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00767591 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00666327 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00608914 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x0000000000000007 Time: 0.00606572 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000007 [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(6400,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_109), Mul_110) (PointWise) [06/27/2024-06:27:06] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_109), Mul_110) (PointWiseV2) [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.00818895 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.010449 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001a Time: 0.00745989 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001b Time: 0.00779188 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001f Time: 0.00672229 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x000000000000001f Time: 0.00672229 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001f [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: Conv_111 (CudaDepthwiseConvolution) [06/27/2024-06:27:06] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: Conv_111 (FusedConvActConvolution) [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000007ffff Time: 0.108105 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000000affff Time: 0.0799451 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000000effff Time: 0.0937691 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000000fffff Time: 0.0842606 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000019ffff Time: 0.0767977 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000001affff Time: 0.098816 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000024ffff Time: 0.0950857 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000027ffff Time: 0.079872 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000002dffff Time: 0.0974263 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000036ffff Time: 0.0880617 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000004cffff Time: 0.0931109 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000062ffff Time: 0.125369 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000006effff Time: 0.0947931 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000077ffff Time: 0.16267 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000086ffff Time: 0.133047 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000089ffff Time: 0.0841143 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000097ffff Time: 0.104958 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000098ffff Time: 0.108032 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000009fffff Time: 0.0754103 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000a2ffff Time: 0.0781166 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000a4ffff Time: 0.187246 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x00000000009fffff Time: 0.0754103 [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: Conv_111 (CudnnConvolution) [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.135826 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.109202 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.296375 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000004 Time: 3.91314 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000005 Time: 1.25074 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0828709 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.135975 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0910629 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000003a Time: 0.178469 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000003c Time: 3.9304 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000003d Time: 1.33018 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000003e Time: 0.105618 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.135461 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.135758 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.178469 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000074 Time: 3.81674 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000075 Time: 1.34144 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000076 Time: 0.105179 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x0000000000000006 Time: 0.0828709 [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: Conv_111 (CaskConvolution) [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x01cf8ce2da913006 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x01cf8ce2da913006 Time: 0.0811886 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x12dbf7d94ee3696d [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x12dbf7d94ee3696d Time: 0.0906971 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.134875 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4727434768e46395 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x4727434768e46395 Time: 0.0913623 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4efce38acc876f5c [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x4efce38acc876f5c Time: 0.311881 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.135826 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 0x5403ad713f811a18 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x5403ad713f811a18 Time: 0.167205 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x5aa723e0481da855 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x5aa723e0481da855 Time: 0.0808274 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 0x5deb29b7a8e275f7 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x5deb29b7a8e275f7 Time: 0.128293 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: 0x94119b4c514b211a [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x94119b4c514b211a Time: 0.069437 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xa31d27de74b895ff [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xa31d27de74b895ff Time: 0.0913554 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.262583 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: 0xbb8c3889c7eacd30 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xbb8c3889c7eacd30 Time: 0.278382 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xd828f024626fa982 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xd828f024626fa982 Time: 0.142921 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.138021 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.147529 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x94119b4c514b211a Time: 0.069437 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x94119b4c514b211a [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: Conv_111 (CaskConvolution) [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x19b688348f983aa0 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x19b688348f983aa0 Time: 0.0878446 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x1da91d865428f237 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x1da91d865428f237 Time: 0.0739474 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.0813349 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0853577 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.0845531 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x3f0c846d6379bc98 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x3f0c846d6379bc98 Time: 0.196023 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0814834 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.140507 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x62835fce994f06dd [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x62835fce994f06dd Time: 0.0803086 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0x634e99502974e4da [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x634e99502974e4da Time: 0.133193 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.0928914 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.148919 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x8014228ec08b4d49 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x8014228ec08b4d49 Time: 0.121271 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x94a7db94ba744c45 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x94a7db94ba744c45 Time: 0.079872 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.114615 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.0900389 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0xbdfdef6b84f7ccc9 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xbdfdef6b84f7ccc9 Time: 0.149065 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x128_relu_exp_large_nhwc_tn_v1 Tactic: 0xca7eeb8d9143d738 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xca7eeb8d9143d738 Time: 0.141751 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xd15dd11d64344e83 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xd15dd11d64344e83 Time: 0.07424 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.138313 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xf48db81f02eca9ee [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xf48db81f02eca9ee Time: 0.0730697 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.131145 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0xf48db81f02eca9ee Time: 0.0730697 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xf48db81f02eca9ee [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: Conv_111 (CaskConvolution) [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.111911 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.088576 [06/27/2024-06:27:06] [V] [TRT] Conv_111 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:06] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.0900389 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x999e005e3b016ea6 Time: 0.088576 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:06] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1), Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_112), Mul_113), Add_114) (PointWise) [06/27/2024-06:27:06] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_112), Mul_113), Add_114) (PointWiseV2) [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0104176 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0169253 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00992792 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0100937 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00929829 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0104699 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00888067 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00804572 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0146028 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00928857 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00836064 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x0000000000000007 Time: 0.00804572 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000007 [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128), Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_112), Mul_113), Add_114) (PointWise) [06/27/2024-06:27:06] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_112), Mul_113), Add_114) (PointWiseV2) [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0085276 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00920657 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00968411 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00934429 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00897855 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00930829 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00875133 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0081026 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00905143 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00973349 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00927143 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x0000000000000007 Time: 0.0081026 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000007 [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32), Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_112), Mul_113), Add_114) (PointWise) [06/27/2024-06:27:06] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_112), Mul_113), Add_114) (PointWiseV2) [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0103236 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0100358 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0162865 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0126897 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00839518 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0170342 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00942657 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0108356 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0141232 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0189989 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0133253 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000b Time: 0.00918829 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000c Time: 0.009134 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0138838 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000e Time: 0.00901943 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000f Time: 0.00896 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.00886373 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0104176 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0136882 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0161402 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.00812699 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00985966 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.00890649 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.00995779 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001c Time: 0.009024 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00955733 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00867388 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x0000000000000014 Time: 0.00812699 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000014 [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(6400,1600:32,40,1), Float(6400,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_112), Mul_113), Add_114) (PointWise) [06/27/2024-06:27:06] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_112), Mul_113), Add_114) (PointWiseV2) [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0131882 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.0131111 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001a Time: 0.00872551 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001b Time: 0.00882877 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001f Time: 0.00995718 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x000000000000001a Time: 0.00872551 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001a [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1), Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(PWN(Sigmoid_112), Mul_113), Add_114) (PointWiseV2) [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.278967 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.53643 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.531163 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.393801 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.666331 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.270043 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.567296 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.779113 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.406674 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.432128 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000a Time: 0.181394 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000b Time: 0.244443 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000c Time: 0.272823 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000d Time: 0.297399 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000e Time: 0.369079 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000000f Time: 0.296375 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.40565 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.447927 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.413111 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.527799 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.2048 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.207584 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.321243 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.423205 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001c Time: 0.032651 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0326217 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000000001e Time: 0.033675 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x000000000000001d Time: 0.0326217 [06/27/2024-06:27:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001d [06/27/2024-06:27:06] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:06] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(6400,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:06] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1), Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128), Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32), Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(6400,1600:32,40,1), Float(6400,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1), Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:06] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(6400,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:06] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1), Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128), Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32), Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(6400,1600:32,40,1), Float(6400,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1), Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:06] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: Conv_129 (CudaDepthwiseConvolution) [06/27/2024-06:27:06] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: Conv_129 (FusedConvActConvolution) [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.0471783 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000009ffff Time: 0.0700724 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.0345527 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000000cffff Time: 0.0341138 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.033675 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.0338789 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.0399726 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.0356672 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000002effff Time: 0.032651 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.0348453 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.0347575 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000003dffff Time: 0.0353134 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.0334117 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.0332654 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.0325339 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.0332654 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000004affff Time: 0.0347575 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000004effff Time: 0.0590507 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.0409966 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000055ffff Time: 0.0353445 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.0338213 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000005effff Time: 0.0359863 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.033675 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000061ffff Time: 0.0322999 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.0412891 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000006affff Time: 0.0357669 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.0728503 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.0354889 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.0338798 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.0345819 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.0354597 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.0369006 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.0353417 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000007effff Time: 0.037888 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.037008 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000008affff Time: 0.0596846 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.0350501 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.0325047 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.0352549 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.0329435 [06/27/2024-06:27:06] [V] [TRT] Fastest Tactic: 0x000000000061ffff Time: 0.0322999 [06/27/2024-06:27:06] [V] [TRT] --------------- Timing Runner: Conv_129 (CudnnConvolution) [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0244305 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0334994 [06/27/2024-06:27:06] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.1024 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 4.09615 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.125897 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.0239909 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0382537 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000003a Time: 0.0710949 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000003c Time: 3.8912 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000003d Time: 0.127285 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.0236565 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0275261 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.0709973 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000074 Time: 3.93874 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.165815 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0x0000000000000070 Time: 0.0236565 [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: Conv_129 (CublasConvolution) [06/27/2024-06:27:07] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: Conv_129 (CaskConvolution) [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.0490434 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.0187977 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.0223817 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0212323 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.0163352 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.0213786 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.0225698 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0363081 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.0162702 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0178469 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.0185246 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.0223817 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0193829 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0xc0b05b61d128e46e Time: 0.0162702 [06/27/2024-06:27:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: Conv_129 (CublasConvolution) [06/27/2024-06:27:07] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: Conv_129 (CaskConvolution) [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0157111 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0157842 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.0215458 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.0159599 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0151872 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0171967 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.0171149 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0152722 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.0217972 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.015275 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.0205851 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.0158606 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.0313051 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.0217966 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.0206054 [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.0206472 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0x35f26f9c09557d86 Time: 0.0151872 [06/27/2024-06:27:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: Conv_129 (CublasConvolution) [06/27/2024-06:27:07] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: Conv_129 (CaskConvolution) [06/27/2024-06:27:07] [V] [TRT] Conv_129 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0155218 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0x130df49cb195156b Time: 0.0155218 [06/27/2024-06:27:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x130df49cb195156b [06/27/2024-06:27:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: BatchNormalization_132 (Scale) [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0136977 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0136977 [06/27/2024-06:27:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Scale Tactic: 0x0000000000000000 [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: BatchNormalization_132 (Scale) [06/27/2024-06:27:07] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: BatchNormalization_132 (Scale) [06/27/2024-06:27:07] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:27:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(12800,1600:32,40,1) -> Float(12800,1600:32,40,1) *************** [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(819200,1600,40,1) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_136), Mul_137) (PointWise) [06/27/2024-06:27:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_136), Mul_137) (PointWiseV2) [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0140438 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0146034 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0111402 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0147173 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0206282 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0113315 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0143543 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0118607 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0115341 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0108153 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0139237 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0x0000000000000009 Time: 0.0108153 [06/27/2024-06:27:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000009 [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(819200,1,20480,512) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_136), Mul_137) (PointWise) [06/27/2024-06:27:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_136), Mul_137) (PointWiseV2) [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0167558 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0137213 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0117479 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0150592 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0215667 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0120571 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0156233 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.012509 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.012544 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0117032 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0137509 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0x0000000000000009 Time: 0.0117032 [06/27/2024-06:27:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000009 [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(204800,1:4,5120,128) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_136), Mul_137) (PointWise) [06/27/2024-06:27:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_136), Mul_137) (PointWiseV2) [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0115453 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0127265 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0116578 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0120564 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0117929 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0128122 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0131923 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0119467 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0124587 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0258377 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0127398 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0121661 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0116132 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0136985 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000000e Time: 0.0123855 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0127269 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0204251 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0124709 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0118154 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0138331 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0258933 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0144465 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0150967 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0161407 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0115242 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0117591 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0144091 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.0115242 [06/27/2024-06:27:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(12800,1600:32,40,1) -> Float(25600,1600:32,40,1) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_136), Mul_137) (PointWise) [06/27/2024-06:27:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_136), Mul_137) (PointWiseV2) [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0112872 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.0114782 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000001a Time: 0.0110615 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000001b Time: 0.0137908 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000001f Time: 0.011501 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0x000000000000001a Time: 0.0110615 [06/27/2024-06:27:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001a [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(819200,1600,40,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: Conv_138 (CudaDepthwiseConvolution) [06/27/2024-06:27:07] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: Conv_138 (FusedConvActConvolution) [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000006ffff Time: 0.293742 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0x000000000006ffff Time: 0.293742 [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: Conv_138 (CudnnConvolution) [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.410624 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.233326 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.306322 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 9.43748 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.279845 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.233033 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000003a Time: 0.305591 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000003d Time: 9.31825 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.280283 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.279991 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.372882 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000075 Time: 10.5212 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0x0000000000000039 Time: 0.233033 [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: Conv_138 (CaskConvolution) [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x01cf8ce2da913006 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x01cf8ce2da913006 Time: 0.207872 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x12dbf7d94ee3696d [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x12dbf7d94ee3696d Time: 0.39307 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.309394 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4727434768e46395 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x4727434768e46395 Time: 0.175543 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4efce38acc876f5c [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x4efce38acc876f5c Time: 0.531895 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.264485 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 0x5403ad713f811a18 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x5403ad713f811a18 Time: 0.268434 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x5aa723e0481da855 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x5aa723e0481da855 Time: 0.191195 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 0x5deb29b7a8e275f7 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x5deb29b7a8e275f7 Time: 0.29813 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xa31d27de74b895ff [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xa31d27de74b895ff Time: 0.175104 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.353134 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: 0xbb8c3889c7eacd30 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xbb8c3889c7eacd30 Time: 0.536722 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xd828f024626fa982 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xd828f024626fa982 Time: 0.353426 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.300471 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.37888 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0xa31d27de74b895ff Time: 0.175104 [06/27/2024-06:27:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xa31d27de74b895ff [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(819200,1,20480,512) -> Float(204800,1,10240,512) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: Conv_138 (CaskConvolution) [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x19b688348f983aa0 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x19b688348f983aa0 Time: 0.168814 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x1da91d865428f237 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x1da91d865428f237 Time: 0.251026 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.298715 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.337774 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.237275 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x3f0c846d6379bc98 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x3f0c846d6379bc98 Time: 0.263168 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.353134 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.282176 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x62835fce994f06dd [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x62835fce994f06dd Time: 0.155502 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0x634e99502974e4da [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x634e99502974e4da Time: 0.314514 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.179785 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.254245 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x8014228ec08b4d49 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x8014228ec08b4d49 Time: 0.2304 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x94a7db94ba744c45 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x94a7db94ba744c45 Time: 0.155643 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.171008 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.245467 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0xbdfdef6b84f7ccc9 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xbdfdef6b84f7ccc9 Time: 0.347867 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x128_relu_exp_large_nhwc_tn_v1 Tactic: 0xca7eeb8d9143d738 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xca7eeb8d9143d738 Time: 0.283355 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xd15dd11d64344e83 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xd15dd11d64344e83 Time: 0.141531 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.332215 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xf48db81f02eca9ee [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xf48db81f02eca9ee Time: 0.249417 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.256585 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0xd15dd11d64344e83 Time: 0.141531 [06/27/2024-06:27:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xd15dd11d64344e83 [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,5120,128) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: Conv_138 (CaskConvolution) [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.179785 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.226889 [06/27/2024-06:27:07] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:07] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.174373 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0xb443c221fcb1565b Time: 0.174373 [06/27/2024-06:27:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_139), Mul_140) (PointWise) [06/27/2024-06:27:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_139), Mul_140) (PointWiseV2) [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00559279 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00662275 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00641749 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00678234 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00649381 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00668945 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00956135 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00622032 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00687584 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00645565 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00567156 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00559279 [06/27/2024-06:27:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000000 [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(204800,1,10240,512) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_139), Mul_140) (PointWise) [06/27/2024-06:27:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_139), Mul_140) (PointWiseV2) [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00565468 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0068062 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00695924 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00661652 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00785932 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00682888 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0130992 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.01024 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0073216 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00706329 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00554198 [06/27/2024-06:27:07] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.00554198 [06/27/2024-06:27:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:07] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_139), Mul_140) (PointWise) [06/27/2024-06:27:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_139), Mul_140) (PointWiseV2) [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0088185 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0062402 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00689633 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00654982 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00635349 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00583424 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00879804 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0065226 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00656291 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00883738 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0066693 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000000b Time: 0.00642365 [06/27/2024-06:27:07] [V] [TRT] Tactic: 0x000000000000000c Time: 0.00957989 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000d Time: 0.00669569 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000e Time: 0.00699385 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000f Time: 0.00648745 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.00557046 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.00676925 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.00682079 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.00572855 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.00598674 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00605219 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.00596919 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0740937 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00743817 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00649642 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00580462 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x0000000000000010 Time: 0.00557046 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000010 [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(6400,400:32,20,1) -> Float(6400,400:32,20,1) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_139), Mul_140) (PointWise) [06/27/2024-06:27:08] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_139), Mul_140) (PointWiseV2) [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.00690525 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.00585728 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001a Time: 0.00668239 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001b Time: 0.00575854 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001f Time: 0.00686171 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x000000000000001b Time: 0.00575854 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001b [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_139), Mul_140) (PointWiseV2) [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.179054 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.291401 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.233033 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.247369 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.37888 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.405943 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.314807 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.467822 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.437979 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.739328 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000a Time: 0.156233 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000b Time: 0.203483 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000c Time: 0.221623 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000d Time: 0.336896 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000e Time: 0.301641 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000f Time: 0.19851 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.374491 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.31232 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.360741 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.270775 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.133851 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.178615 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.228498 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.330606 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0228212 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0229042 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0241859 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.0228212 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:08] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: Conv_141 (CudaDepthwiseConvolution) [06/27/2024-06:27:08] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: Conv_141 (FusedConvActConvolution) [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.11637 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000009ffff Time: 0.10635 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.0435931 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000000cffff Time: 0.034933 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.0439234 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.039168 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.0465554 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.0441783 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000002effff Time: 0.0833097 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.0428983 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.039424 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000003dffff Time: 0.0351671 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.0420571 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.0368286 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.0443246 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.0749714 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000004affff Time: 0.0465554 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000004effff Time: 0.0452389 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.0506149 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000055ffff Time: 0.0351689 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.0422766 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000005effff Time: 0.0418011 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.044032 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000061ffff Time: 0.0469943 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.0493943 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000006affff Time: 0.0441783 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.0415086 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.0432286 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.0365006 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.0653897 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.0404846 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.0643657 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.0456046 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000007effff Time: 0.0468491 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.0433006 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000008affff Time: 0.0433371 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.0831634 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.0412891 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.0380709 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.048128 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x00000000000cffff Time: 0.034933 [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: Conv_141 (CudnnConvolution) [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0405211 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0416549 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.147602 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000004 Time: 3.96961 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.353426 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.0406309 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0416183 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000003a Time: 0.214455 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000003c Time: 4.06543 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000003d Time: 0.353426 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.0406309 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0431543 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.147749 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000074 Time: 3.88564 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.379758 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0405211 [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: Conv_141 (CublasConvolution) [06/27/2024-06:27:08] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: Conv_141 (CaskConvolution) [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x1fc87d7eb370bb7a Time: 0.0269661 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.0763611 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.051395 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.0648046 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0525653 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.0279406 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x8e3884f0eaec3ecd [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x8e3884f0eaec3ecd Time: 0.0434834 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.0620251 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x9cd5cdc35441c505 Time: 0.0393143 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9de226a0c44627c4 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x9de226a0c44627c4 Time: 0.0754171 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xa419b3b68f2da07b Time: 0.0503223 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.066365 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0637318 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.0393874 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0474697 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.0441417 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.0639756 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0537844 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xfff46c7893896eb1 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xfff46c7893896eb1 Time: 0.126171 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x1fc87d7eb370bb7a Time: 0.0269661 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: Conv_141 (CublasConvolution) [06/27/2024-06:27:08] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: Conv_141 (CaskConvolution) [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0366811 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0424594 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.0656823 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.0397143 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0361691 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0431543 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.061184 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0392423 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.0665128 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.0238949 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.0883566 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.0237401 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0423863 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.0356352 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.0603672 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.0361691 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.0421303 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.0358034 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.0656335 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.084797 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.0621714 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x90898977fc8ce537 Time: 0.0237401 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: Conv_141 (CublasConvolution) [06/27/2024-06:27:08] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: Conv_141 (CaskConvolution) [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0425326 [06/27/2024-06:27:08] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0423863 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x9dece0dc37e90462 Time: 0.0423863 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:08] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_142), Mul_143) (PointWise) [06/27/2024-06:27:08] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_142), Mul_143) (PointWiseV2) [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0068081 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00622748 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0066294 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00672333 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0066161 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00722373 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0122514 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00654296 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00654151 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00677569 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00682623 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.00622748 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000001 [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(102400,1,5120,256) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_142), Mul_143) (PointWise) [06/27/2024-06:27:08] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_142), Mul_143) (PointWiseV2) [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.010336 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00727652 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00612457 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00691069 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00973288 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00712185 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.006656 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00585728 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00633481 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0104278 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0068406 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x0000000000000007 Time: 0.00585728 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000007 [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_142), Mul_143) (PointWise) [06/27/2024-06:27:08] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_142), Mul_143) (PointWiseV2) [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00938681 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00655881 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00913371 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00684931 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00651574 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00642961 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00962743 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0058635 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00563833 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00670919 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000a Time: 0.00547759 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000b Time: 0.00996743 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0056 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000d Time: 0.00618667 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000e Time: 0.00649766 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000f Time: 0.00649704 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.00655065 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0113653 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.00928889 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.00681948 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.00706351 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00673849 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.00673621 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0111399 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00677527 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00668925 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00672956 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x000000000000000a Time: 0.00547759 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000000a [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(3200,400:32,20,1) -> Float(3200,400:32,20,1) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_142), Mul_143) (PointWise) [06/27/2024-06:27:08] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_142), Mul_143) (PointWiseV2) [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0068541 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.00670899 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001a Time: 0.00627524 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001b Time: 0.00628989 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001f Time: 0.00720846 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x000000000000001a Time: 0.00627524 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001a [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_142), Mul_143) (PointWiseV2) [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.091136 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.107959 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.150455 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.125294 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.203922 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.172032 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.159013 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.34816 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.220891 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.333536 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0797257 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000b Time: 0.165303 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0908434 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000d Time: 0.153893 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000e Time: 0.120539 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000000f Time: 0.100937 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.189001 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.157842 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.143214 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.136923 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0898926 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0909897 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.115931 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.244736 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0112647 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001d Time: 0.011444 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0151753 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.0112647 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:08] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(409600,400,20,1) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: MaxPool_144 (TiledPooling) [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760101 Time: 0.162377 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760102 Time: 0.0827246 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760104 Time: 0.0736549 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760107 Time: 0.0364251 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760108 Time: 0.0363886 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000076010f Time: 0.0250636 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760110 Time: 0.0296375 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760114 Time: 0.0167736 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760201 Time: 0.0827977 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760202 Time: 0.058563 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760204 Time: 0.030752 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760207 Time: 0.0194194 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760208 Time: 0.0196571 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000076020f Time: 0.0167578 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760210 Time: 0.0168061 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760214 Time: 0.013578 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760301 Time: 0.0806034 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760302 Time: 0.0418731 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760304 Time: 0.022528 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760307 Time: 0.0147456 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760308 Time: 0.0303104 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000076030f Time: 0.0167583 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760310 Time: 0.0166278 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760314 Time: 0.010846 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760401 Time: 0.0586118 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760402 Time: 0.0308078 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760404 Time: 0.0170667 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760407 Time: 0.0115003 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760408 Time: 0.0114666 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000076040f Time: 0.0130203 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760410 Time: 0.0129592 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760414 Time: 0.00997699 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760501 Time: 0.0474331 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760502 Time: 0.0253318 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760504 Time: 0.0171347 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760507 Time: 0.0140172 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760508 Time: 0.0203998 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000076050f Time: 0.0150254 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760510 Time: 0.0152581 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760514 Time: 0.00950857 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760601 Time: 0.0475417 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760602 Time: 0.0253562 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760604 Time: 0.0194072 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760607 Time: 0.0140434 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760608 Time: 0.0133519 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000076060f Time: 0.0141631 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760610 Time: 0.0152635 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760614 Time: 0.00988891 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760701 Time: 0.0364251 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760702 Time: 0.019384 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760704 Time: 0.0114447 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760707 Time: 0.0106067 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760708 Time: 0.0119467 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000076070f Time: 0.012544 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760710 Time: 0.0217339 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760714 Time: 0.00967467 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760801 Time: 0.0364617 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760802 Time: 0.0195109 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760804 Time: 0.0114229 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760807 Time: 0.0119345 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760808 Time: 0.0119589 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000076080f Time: 0.0143063 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760810 Time: 0.0144384 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760814 Time: 0.00965486 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760901 Time: 0.0352549 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760902 Time: 0.0194194 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760904 Time: 0.0153015 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760907 Time: 0.011971 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760908 Time: 0.0148626 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x000000000076090f Time: 0.0152142 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760910 Time: 0.0155803 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760914 Time: 0.0480305 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760a01 Time: 0.0249653 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760a02 Time: 0.0160398 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760a04 Time: 0.0114219 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760a07 Time: 0.0106472 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760a08 Time: 0.0114328 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760a0f Time: 0.0117704 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760a10 Time: 0.0121661 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760a14 Time: 0.00949029 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760b01 Time: 0.0414354 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760b02 Time: 0.0144823 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760b04 Time: 0.0114666 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760b07 Time: 0.0113769 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760b08 Time: 0.0114215 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760b0f Time: 0.0128126 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760b10 Time: 0.0143365 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760b14 Time: 0.0096451 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760c01 Time: 0.0249905 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760c02 Time: 0.0142828 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760c04 Time: 0.0114328 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760c07 Time: 0.0113315 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760c08 Time: 0.0113765 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760c0f Time: 0.0134587 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760c10 Time: 0.013445 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x0000000000760c14 Time: 0.01024 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x0000000000760a14 Time: 0.00949029 [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: MaxPool_144 (CudnnPooling) [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xffffffffffffffff Time: 0.0126903 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0xffffffffffffffff Time: 0.0126903 [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: MaxPool_144 (CaskPooling) [06/27/2024-06:27:08] [V] [TRT] MaxPool_144 Set Tactic Name: sm50_xmma_pooling_fw_4d_FP32FP32NCHW_Max Tactic: 0xb59f9cfb90407c92 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xb59f9cfb90407c92 Time: 0.0152795 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0xb59f9cfb90407c92 Time: 0.0152795 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: TiledPooling Tactic: 0x0000000000760a14 [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(102400,1:4,5120,256) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: MaxPool_144 (CaskPooling) [06/27/2024-06:27:08] [V] [TRT] MaxPool_144 Set Tactic Name: sm50_xmma_pooling_fw_4d_FP32FP32NHWC_Max_CAlign4 Tactic: 0x22fb1bb4a70e340d [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x22fb1bb4a70e340d Time: 0.0148187 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x22fb1bb4a70e340d Time: 0.0148187 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskPooling Tactic: 0x22fb1bb4a70e340d [06/27/2024-06:27:08] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(409600,400,20,1) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: MaxPool_145 (TiledPooling) [06/27/2024-06:27:08] [V] [TRT] TiledPooling has no valid tactics for this config, skipping [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: MaxPool_145 (CudnnPooling) [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xffffffffffffffff Time: 0.0279893 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0xffffffffffffffff Time: 0.0279893 [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: MaxPool_145 (CaskPooling) [06/27/2024-06:27:08] [V] [TRT] MaxPool_145 Set Tactic Name: sm50_xmma_pooling_fw_4d_FP32FP32NCHW_Max Tactic: 0xb59f9cfb90407c92 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xb59f9cfb90407c92 Time: 0.0284533 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0xb59f9cfb90407c92 Time: 0.0284533 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CudnnPooling Tactic: 0xffffffffffffffff [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(102400,1:4,5120,256) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: MaxPool_145 (CaskPooling) [06/27/2024-06:27:08] [V] [TRT] MaxPool_145 Set Tactic Name: sm50_xmma_pooling_fw_4d_FP32FP32NHWC_Max_CAlign4 Tactic: 0x22fb1bb4a70e340d [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x22fb1bb4a70e340d Time: 0.0469577 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x22fb1bb4a70e340d Time: 0.0469577 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskPooling Tactic: 0x22fb1bb4a70e340d [06/27/2024-06:27:08] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(409600,400,20,1) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: MaxPool_146 (TiledPooling) [06/27/2024-06:27:08] [V] [TRT] TiledPooling has no valid tactics for this config, skipping [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: MaxPool_146 (CudnnPooling) [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xffffffffffffffff Time: 0.0497341 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0xffffffffffffffff Time: 0.0497341 [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: MaxPool_146 (CaskPooling) [06/27/2024-06:27:08] [V] [TRT] MaxPool_146 Set Tactic Name: sm50_xmma_pooling_fw_4d_FP32FP32NCHW_Max Tactic: 0xb59f9cfb90407c92 [06/27/2024-06:27:08] [V] [TRT] Tactic: 0xb59f9cfb90407c92 Time: 0.0503223 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0xb59f9cfb90407c92 Time: 0.0503223 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CudnnPooling Tactic: 0xffffffffffffffff [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(102400,1:4,5120,256) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: MaxPool_146 (CaskPooling) [06/27/2024-06:27:08] [V] [TRT] MaxPool_146 Set Tactic Name: sm50_xmma_pooling_fw_4d_FP32FP32NHWC_Max_CAlign4 Tactic: 0x22fb1bb4a70e340d [06/27/2024-06:27:08] [V] [TRT] Tactic: 0x22fb1bb4a70e340d Time: 0.0841874 [06/27/2024-06:27:08] [V] [TRT] Fastest Tactic: 0x22fb1bb4a70e340d Time: 0.0841874 [06/27/2024-06:27:08] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskPooling Tactic: 0x22fb1bb4a70e340d [06/27/2024-06:27:08] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:08] [V] [TRT] *************** Autotuning format combination: Float(409600,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: Conv_148 (CudaDepthwiseConvolution) [06/27/2024-06:27:08] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:08] [V] [TRT] --------------- Timing Runner: Conv_148 (FusedConvActConvolution) [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.398043 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000009ffff Time: 0.244736 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.126025 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000000cffff Time: 0.143799 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.183881 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.134875 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.18549 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.148919 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000002effff Time: 0.133925 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.149065 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.128073 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000003dffff Time: 0.144969 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.164937 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.150967 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.147749 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.125367 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000004affff Time: 0.14965 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000004effff Time: 0.14848 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.184905 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000055ffff Time: 0.14453 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.129321 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000005effff Time: 0.160329 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.163109 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000061ffff Time: 0.140507 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.183003 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000006affff Time: 0.147323 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.150382 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.198656 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.134656 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.170277 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.160183 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.237422 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.210066 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000007effff Time: 0.170423 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.164864 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000008affff Time: 0.143653 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.150382 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.155136 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.156233 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.143799 [06/27/2024-06:27:09] [V] [TRT] Fastest Tactic: 0x000000000046ffff Time: 0.125367 [06/27/2024-06:27:09] [V] [TRT] --------------- Timing Runner: Conv_148 (CudnnConvolution) [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.126025 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0507124 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.299008 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000000004 Time: 15.3495 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000000005 Time: 1.05165 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.126025 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0502735 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000000003a Time: 0.231424 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000000003c Time: 15.3063 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000000003d Time: 1.14132 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.126025 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.121051 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.232009 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000000074 Time: 16.3193 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x0000000000000075 Time: 1.06584 [06/27/2024-06:27:09] [V] [TRT] Fastest Tactic: 0x0000000000000039 Time: 0.0502735 [06/27/2024-06:27:09] [V] [TRT] --------------- Timing Runner: Conv_148 (CublasConvolution) [06/27/2024-06:27:09] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:09] [V] [TRT] --------------- Timing Runner: Conv_148 (CaskConvolution) [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x1fc87d7eb370bb7a Time: 0.082432 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.1408 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.217527 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.122441 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0963291 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.081408 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x8e3884f0eaec3ecd [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x8e3884f0eaec3ecd Time: 0.219575 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.200265 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x9cd5cdc35441c505 Time: 0.0739474 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9de226a0c44627c4 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x9de226a0c44627c4 Time: 0.141312 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xa419b3b68f2da07b Time: 0.129097 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.170569 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.175397 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.0733623 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.197778 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.148626 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.120832 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.147163 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xfff46c7893896eb1 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xfff46c7893896eb1 Time: 0.313929 [06/27/2024-06:27:09] [V] [TRT] Fastest Tactic: 0xc0b05b61d128e46e Time: 0.0733623 [06/27/2024-06:27:09] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CudnnConvolution Tactic: 0x0000000000000039 [06/27/2024-06:27:09] [V] [TRT] *************** Autotuning format combination: Float(409600,1,20480,1024) -> Float(204800,1,10240,512) *************** [06/27/2024-06:27:09] [V] [TRT] --------------- Timing Runner: Conv_148 (CublasConvolution) [06/27/2024-06:27:09] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:09] [V] [TRT] --------------- Timing Runner: Conv_148 (CaskConvolution) [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.068413 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0797257 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.190757 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.13941 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.116078 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.113225 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.155941 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.138679 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.131438 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.0735086 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.136558 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.0732183 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0796526 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.122661 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.114544 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.0679741 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.110153 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.115931 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.129243 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.132535 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.119442 [06/27/2024-06:27:09] [V] [TRT] Fastest Tactic: 0xc7b3afceb5fb03c0 Time: 0.0679741 [06/27/2024-06:27:09] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:09] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,5120,256) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:27:09] [V] [TRT] --------------- Timing Runner: Conv_148 (CublasConvolution) [06/27/2024-06:27:09] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:09] [V] [TRT] --------------- Timing Runner: Conv_148 (CaskConvolution) [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0797989 [06/27/2024-06:27:09] [V] [TRT] Conv_148 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0795063 [06/27/2024-06:27:09] [V] [TRT] Fastest Tactic: 0x9dece0dc37e90462 Time: 0.0795063 [06/27/2024-06:27:09] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:09] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:09] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:27:09] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(204800,1,10240,512) *************** [06/27/2024-06:27:09] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:27:09] [V] [TRT] *************** Autotuning format combination: Float(6400,400:32,20,1) -> Float(6400,400:32,20,1) *************** [06/27/2024-06:27:09] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:27:09] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:09] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:27:09] [V] [TRT] --------------- Timing Runner: Conv_151 || Conv_161 (CudaDepthwiseConvolution) [06/27/2024-06:27:09] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:09] [V] [TRT] --------------- Timing Runner: Conv_151 || Conv_161 (FusedConvActConvolution) [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.210944 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000009ffff Time: 0.198437 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.0730034 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000000cffff Time: 0.0776046 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.0799451 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.0732891 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.102107 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.156672 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000002effff Time: 0.07936 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.0827246 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.0729943 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000003dffff Time: 0.0781166 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.0748983 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.156379 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.0820663 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.0721189 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000004affff Time: 0.0792846 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000004effff Time: 0.0820663 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.100937 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000055ffff Time: 0.080384 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.0699733 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000005effff Time: 0.0809691 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.0799451 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000061ffff Time: 0.0814103 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.0983771 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000006affff Time: 0.082432 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.078336 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.0731429 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.0735063 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.0950857 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.0863817 [06/27/2024-06:27:09] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.121051 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.0782629 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000007effff Time: 0.0925989 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.0888686 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000008affff Time: 0.0792869 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.0794331 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.0768 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.0999131 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.0826514 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0x000000000059ffff Time: 0.0699733 [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_151 || Conv_161 (CudnnConvolution) [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.094208 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0402651 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.174665 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000004 Time: 7.86417 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.545499 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.069437 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0400457 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000003a Time: 0.174958 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000003c Time: 7.8984 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000003d Time: 0.545938 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.0695832 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0683642 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.174665 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000074 Time: 7.84472 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.547255 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0x0000000000000039 Time: 0.0400457 [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_151 || Conv_161 (CublasConvolution) [06/27/2024-06:27:10] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_151 || Conv_161 (CaskConvolution) [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x1fc87d7eb370bb7a Time: 0.0450926 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.0981577 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.0758491 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.065731 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0549547 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.0447269 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x8e3884f0eaec3ecd [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x8e3884f0eaec3ecd Time: 0.0791406 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.0863086 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x9cd5cdc35441c505 Time: 0.0413257 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9de226a0c44627c4 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x9de226a0c44627c4 Time: 0.0799451 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xa419b3b68f2da07b Time: 0.0513493 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.159305 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0648533 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.0412526 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0681204 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.0792137 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.100425 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0782629 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xfff46c7893896eb1 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xfff46c7893896eb1 Time: 0.156891 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0xc0b05b61d128e46e Time: 0.0412526 [06/27/2024-06:27:10] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CudnnConvolution Tactic: 0x0000000000000039 [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(204800,1,10240,512) *************** [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_151 || Conv_161 (CublasConvolution) [06/27/2024-06:27:10] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_151 || Conv_161 (CaskConvolution) [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0748251 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0434834 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.0685592 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.0739474 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0822126 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0613425 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.093696 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0735086 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.0695345 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.0405577 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.0631467 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.0401189 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0791406 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.0656823 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.0823101 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.0380011 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.0597333 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.0621714 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.101522 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.0631467 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.0795794 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0xc7b3afceb5fb03c0 Time: 0.0380011 [06/27/2024-06:27:10] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_151 || Conv_161 (CublasConvolution) [06/27/2024-06:27:10] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_151 || Conv_161 (CaskConvolution) [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0434103 [06/27/2024-06:27:10] [V] [TRT] Conv_151 || Conv_161 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.043264 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0x9dece0dc37e90462 Time: 0.043264 [06/27/2024-06:27:10] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:10] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_152), Mul_153) (PointWise) [06/27/2024-06:27:10] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_152), Mul_153) (PointWiseV2) [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00702868 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00665538 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00657642 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00652862 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00644929 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00633262 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00673787 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00680229 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00625193 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00663543 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0101474 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0x0000000000000008 Time: 0.00625193 [06/27/2024-06:27:10] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000008 [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_152), Mul_153) (PointWise) [06/27/2024-06:27:10] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_152), Mul_153) (PointWiseV2) [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00706373 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00653049 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00666265 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00622533 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00644313 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00597219 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00682689 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0066826 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00663564 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00637317 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0068062 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0x0000000000000005 Time: 0.00597219 [06/27/2024-06:27:10] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000005 [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_152), Mul_153) (PointWise) [06/27/2024-06:27:10] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_152), Mul_153) (PointWiseV2) [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00803759 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00696599 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0063678 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00658971 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00660925 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00693834 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0065494 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00690329 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00678899 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00629029 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000000a Time: 0.00630937 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000000b Time: 0.00654047 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000000c Time: 0.00648109 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000000d Time: 0.00685453 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000000e Time: 0.00682732 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000000f Time: 0.00658868 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.00778442 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.00761456 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.00662234 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.00660883 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.00663605 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00664249 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0109699 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.00631513 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00651636 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00646201 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00687543 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0x0000000000000009 Time: 0.00629029 [06/27/2024-06:27:10] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000009 [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(6400,400:32,20,1) -> Float(3200,400:32,20,1) *************** [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_152), Mul_153) (PointWise) [06/27/2024-06:27:10] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_152), Mul_153) (PointWiseV2) [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0078299 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.0120076 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000001a Time: 0.00796174 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000001b Time: 0.00608914 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000001f Time: 0.00704958 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0x000000000000001b Time: 0.00608914 [06/27/2024-06:27:10] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001b [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:27:10] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_154 (CudaDepthwiseConvolution) [06/27/2024-06:27:10] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_154 (FusedConvActConvolution) [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.0823589 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000009ffff Time: 0.0441417 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.0281112 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000000cffff Time: 0.0239177 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.0278187 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.0257707 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.0297545 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.0284038 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000002effff Time: 0.0292571 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.0531992 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.026112 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000003dffff Time: 0.0238446 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.0265265 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.0249189 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.0282568 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.0283063 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000004affff Time: 0.028789 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000004effff Time: 0.0286427 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.0308955 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000055ffff Time: 0.0238202 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.028355 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000005effff Time: 0.0271848 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.0278918 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000061ffff Time: 0.0303113 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.0299886 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000006affff Time: 0.0285501 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.0270141 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.0286135 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.0243817 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.0295205 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.0599771 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.0359131 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.0288768 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000007effff Time: 0.0297838 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.0279398 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000008affff Time: 0.0284282 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.0285989 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.0267947 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.025917 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.0304274 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0x000000000055ffff Time: 0.0238202 [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_154 (CudnnConvolution) [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.030837 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.052224 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.089824 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000004 Time: 2.04537 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.212114 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.0304567 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0416549 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000003a Time: 0.0899657 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000003c Time: 2.05619 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000000003d Time: 0.214894 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.0311589 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0373029 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.0898926 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000074 Time: 2.06789 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.209627 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0x0000000000000038 Time: 0.0304567 [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_154 (CublasConvolution) [06/27/2024-06:27:10] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_154 (CaskConvolution) [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x1fc87d7eb370bb7a Time: 0.0163677 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.0456046 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.0293742 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.0364983 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0318025 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.0167904 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x8e3884f0eaec3ecd [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x8e3884f0eaec3ecd Time: 0.0256 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.0346405 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x9cd5cdc35441c505 Time: 0.0234057 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9de226a0c44627c4 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x9de226a0c44627c4 Time: 0.0442514 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xa419b3b68f2da07b Time: 0.0302519 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.0368274 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0359886 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.0236774 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0276968 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.0260632 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.0361326 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0307493 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xfff46c7893896eb1 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xfff46c7893896eb1 Time: 0.0805547 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0x1fc87d7eb370bb7a Time: 0.0163677 [06/27/2024-06:27:10] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(102400,1,5120,256) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_154 (CublasConvolution) [06/27/2024-06:27:10] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_154 (CaskConvolution) [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0218168 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0245516 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.0361691 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.0234266 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.021922 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0257707 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.0254057 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0231145 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.0367177 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.0149074 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.0341723 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.0148187 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0244785 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.0212532 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.0359497 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.0214204 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.025088 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.0216085 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.0580754 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.0341723 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.0345527 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0x90898977fc8ce537 Time: 0.0148187 [06/27/2024-06:27:10] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_154 (CublasConvolution) [06/27/2024-06:27:10] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_154 (CaskConvolution) [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.024576 [06/27/2024-06:27:10] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.024381 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0x9dece0dc37e90462 Time: 0.024381 [06/27/2024-06:27:10] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:10] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(102400,1,5120,256) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(3200,400:32,20,1) -> Float(3200,400:32,20,1) *************** [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:27:10] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:10] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_157 (CudaDepthwiseConvolution) [06/27/2024-06:27:10] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_157 (FusedConvActConvolution) [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000007ffff Time: 0.157257 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000000affff Time: 0.190903 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000000effff Time: 0.0945737 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000000fffff Time: 0.177737 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000019ffff Time: 0.0996206 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000001affff Time: 0.0991817 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000024ffff Time: 0.131954 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000027ffff Time: 0.135387 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000002dffff Time: 0.0925989 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000036ffff Time: 0.0959634 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000004cffff Time: 0.116297 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000062ffff Time: 0.150967 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000006effff Time: 0.101815 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000077ffff Time: 0.125001 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000086ffff Time: 0.108325 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000089ffff Time: 0.118418 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000097ffff Time: 0.115493 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x000000000098ffff Time: 0.0991817 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x00000000009fffff Time: 0.0858697 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000a2ffff Time: 0.0969143 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000a4ffff Time: 0.172763 [06/27/2024-06:27:10] [V] [TRT] Fastest Tactic: 0x00000000009fffff Time: 0.0858697 [06/27/2024-06:27:10] [V] [TRT] --------------- Timing Runner: Conv_157 (CudnnConvolution) [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.166473 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.131438 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.185344 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000004 Time: 1.96871 [06/27/2024-06:27:10] [V] [TRT] Tactic: 0x0000000000000005 Time: 2.09408 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.113957 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.166619 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.160768 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000000003a Time: 0.185198 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000000003c Time: 2.12626 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000000003d Time: 1.94794 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000000003e Time: 0.167643 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.199241 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.166619 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.254391 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000074 Time: 2.11588 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000075 Time: 2.06117 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000076 Time: 0.113006 [06/27/2024-06:27:11] [V] [TRT] Fastest Tactic: 0x0000000000000076 Time: 0.113006 [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_157 (CaskConvolution) [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x01cf8ce2da913006 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x01cf8ce2da913006 Time: 0.151406 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x12dbf7d94ee3696d [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x12dbf7d94ee3696d Time: 0.170715 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.207584 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4727434768e46395 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x4727434768e46395 Time: 0.106057 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4efce38acc876f5c [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x4efce38acc876f5c Time: 0.529408 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.316562 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 0x5403ad713f811a18 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x5403ad713f811a18 Time: 0.275017 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x5aa723e0481da855 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x5aa723e0481da855 Time: 0.151698 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 0x5deb29b7a8e275f7 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x5deb29b7a8e275f7 Time: 0.217819 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: 0x94119b4c514b211a [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x94119b4c514b211a Time: 0.0866034 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xa31d27de74b895ff [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xa31d27de74b895ff Time: 0.125147 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.294619 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: 0xbb8c3889c7eacd30 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xbb8c3889c7eacd30 Time: 0.536869 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xd828f024626fa982 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xd828f024626fa982 Time: 0.202313 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.341285 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.237714 [06/27/2024-06:27:11] [V] [TRT] Fastest Tactic: 0x94119b4c514b211a Time: 0.0866034 [06/27/2024-06:27:11] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: FusedConvActConvolution Tactic: 0x00000000009fffff [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(102400,1,5120,256) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_157 (CaskConvolution) [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x19b688348f983aa0 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x19b688348f983aa0 Time: 0.0994811 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x1da91d865428f237 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x1da91d865428f237 Time: 0.20875 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.224841 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.164133 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.162085 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x3f0c846d6379bc98 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x3f0c846d6379bc98 Time: 0.259803 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.156379 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.342601 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x62835fce994f06dd [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x62835fce994f06dd Time: 0.0913554 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0x634e99502974e4da [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x634e99502974e4da Time: 0.260389 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.179054 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.25483 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x8014228ec08b4d49 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x8014228ec08b4d49 Time: 0.230107 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x94a7db94ba744c45 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x94a7db94ba744c45 Time: 0.0910629 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.170423 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.173495 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0xbdfdef6b84f7ccc9 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xbdfdef6b84f7ccc9 Time: 0.190757 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x128_relu_exp_large_nhwc_tn_v1 Tactic: 0xca7eeb8d9143d738 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xca7eeb8d9143d738 Time: 0.33909 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xd15dd11d64344e83 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xd15dd11d64344e83 Time: 0.193097 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.298715 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xf48db81f02eca9ee [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xf48db81f02eca9ee Time: 0.208896 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.255269 [06/27/2024-06:27:11] [V] [TRT] Fastest Tactic: 0x94a7db94ba744c45 Time: 0.0910629 [06/27/2024-06:27:11] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x94a7db94ba744c45 [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_157 (CaskConvolution) [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.179054 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.207579 [06/27/2024-06:27:11] [V] [TRT] Conv_157 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.173934 [06/27/2024-06:27:11] [V] [TRT] Fastest Tactic: 0xb443c221fcb1565b Time: 0.173934 [06/27/2024-06:27:11] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:11] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(102400,1,5120,256) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(3200,400:32,20,1) -> Float(3200,400:32,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_160 (CudaDepthwiseConvolution) [06/27/2024-06:27:11] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_160 (FusedConvActConvolution) [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.101157 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000009ffff Time: 0.044288 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.027843 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000000cffff Time: 0.0237616 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.0279406 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.0260632 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.0297838 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.0284038 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000002effff Time: 0.0453509 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.0279909 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.0259901 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000003dffff Time: 0.023761 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.0264533 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.0250149 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.0282819 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.0392533 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000004affff Time: 0.0289079 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000004effff Time: 0.0285013 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.0308672 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000055ffff Time: 0.0236983 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.0283063 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000005effff Time: 0.0271131 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.0278918 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000061ffff Time: 0.0302519 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.0303982 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000006affff Time: 0.0285013 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.0269897 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.0287305 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.0244541 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.0294912 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.0263802 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.0354606 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.0288768 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000007effff Time: 0.0297253 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.0277455 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000008affff Time: 0.0288777 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.0289061 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.0265265 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.0257714 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.0304274 [06/27/2024-06:27:11] [V] [TRT] Fastest Tactic: 0x000000000055ffff Time: 0.0236983 [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_160 (CudnnConvolution) [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0238446 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0291986 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.106935 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000004 Time: 2.07067 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.202606 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.0239657 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0375954 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000000003a Time: 0.156672 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000000003c Time: 2.02723 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000000003d Time: 0.202752 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.0234475 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0313637 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.0854309 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000074 Time: 2.06702 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.229961 [06/27/2024-06:27:11] [V] [TRT] Fastest Tactic: 0x0000000000000070 Time: 0.0234475 [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_160 (CublasConvolution) [06/27/2024-06:27:11] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_160 (CaskConvolution) [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.0454217 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.0294327 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.0363897 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0314514 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.0256549 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.0347575 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.0365714 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0359131 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.0234893 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0275261 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.0259657 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.0362423 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.030603 [06/27/2024-06:27:11] [V] [TRT] Fastest Tactic: 0xc0b05b61d128e46e Time: 0.0234893 [06/27/2024-06:27:11] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CudnnConvolution Tactic: 0x0000000000000070 [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(102400,1,5120,256) -> Float(204800,1,10240,512) *************** [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_160 (CublasConvolution) [06/27/2024-06:27:11] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_160 (CaskConvolution) [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0216712 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0243566 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.0359131 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.03968 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0216908 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0252587 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.0251368 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0228624 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.0364983 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.0213431 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.0339968 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.020962 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.024797 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.0359497 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.0339383 [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.0342601 [06/27/2024-06:27:11] [V] [TRT] Fastest Tactic: 0xae0c89d047932ba3 Time: 0.020962 [06/27/2024-06:27:11] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_160 (CublasConvolution) [06/27/2024-06:27:11] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_160 (CaskConvolution) [06/27/2024-06:27:11] [V] [TRT] Conv_160 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0244305 [06/27/2024-06:27:11] [V] [TRT] Fastest Tactic: 0x130df49cb195156b Time: 0.0244305 [06/27/2024-06:27:11] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x130df49cb195156b [06/27/2024-06:27:11] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: BatchNormalization_163 (Scale) [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00585582 [06/27/2024-06:27:11] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00585582 [06/27/2024-06:27:11] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Scale Tactic: 0x0000000000000000 [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(204800,1,10240,512) *************** [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: BatchNormalization_163 (Scale) [06/27/2024-06:27:11] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: BatchNormalization_163 (Scale) [06/27/2024-06:27:11] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:27:11] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(204800,1,10240,512) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(6400,400:32,20,1) -> Float(6400,400:32,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(204800,1,10240,512) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:27:11] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(204800,1,10240,512) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(6400,400:32,20,1) -> Float(6400,400:32,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:11] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(102400,1,5120,256) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(3200,400:32,20,1) -> Float(3200,400:32,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:27:11] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Resize_173 (Resize) [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00907886 [06/27/2024-06:27:11] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00907886 [06/27/2024-06:27:11] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Resize Tactic: 0x0000000000000000 [06/27/2024-06:27:11] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:11] [V] [TRT] *************** Autotuning format combination: Float(819200,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_175 || Conv_185 (CudaDepthwiseConvolution) [06/27/2024-06:27:11] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_175 || Conv_185 (FusedConvActConvolution) [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.412965 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000009ffff Time: 0.155355 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.138459 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000000cffff Time: 0.199973 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.161719 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.148914 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.166766 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.198363 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000002effff Time: 0.156965 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.135753 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.216357 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000003dffff Time: 0.129682 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.149358 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.140361 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.142482 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.121271 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000004affff Time: 0.149065 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000004effff Time: 0.199241 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.199387 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000055ffff Time: 0.129463 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.186222 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000005effff Time: 0.159451 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.180663 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000061ffff Time: 0.156233 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.209774 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000006affff Time: 0.158281 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.178615 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.130487 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.136265 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.148626 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.142336 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.202898 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.148773 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000007effff Time: 0.174665 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.164571 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000008affff Time: 0.155214 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.149358 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.129609 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.135973 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.217234 [06/27/2024-06:27:11] [V] [TRT] Fastest Tactic: 0x000000000046ffff Time: 0.121271 [06/27/2024-06:27:11] [V] [TRT] --------------- Timing Runner: Conv_175 || Conv_185 (CudnnConvolution) [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.090624 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0674865 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.233911 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000004 Time: 29.707 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.479671 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.0905509 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0905021 [06/27/2024-06:27:11] [V] [TRT] Tactic: 0x000000000000003a Time: 0.235374 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000000003c Time: 29.756 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000000003d Time: 0.470016 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.0905509 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0773851 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.301787 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000074 Time: 29.7476 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.51083 [06/27/2024-06:27:12] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.0674865 [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: Conv_175 || Conv_185 (CublasConvolution) [06/27/2024-06:27:12] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: Conv_175 || Conv_185 (CaskConvolution) [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x1fc87d7eb370bb7a Time: 0.0851383 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.0826514 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.0755566 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.066755 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.056128 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.0850651 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x8e3884f0eaec3ecd [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x8e3884f0eaec3ecd Time: 0.078336 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.128439 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x9cd5cdc35441c505 Time: 0.10101 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9de226a0c44627c4 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x9de226a0c44627c4 Time: 0.0827246 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xa419b3b68f2da07b Time: 0.0525166 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.0978651 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0658286 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.146286 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0791893 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.0785554 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.0659749 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0776046 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xfff46c7893896eb1 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xfff46c7893896eb1 Time: 0.127854 [06/27/2024-06:27:12] [V] [TRT] Fastest Tactic: 0xa419b3b68f2da07b Time: 0.0525166 [06/27/2024-06:27:12] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(819200,1,20480,512) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: Conv_175 || Conv_185 (CublasConvolution) [06/27/2024-06:27:12] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: Conv_175 || Conv_185 (CaskConvolution) [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0632411 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0449463 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.0900145 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.0736549 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0867474 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0800183 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.0792137 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0734354 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.0720457 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.074752 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.0658773 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.0876251 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0447269 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.0929402 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.0658286 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.0629029 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.108325 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.0843093 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.0848945 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.0659276 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.0667063 [06/27/2024-06:27:12] [V] [TRT] Fastest Tactic: 0x9dece0dc37e90462 Time: 0.0447269 [06/27/2024-06:27:12] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,5120,128) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: Conv_175 || Conv_185 (CublasConvolution) [06/27/2024-06:27:12] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: Conv_175 || Conv_185 (CaskConvolution) [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0449131 [06/27/2024-06:27:12] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.09216 [06/27/2024-06:27:12] [V] [TRT] Fastest Tactic: 0x130df49cb195156b Time: 0.0449131 [06/27/2024-06:27:12] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x130df49cb195156b [06/27/2024-06:27:12] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(12800,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:12] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(6400,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:12] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(6400,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:12] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: BatchNormalization_187 (Scale) [06/27/2024-06:27:12] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: BatchNormalization_187 (Scale) [06/27/2024-06:27:12] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:27:12] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(12800,1600:32,40,1) -> Float(12800,1600:32,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:12] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(12800,1600:32,40,1) -> Float(12800,1600:32,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: Conv_193 (CudaDepthwiseConvolution) [06/27/2024-06:27:12] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: Conv_193 (FusedConvActConvolution) [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.185929 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000009ffff Time: 0.0778971 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.0849189 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000000cffff Time: 0.0529554 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.0474697 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.0507124 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.0643657 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.0533455 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000002effff Time: 0.0517394 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.0529554 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.0491032 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000003dffff Time: 0.0549547 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.0476537 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.0518339 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.046336 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.0464457 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000004affff Time: 0.0517851 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000004effff Time: 0.0528091 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.0618301 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000055ffff Time: 0.0904777 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.0469246 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000005effff Time: 0.0776777 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.0474331 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000061ffff Time: 0.0516876 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.0908434 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000006affff Time: 0.0530042 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.0482743 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.0879177 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.0505173 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.0560274 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.0522728 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.0727528 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.0523703 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000007effff Time: 0.0599284 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.0592457 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000008affff Time: 0.0540282 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.0521752 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.0483718 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.052029 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.0521783 [06/27/2024-06:27:12] [V] [TRT] Fastest Tactic: 0x000000000045ffff Time: 0.046336 [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: Conv_193 (CudnnConvolution) [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0404114 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0418011 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.150674 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000004 Time: 7.54395 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.292425 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.0411063 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0419474 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000000003a Time: 0.151845 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000000003c Time: 7.59808 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x000000000000003d Time: 0.19221 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.0405211 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0418011 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.150821 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000074 Time: 7.49436 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.187099 [06/27/2024-06:27:12] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0404114 [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: Conv_193 (CublasConvolution) [06/27/2024-06:27:12] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: Conv_193 (CaskConvolution) [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x1fc87d7eb370bb7a Time: 0.0261851 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.0493958 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.0315392 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.03712 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0350501 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.0259185 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x8e3884f0eaec3ecd [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x8e3884f0eaec3ecd Time: 0.0266484 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.047104 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x9cd5cdc35441c505 Time: 0.0250659 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9de226a0c44627c4 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0x9de226a0c44627c4 Time: 0.0490545 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xa419b3b68f2da07b Time: 0.0310757 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.0485181 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0366811 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.0418011 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0300178 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.0271611 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.0369737 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0327095 [06/27/2024-06:27:12] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xfff46c7893896eb1 [06/27/2024-06:27:12] [V] [TRT] Tactic: 0xfff46c7893896eb1 Time: 0.070656 [06/27/2024-06:27:12] [V] [TRT] Fastest Tactic: 0x9cd5cdc35441c505 Time: 0.0250659 [06/27/2024-06:27:12] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:27:12] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:12] [V] [TRT] --------------- Timing Runner: Conv_193 (CublasConvolution) [06/27/2024-06:27:12] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:13] [V] [TRT] --------------- Timing Runner: Conv_193 (CaskConvolution) [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0243566 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0257707 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.0391314 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.0248442 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0238202 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0512 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.0266728 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0248686 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.03968 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.0241387 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.0368274 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.0240152 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0373516 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.0230713 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.0379611 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.0242606 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.0261364 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.0235102 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.039168 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.0370103 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.0374857 [06/27/2024-06:27:13] [V] [TRT] Fastest Tactic: 0xae0c89d047932ba3 Time: 0.0230713 [06/27/2024-06:27:13] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:13] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:13] [V] [TRT] --------------- Timing Runner: Conv_193 (CublasConvolution) [06/27/2024-06:27:13] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:13] [V] [TRT] --------------- Timing Runner: Conv_193 (CaskConvolution) [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0257463 [06/27/2024-06:27:13] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0255756 [06/27/2024-06:27:13] [V] [TRT] Fastest Tactic: 0x9dece0dc37e90462 Time: 0.0255756 [06/27/2024-06:27:13] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:13] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:13] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:13] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:13] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:13] [V] [TRT] *************** Autotuning format combination: Float(6400,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:13] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:13] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:13] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(819200,6400,80,1) *************** [06/27/2024-06:27:13] [V] [TRT] --------------- Timing Runner: Resize_197 (Resize) [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.019492 [06/27/2024-06:27:13] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.019492 [06/27/2024-06:27:13] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Resize Tactic: 0x0000000000000000 [06/27/2024-06:27:13] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:13] [V] [TRT] *************** Autotuning format combination: Float(1638400,6400,80,1) -> Float(819200,6400,80,1) *************** [06/27/2024-06:27:13] [V] [TRT] --------------- Timing Runner: Conv_199 || Conv_209 (CudaDepthwiseConvolution) [06/27/2024-06:27:13] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:13] [V] [TRT] --------------- Timing Runner: Conv_199 || Conv_209 (FusedConvActConvolution) [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.365422 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000009ffff Time: 0.197047 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.173934 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000000cffff Time: 0.171739 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.158574 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.318171 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.210798 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.179054 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000002effff Time: 0.205678 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.165303 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.187543 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000003dffff Time: 0.162377 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.219282 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.236398 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.154624 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.195438 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000004affff Time: 0.166331 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000004effff Time: 0.186514 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.238153 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000055ffff Time: 0.161646 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.148187 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000005effff Time: 0.257317 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.159744 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000061ffff Time: 0.165742 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.247822 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000006affff Time: 0.219721 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.152576 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.161938 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.165157 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.317001 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.165157 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.228059 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.216503 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000007effff Time: 0.21621 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.210359 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000008affff Time: 0.247954 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.166619 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.161353 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.251904 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.166766 [06/27/2024-06:27:13] [V] [TRT] Fastest Tactic: 0x000000000059ffff Time: 0.148187 [06/27/2024-06:27:13] [V] [TRT] --------------- Timing Runner: Conv_199 || Conv_209 (CudnnConvolution) [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.192951 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0707048 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.246638 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x0000000000000004 Time: 43.0803 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.396581 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.132903 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.143067 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000000003a Time: 0.247954 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000000003c Time: 44.1287 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x000000000000003d Time: 0.372443 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.153381 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0928914 [06/27/2024-06:27:13] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.246784 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000074 Time: 43.0689 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.396873 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.0707048 [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_199 || Conv_209 (CublasConvolution) [06/27/2024-06:27:14] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_199 || Conv_209 (CaskConvolution) [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x1fc87d7eb370bb7a Time: 0.0882103 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.150382 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.07936 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.0686568 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0705585 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.153522 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x8e3884f0eaec3ecd [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x8e3884f0eaec3ecd Time: 0.0849189 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.0884297 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x9cd5cdc35441c505 Time: 0.0796526 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9de226a0c44627c4 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x9de226a0c44627c4 Time: 0.154478 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xa419b3b68f2da07b Time: 0.0619276 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.0977188 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0681691 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.0792137 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0735817 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.0863817 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.0689493 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0809691 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xfff46c7893896eb1 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xfff46c7893896eb1 Time: 0.0744594 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0xa419b3b68f2da07b Time: 0.0619276 [06/27/2024-06:27:14] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(1638400,1,20480,256) -> Float(819200,1,10240,128) *************** [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_199 || Conv_209 (CublasConvolution) [06/27/2024-06:27:14] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_199 || Conv_209 (CaskConvolution) [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.101742 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0521752 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x128_relu_exp_interior_nhwc_tn_v1 Tactic: 0x17173deba0b64484 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x17173deba0b64484 Time: 0.0724846 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.100498 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.106715 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0852114 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.110738 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0768 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.0735086 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.0770194 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.0690956 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.115639 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.09216 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xae0c89d047932ba3 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xae0c89d047932ba3 Time: 0.0690469 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.0662202 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.069632 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xc7feb33970feefa7 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xc7feb33970feefa7 Time: 0.0823589 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.0691444 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.0735817 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_interior_nhwc_tn_v1 Tactic: 0xe47307053a42b3e4 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xe47307053a42b3e4 Time: 0.0690469 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.0971337 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0x130df49cb195156b Time: 0.0521752 [06/27/2024-06:27:14] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x130df49cb195156b [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(409600,1:4,5120,64) -> Float(204800,1:4,2560,32) *************** [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_199 || Conv_209 (CublasConvolution) [06/27/2024-06:27:14] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_199 || Conv_209 (CaskConvolution) [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0521265 [06/27/2024-06:27:14] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0522728 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0x130df49cb195156b Time: 0.0521265 [06/27/2024-06:27:14] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x130df49cb195156b [06/27/2024-06:27:14] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,1,10240,128) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,2560,32) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(25600,6400:32,80,1) -> Float(12800,6400:32,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:14] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(12800,6400:32,80,1) -> Float(12800,6400:32,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:14] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(409600,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(409600,1,5120,64) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(102400,1:4,1280,16) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(12800,6400:32,80,1) -> Float(12800,6400:32,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(409600,6400,80,1) -> Float(819200,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(409600,1,5120,64) -> Float(819200,1,10240,128) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,1280,16) -> Float(204800,1:4,2560,32) *************** [06/27/2024-06:27:14] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,6400,80,1) -> Float(819200,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,1,10240,128) -> Float(819200,1,10240,128) *************** [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: BatchNormalization_211 (Scale) [06/27/2024-06:27:14] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,2560,32) -> Float(204800,1:4,2560,32) *************** [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: BatchNormalization_211 (Scale) [06/27/2024-06:27:14] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:27:14] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,6400,80,1) -> Float(819200,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,1,10240,128) -> Float(819200,1,10240,128) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,2560,32) -> Float(204800,1:4,2560,32) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(25600,6400:32,80,1) -> Float(25600,6400:32,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,6400,80,1) -> Float(819200,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,1,10240,128) -> Float(819200,1,10240,128) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,2560,32) -> Float(204800,1:4,2560,32) *************** [06/27/2024-06:27:14] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,6400,80,1) -> Float(819200,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,1,10240,128) -> Float(819200,1,10240,128) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,2560,32) -> Float(204800,1:4,2560,32) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(25600,6400:32,80,1) -> Float(25600,6400:32,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(1:4,6400,80,1) -> Float(1:4,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,6400,80,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_217 (CudaDepthwiseConvolution) [06/27/2024-06:27:14] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_217 (FusedConvActConvolution) [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000006ffff Time: 0.203483 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0x000000000006ffff Time: 0.203483 [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_217 (CudnnConvolution) [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.142629 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0922331 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.226011 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000005 Time: 2.61573 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.179127 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0934766 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000000003a Time: 0.18827 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000000003d Time: 2.57931 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.143214 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.142921 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.188123 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000075 Time: 2.58853 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.0922331 [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_217 (CaskConvolution) [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x01cf8ce2da913006 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x01cf8ce2da913006 Time: 0.0811154 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x12dbf7d94ee3696d [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x12dbf7d94ee3696d Time: 0.0907703 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.168448 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4727434768e46395 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x4727434768e46395 Time: 0.09216 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4efce38acc876f5c [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x4efce38acc876f5c Time: 0.312466 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.136119 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 0x5403ad713f811a18 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x5403ad713f811a18 Time: 0.164059 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x5aa723e0481da855 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x5aa723e0481da855 Time: 0.0812617 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 0x5deb29b7a8e275f7 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x5deb29b7a8e275f7 Time: 0.131072 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xa31d27de74b895ff [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xa31d27de74b895ff Time: 0.09216 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.173787 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: 0xbb8c3889c7eacd30 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xbb8c3889c7eacd30 Time: 0.334702 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xd828f024626fa982 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xd828f024626fa982 Time: 0.143799 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.18213 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.118418 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0x01cf8ce2da913006 Time: 0.0811154 [06/27/2024-06:27:14] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x01cf8ce2da913006 [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,1,10240,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_217 (CaskConvolution) [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x19b688348f983aa0 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x19b688348f983aa0 Time: 0.0893074 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x1da91d865428f237 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x1da91d865428f237 Time: 0.0788503 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.0823589 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.0869669 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.0860891 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x3f0c846d6379bc98 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x3f0c846d6379bc98 Time: 0.14219 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.0825051 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.198217 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x62835fce994f06dd [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x62835fce994f06dd Time: 0.0823589 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0x634e99502974e4da [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x634e99502974e4da Time: 0.207726 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.0940617 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.135022 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x8014228ec08b4d49 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x8014228ec08b4d49 Time: 0.152064 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x94a7db94ba744c45 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x94a7db94ba744c45 Time: 0.0831634 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.0894537 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.110523 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0xbdfdef6b84f7ccc9 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xbdfdef6b84f7ccc9 Time: 0.632759 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x128_relu_exp_large_nhwc_tn_v1 Tactic: 0xca7eeb8d9143d738 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xca7eeb8d9143d738 Time: 0.148626 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xd15dd11d64344e83 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xd15dd11d64344e83 Time: 0.0785554 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.217088 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xf48db81f02eca9ee [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xf48db81f02eca9ee Time: 0.107739 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.13627 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0xd15dd11d64344e83 Time: 0.0785554 [06/27/2024-06:27:14] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xd15dd11d64344e83 [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,2560,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_217 (CaskConvolution) [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.0937646 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.0898926 [06/27/2024-06:27:14] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.0912091 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0x999e005e3b016ea6 Time: 0.0898926 [06/27/2024-06:27:14] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:14] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,6400,80,1) -> Float(115200,6400,80,1) *************** [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_261 (CudaDepthwiseConvolution) [06/27/2024-06:27:14] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_261 (FusedConvActConvolution) [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.064512 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.0284038 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.0359497 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.0291109 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.0367531 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.0319488 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.0371943 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.0289061 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.0286427 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.030603 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.0271863 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.0369737 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000004affff Time: 0.0307776 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000004effff Time: 0.0378514 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.0501272 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.0381806 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000005effff Time: 0.038912 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.0361326 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.0343479 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000006affff Time: 0.0319781 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.0340261 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.0395691 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.0365714 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.032885 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.0600259 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.0343186 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.0303689 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000007effff Time: 0.0351671 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.0338798 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000008affff Time: 0.0329143 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.0307209 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.0289646 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.0378857 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.0291986 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0x000000000045ffff Time: 0.0271863 [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_261 (CudnnConvolution) [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0310711 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0372297 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0861623 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000004 Time: 3.39983 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.137435 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.0305737 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0460434 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000000003a Time: 0.0861623 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000000003c Time: 3.35228 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000000003d Time: 0.140069 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.03712 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0527604 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.0863817 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000074 Time: 3.34732 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.135973 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0x0000000000000038 Time: 0.0305737 [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_261 (CublasConvolution) [06/27/2024-06:27:14] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_261 (CaskConvolution) [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x1fc87d7eb370bb7a Time: 0.0280869 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.0891611 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.025917 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.039424 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0424229 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.054272 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x8e3884f0eaec3ecd [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x8e3884f0eaec3ecd Time: 0.0267703 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.0283291 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x9cd5cdc35441c505 Time: 0.045568 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9de226a0c44627c4 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x9de226a0c44627c4 Time: 0.0893806 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xa419b3b68f2da07b Time: 0.0381074 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.0293157 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0388754 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.045824 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0241874 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.027331 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.0395337 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0265021 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xfff46c7893896eb1 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xfff46c7893896eb1 Time: 0.0424229 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0241874 [06/27/2024-06:27:14] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(819200,1,10240,128) -> Float(115200,1,1440,18) *************** [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_261 (CublasConvolution) [06/27/2024-06:27:14] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_261 (CaskConvolution) [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0401909 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0239924 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.025795 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.025184 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.029579 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.0392411 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.0241615 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0x35f26f9c09557d86 Time: 0.0239924 [06/27/2024-06:27:14] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(204800,1:4,2560,32) -> Float(32000,1:4,400,5) *************** [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_261 (CublasConvolution) [06/27/2024-06:27:14] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: Conv_261 (CaskConvolution) [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0303963 [06/27/2024-06:27:14] [V] [TRT] Conv_261 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0298715 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0x9dece0dc37e90462 Time: 0.0298715 [06/27/2024-06:27:14] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:14] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_218), Mul_219) (PointWise) [06/27/2024-06:27:14] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_218), Mul_219) (PointWiseV2) [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00664935 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.006144 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00610133 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00622032 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00626484 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0065627 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00651636 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0066959 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00708441 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00680664 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0062404 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0x0000000000000002 Time: 0.00610133 [06/27/2024-06:27:14] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000002 [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_218), Mul_219) (PointWise) [06/27/2024-06:27:14] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_218), Mul_219) (PointWiseV2) [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00693116 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00545608 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00622032 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00630301 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00701518 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00644254 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00593353 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00599219 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00639841 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00639821 [06/27/2024-06:27:14] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0172492 [06/27/2024-06:27:14] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.00545608 [06/27/2024-06:27:14] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000001 [06/27/2024-06:27:14] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_218), Mul_219) (PointWise) [06/27/2024-06:27:14] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:14] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_218), Mul_219) (PointWiseV2) [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00812698 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00852787 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00588069 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00608305 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00881033 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00637297 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00551965 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00659138 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0072192 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00946343 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x000000000000000a Time: 0.00633481 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x000000000000000b Time: 0.00629029 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x000000000000000c Time: 0.00673642 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0061501 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x000000000000000e Time: 0.006144 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x000000000000000f Time: 0.00643001 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.00590994 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.00569987 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.00572873 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0056269 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0159777 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00741006 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.00568316 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.00667013 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0066959 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00678234 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00709137 [06/27/2024-06:27:15] [V] [TRT] Fastest Tactic: 0x0000000000000006 Time: 0.00551965 [06/27/2024-06:27:15] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000006 [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(6400,1600:32,40,1) -> Float(12800,1600:32,40,1) *************** [06/27/2024-06:27:15] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_218), Mul_219) (PointWise) [06/27/2024-06:27:15] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:15] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_218), Mul_219) (PointWiseV2) [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.00697295 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.00676218 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x000000000000001a Time: 0.00676239 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x000000000000001b Time: 0.00634753 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x000000000000001f Time: 0.00644293 [06/27/2024-06:27:15] [V] [TRT] Fastest Tactic: 0x000000000000001b Time: 0.00634753 [06/27/2024-06:27:15] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001b [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:15] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:15] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(115200,6400,80,1) -> Float(115200,38400,480,6,1) *************** [06/27/2024-06:27:15] [V] [TRT] --------------- Timing Runner: Reshape_275 + Transpose_276 (Shuffle) [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0103693 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0261851 [06/27/2024-06:27:15] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0103693 [06/27/2024-06:27:15] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(115200,1,1440,18) -> Float(115200,38400,1,480,80) *************** [06/27/2024-06:27:15] [V] [TRT] --------------- Timing Runner: Reshape_275 + Transpose_276 (Shuffle) [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0154039 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0211278 [06/27/2024-06:27:15] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0154039 [06/27/2024-06:27:15] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(32000,1:4,400,5) -> Float(28800,9600,1:4,120,20) *************** [06/27/2024-06:27:15] [V] [TRT] --------------- Timing Runner: Reshape_275 + Transpose_276 (Shuffle) [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0153746 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0211076 [06/27/2024-06:27:15] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0153746 [06/27/2024-06:27:15] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(6400,6400:32,80,1) -> Float(4320,1440,480:32,6,1) *************** [06/27/2024-06:27:15] [V] [TRT] --------------- Timing Runner: Reshape_275 + Transpose_276 (Shuffle) [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0236356 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0220473 [06/27/2024-06:27:15] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.0220473 [06/27/2024-06:27:15] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000001 [06/27/2024-06:27:15] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(12800,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:15] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:15] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:15] [V] [TRT] *************** Autotuning format combination: Float(115200,38400,480,6,1) -> Float(115200,38400,480,6,1) *************** [06/27/2024-06:27:15] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_277) (PointWise) [06/27/2024-06:27:15] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:15] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_277) (PointWiseV2) [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00635906 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0101297 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00536804 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00651616 [06/27/2024-06:27:15] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00666327 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00658805 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0112622 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00629545 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00617886 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00676779 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0103027 [06/27/2024-06:27:16] [V] [TRT] Fastest Tactic: 0x0000000000000002 Time: 0.00536804 [06/27/2024-06:27:16] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000002 [06/27/2024-06:27:16] [V] [TRT] *************** Autotuning format combination: Float(115200,38400,1,480,80) -> Float(115200,38400,1,480,80) *************** [06/27/2024-06:27:16] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_277) (PointWise) [06/27/2024-06:27:16] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:16] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_277) (PointWiseV2) [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0103758 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00917029 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00864807 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00904229 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00878575 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0222208 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0103654 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00914286 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00869109 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00898743 [06/27/2024-06:27:16] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0101327 [06/27/2024-06:27:16] [V] [TRT] Fastest Tactic: 0x0000000000000002 Time: 0.00864807 [06/27/2024-06:27:16] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000002 [06/27/2024-06:27:16] [V] [TRT] *************** Autotuning format combination: Float(28800,9600,1:4,120,20) -> Float(28800,9600,1:4,120,20) *************** [06/27/2024-06:27:16] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_277) (PointWise) [06/27/2024-06:27:16] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:16] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_277) (PointWiseV2) [06/27/2024-06:27:17] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00864807 [06/27/2024-06:27:17] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00939886 [06/27/2024-06:27:17] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0110052 [06/27/2024-06:27:17] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0115341 [06/27/2024-06:27:17] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0138306 [06/27/2024-06:27:18] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.015755 [06/27/2024-06:27:18] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0173105 [06/27/2024-06:27:18] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0219011 [06/27/2024-06:27:18] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.024771 [06/27/2024-06:27:18] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0310126 [06/27/2024-06:27:18] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0150528 [06/27/2024-06:27:19] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0155794 [06/27/2024-06:27:19] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0170179 [06/27/2024-06:27:19] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0192 [06/27/2024-06:27:19] [V] [TRT] Tactic: 0x000000000000000e Time: 0.0176193 [06/27/2024-06:27:19] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0182674 [06/27/2024-06:27:20] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0248198 [06/27/2024-06:27:20] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.02211 [06/27/2024-06:27:20] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0219847 [06/27/2024-06:27:20] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0239909 [06/27/2024-06:27:20] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0228415 [06/27/2024-06:27:20] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0195474 [06/27/2024-06:27:21] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0719238 [06/27/2024-06:27:21] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0213786 [06/27/2024-06:27:21] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0162052 [06/27/2024-06:27:21] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0172942 [06/27/2024-06:27:21] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0682109 [06/27/2024-06:27:21] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00864807 [06/27/2024-06:27:21] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000000 [06/27/2024-06:27:21] [V] [TRT] *************** Autotuning format combination: Float(4320,1440,480:32,6,1) -> Float(4320,1440,480:32,6,1) *************** [06/27/2024-06:27:21] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_277) (PointWise) [06/27/2024-06:27:21] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:21] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_277) (PointWiseV2) [06/27/2024-06:27:21] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0212741 [06/27/2024-06:27:22] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.339822 [06/27/2024-06:27:22] [V] [TRT] Tactic: 0x000000000000001a Time: 0.0258194 [06/27/2024-06:27:22] [V] [TRT] Tactic: 0x000000000000001b Time: 0.0383269 [06/27/2024-06:27:22] [V] [TRT] Tactic: 0x000000000000001f Time: 0.0211278 [06/27/2024-06:27:22] [V] [TRT] Fastest Tactic: 0x000000000000001f Time: 0.0211278 [06/27/2024-06:27:22] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001f [06/27/2024-06:27:22] [V] [TRT] *************** Autotuning format combination: Float(38400,1:4,480,6,1) -> Float(38400,1:4,480,6,1) *************** [06/27/2024-06:27:22] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_277) (PointWiseV2) [06/27/2024-06:27:22] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.724553 [06/27/2024-06:27:22] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.343479 [06/27/2024-06:27:23] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.374053 [06/27/2024-06:27:23] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.402139 [06/27/2024-06:27:23] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.509659 [06/27/2024-06:27:23] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.550766 [06/27/2024-06:27:23] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.51829 [06/27/2024-06:27:24] [V] [TRT] Tactic: 0x0000000000000007 Time: 1.68799 [06/27/2024-06:27:24] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.704658 [06/27/2024-06:27:24] [V] [TRT] Tactic: 0x0000000000000009 Time: 1.00425 [06/27/2024-06:27:24] [V] [TRT] Tactic: 0x000000000000000a Time: 0.251173 [06/27/2024-06:27:24] [V] [TRT] Tactic: 0x000000000000000b Time: 0.331045 [06/27/2024-06:27:25] [V] [TRT] Tactic: 0x000000000000000c Time: 0.286427 [06/27/2024-06:27:25] [V] [TRT] Tactic: 0x000000000000000d Time: 0.423643 [06/27/2024-06:27:25] [V] [TRT] Tactic: 0x000000000000000e Time: 0.389851 [06/27/2024-06:27:25] [V] [TRT] Tactic: 0x000000000000000f Time: 0.323291 [06/27/2024-06:27:25] [V] [TRT] Tactic: 0x0000000000000010 Time: 1.17541 [06/27/2024-06:27:25] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.516827 [06/27/2024-06:27:26] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.458167 [06/27/2024-06:27:26] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.433298 [06/27/2024-06:27:26] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.182272 [06/27/2024-06:27:26] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.259803 [06/27/2024-06:27:26] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.362789 [06/27/2024-06:27:27] [V] [TRT] Tactic: 0x0000000000000017 Time: 1.52591 [06/27/2024-06:27:27] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0190354 [06/27/2024-06:27:27] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0229042 [06/27/2024-06:27:27] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0308663 [06/27/2024-06:27:27] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.0190354 [06/27/2024-06:27:27] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:27] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:27] [V] [TRT] *************** Autotuning format combination: Float(115200,38400,480,6,1) -> Float(38400,12800,160,2,1) *************** [06/27/2024-06:27:27] [V] [TRT] --------------- Timing Runner: Split_278 (Padding) [06/27/2024-06:27:27] [V] [TRT] Padding has no valid tactics for this config, skipping [06/27/2024-06:27:27] [V] [TRT] --------------- Timing Runner: Split_278 (Slice) [06/27/2024-06:27:27] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0155648 [06/27/2024-06:27:27] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0155648 [06/27/2024-06:27:27] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Slice Tactic: 0x0000000000000000 [06/27/2024-06:27:27] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:27] [V] [TRT] *************** Autotuning format combination: Float(115200,38400,480,6,1) -> Float(38400,12800,160,2,1) *************** [06/27/2024-06:27:27] [V] [TRT] --------------- Timing Runner: Split_278_59 (Padding) [06/27/2024-06:27:27] [V] [TRT] Padding has no valid tactics for this config, skipping [06/27/2024-06:27:27] [V] [TRT] --------------- Timing Runner: Split_278_59 (Slice) [06/27/2024-06:27:27] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.227182 [06/27/2024-06:27:27] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.227182 [06/27/2024-06:27:27] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Slice Tactic: 0x0000000000000000 [06/27/2024-06:27:27] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:27] [V] [TRT] *************** Autotuning format combination: Float(115200,38400,480,6,1) -> Float(115200,38400,480,6,1) *************** [06/27/2024-06:27:27] [V] [TRT] --------------- Timing Runner: Split_278_60 (Padding) [06/27/2024-06:27:27] [V] [TRT] Padding has no valid tactics for this config, skipping [06/27/2024-06:27:27] [V] [TRT] --------------- Timing Runner: Split_278_60 (Slice) [06/27/2024-06:27:27] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0172292 [06/27/2024-06:27:27] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0172292 [06/27/2024-06:27:27] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Slice Tactic: 0x0000000000000000 [06/27/2024-06:27:27] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:27] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:27] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:27] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:27] [V] [TRT] *************** Autotuning format combination: Float(6400,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:27] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:27] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:27] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:27] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:27] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:27] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:27] [V] [TRT] *************** Autotuning format combination: Float(38400,12800,160,2,1), Float(38400,12800,160,2,1) -> Float(38400,12800,160,2,1) *************** [06/27/2024-06:27:27] [V] [TRT] --------------- Timing Runner: PWN(PWN(457 + (Unnamed Layer* 250) [Shuffle] + Mul_280, Add_282), 461 + (Unnamed Layer* 255) [Shuffle] + Mul_284) (PointWise) [06/27/2024-06:27:27] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:27] [V] [TRT] --------------- Timing Runner: PWN(PWN(457 + (Unnamed Layer* 250) [Shuffle] + Mul_280, Add_282), 461 + (Unnamed Layer* 255) [Shuffle] + Mul_284) (PointWiseV2) [06/27/2024-06:27:27] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0136578 [06/27/2024-06:27:27] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0135115 [06/27/2024-06:27:28] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0133386 [06/27/2024-06:27:28] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0161077 [06/27/2024-06:27:28] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0153893 [06/27/2024-06:27:28] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0160589 [06/27/2024-06:27:28] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0196389 [06/27/2024-06:27:28] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0169041 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0161077 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.017408 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x000000000000001c Time: 0.014848 [06/27/2024-06:27:29] [V] [TRT] Fastest Tactic: 0x0000000000000002 Time: 0.0133386 [06/27/2024-06:27:29] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000002 [06/27/2024-06:27:29] [V] [TRT] *************** Autotuning format combination: Float(38400,12800,1,160,80), Float(38400,12800,1,160,80) -> Float(38400,12800,1,160,80) *************** [06/27/2024-06:27:29] [V] [TRT] --------------- Timing Runner: PWN(PWN(457 + (Unnamed Layer* 250) [Shuffle] + Mul_280, Add_282), 461 + (Unnamed Layer* 255) [Shuffle] + Mul_284) (PointWise) [06/27/2024-06:27:29] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:29] [V] [TRT] --------------- Timing Runner: PWN(PWN(457 + (Unnamed Layer* 250) [Shuffle] + Mul_280, Add_282), 461 + (Unnamed Layer* 255) [Shuffle] + Mul_284) (PointWiseV2) [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0150674 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0147456 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.228557 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0160427 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0154185 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0160427 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0196023 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0169366 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0160264 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0173592 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0149211 [06/27/2024-06:27:29] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.0147456 [06/27/2024-06:27:29] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000001 [06/27/2024-06:27:29] [V] [TRT] *************** Autotuning format combination: Float(9600,3200,1:4,40,20), Float(9600,3200,1:4,40,20) -> Float(9600,3200,1:4,40,20) *************** [06/27/2024-06:27:29] [V] [TRT] --------------- Timing Runner: PWN(PWN(457 + (Unnamed Layer* 250) [Shuffle] + Mul_280, Add_282), 461 + (Unnamed Layer* 255) [Shuffle] + Mul_284) (PointWise) [06/27/2024-06:27:29] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:29] [V] [TRT] --------------- Timing Runner: PWN(PWN(457 + (Unnamed Layer* 250) [Shuffle] + Mul_280, Add_282), 461 + (Unnamed Layer* 255) [Shuffle] + Mul_284) (PointWiseV2) [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0142163 [06/27/2024-06:27:29] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0151698 [06/27/2024-06:27:30] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0198217 [06/27/2024-06:27:30] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0181029 [06/27/2024-06:27:30] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0181211 [06/27/2024-06:27:30] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0191634 [06/27/2024-06:27:30] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0235938 [06/27/2024-06:27:30] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0230504 [06/27/2024-06:27:31] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0235938 [06/27/2024-06:27:31] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0246491 [06/27/2024-06:27:31] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0124343 [06/27/2024-06:27:31] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0127756 [06/27/2024-06:27:31] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0136046 [06/27/2024-06:27:32] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0144823 [06/27/2024-06:27:32] [V] [TRT] Tactic: 0x000000000000000e Time: 0.131131 [06/27/2024-06:27:32] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0219011 [06/27/2024-06:27:32] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0226952 [06/27/2024-06:27:32] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.025917 [06/27/2024-06:27:32] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0297545 [06/27/2024-06:27:33] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0264533 [06/27/2024-06:27:33] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0155648 [06/27/2024-06:27:33] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0151845 [06/27/2024-06:27:33] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0160914 [06/27/2024-06:27:33] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0197669 [06/27/2024-06:27:34] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0162052 [06/27/2024-06:27:34] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0153015 [06/27/2024-06:27:34] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0155355 [06/27/2024-06:27:34] [V] [TRT] Fastest Tactic: 0x000000000000000a Time: 0.0124343 [06/27/2024-06:27:34] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000000a [06/27/2024-06:27:34] [V] [TRT] *************** Autotuning format combination: Float(1440,480,160:32,2,1), Float(1440,480,160:32,2,1) -> Float(1440,480,160:32,2,1) *************** [06/27/2024-06:27:34] [V] [TRT] --------------- Timing Runner: PWN(PWN(457 + (Unnamed Layer* 250) [Shuffle] + Mul_280, Add_282), 461 + (Unnamed Layer* 255) [Shuffle] + Mul_284) (PointWise) [06/27/2024-06:27:34] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:34] [V] [TRT] --------------- Timing Runner: PWN(PWN(457 + (Unnamed Layer* 250) [Shuffle] + Mul_280, Add_282), 461 + (Unnamed Layer* 255) [Shuffle] + Mul_284) (PointWiseV2) [06/27/2024-06:27:34] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0190903 [06/27/2024-06:27:34] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.0242103 [06/27/2024-06:27:34] [V] [TRT] Tactic: 0x000000000000001a Time: 0.031744 [06/27/2024-06:27:35] [V] [TRT] Tactic: 0x000000000000001b Time: 0.0497859 [06/27/2024-06:27:35] [V] [TRT] Tactic: 0x000000000000001f Time: 0.132827 [06/27/2024-06:27:35] [V] [TRT] Fastest Tactic: 0x0000000000000018 Time: 0.0190903 [06/27/2024-06:27:35] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000018 [06/27/2024-06:27:35] [V] [TRT] *************** Autotuning format combination: Float(12800,1:4,160,2,1), Float(12800,1:4,160,2,1) -> Float(12800,1:4,160,2,1) *************** [06/27/2024-06:27:35] [V] [TRT] --------------- Timing Runner: PWN(PWN(457 + (Unnamed Layer* 250) [Shuffle] + Mul_280, Add_282), 461 + (Unnamed Layer* 255) [Shuffle] + Mul_284) (PointWiseV2) [06/27/2024-06:27:35] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.151259 [06/27/2024-06:27:35] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.172617 [06/27/2024-06:27:35] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.253074 [06/27/2024-06:27:35] [V] [TRT] Tactic: 0x0000000000000003 Time: 1.18009 [06/27/2024-06:27:36] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.191488 [06/27/2024-06:27:36] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.191927 [06/27/2024-06:27:36] [V] [TRT] Tactic: 0x0000000000000006 Time: 1.31028 [06/27/2024-06:27:36] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.246491 [06/27/2024-06:27:36] [V] [TRT] Tactic: 0x0000000000000008 Time: 1.11996 [06/27/2024-06:27:37] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.206702 [06/27/2024-06:27:37] [V] [TRT] Tactic: 0x000000000000000a Time: 0.103936 [06/27/2024-06:27:37] [V] [TRT] Tactic: 0x000000000000000b Time: 0.131145 [06/27/2024-06:27:37] [V] [TRT] Tactic: 0x000000000000000c Time: 0.129755 [06/27/2024-06:27:37] [V] [TRT] Tactic: 0x000000000000000d Time: 1.146 [06/27/2024-06:27:38] [V] [TRT] Tactic: 0x000000000000000e Time: 0.189733 [06/27/2024-06:27:38] [V] [TRT] Tactic: 0x000000000000000f Time: 0.169399 [06/27/2024-06:27:38] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.222062 [06/27/2024-06:27:38] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.235666 [06/27/2024-06:27:38] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.252489 [06/27/2024-06:27:39] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.289061 [06/27/2024-06:27:39] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0850651 [06/27/2024-06:27:39] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.115785 [06/27/2024-06:27:39] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.152283 [06/27/2024-06:27:39] [V] [TRT] Tactic: 0x0000000000000017 Time: 1.18418 [06/27/2024-06:27:39] [V] [TRT] Tactic: 0x000000000000001c Time: 0.018304 [06/27/2024-06:27:40] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0172617 [06/27/2024-06:27:40] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0201509 [06/27/2024-06:27:40] [V] [TRT] Fastest Tactic: 0x000000000000001d Time: 0.0172617 [06/27/2024-06:27:40] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001d [06/27/2024-06:27:40] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:40] [V] [TRT] *************** Autotuning format combination: Float(38400,12800,160,2,1), Float(38400,12800,160,2,1) -> Float(38400,12800,160,2,1) *************** [06/27/2024-06:27:40] [V] [TRT] --------------- Timing Runner: PWN(PWN(463 + (Unnamed Layer* 258) [Shuffle] + Mul_286, PWN(465 + (Unnamed Layer* 261) [Shuffle], Pow_288)), Mul_290) (PointWise) [06/27/2024-06:27:40] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:40] [V] [TRT] --------------- Timing Runner: PWN(PWN(463 + (Unnamed Layer* 258) [Shuffle] + Mul_286, PWN(465 + (Unnamed Layer* 261) [Shuffle], Pow_288)), Mul_290) (PointWiseV2) [06/27/2024-06:27:40] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.014965 [06/27/2024-06:27:40] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0145993 [06/27/2024-06:27:40] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0148334 [06/27/2024-06:27:40] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0159939 [06/27/2024-06:27:41] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0155648 [06/27/2024-06:27:41] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0159939 [06/27/2024-06:27:41] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0193463 [06/27/2024-06:27:41] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0184686 [06/27/2024-06:27:41] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0162702 [06/27/2024-06:27:41] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0149211 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0131258 [06/27/2024-06:27:42] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.0131258 [06/27/2024-06:27:42] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:42] [V] [TRT] *************** Autotuning format combination: Float(38400,12800,1,160,80), Float(38400,12800,1,160,80) -> Float(38400,12800,1,160,80) *************** [06/27/2024-06:27:42] [V] [TRT] --------------- Timing Runner: PWN(PWN(463 + (Unnamed Layer* 258) [Shuffle] + Mul_286, PWN(465 + (Unnamed Layer* 261) [Shuffle], Pow_288)), Mul_290) (PointWise) [06/27/2024-06:27:42] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:42] [V] [TRT] --------------- Timing Runner: PWN(PWN(463 + (Unnamed Layer* 258) [Shuffle] + Mul_286, PWN(465 + (Unnamed Layer* 261) [Shuffle], Pow_288)), Mul_290) (PointWiseV2) [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0131258 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0127269 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.332605 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0139902 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.013578 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0147456 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0168066 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.333385 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0140434 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0148626 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0130726 [06/27/2024-06:27:42] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.0127269 [06/27/2024-06:27:42] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000001 [06/27/2024-06:27:42] [V] [TRT] *************** Autotuning format combination: Float(9600,3200,1:4,40,20), Float(9600,3200,1:4,40,20) -> Float(9600,3200,1:4,40,20) *************** [06/27/2024-06:27:42] [V] [TRT] --------------- Timing Runner: PWN(PWN(463 + (Unnamed Layer* 258) [Shuffle] + Mul_286, PWN(465 + (Unnamed Layer* 261) [Shuffle], Pow_288)), Mul_290) (PointWise) [06/27/2024-06:27:42] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:42] [V] [TRT] --------------- Timing Runner: PWN(PWN(463 + (Unnamed Layer* 258) [Shuffle] + Mul_286, PWN(465 + (Unnamed Layer* 261) [Shuffle], Pow_288)), Mul_290) (PointWiseV2) [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0136977 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0148626 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0189257 [06/27/2024-06:27:42] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0181029 [06/27/2024-06:27:43] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0171967 [06/27/2024-06:27:43] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0182309 [06/27/2024-06:27:43] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0260145 [06/27/2024-06:27:43] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.113371 [06/27/2024-06:27:43] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0242347 [06/27/2024-06:27:44] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0328558 [06/27/2024-06:27:44] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0156672 [06/27/2024-06:27:44] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0967314 [06/27/2024-06:27:44] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0168716 [06/27/2024-06:27:44] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0178306 [06/27/2024-06:27:44] [V] [TRT] Tactic: 0x000000000000000e Time: 0.019456 [06/27/2024-06:27:45] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0218593 [06/27/2024-06:27:45] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.023134 [06/27/2024-06:27:45] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0258926 [06/27/2024-06:27:45] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0301056 [06/27/2024-06:27:45] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0264777 [06/27/2024-06:27:46] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0155355 [06/27/2024-06:27:46] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0152137 [06/27/2024-06:27:46] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0160264 [06/27/2024-06:27:46] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0192914 [06/27/2024-06:27:46] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0166766 [06/27/2024-06:27:46] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0154478 [06/27/2024-06:27:47] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0155502 [06/27/2024-06:27:47] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0136977 [06/27/2024-06:27:47] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000000 [06/27/2024-06:27:47] [V] [TRT] *************** Autotuning format combination: Float(1440,480,160:32,2,1), Float(1440,480,160:32,2,1) -> Float(1440,480,160:32,2,1) *************** [06/27/2024-06:27:47] [V] [TRT] --------------- Timing Runner: PWN(PWN(463 + (Unnamed Layer* 258) [Shuffle] + Mul_286, PWN(465 + (Unnamed Layer* 261) [Shuffle], Pow_288)), Mul_290) (PointWise) [06/27/2024-06:27:47] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:47] [V] [TRT] --------------- Timing Runner: PWN(PWN(463 + (Unnamed Layer* 258) [Shuffle] + Mul_286, PWN(465 + (Unnamed Layer* 261) [Shuffle], Pow_288)), Mul_290) (PointWiseV2) [06/27/2024-06:27:47] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.019072 [06/27/2024-06:27:47] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.0236147 [06/27/2024-06:27:47] [V] [TRT] Tactic: 0x000000000000001a Time: 0.031627 [06/27/2024-06:27:47] [V] [TRT] Tactic: 0x000000000000001b Time: 0.0495909 [06/27/2024-06:27:48] [V] [TRT] Tactic: 0x000000000000001f Time: 0.0194377 [06/27/2024-06:27:48] [V] [TRT] Fastest Tactic: 0x0000000000000018 Time: 0.019072 [06/27/2024-06:27:48] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000018 [06/27/2024-06:27:48] [V] [TRT] *************** Autotuning format combination: Float(12800,1:4,160,2,1), Float(12800,1:4,160,2,1) -> Float(12800,1:4,160,2,1) *************** [06/27/2024-06:27:48] [V] [TRT] --------------- Timing Runner: PWN(PWN(463 + (Unnamed Layer* 258) [Shuffle] + Mul_286, PWN(465 + (Unnamed Layer* 261) [Shuffle], Pow_288)), Mul_290) (PointWiseV2) [06/27/2024-06:27:48] [V] [TRT] Tactic: 0x0000000000000000 Time: 1.1106 [06/27/2024-06:27:48] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.177883 [06/27/2024-06:27:48] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.25995 [06/27/2024-06:27:48] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.213723 [06/27/2024-06:27:48] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.328265 [06/27/2024-06:27:49] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.190171 [06/27/2024-06:27:49] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.284233 [06/27/2024-06:27:49] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.397897 [06/27/2024-06:27:49] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.251026 [06/27/2024-06:27:49] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.210066 [06/27/2024-06:27:50] [V] [TRT] Tactic: 0x000000000000000a Time: 0.880201 [06/27/2024-06:27:50] [V] [TRT] Tactic: 0x000000000000000b Time: 0.613376 [06/27/2024-06:27:50] [V] [TRT] Tactic: 0x000000000000000c Time: 0.131145 [06/27/2024-06:27:50] [V] [TRT] Tactic: 0x000000000000000d Time: 0.163401 [06/27/2024-06:27:50] [V] [TRT] Tactic: 0x000000000000000e Time: 0.190464 [06/27/2024-06:27:51] [V] [TRT] Tactic: 0x000000000000000f Time: 0.17013 [06/27/2024-06:27:51] [V] [TRT] Tactic: 0x0000000000000010 Time: 1.21651 [06/27/2024-06:27:51] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.235813 [06/27/2024-06:27:51] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.253367 [06/27/2024-06:27:51] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.289792 [06/27/2024-06:27:52] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0838217 [06/27/2024-06:27:52] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.119296 [06/27/2024-06:27:52] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.158281 [06/27/2024-06:27:52] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.23669 [06/27/2024-06:27:52] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0184686 [06/27/2024-06:27:52] [V] [TRT] Tactic: 0x000000000000001d Time: 0.210846 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0198217 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.0184686 [06/27/2024-06:27:53] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:53] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(204800,1600,40,1) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(204800,1,5120,128) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(51200,1:4,1280,32) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(6400,1600:32,40,1) -> Float(6400,1600:32,40,1) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:53] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(204800,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(204800,1,5120,128) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,1280,32) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:53] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(115200,38400,480,6,1) -> Float(115200,6,1) *************** [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Reshape_298 (Shuffle) [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0166441 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0443246 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0166441 [06/27/2024-06:27:53] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(115200,38400,1,480,80) -> Float(1,6,1) *************** [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Reshape_298 (Shuffle) [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0523215 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000001 Time: 12.6085 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0523215 [06/27/2024-06:27:53] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(28800,9600,1:4,120,20) -> Float(1:4,6,1) *************** [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Reshape_298 (Shuffle) [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000000 Time: 3.24564 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000001 Time: 1.55004 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 1.55004 [06/27/2024-06:27:53] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000001 [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(4320,1440,480:32,6,1) -> Float(115200:32,6,1) *************** [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Reshape_298 (Shuffle) [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000000 Time: 9.48326 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.12288 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.12288 [06/27/2024-06:27:53] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000001 [06/27/2024-06:27:53] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: BatchNormalization_233 (Scale) [06/27/2024-06:27:53] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: BatchNormalization_233 (Scale) [06/27/2024-06:27:53] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:27:53] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(12800,1600:32,40,1) -> Float(12800,1600:32,40,1) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:53] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:53] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(409600,1600,40,1) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(409600,1,10240,256) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(102400,1:4,2560,64) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(12800,1600:32,40,1) -> Float(12800,1600:32,40,1) *************** [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(1:4,1600,40,1) -> Float(1:4,1600,40,1) *************** [06/27/2024-06:27:53] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_239 (CudaDepthwiseConvolution) [06/27/2024-06:27:53] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_239 (FusedConvActConvolution) [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000006ffff Time: 0.264631 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x000000000006ffff Time: 0.264631 [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_239 (CudnnConvolution) [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.246784 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.137074 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.196754 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000005 Time: 4.80066 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.203191 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.136704 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000000003a Time: 0.194999 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000000003d Time: 4.78164 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.239909 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.194706 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.246345 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000075 Time: 4.86078 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x0000000000000039 Time: 0.136704 [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_239 (CaskConvolution) [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x01cf8ce2da913006 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x01cf8ce2da913006 Time: 0.1536 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x12dbf7d94ee3696d [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x12dbf7d94ee3696d Time: 0.17291 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.210213 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4727434768e46395 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x4727434768e46395 Time: 0.108471 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x4efce38acc876f5c [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x4efce38acc876f5c Time: 0.535698 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.29696 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 0x5403ad713f811a18 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x5403ad713f811a18 Time: 0.271067 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x5aa723e0481da855 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x5aa723e0481da855 Time: 0.276773 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 0x5deb29b7a8e275f7 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x5deb29b7a8e275f7 Time: 0.215918 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xa31d27de74b895ff [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xa31d27de74b895ff Time: 0.108032 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.384731 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: 0xbb8c3889c7eacd30 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xbb8c3889c7eacd30 Time: 0.614254 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xd828f024626fa982 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xd828f024626fa982 Time: 0.204069 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.267858 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.232155 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0xa31d27de74b895ff Time: 0.108032 [06/27/2024-06:27:53] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0xa31d27de74b895ff [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_239 (CaskConvolution) [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x19b688348f983aa0 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x19b688348f983aa0 Time: 0.118491 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x1da91d865428f237 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x1da91d865428f237 Time: 0.139483 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x27b316f52c109002 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x27b316f52c109002 Time: 0.156677 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x3e191488237fab8f [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x3e191488237fab8f Time: 0.24181 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x32_sliced1x4_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0x3e2b881168d9689d [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x3e2b881168d9689d Time: 0.165449 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x3f0c846d6379bc98 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x3f0c846d6379bc98 Time: 0.262144 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x412c44dfeaf9161d [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x412c44dfeaf9161d Time: 0.156087 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x128_relu_exp_small_nhwc_tn_v1 Tactic: 0x5030121339a48bf3 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x5030121339a48bf3 Time: 0.287744 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0x62835fce994f06dd [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x62835fce994f06dd Time: 0.0931109 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0x634e99502974e4da [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x634e99502974e4da Time: 0.262437 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.181102 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_medium_nhwc_tn_v1 Tactic: 0x7bc32c782b800c48 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x7bc32c782b800c48 Time: 0.257024 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 0x8014228ec08b4d49 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x8014228ec08b4d49 Time: 0.233179 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x94a7db94ba744c45 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x94a7db94ba744c45 Time: 0.165449 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.172325 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.213138 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x64_sliced1x2_ldg4_relu_exp_large_nhwc_tn_v1 Tactic: 0xbdfdef6b84f7ccc9 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xbdfdef6b84f7ccc9 Time: 0.1536 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x128_relu_exp_large_nhwc_tn_v1 Tactic: 0xca7eeb8d9143d738 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xca7eeb8d9143d738 Time: 0.343771 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xd15dd11d64344e83 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xd15dd11d64344e83 Time: 0.174446 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x128_relu_exp_medium_nhwc_tn_v1 Tactic: 0xd9031472c05adf51 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xd9031472c05adf51 Time: 0.321243 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xf48db81f02eca9ee [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xf48db81f02eca9ee Time: 0.139045 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: ampere_scudnn_128x128_ldg4_relu_exp_small_nhwc_tn_v1 Tactic: 0xf90060ce8193b811 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xf90060ce8193b811 Time: 0.259365 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x62835fce994f06dd Time: 0.0931109 [06/27/2024-06:27:53] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x62835fce994f06dd [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_239 (CaskConvolution) [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0x65e41d81f093b482 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x65e41d81f093b482 Time: 0.181394 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x999e005e3b016ea6 Time: 0.172032 [06/27/2024-06:27:53] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 0xb443c221fcb1565b [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xb443c221fcb1565b Time: 0.175397 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x999e005e3b016ea6 Time: 0.172032 [06/27/2024-06:27:53] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x999e005e3b016ea6 [06/27/2024-06:27:53] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(409600,1600,40,1) -> Float(28800,1600,40,1) *************** [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_299 (CudaDepthwiseConvolution) [06/27/2024-06:27:53] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_299 (FusedConvActConvolution) [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.0588556 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.0188526 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.0361691 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.0142334 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.0198949 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.0166603 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.0175543 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.0148187 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.0138439 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.0152297 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.0231944 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.0208993 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000004affff Time: 0.0150368 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000004effff Time: 0.0189806 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.0234893 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.0207935 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000005effff Time: 0.0181577 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.0179206 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.0159451 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000006affff Time: 0.0166603 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.0175705 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.0218599 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.0164815 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.0176518 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.020316 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.0167253 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.0149943 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000007effff Time: 0.0188349 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.0182674 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000008affff Time: 0.0165953 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.0150382 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.0130726 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.0192731 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.0224026 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x00000000009dffff Time: 0.0130726 [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_299 (CudnnConvolution) [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0314514 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0362514 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.134217 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000004 Time: 1.11133 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.112421 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.0535406 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0381074 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000000003a Time: 0.187538 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000000003c Time: 1.19062 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x000000000000003d Time: 0.177591 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.0312466 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0458971 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.133632 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000074 Time: 1.25937 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.116955 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x0000000000000070 Time: 0.0312466 [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_299 (CublasConvolution) [06/27/2024-06:27:53] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_299 (CaskConvolution) [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x1fc87d7eb370bb7a Time: 0.0162215 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.0462263 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.0294619 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.0367909 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0315383 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.0169046 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x8e3884f0eaec3ecd [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x8e3884f0eaec3ecd Time: 0.0255756 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.0338222 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x9cd5cdc35441c505 Time: 0.023869 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9de226a0c44627c4 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x9de226a0c44627c4 Time: 0.0672914 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xa419b3b68f2da07b Time: 0.0304567 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.0350501 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0362057 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.024064 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0276724 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.0262339 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.052992 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0306615 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xfff46c7893896eb1 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xfff46c7893896eb1 Time: 0.0990354 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x1fc87d7eb370bb7a Time: 0.0162215 [06/27/2024-06:27:53] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: FusedConvActConvolution Tactic: 0x00000000009dffff [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(409600,1,10240,256) -> Float(28800,1,720,18) *************** [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_299 (CublasConvolution) [06/27/2024-06:27:53] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_299 (CaskConvolution) [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0213786 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0214831 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.015477 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.0145847 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.0360229 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.0210227 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.0210233 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x90898977fc8ce537 Time: 0.0145847 [06/27/2024-06:27:53] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x90898977fc8ce537 [06/27/2024-06:27:53] [V] [TRT] *************** Autotuning format combination: Float(102400,1:4,2560,64) -> Float(8000,1:4,200,5) *************** [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_299 (CublasConvolution) [06/27/2024-06:27:53] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:27:53] [V] [TRT] --------------- Timing Runner: Conv_299 (CaskConvolution) [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0364983 [06/27/2024-06:27:53] [V] [TRT] Conv_299 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:53] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0244297 [06/27/2024-06:27:53] [V] [TRT] Fastest Tactic: 0x9dece0dc37e90462 Time: 0.0244297 [06/27/2024-06:27:53] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x9dece0dc37e90462 [06/27/2024-06:27:53] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_240), Mul_241) (PointWise) [06/27/2024-06:27:54] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_240), Mul_241) (PointWiseV2) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00662275 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00693029 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00643657 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00647573 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00641113 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00692419 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00645565 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00709834 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00672914 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00662899 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00659616 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x0000000000000004 Time: 0.00641113 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000004 [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(102400,1,5120,256) -> Float(204800,1,10240,512) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_240), Mul_241) (PointWise) [06/27/2024-06:27:54] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_240), Mul_241) (PointWiseV2) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00623284 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00651636 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00641829 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00637357 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00608457 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0106237 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00669818 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00649361 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0063994 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00656187 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0059509 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.0059509 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_240), Mul_241) (PointWise) [06/27/2024-06:27:54] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_240), Mul_241) (PointWiseV2) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0066747 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00980328 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00655231 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00660301 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.011985 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00633501 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00700016 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00689676 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00671584 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00636065 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000a Time: 0.00662442 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000b Time: 0.00691069 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000c Time: 0.00650307 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000d Time: 0.00701649 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000e Time: 0.00637297 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000f Time: 0.00665891 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.00683581 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.00662275 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.00637933 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.00694509 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.00623284 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00686878 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.00553635 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.00672229 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00695249 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00643657 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00638569 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x0000000000000016 Time: 0.00553635 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000016 [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(3200,400:32,20,1) -> Float(6400,400:32,20,1) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_240), Mul_241) (PointWise) [06/27/2024-06:27:54] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(PWN(Sigmoid_240), Mul_241) (PointWiseV2) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0103549 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.00656291 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001a Time: 0.00676904 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001b Time: 0.00571077 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001f Time: 0.00659408 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x000000000000001b Time: 0.00571077 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001b [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:27:54] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(204800,1,10240,512) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:27:54] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(28800,1600,40,1) -> Float(28800,9600,240,6,1) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: Reshape_313 + Transpose_314 (Shuffle) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00677008 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0228833 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00677008 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(28800,1,720,18) -> Float(28800,9600,1,240,40) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: Reshape_313 + Transpose_314 (Shuffle) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00633461 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0221962 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00633461 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(8000,1:4,200,5) -> Float(7200,2400,1:4,60,10) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: Reshape_313 + Transpose_314 (Shuffle) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00654296 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0226743 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00654296 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(1600,1600:32,40,1) -> Float(1440,480,240:32,6,1) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: Reshape_313 + Transpose_314 (Shuffle) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0129928 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.025917 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0129928 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:27:54] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(6400,400:32,20,1) -> Float(3200,400:32,20,1) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:27:54] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(102400,1,5120,256) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:54] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(28800,9600,240,6,1) -> Float(28800,9600,240,6,1) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_315) (PointWise) [06/27/2024-06:27:54] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_315) (PointWiseV2) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00610743 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0063036 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00637873 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00657766 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00761576 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00594907 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00711227 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00687652 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00650764 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00860504 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00693116 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x0000000000000005 Time: 0.00594907 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000005 [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(28800,9600,1,240,40) -> Float(28800,9600,1,240,40) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_315) (PointWise) [06/27/2024-06:27:54] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_315) (PointWiseV2) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00768385 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00642564 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00622688 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0121585 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00742537 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00748251 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00680914 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00692419 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00671044 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00993989 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00615619 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.00615619 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(7200,2400,1:4,60,10) -> Float(7200,2400,1:4,60,10) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_315) (PointWise) [06/27/2024-06:27:54] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_315) (PointWiseV2) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00609714 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00607048 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0068406 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00859022 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00607086 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00614324 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00691701 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00581632 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00582217 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00608933 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000a Time: 0.00754526 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000b Time: 0.00624835 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000c Time: 0.00598034 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000d Time: 0.00695206 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000e Time: 0.00686846 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000f Time: 0.00760541 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.00684778 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.00652966 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.00650017 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0066427 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.00798933 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00608933 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.00654151 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.00673579 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0062394 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00825549 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00643021 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x0000000000000007 Time: 0.00581632 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000007 [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(1440,480,240:32,6,1) -> Float(1440,480,240:32,6,1) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_315) (PointWise) [06/27/2024-06:27:54] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_315) (PointWiseV2) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.00972495 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.00655543 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001a Time: 0.00685453 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001b Time: 0.00595524 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001f Time: 0.00661486 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x000000000000001b Time: 0.00595524 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001b [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(9600,1:4,240,6,1) -> Float(9600,1:4,240,6,1) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_315) (PointWiseV2) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0108251 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0141153 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0159619 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0154331 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0176193 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0187246 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0177331 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0219638 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0232594 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0323291 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000a Time: 0.00996815 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0217971 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000c Time: 0.011219 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0149797 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000e Time: 0.015243 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0125196 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0203337 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0177981 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0160919 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0153746 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.00881156 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0114215 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0148197 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0216921 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00640179 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00683075 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00874487 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.00640179 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:54] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(28800,9600,240,6,1) -> Float(9600,3200,80,2,1) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: Split_316 (Padding) [06/27/2024-06:27:54] [V] [TRT] Padding has no valid tactics for this config, skipping [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: Split_316 (Slice) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00668883 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00668883 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Slice Tactic: 0x0000000000000000 [06/27/2024-06:27:54] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(28800,9600,240,6,1) -> Float(9600,3200,80,2,1) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: Split_316_65 (Padding) [06/27/2024-06:27:54] [V] [TRT] Padding has no valid tactics for this config, skipping [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: Split_316_65 (Slice) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00976579 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00976579 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Slice Tactic: 0x0000000000000000 [06/27/2024-06:27:54] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(28800,9600,240,6,1) -> Float(28800,9600,240,6,1) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: Split_316_66 (Padding) [06/27/2024-06:27:54] [V] [TRT] Padding has no valid tactics for this config, skipping [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: Split_316_66 (Slice) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00635687 [06/27/2024-06:27:54] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00635687 [06/27/2024-06:27:54] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Slice Tactic: 0x0000000000000000 [06/27/2024-06:27:54] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(102400,1,5120,256) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(3200,400:32,20,1) -> Float(3200,400:32,20,1) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:27:54] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(102400,1,5120,256) -> Float(102400,1,5120,256) *************** [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:27:54] [V] [TRT] =============== Computing costs for [06/27/2024-06:27:54] [V] [TRT] *************** Autotuning format combination: Float(9600,3200,80,2,1), Float(9600,3200,80,2,1) -> Float(9600,3200,80,2,1) *************** [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(PWN(503 + (Unnamed Layer* 297) [Shuffle] + Mul_318, Add_320), 507 + (Unnamed Layer* 302) [Shuffle] + Mul_322) (PointWise) [06/27/2024-06:27:54] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:54] [V] [TRT] --------------- Timing Runner: PWN(PWN(503 + (Unnamed Layer* 297) [Shuffle] + Mul_318, Add_320), 507 + (Unnamed Layer* 302) [Shuffle] + Mul_322) (PointWiseV2) [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0063521 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.006204 [06/27/2024-06:27:54] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00631632 [06/27/2024-06:27:55] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00636164 [06/27/2024-06:27:55] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00934944 [06/27/2024-06:27:55] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00550468 [06/27/2024-06:27:55] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.006608 [06/27/2024-06:27:55] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00981577 [06/27/2024-06:27:55] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00629665 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00670296 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00674909 [06/27/2024-06:27:56] [V] [TRT] Fastest Tactic: 0x0000000000000005 Time: 0.00550468 [06/27/2024-06:27:56] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000005 [06/27/2024-06:27:56] [V] [TRT] *************** Autotuning format combination: Float(9600,3200,1,80,40), Float(9600,3200,1,80,40) -> Float(9600,3200,1,80,40) *************** [06/27/2024-06:27:56] [V] [TRT] --------------- Timing Runner: PWN(PWN(503 + (Unnamed Layer* 297) [Shuffle] + Mul_318, Add_320), 507 + (Unnamed Layer* 302) [Shuffle] + Mul_322) (PointWise) [06/27/2024-06:27:56] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:56] [V] [TRT] --------------- Timing Runner: PWN(PWN(503 + (Unnamed Layer* 297) [Shuffle] + Mul_318, Add_320), 507 + (Unnamed Layer* 302) [Shuffle] + Mul_322) (PointWiseV2) [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00704958 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00774544 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00849318 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.009088 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00728503 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00726309 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00968411 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00886319 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0400823 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0102296 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00667595 [06/27/2024-06:27:56] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.00667595 [06/27/2024-06:27:56] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:27:56] [V] [TRT] *************** Autotuning format combination: Float(2400,800,1:4,20,10), Float(2400,800,1:4,20,10) -> Float(2400,800,1:4,20,10) *************** [06/27/2024-06:27:56] [V] [TRT] --------------- Timing Runner: PWN(PWN(503 + (Unnamed Layer* 297) [Shuffle] + Mul_318, Add_320), 507 + (Unnamed Layer* 302) [Shuffle] + Mul_322) (PointWise) [06/27/2024-06:27:56] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:27:56] [V] [TRT] --------------- Timing Runner: PWN(PWN(503 + (Unnamed Layer* 297) [Shuffle] + Mul_318, Add_320), 507 + (Unnamed Layer* 302) [Shuffle] + Mul_322) (PointWiseV2) [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00831391 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00893257 [06/27/2024-06:27:56] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0123246 [06/27/2024-06:27:57] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0121539 [06/27/2024-06:27:57] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0126293 [06/27/2024-06:27:57] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0136711 [06/27/2024-06:27:57] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0205218 [06/27/2024-06:27:57] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0210442 [06/27/2024-06:27:57] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0232385 [06/27/2024-06:27:58] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0258926 [06/27/2024-06:27:58] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0104803 [06/27/2024-06:27:58] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0119101 [06/27/2024-06:27:58] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0130992 [06/27/2024-06:27:58] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0152869 [06/27/2024-06:27:58] [V] [TRT] Tactic: 0x000000000000000e Time: 0.016319 [06/27/2024-06:27:59] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0187429 [06/27/2024-06:27:59] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0207308 [06/27/2024-06:27:59] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0240152 [06/27/2024-06:27:59] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0276236 [06/27/2024-06:27:59] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0230087 [06/27/2024-06:28:00] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0112978 [06/27/2024-06:28:00] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0115003 [06/27/2024-06:28:00] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0132322 [06/27/2024-06:28:00] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0171154 [06/27/2024-06:28:00] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0115678 [06/27/2024-06:28:00] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0112527 [06/27/2024-06:28:01] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0113315 [06/27/2024-06:28:01] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00831391 [06/27/2024-06:28:01] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000000 [06/27/2024-06:28:01] [V] [TRT] *************** Autotuning format combination: Float(480,160,80:32,2,1), Float(480,160,80:32,2,1) -> Float(480,160,80:32,2,1) *************** [06/27/2024-06:28:01] [V] [TRT] --------------- Timing Runner: PWN(PWN(503 + (Unnamed Layer* 297) [Shuffle] + Mul_318, Add_320), 507 + (Unnamed Layer* 302) [Shuffle] + Mul_322) (PointWise) [06/27/2024-06:28:01] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:01] [V] [TRT] --------------- Timing Runner: PWN(PWN(503 + (Unnamed Layer* 297) [Shuffle] + Mul_318, Add_320), 507 + (Unnamed Layer* 302) [Shuffle] + Mul_322) (PointWiseV2) [06/27/2024-06:28:01] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.014848 [06/27/2024-06:28:01] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.0202057 [06/27/2024-06:28:01] [V] [TRT] Tactic: 0x000000000000001a Time: 0.618642 [06/27/2024-06:28:01] [V] [TRT] Tactic: 0x000000000000001b Time: 0.0474331 [06/27/2024-06:28:01] [V] [TRT] Tactic: 0x000000000000001f Time: 0.0145262 [06/27/2024-06:28:01] [V] [TRT] Fastest Tactic: 0x000000000000001f Time: 0.0145262 [06/27/2024-06:28:01] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001f [06/27/2024-06:28:01] [V] [TRT] *************** Autotuning format combination: Float(3200,1:4,80,2,1), Float(3200,1:4,80,2,1) -> Float(3200,1:4,80,2,1) *************** [06/27/2024-06:28:01] [V] [TRT] --------------- Timing Runner: PWN(PWN(503 + (Unnamed Layer* 297) [Shuffle] + Mul_318, Add_320), 507 + (Unnamed Layer* 302) [Shuffle] + Mul_322) (PointWiseV2) [06/27/2024-06:28:02] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0453851 [06/27/2024-06:28:02] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0518827 [06/27/2024-06:28:02] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0750446 [06/27/2024-06:28:02] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0625128 [06/27/2024-06:28:02] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0574903 [06/27/2024-06:28:03] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0594895 [06/27/2024-06:28:03] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0809691 [06/27/2024-06:28:03] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.979529 [06/27/2024-06:28:03] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.07168 [06/27/2024-06:28:03] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0604648 [06/27/2024-06:28:04] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0345819 [06/27/2024-06:28:04] [V] [TRT] Tactic: 0x000000000000000b Time: 0.24971 [06/27/2024-06:28:04] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0413257 [06/27/2024-06:28:04] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0490545 [06/27/2024-06:28:04] [V] [TRT] Tactic: 0x000000000000000e Time: 0.0564663 [06/27/2024-06:28:04] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0519802 [06/27/2024-06:28:05] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0648046 [06/27/2024-06:28:05] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.068803 [06/27/2024-06:28:05] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0729234 [06/27/2024-06:28:05] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.081408 [06/27/2024-06:28:05] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0292864 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0369371 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.649655 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0657798 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0144091 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0136977 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0135381 [06/27/2024-06:28:06] [V] [TRT] Fastest Tactic: 0x000000000000001e Time: 0.0135381 [06/27/2024-06:28:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001e [06/27/2024-06:28:06] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:06] [V] [TRT] *************** Autotuning format combination: Float(9600,3200,80,2,1), Float(9600,3200,80,2,1) -> Float(9600,3200,80,2,1) *************** [06/27/2024-06:28:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(509 + (Unnamed Layer* 305) [Shuffle] + Mul_324, PWN(511 + (Unnamed Layer* 308) [Shuffle], Pow_326)), Mul_328) (PointWise) [06/27/2024-06:28:06] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(509 + (Unnamed Layer* 305) [Shuffle] + Mul_324, PWN(511 + (Unnamed Layer* 308) [Shuffle], Pow_326)), Mul_328) (PointWiseV2) [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0125928 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0128 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0128244 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0146578 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0135115 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0131524 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0186514 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0159939 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0140567 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.014731 [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0126171 [06/27/2024-06:28:06] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0125928 [06/27/2024-06:28:06] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000000 [06/27/2024-06:28:06] [V] [TRT] *************** Autotuning format combination: Float(9600,3200,1,80,40), Float(9600,3200,1,80,40) -> Float(9600,3200,1,80,40) *************** [06/27/2024-06:28:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(509 + (Unnamed Layer* 305) [Shuffle] + Mul_324, PWN(511 + (Unnamed Layer* 308) [Shuffle], Pow_326)), Mul_328) (PointWise) [06/27/2024-06:28:06] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:06] [V] [TRT] --------------- Timing Runner: PWN(PWN(509 + (Unnamed Layer* 305) [Shuffle] + Mul_324, PWN(511 + (Unnamed Layer* 308) [Shuffle], Pow_326)), Mul_328) (PointWiseV2) [06/27/2024-06:28:06] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0126537 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0699099 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0127634 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0147456 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0135115 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0118041 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.016449 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0140567 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.012483 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0130327 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0111065 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.0111065 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(2400,800,1:4,20,10), Float(2400,800,1:4,20,10) -> Float(2400,800,1:4,20,10) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(509 + (Unnamed Layer* 305) [Shuffle] + Mul_324, PWN(511 + (Unnamed Layer* 308) [Shuffle], Pow_326)), Mul_328) (PointWise) [06/27/2024-06:28:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(509 + (Unnamed Layer* 305) [Shuffle] + Mul_324, PWN(511 + (Unnamed Layer* 308) [Shuffle], Pow_326)), Mul_328) (PointWiseV2) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0118716 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0139237 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0181394 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0184869 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0172617 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0175543 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0273067 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.024893 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0251124 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0264533 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0115453 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0121539 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0130726 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0146286 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000e Time: 0.0156818 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0176681 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0197669 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0217966 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0256731 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0213995 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0112077 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0114553 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0129707 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0164328 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0118491 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0114103 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0110615 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x000000000000001e Time: 0.0110615 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001e [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(480,160,80:32,2,1), Float(480,160,80:32,2,1) -> Float(480,160,80:32,2,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(509 + (Unnamed Layer* 305) [Shuffle] + Mul_324, PWN(511 + (Unnamed Layer* 308) [Shuffle], Pow_326)), Mul_328) (PointWise) [06/27/2024-06:28:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(509 + (Unnamed Layer* 305) [Shuffle] + Mul_324, PWN(511 + (Unnamed Layer* 308) [Shuffle], Pow_326)), Mul_328) (PointWiseV2) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.00819073 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.00670317 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001a Time: 0.00559297 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001b Time: 0.00952808 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001f Time: 0.00656499 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x000000000000001a Time: 0.00559297 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001a [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(3200,1:4,80,2,1), Float(3200,1:4,80,2,1) -> Float(3200,1:4,80,2,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(509 + (Unnamed Layer* 305) [Shuffle] + Mul_324, PWN(511 + (Unnamed Layer* 308) [Shuffle], Pow_326)), Mul_328) (PointWiseV2) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0104405 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00805435 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.010543 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00914286 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.013046 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00958171 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0113203 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0148041 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0103337 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.009216 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000a Time: 0.00575269 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0074608 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000c Time: 0.00724846 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000d Time: 0.00891563 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000e Time: 0.0254537 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000f Time: 0.00896914 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.00929914 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.00961707 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0104072 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.011444 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0115214 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00730176 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0102403 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.00955733 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0061501 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00635588 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00583936 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x000000000000000a Time: 0.00575269 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000000a [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(102400,400,20,1) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(102400,1,5120,256) -> Float(102400,1,5120,256) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(25600,1:4,1280,64) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(3200,400:32,20,1) -> Float(3200,400:32,20,1) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(102400,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(102400,1,5120,256) -> Float(204800,1,10240,512) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(25600,1:4,1280,64) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(28800,9600,240,6,1) -> Float(28800,6,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Reshape_336 (Shuffle) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0066693 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0300101 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0066693 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(28800,9600,1,240,40) -> Float(1,6,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Reshape_336 (Shuffle) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00721189 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.282917 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00721189 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(7200,2400,1:4,60,10) -> Float(1:4,6,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Reshape_336 (Shuffle) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0335872 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0433371 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0335872 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(1440,480,240:32,6,1) -> Float(28800:32,6,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Reshape_336 (Shuffle) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.314514 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0411429 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.0411429 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000001 [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(204800,1,10240,512) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: BatchNormalization_255 (Scale) [06/27/2024-06:28:07] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: BatchNormalization_255 (Scale) [06/27/2024-06:28:07] [V] [TRT] Scale has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(204800,1,10240,512) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(6400,400:32,20,1) -> Float(6400,400:32,20,1) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(204800,1,10240,512) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(204800,400,20,1) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(204800,1,10240,512) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(51200,1:4,2560,128) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(6400,400:32,20,1) -> Float(6400,400:32,20,1) *************** [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(1:4,400,20,1) -> Float(1:4,400,20,1) *************** [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(204800,400,20,1) -> Float(7200,400,20,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Conv_337 (CudaDepthwiseConvolution) [06/27/2024-06:28:07] [V] [TRT] CudaDepthwiseConvolution has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Conv_337 (FusedConvActConvolution) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000008ffff Time: 0.109861 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000000bffff Time: 0.0261608 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000011ffff Time: 0.0167903 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000018ffff Time: 0.0179383 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000001bffff Time: 0.0293157 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000001fffff Time: 0.0116698 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000035ffff Time: 0.0234906 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000003cffff Time: 0.0162052 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000040ffff Time: 0.0123855 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000041ffff Time: 0.0211487 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000045ffff Time: 0.0163027 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000046ffff Time: 0.016449 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000004affff Time: 0.0163515 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000004effff Time: 0.0148914 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000051ffff Time: 0.0173105 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000059ffff Time: 0.0167253 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000005effff Time: 0.0221309 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000005fffff Time: 0.0168229 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000066ffff Time: 0.0154194 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000006affff Time: 0.0120686 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000006bffff Time: 0.0230713 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000006fffff Time: 0.017409 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000070ffff Time: 0.0194926 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000073ffff Time: 0.0236147 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000075ffff Time: 0.0413989 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000007cffff Time: 0.0206263 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000007dffff Time: 0.0162042 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000007effff Time: 0.0247954 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000083ffff Time: 0.0236356 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000008affff Time: 0.0117704 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000091ffff Time: 0.0163515 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x00000000009dffff Time: 0.014731 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000a3ffff Time: 0.0223399 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000a6ffff Time: 0.0610011 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x00000000001fffff Time: 0.0116698 [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Conv_337 (CudnnConvolution) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.030837 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0594408 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0869669 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.412389 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.111177 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000038 Time: 0.0301349 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000039 Time: 0.0586606 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000003a Time: 0.0866789 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000003c Time: 0.465778 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000003d Time: 0.114103 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000070 Time: 0.0306615 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000071 Time: 0.0418011 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000072 Time: 0.0867474 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000074 Time: 0.314222 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000075 Time: 0.163109 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000038 Time: 0.0301349 [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Conv_337 (CublasConvolution) [06/27/2024-06:28:07] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Conv_337 (CaskConvolution) [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x1fc87d7eb370bb7a Time: 0.0262583 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x2ee10e11d6651675 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x2ee10e11d6651675 Time: 0.0773851 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 0x3f243c490d502deb [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x3f243c490d502deb Time: 0.0514438 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x503619c69ae500ff Time: 0.0650971 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1_aligna4_alignc4 Tactic: 0x7f0145cb49517338 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x7f0145cb49517338 Time: 0.0773364 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x865894c4635db7fd Time: 0.027453 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x8e3884f0eaec3ecd [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x8e3884f0eaec3ecd Time: 0.0437029 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x9808072e706def96 Time: 0.0770194 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x9cd5cdc35441c505 Time: 0.0393509 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9de226a0c44627c4 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x9de226a0c44627c4 Time: 0.0765074 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:28:07] [V] [TRT] Tactic: 0xa419b3b68f2da07b Time: 0.0509562 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0xa8609adc4e0ceb90 Time: 0.0632442 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0xa8ef60e712f8ad24 Time: 0.0641722 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:28:07] [V] [TRT] Tactic: 0xc0b05b61d128e46e Time: 0.0393897 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0xc3cf6e1d1c6aff27 Time: 0.0476404 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xe5603263b7f00303 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0xe5603263b7f00303 Time: 0.044288 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: 0xf067e6205da31c2e [06/27/2024-06:28:07] [V] [TRT] Tactic: 0xf067e6205da31c2e Time: 0.0646095 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: 0xf64396b97c889179 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0xf64396b97c889179 Time: 0.0758491 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xfff46c7893896eb1 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0xfff46c7893896eb1 Time: 0.163547 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x1fc87d7eb370bb7a Time: 0.0262583 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: FusedConvActConvolution Tactic: 0x00000000001fffff [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(204800,1,10240,512) -> Float(7200,1,360,18) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Conv_337 (CublasConvolution) [06/27/2024-06:28:07] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Conv_337 (CaskConvolution) [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x1022069e6f8d9aeb [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x1022069e6f8d9aeb Time: 0.0344649 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x35f26f9c09557d86 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x35f26f9c09557d86 Time: 0.0354597 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x55d80c17b1cd982d [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x55d80c17b1cd982d Time: 0.0225489 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x90898977fc8ce537 Time: 0.022319 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xbc0bba0ff1a92939 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0xbc0bba0ff1a92939 Time: 0.0853821 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xc7b3afceb5fb03c0 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0xc7b3afceb5fb03c0 Time: 0.0342016 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0xd55ee6fd0b56f808 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0xd55ee6fd0b56f808 Time: 0.0350501 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x90898977fc8ce537 Time: 0.022319 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x90898977fc8ce537 [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(51200,1:4,2560,128) -> Float(2000,1:4,100,5) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Conv_337 (CublasConvolution) [06/27/2024-06:28:07] [V] [TRT] CublasConvolution has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Conv_337 (CaskConvolution) [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r1s1 Tactic: 0x130df49cb195156b [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x130df49cb195156b Time: 0.0425691 [06/27/2024-06:28:07] [V] [TRT] Conv_337 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x9dece0dc37e90462 Time: 0.0424594 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x9dece0dc37e90462 Time: 0.0424594 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 0x9dece0dc37e90462 [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(7200,400,20,1) -> Float(7200,2400,120,6,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Reshape_351 + Transpose_352 (Shuffle) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0119432 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.022528 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0119432 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(7200,1,360,18) -> Float(7200,2400,1,120,20) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Reshape_351 + Transpose_352 (Shuffle) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00712664 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0218175 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00712664 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(2000,1:4,100,5) -> Float(1800,600,1:4,30,5) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Reshape_351 + Transpose_352 (Shuffle) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00688239 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0283063 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00688239 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(400,400:32,20,1) -> Float(360,120,120:32,6,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Reshape_351 + Transpose_352 (Shuffle) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0060541 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0402651 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0060541 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(7200,2400,120,6,1) -> Float(7200,2400,120,6,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_353) (PointWise) [06/27/2024-06:28:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_353) (PointWiseV2) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00627995 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00574025 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00671481 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00701845 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00637933 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00577335 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00579236 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00618648 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00600411 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00920886 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00599143 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.00574025 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000001 [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(7200,2400,1,120,20) -> Float(7200,2400,1,120,20) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_353) (PointWise) [06/27/2024-06:28:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_353) (PointWiseV2) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00624038 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00623695 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00600503 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00671065 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00667616 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00633044 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00657642 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00683657 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0063048 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00615524 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00928935 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000002 Time: 0.00600503 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000002 [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(1800,600,1:4,30,5) -> Float(1800,600,1:4,30,5) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_353) (PointWise) [06/27/2024-06:28:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_353) (PointWiseV2) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00658016 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00589312 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00661777 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00681382 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00656478 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0061501 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00686694 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00585161 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00578724 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00579913 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000a Time: 0.011284 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0057931 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000c Time: 0.00596819 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000d Time: 0.00681274 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000e Time: 0.00655938 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0065125 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0120438 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.00871289 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.00675117 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0068049 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.00637754 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00621356 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0106697 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0065765 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00630241 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00621456 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00629684 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000008 Time: 0.00578724 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000008 [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(360,120,120:32,6,1) -> Float(360,120,120:32,6,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_353) (PointWise) [06/27/2024-06:28:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_353) (PointWiseV2) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0110389 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.00616838 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001a Time: 0.00658265 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001b Time: 0.00593957 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001f Time: 0.00639861 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x000000000000001b Time: 0.00593957 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001b [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(2400,1:4,120,6,1) -> Float(2400,1:4,120,6,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(Sigmoid_353) (PointWiseV2) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00659636 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00596724 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00575196 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0073941 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0080462 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0102296 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0130065 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00860504 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00832254 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0105319 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000a Time: 0.00668904 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000b Time: 0.00554233 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000c Time: 0.00649381 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000d Time: 0.00623581 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000e Time: 0.00585161 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000000f Time: 0.00553072 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.00910629 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.00707744 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.00727771 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.00657013 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.00667595 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00639841 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0110727 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.00898971 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00607124 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00603638 [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00818454 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x000000000000000f Time: 0.00553072 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000000f [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(7200,2400,120,6,1) -> Float(2400,800,40,2,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Split_354 (Padding) [06/27/2024-06:28:07] [V] [TRT] Padding has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Split_354 (Slice) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00675179 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00675179 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Slice Tactic: 0x0000000000000000 [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(7200,2400,120,6,1) -> Float(2400,800,40,2,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Split_354_71 (Padding) [06/27/2024-06:28:07] [V] [TRT] Padding has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Split_354_71 (Slice) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00624855 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00624855 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Slice Tactic: 0x0000000000000000 [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(7200,2400,120,6,1) -> Float(7200,2400,120,6,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Split_354_72 (Padding) [06/27/2024-06:28:07] [V] [TRT] Padding has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: Split_354_72 (Slice) [06/27/2024-06:28:07] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0057644 [06/27/2024-06:28:07] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0057644 [06/27/2024-06:28:07] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Slice Tactic: 0x0000000000000000 [06/27/2024-06:28:07] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:07] [V] [TRT] *************** Autotuning format combination: Float(2400,800,40,2,1), Float(2400,800,40,2,1) -> Float(2400,800,40,2,1) *************** [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(549 + (Unnamed Layer* 344) [Shuffle] + Mul_356, Add_358), 553 + (Unnamed Layer* 349) [Shuffle] + Mul_360) (PointWise) [06/27/2024-06:28:07] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:07] [V] [TRT] --------------- Timing Runner: PWN(PWN(549 + (Unnamed Layer* 344) [Shuffle] + Mul_356, Add_358), 553 + (Unnamed Layer* 349) [Shuffle] + Mul_360) (PointWiseV2) [06/27/2024-06:28:08] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00632825 [06/27/2024-06:28:08] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00633739 [06/27/2024-06:28:08] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00654255 [06/27/2024-06:28:08] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00650971 [06/27/2024-06:28:08] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00639722 [06/27/2024-06:28:08] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00662254 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00653631 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00641212 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00629665 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00695902 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00735817 [06/27/2024-06:28:09] [V] [TRT] Fastest Tactic: 0x0000000000000008 Time: 0.00629665 [06/27/2024-06:28:09] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000008 [06/27/2024-06:28:09] [V] [TRT] *************** Autotuning format combination: Float(2400,800,1,40,20), Float(2400,800,1,40,20) -> Float(2400,800,1,40,20) *************** [06/27/2024-06:28:09] [V] [TRT] --------------- Timing Runner: PWN(PWN(549 + (Unnamed Layer* 344) [Shuffle] + Mul_356, Add_358), 553 + (Unnamed Layer* 349) [Shuffle] + Mul_360) (PointWise) [06/27/2024-06:28:09] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:09] [V] [TRT] --------------- Timing Runner: PWN(PWN(549 + (Unnamed Layer* 344) [Shuffle] + Mul_356, Add_358), 553 + (Unnamed Layer* 349) [Shuffle] + Mul_360) (PointWiseV2) [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0068406 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0066427 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00712055 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00762226 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00660945 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00619886 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0239177 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00794006 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00647473 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00743131 [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00728571 [06/27/2024-06:28:09] [V] [TRT] Fastest Tactic: 0x0000000000000005 Time: 0.00619886 [06/27/2024-06:28:09] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000005 [06/27/2024-06:28:09] [V] [TRT] *************** Autotuning format combination: Float(600,200,1:4,10,5), Float(600,200,1:4,10,5) -> Float(600,200,1:4,10,5) *************** [06/27/2024-06:28:09] [V] [TRT] --------------- Timing Runner: PWN(PWN(549 + (Unnamed Layer* 344) [Shuffle] + Mul_356, Add_358), 553 + (Unnamed Layer* 349) [Shuffle] + Mul_360) (PointWise) [06/27/2024-06:28:09] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:09] [V] [TRT] --------------- Timing Runner: PWN(PWN(549 + (Unnamed Layer* 344) [Shuffle] + Mul_356, Add_358), 553 + (Unnamed Layer* 349) [Shuffle] + Mul_360) (PointWiseV2) [06/27/2024-06:28:09] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0211789 [06/27/2024-06:28:10] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00937143 [06/27/2024-06:28:10] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0103654 [06/27/2024-06:28:10] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0102296 [06/27/2024-06:28:10] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0105326 [06/27/2024-06:28:10] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0110727 [06/27/2024-06:28:10] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.015755 [06/27/2024-06:28:11] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.015755 [06/27/2024-06:28:11] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0165465 [06/27/2024-06:28:11] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0185417 [06/27/2024-06:28:11] [V] [TRT] Tactic: 0x000000000000000a Time: 0.00955733 [06/27/2024-06:28:11] [V] [TRT] Tactic: 0x000000000000000b Time: 0.00969387 [06/27/2024-06:28:12] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0108356 [06/27/2024-06:28:12] [V] [TRT] Tactic: 0x000000000000000d Time: 0.012739 [06/27/2024-06:28:12] [V] [TRT] Tactic: 0x000000000000000e Time: 0.0141232 [06/27/2024-06:28:12] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0168716 [06/27/2024-06:28:12] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0194377 [06/27/2024-06:28:12] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0225698 [06/27/2024-06:28:13] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0272091 [06/27/2024-06:28:13] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0246979 [06/27/2024-06:28:13] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0115116 [06/27/2024-06:28:13] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.0123246 [06/27/2024-06:28:13] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0138572 [06/27/2024-06:28:14] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0176681 [06/27/2024-06:28:14] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0121661 [06/27/2024-06:28:14] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0114215 [06/27/2024-06:28:14] [V] [TRT] Tactic: 0x000000000000001e Time: 0.0110502 [06/27/2024-06:28:14] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.00937143 [06/27/2024-06:28:14] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000001 [06/27/2024-06:28:14] [V] [TRT] *************** Autotuning format combination: Float(120,40,40:32,2,1), Float(120,40,40:32,2,1) -> Float(120,40,40:32,2,1) *************** [06/27/2024-06:28:14] [V] [TRT] --------------- Timing Runner: PWN(PWN(549 + (Unnamed Layer* 344) [Shuffle] + Mul_356, Add_358), 553 + (Unnamed Layer* 349) [Shuffle] + Mul_360) (PointWise) [06/27/2024-06:28:14] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:14] [V] [TRT] --------------- Timing Runner: PWN(PWN(549 + (Unnamed Layer* 344) [Shuffle] + Mul_356, Add_358), 553 + (Unnamed Layer* 349) [Shuffle] + Mul_360) (PointWiseV2) [06/27/2024-06:28:14] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.0138838 [06/27/2024-06:28:14] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.0184503 [06/27/2024-06:28:15] [V] [TRT] Tactic: 0x000000000000001a Time: 0.0260389 [06/27/2024-06:28:15] [V] [TRT] Tactic: 0x000000000000001b Time: 0.0406309 [06/27/2024-06:28:15] [V] [TRT] Tactic: 0x000000000000001f Time: 0.0916376 [06/27/2024-06:28:15] [V] [TRT] Fastest Tactic: 0x0000000000000018 Time: 0.0138838 [06/27/2024-06:28:15] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000018 [06/27/2024-06:28:15] [V] [TRT] *************** Autotuning format combination: Float(800,1:4,40,2,1), Float(800,1:4,40,2,1) -> Float(800,1:4,40,2,1) *************** [06/27/2024-06:28:15] [V] [TRT] --------------- Timing Runner: PWN(PWN(549 + (Unnamed Layer* 344) [Shuffle] + Mul_356, Add_358), 553 + (Unnamed Layer* 349) [Shuffle] + Mul_360) (PointWiseV2) [06/27/2024-06:28:15] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0181211 [06/27/2024-06:28:15] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0197669 [06/27/2024-06:28:15] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0256975 [06/27/2024-06:28:16] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0221936 [06/27/2024-06:28:16] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0212114 [06/27/2024-06:28:16] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.018816 [06/27/2024-06:28:16] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0266728 [06/27/2024-06:28:16] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0244297 [06/27/2024-06:28:17] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.024576 [06/27/2024-06:28:17] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0226952 [06/27/2024-06:28:17] [V] [TRT] Tactic: 0x000000000000000a Time: 0.0148626 [06/27/2024-06:28:17] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0161402 [06/27/2024-06:28:17] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0169854 [06/27/2024-06:28:18] [V] [TRT] Tactic: 0x000000000000000d Time: 0.0181211 [06/27/2024-06:28:18] [V] [TRT] Tactic: 0x000000000000000e Time: 0.0199497 [06/27/2024-06:28:18] [V] [TRT] Tactic: 0x000000000000000f Time: 0.0196389 [06/27/2024-06:28:18] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.0234475 [06/27/2024-06:28:18] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.0244785 [06/27/2024-06:28:18] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.0248442 [06/27/2024-06:28:19] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.0293449 [06/27/2024-06:28:19] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.0139104 [06/27/2024-06:28:19] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.154185 [06/27/2024-06:28:19] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.0181211 [06/27/2024-06:28:19] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.0226743 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001c Time: 0.173056 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0115453 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001e Time: 0.011084 [06/27/2024-06:28:20] [V] [TRT] Fastest Tactic: 0x000000000000001e Time: 0.011084 [06/27/2024-06:28:20] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001e [06/27/2024-06:28:20] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:20] [V] [TRT] *************** Autotuning format combination: Float(2400,800,40,2,1), Float(2400,800,40,2,1) -> Float(2400,800,40,2,1) *************** [06/27/2024-06:28:20] [V] [TRT] --------------- Timing Runner: PWN(PWN(555 + (Unnamed Layer* 352) [Shuffle] + Mul_362, PWN(557 + (Unnamed Layer* 355) [Shuffle], Pow_364)), Mul_366) (PointWise) [06/27/2024-06:28:20] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:20] [V] [TRT] --------------- Timing Runner: PWN(PWN(555 + (Unnamed Layer* 352) [Shuffle] + Mul_362, PWN(557 + (Unnamed Layer* 355) [Shuffle], Pow_364)), Mul_366) (PointWiseV2) [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0106371 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0113765 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0114553 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0131258 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.0119345 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0119223 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0988241 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0144384 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0127025 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0133253 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0106684 [06/27/2024-06:28:20] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0106371 [06/27/2024-06:28:20] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000000 [06/27/2024-06:28:20] [V] [TRT] *************** Autotuning format combination: Float(2400,800,1,40,20), Float(2400,800,1,40,20) -> Float(2400,800,1,40,20) *************** [06/27/2024-06:28:20] [V] [TRT] --------------- Timing Runner: PWN(PWN(555 + (Unnamed Layer* 352) [Shuffle] + Mul_362, PWN(557 + (Unnamed Layer* 355) [Shuffle], Pow_364)), Mul_366) (PointWise) [06/27/2024-06:28:20] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:20] [V] [TRT] --------------- Timing Runner: PWN(PWN(555 + (Unnamed Layer* 352) [Shuffle] + Mul_362, PWN(557 + (Unnamed Layer* 355) [Shuffle], Pow_364)), Mul_366) (PointWiseV2) [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.090112 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.011309 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.011444 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.0131524 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.011971 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0119101 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0168554 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0143506 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0127634 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0132056 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001c Time: 0.0106057 [06/27/2024-06:28:20] [V] [TRT] Fastest Tactic: 0x000000000000001c Time: 0.0106057 [06/27/2024-06:28:20] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001c [06/27/2024-06:28:20] [V] [TRT] *************** Autotuning format combination: Float(600,200,1:4,10,5), Float(600,200,1:4,10,5) -> Float(600,200,1:4,10,5) *************** [06/27/2024-06:28:20] [V] [TRT] --------------- Timing Runner: PWN(PWN(555 + (Unnamed Layer* 352) [Shuffle] + Mul_362, PWN(557 + (Unnamed Layer* 355) [Shuffle], Pow_364)), Mul_366) (PointWise) [06/27/2024-06:28:20] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:20] [V] [TRT] --------------- Timing Runner: PWN(PWN(555 + (Unnamed Layer* 352) [Shuffle] + Mul_362, PWN(557 + (Unnamed Layer* 355) [Shuffle], Pow_364)), Mul_366) (PointWiseV2) [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0119345 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0142921 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.0187246 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.018944 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.017603 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.0177656 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.0264533 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.0230713 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.0233012 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.0251855 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000000a Time: 0.49408 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0122758 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000000c Time: 0.0131391 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000000d Time: 0.00736549 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000000e Time: 0.00526596 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000000f Time: 0.00554778 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.00866608 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.006656 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.00670192 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.00682688 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.00629128 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00658348 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.00656062 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.00649621 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00662816 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001d Time: 0.0076379 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00629168 [06/27/2024-06:28:20] [V] [TRT] Fastest Tactic: 0x000000000000000e Time: 0.00526596 [06/27/2024-06:28:20] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000000e [06/27/2024-06:28:20] [V] [TRT] *************** Autotuning format combination: Float(120,40,40:32,2,1), Float(120,40,40:32,2,1) -> Float(120,40,40:32,2,1) *************** [06/27/2024-06:28:20] [V] [TRT] --------------- Timing Runner: PWN(PWN(555 + (Unnamed Layer* 352) [Shuffle] + Mul_362, PWN(557 + (Unnamed Layer* 355) [Shuffle], Pow_364)), Mul_366) (PointWise) [06/27/2024-06:28:20] [V] [TRT] PointWise has no valid tactics for this config, skipping [06/27/2024-06:28:20] [V] [TRT] --------------- Timing Runner: PWN(PWN(555 + (Unnamed Layer* 352) [Shuffle] + Mul_362, PWN(557 + (Unnamed Layer* 355) [Shuffle], Pow_364)), Mul_366) (PointWiseV2) [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000018 Time: 0.00956709 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000019 Time: 0.0151698 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001a Time: 0.00637933 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001b Time: 0.00970362 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001f Time: 0.00654483 [06/27/2024-06:28:20] [V] [TRT] Fastest Tactic: 0x000000000000001a Time: 0.00637933 [06/27/2024-06:28:20] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x000000000000001a [06/27/2024-06:28:20] [V] [TRT] *************** Autotuning format combination: Float(800,1:4,40,2,1), Float(800,1:4,40,2,1) -> Float(800,1:4,40,2,1) *************** [06/27/2024-06:28:20] [V] [TRT] --------------- Timing Runner: PWN(PWN(555 + (Unnamed Layer* 352) [Shuffle] + Mul_362, PWN(557 + (Unnamed Layer* 355) [Shuffle], Pow_364)), Mul_366) (PointWiseV2) [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00687586 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.00691723 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000002 Time: 0.00811886 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000003 Time: 0.00633481 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000004 Time: 0.00572343 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000005 Time: 0.00691766 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000006 Time: 0.00580462 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000007 Time: 0.00658286 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000008 Time: 0.00618724 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000009 Time: 0.00693116 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000000a Time: 0.00640417 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000000b Time: 0.0059981 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000000c Time: 0.00644929 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000000d Time: 0.00632209 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000000e Time: 0.00658951 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000000f Time: 0.00682623 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000010 Time: 0.00656956 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000011 Time: 0.00672914 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000012 Time: 0.00700016 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000013 Time: 0.00571685 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000014 Time: 0.00597429 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000015 Time: 0.00609543 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000016 Time: 0.00664935 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000017 Time: 0.006848 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001c Time: 0.00658951 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001d Time: 0.00676322 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x000000000000001e Time: 0.00680873 [06/27/2024-06:28:20] [V] [TRT] Fastest Tactic: 0x0000000000000013 Time: 0.00571685 [06/27/2024-06:28:20] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0x0000000000000013 [06/27/2024-06:28:20] [V] [TRT] =============== Computing costs for [06/27/2024-06:28:20] [V] [TRT] *************** Autotuning format combination: Float(7200,2400,120,6,1) -> Float(7200,6,1) *************** [06/27/2024-06:28:20] [V] [TRT] --------------- Timing Runner: Reshape_374 (Shuffle) [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.00650991 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0212532 [06/27/2024-06:28:20] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.00650991 [06/27/2024-06:28:20] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:28:20] [V] [TRT] *************** Autotuning format combination: Float(7200,2400,1,120,20) -> Float(1,6,1) *************** [06/27/2024-06:28:20] [V] [TRT] --------------- Timing Runner: Reshape_374 (Shuffle) [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0115678 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.080896 [06/27/2024-06:28:20] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0115678 [06/27/2024-06:28:20] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:28:20] [V] [TRT] *************** Autotuning format combination: Float(1800,600,1:4,30,5) -> Float(1:4,6,1) *************** [06/27/2024-06:28:20] [V] [TRT] --------------- Timing Runner: Reshape_374 (Shuffle) [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0121295 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.0226116 [06/27/2024-06:28:20] [V] [TRT] Fastest Tactic: 0x0000000000000000 Time: 0.0121295 [06/27/2024-06:28:20] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000000 [06/27/2024-06:28:20] [V] [TRT] *************** Autotuning format combination: Float(360,120,120:32,6,1) -> Float(7200:32,6,1) *************** [06/27/2024-06:28:20] [V] [TRT] --------------- Timing Runner: Reshape_374 (Shuffle) [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000000 Time: 0.0841874 [06/27/2024-06:28:20] [V] [TRT] Tactic: 0x0000000000000001 Time: 0.023552 [06/27/2024-06:28:20] [V] [TRT] Fastest Tactic: 0x0000000000000001 Time: 0.023552 [06/27/2024-06:28:20] [V] [TRT] >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0x0000000000000001 [06/27/2024-06:28:20] [V] [TRT] Adding reformat layer: Reformatted Input Tensor 0 to Conv_102 (267) from Float(1638400,6400,80,1) to Float(409600,1:4,5120,64) [06/27/2024-06:28:20] [V] [TRT] Adding reformat layer: Reformatted Input Tensor 0 to Conv_105 || Conv_130 (270) from Float(102400,1:4,2560,64) to Float(409600,1,10240,256) [06/27/2024-06:28:20] [V] [TRT] Adding reformat layer: Reformatted Output Tensor 0 to Conv_105 || Conv_130 (Conv_105 || Conv_130) from Float(409600,1,10240,256) to Float(102400,1:4,2560,64) [06/27/2024-06:28:20] [V] [TRT] Adding reformat layer: Reformatted Output Tensor 0 to PWN(PWN(Sigmoid_106), Mul_107) (273) from Float(51200,1:4,1280,32) to Float(204800,1600,40,1) [06/27/2024-06:28:20] [V] [TRT] Adding reformat layer: Reformatted Input Tensor 0 to Conv_138 (303) from Float(819200,1600,40,1) to Float(819200,1,20480,512) [06/27/2024-06:28:20] [V] [TRT] Adding reformat layer: Reformatted Input Tensor 0 to PWN(PWN(Sigmoid_142), Mul_143) (307) from Float(102400,1,5120,256) to Float(25600,1:4,1280,64) [06/27/2024-06:28:20] [V] [TRT] Adding reformat layer: Reformatted Output Tensor 0 to PWN(PWN(Sigmoid_142), Mul_143) (309) from Float(25600,1:4,1280,64) to Float(102400,400,20,1) [06/27/2024-06:28:20] [V] [TRT] Formats and tactics selection completed in 131.585 seconds. [06/27/2024-06:28:20] [V] [TRT] After reformat layers: 202 layers [06/27/2024-06:28:20] [V] [TRT] Pre-optimized block assignment. [06/27/2024-06:28:20] [V] [TRT] Block size 2457600 [06/27/2024-06:28:20] [V] [TRT] Block size 2457600 [06/27/2024-06:28:20] [V] [TRT] Block size 2457600 [06/27/2024-06:28:20] [V] [TRT] Block size 2457600 [06/27/2024-06:28:20] [V] [TRT] Block size 4915200 [06/27/2024-06:28:20] [V] [TRT] Block size 13107200 [06/27/2024-06:28:20] [V] [TRT] Block size 13107200 [06/27/2024-06:28:20] [V] [TRT] Block size 6553600 [06/27/2024-06:28:20] [V] [TRT] Block size 6553600 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 6553600 [06/27/2024-06:28:20] [V] [TRT] Block size 6553600 [06/27/2024-06:28:20] [V] [TRT] Block size 6553600 [06/27/2024-06:28:20] [V] [TRT] Block size 6553600 [06/27/2024-06:28:20] [V] [TRT] Block size 6553600 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 6553600 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 460800 [06/27/2024-06:28:20] [V] [TRT] Block size 460800 [06/27/2024-06:28:20] [V] [TRT] Block size 460800 [06/27/2024-06:28:20] [V] [TRT] Block size 153600 [06/27/2024-06:28:20] [V] [TRT] Block size 153600 [06/27/2024-06:28:20] [V] [TRT] Block size 153600 [06/27/2024-06:28:20] [V] [TRT] Block size 153600 [06/27/2024-06:28:20] [V] [TRT] Block size 460800 [06/27/2024-06:28:20] [V] [TRT] Block size 115200 [06/27/2024-06:28:20] [V] [TRT] Block size 115200 [06/27/2024-06:28:20] [V] [TRT] Block size 115200 [06/27/2024-06:28:20] [V] [TRT] Block size 38400 [06/27/2024-06:28:20] [V] [TRT] Block size 38400 [06/27/2024-06:28:20] [V] [TRT] Block size 38400 [06/27/2024-06:28:20] [V] [TRT] Block size 38400 [06/27/2024-06:28:20] [V] [TRT] Block size 115200 [06/27/2024-06:28:20] [V] [TRT] Block size 29184 [06/27/2024-06:28:20] [V] [TRT] Block size 29184 [06/27/2024-06:28:20] [V] [TRT] Block size 29184 [06/27/2024-06:28:20] [V] [TRT] Block size 9728 [06/27/2024-06:28:20] [V] [TRT] Block size 9728 [06/27/2024-06:28:20] [V] [TRT] Block size 9728 [06/27/2024-06:28:20] [V] [TRT] Block size 9728 [06/27/2024-06:28:20] [V] [TRT] Block size 29184 [06/27/2024-06:28:20] [V] [TRT] Block size 6553600 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 4 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 4 [06/27/2024-06:28:20] [V] [TRT] Block size 4 [06/27/2024-06:28:20] [V] [TRT] Block size 4 [06/27/2024-06:28:20] [V] [TRT] Block size 4 [06/27/2024-06:28:20] [V] [TRT] Block size 3276800 [06/27/2024-06:28:20] [V] [TRT] Block size 4 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 819200 [06/27/2024-06:28:20] [V] [TRT] Block size 1638400 [06/27/2024-06:28:20] [V] [TRT] Block size 4 [06/27/2024-06:28:20] [V] [TRT] Block size 409600 [06/27/2024-06:28:20] [V] [TRT] Block size 12884377600 [06/27/2024-06:28:20] [V] [TRT] Total Activation Memory: 13166133276 [06/27/2024-06:28:20] [I] [TRT] Detected 1 inputs and 4 output network tensors. [06/27/2024-06:28:20] [V] [TRT] Conv_41 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: 0xa8609adc4e0ceb90 [06/27/2024-06:28:20] [V] [TRT] Conv_44 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 0x5deb29b7a8e275f7 [06/27/2024-06:28:20] [V] [TRT] Conv_47 || Conv_58 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:28:20] [V] [TRT] Conv_50 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:28:20] [V] [TRT] Conv_57 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: 0x9808072e706def96 [06/27/2024-06:28:20] [V] [TRT] Conv_63 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:28:20] [V] [TRT] Conv_66 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 0x503619c69ae500ff [06/27/2024-06:28:20] [V] [TRT] Conv_69 || Conv_94 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:28:20] [V] [TRT] Conv_72 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:28:20] [V] [TRT] Conv_79 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:28:20] [V] [TRT] Conv_86 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:28:20] [V] [TRT] Conv_93 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:28:20] [V] [TRT] Conv_99 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:28:20] [V] [TRT] Conv_102 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: 0x999e005e3b016ea6 [06/27/2024-06:28:20] [V] [TRT] Conv_105 || Conv_130 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1 Tactic: 0x9dece0dc37e90462 [06/27/2024-06:28:20] [V] [TRT] Conv_108 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:28:20] [V] [TRT] Conv_111 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: 0x94119b4c514b211a [06/27/2024-06:28:20] [V] [TRT] Conv_115 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:28:20] [V] [TRT] Conv_118 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: 0x94119b4c514b211a [06/27/2024-06:28:20] [V] [TRT] Conv_122 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:28:20] [V] [TRT] Conv_125 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: 0x94119b4c514b211a [06/27/2024-06:28:20] [V] [TRT] Conv_129 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:28:20] [V] [TRT] Conv_135 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:28:20] [V] [TRT] Conv_138 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0xd15dd11d64344e83 [06/27/2024-06:28:20] [V] [TRT] Conv_141 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x90898977fc8ce537 [06/27/2024-06:28:20] [V] [TRT] Conv_154 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:28:20] [V] [TRT] Conv_169 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:28:20] [V] [TRT] Conv_175 || Conv_185 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:28:20] [V] [TRT] Conv_178 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:28:20] [V] [TRT] Conv_181 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: 0x94119b4c514b211a [06/27/2024-06:28:20] [V] [TRT] Conv_184 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:28:20] [V] [TRT] Conv_190 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:28:20] [V] [TRT] Conv_193 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x9cd5cdc35441c505 [06/27/2024-06:28:20] [V] [TRT] Conv_199 || Conv_209 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:28:20] [V] [TRT] Conv_202 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:28:20] [V] [TRT] Conv_208 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:28:20] [V] [TRT] Conv_214 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: 0xa8ef60e712f8ad24 [06/27/2024-06:28:20] [V] [TRT] Conv_217 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 0x01cf8ce2da913006 [06/27/2024-06:28:20] [V] [TRT] Conv_261 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: 0xc3cf6e1d1c6aff27 [06/27/2024-06:28:20] [V] [TRT] Conv_221 || Conv_231 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:28:20] [V] [TRT] Conv_224 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0x865894c4635db7fd [06/27/2024-06:28:20] [V] [TRT] Conv_227 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: 0x94119b4c514b211a [06/27/2024-06:28:20] [V] [TRT] Conv_230 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r1s1_aligna4_alignc4 Tactic: 0xc0b05b61d128e46e [06/27/2024-06:28:20] [V] [TRT] Conv_236 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nchwkrsc_nchw_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_simple_t1r1s1_aligna4_alignc4 Tactic: 0xa419b3b68f2da07b [06/27/2024-06:28:21] [V] [TRT] Conv_239 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 0xa31d27de74b895ff [06/27/2024-06:28:21] [V] [TRT] Conv_246 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_simple_t1r1s1_aligna4_alignc4 Tactic: 0x1fc87d7eb370bb7a [06/27/2024-06:28:21] [V] [TRT] Layer: Reformatting CopyNode for Network Input images Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 459 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 467 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 505 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 513 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 551 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 559 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Slice_4 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Slice_14 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Slice_24 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Slice_34 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Slice_9 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Slice_19 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Slice_29 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Slice_39 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_41 Host Persistent: 1664 Device Persistent: 614912 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_42), Mul_43) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_44 Host Persistent: 4224 Device Persistent: 154112 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_45), Mul_46) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_47 || Conv_58 Host Persistent: 3200 Device Persistent: 154112 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_48), Mul_49) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_50 Host Persistent: 3200 Device Persistent: 154112 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_51), Mul_52) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_53 Host Persistent: 32 Device Persistent: 0 Scratch Memory: 103936 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(PWN(Sigmoid_54), Mul_55), Add_56) Host Persistent: 340 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_57 Host Persistent: 3200 Device Persistent: 154112 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 224 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: BatchNormalization_60 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_61), Mul_62) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_63 Host Persistent: 3200 Device Persistent: 154112 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_64), Mul_65) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_66 Host Persistent: 1664 Device Persistent: 38912 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_67), Mul_68) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_69 || Conv_94 Host Persistent: 3200 Device Persistent: 38912 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_70), Mul_71) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_72 Host Persistent: 3200 Device Persistent: 38912 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_73), Mul_74) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_75 Host Persistent: 32 Device Persistent: 0 Scratch Memory: 411136 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(PWN(Sigmoid_76), Mul_77), Add_78) Host Persistent: 340 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_79 Host Persistent: 3200 Device Persistent: 38912 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_80), Mul_81) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_82 Host Persistent: 32 Device Persistent: 0 Scratch Memory: 411136 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(PWN(Sigmoid_83), Mul_84), Add_85) Host Persistent: 340 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_86 Host Persistent: 3200 Device Persistent: 38912 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_87), Mul_88) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_89 Host Persistent: 32 Device Persistent: 0 Scratch Memory: 411136 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(PWN(Sigmoid_90), Mul_91), Add_92) Host Persistent: 340 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_93 Host Persistent: 3200 Device Persistent: 38912 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 260 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: BatchNormalization_96 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_97), Mul_98) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_99 Host Persistent: 3200 Device Persistent: 38912 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_100), Mul_101) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reformatting CopyNode for Input Tensor 0 to Conv_102 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_102 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_103), Mul_104) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reformatting CopyNode for Input Tensor 0 to Conv_105 || Conv_130 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_105 || Conv_130 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reformatting CopyNode for Output Tensor 0 to Conv_105 || Conv_130 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_106), Mul_107) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reformatting CopyNode for Output Tensor 0 to PWN(PWN(Sigmoid_106), Mul_107) Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_108 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_109), Mul_110) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_111 Host Persistent: 512 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(PWN(Sigmoid_112), Mul_113), Add_114) Host Persistent: 340 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_115 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_116), Mul_117) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_118 Host Persistent: 512 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(PWN(Sigmoid_119), Mul_120), Add_121) Host Persistent: 340 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_122 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_123), Mul_124) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_125 Host Persistent: 512 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(PWN(Sigmoid_126), Mul_127), Add_128) Host Persistent: 340 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_129 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 296 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: BatchNormalization_132 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_133), Mul_134) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_135 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_136), Mul_137) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reformatting CopyNode for Input Tensor 0 to Conv_138 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_138 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_139), Mul_140) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_141 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reformatting CopyNode for Input Tensor 0 to PWN(PWN(Sigmoid_142), Mul_143) Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_142), Mul_143) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reformatting CopyNode for Output Tensor 0 to PWN(PWN(Sigmoid_142), Mul_143) Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: MaxPool_144 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: MaxPool_145 Host Persistent: 48 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: MaxPool_146 Host Persistent: 48 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 309 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_148 Host Persistent: 32 Device Persistent: 0 Scratch Memory: 3736064 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_149), Mul_150) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_151 || Conv_161 Host Persistent: 32 Device Persistent: 0 Scratch Memory: 1868288 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_152), Mul_153) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_154 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_155), Mul_156) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_157 Host Persistent: 2192 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_158), Mul_159) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_160 Host Persistent: 32 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 327 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: BatchNormalization_163 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_164), Mul_165) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_166 Host Persistent: 32 Device Persistent: 0 Scratch Memory: 1868288 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_167), Mul_168) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_169 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_170), Mul_171) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Resize_173 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 342 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_175 || Conv_185 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_176), Mul_177) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_178 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_179), Mul_180) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_181 Host Persistent: 512 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_182), Mul_183) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_184 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 354 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: BatchNormalization_187 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_188), Mul_189) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_190 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_191), Mul_192) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_193 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_194), Mul_195) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Resize_197 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 369 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_199 || Conv_209 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_200), Mul_201) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_202 Host Persistent: 3200 Device Persistent: 38912 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_203), Mul_204) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_205 Host Persistent: 32 Device Persistent: 0 Scratch Memory: 411136 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_206), Mul_207) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_208 Host Persistent: 3200 Device Persistent: 38912 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 381 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: BatchNormalization_211 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_212), Mul_213) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_214 Host Persistent: 3200 Device Persistent: 38912 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_215), Mul_216) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_217 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_261 Host Persistent: 3200 Device Persistent: 38912 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_218), Mul_219) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 364 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_221 || Conv_231 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reshape_275 + Transpose_276 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_222), Mul_223) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_224 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(Sigmoid_277) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Split_278 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Split_278_59 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Split_278_60 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_225), Mul_226) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_227 Host Persistent: 512 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(457 + (Unnamed Layer* 250) [Shuffle] + Mul_280, Add_282), 461 + (Unnamed Layer* 255) [Shuffle] + Mul_284) Host Persistent: 340 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(463 + (Unnamed Layer* 258) [Shuffle] + Mul_286, PWN(465 + (Unnamed Layer* 261) [Shuffle], Pow_288)), Mul_290) Host Persistent: 340 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_228), Mul_229) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 462 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 468 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_230 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reshape_298 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reshape_298_copy_output Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 403 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: BatchNormalization_233 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_234), Mul_235) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_236 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_237), Mul_238) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_239 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_299 Host Persistent: 2192 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_240), Mul_241) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 337 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_243 || Conv_253 Host Persistent: 32 Device Persistent: 0 Scratch Memory: 1868288 [06/27/2024-06:28:21] [V] [TRT] Layer: Reshape_313 + Transpose_314 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_244), Mul_245) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_246 Host Persistent: 2784 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(Sigmoid_315) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Split_316 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Split_316_65 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Split_316_66 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_247), Mul_248) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_249 Host Persistent: 2192 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(503 + (Unnamed Layer* 297) [Shuffle] + Mul_318, Add_320), 507 + (Unnamed Layer* 302) [Shuffle] + Mul_322) Host Persistent: 340 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(509 + (Unnamed Layer* 305) [Shuffle] + Mul_324, PWN(511 + (Unnamed Layer* 308) [Shuffle], Pow_326)), Mul_328) Host Persistent: 340 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_250), Mul_251) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 508 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 514 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_252 Host Persistent: 32 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reshape_336 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reshape_336_copy_output Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 425 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: BatchNormalization_255 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_256), Mul_257) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_258 Host Persistent: 32 Device Persistent: 0 Scratch Memory: 1868288 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(Sigmoid_259), Mul_260) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Conv_337 Host Persistent: 2192 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reshape_351 + Transpose_352 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(Sigmoid_353) Host Persistent: 244 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Split_354 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Split_354_71 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Split_354_72 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(549 + (Unnamed Layer* 344) [Shuffle] + Mul_356, Add_358), 553 + (Unnamed Layer* 349) [Shuffle] + Mul_360) Host Persistent: 340 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: PWN(PWN(555 + (Unnamed Layer* 352) [Shuffle] + Mul_362, PWN(557 + (Unnamed Layer* 355) [Shuffle], Pow_364)), Mul_366) Host Persistent: 340 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 554 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: 560 copy Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reshape_374 Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [V] [TRT] Layer: Reshape_374_copy_output Host Persistent: 0 Device Persistent: 0 Scratch Memory: 0 [06/27/2024-06:28:21] [I] [TRT] Total Host Persistent Memory: 149632 [06/27/2024-06:28:21] [I] [TRT] Total Device Persistent Memory: 1813504 [06/27/2024-06:28:21] [I] [TRT] Total Scratch Memory: 3736064 [06/27/2024-06:28:21] [I] [TRT] [MemUsageStats] Peak memory usage of TRT CPU/GPU memory allocators: CPU 7 MiB, GPU 4369 MiB [06/27/2024-06:28:21] [I] [TRT] [BlockAssignment] Algorithm ShiftNTopDown took 29.7663ms to assign 8 blocks to 179 nodes requiring 35942400 bytes. [06/27/2024-06:28:21] [V] [TRT] Optimized block assignment. [06/27/2024-06:28:21] [V] [TRT] Block size 13107200 [06/27/2024-06:28:21] [V] [TRT] Block size 13107200 [06/27/2024-06:28:21] [V] [TRT] Block size 3276800 [06/27/2024-06:28:21] [V] [TRT] Block size 3276800 [06/27/2024-06:28:21] [V] [TRT] Block size 2457600 [06/27/2024-06:28:21] [V] [TRT] Block size 409600 [06/27/2024-06:28:21] [V] [TRT] Block size 153600 [06/27/2024-06:28:21] [V] [TRT] Block size 153600 [06/27/2024-06:28:21] [I] [TRT] Total Activation Memory: 35942400 [06/27/2024-06:28:21] [V] [TRT] Disabling unused tactic source: CUBLAS, CUBLAS_LT [06/27/2024-06:28:21] [V] [TRT] Using cuDNN as a tactic source [06/27/2024-06:28:21] [I] [TRT] [MemUsageChange] Init cuDNN: CPU +0, GPU +10, now: CPU 10232, GPU 2234 (MiB) [06/27/2024-06:28:21] [V] [TRT] Engine generation completed in 133.72 seconds. [06/27/2024-06:28:21] [V] [TRT] Deleting timing cache: 1265 entries, served 2768 hits since creation. [06/27/2024-06:28:21] [V] [TRT] Engine Layer Information: Layer(NoOp): Reformatting CopyNode for Network Input images, Tactic: 0x0000000000000000, images[Float(1,3,640,640)] -> Reformatted images[Float(1,3,640,640)] Layer(Constant): 459, Tactic: 0x0000000000000000, -> (Unnamed Layer* 252) [Constant]_output[Float(1,3,80,80,2)] Layer(Constant): 467, Tactic: 0x0000000000000000, -> (Unnamed Layer* 263) [Constant]_output[Float(1,3,80,80,2)] Layer(Constant): 505, Tactic: 0x0000000000000000, -> (Unnamed Layer* 299) [Constant]_output[Float(1,3,40,40,2)] Layer(Constant): 513, Tactic: 0x0000000000000000, -> (Unnamed Layer* 310) [Constant]_output[Float(1,3,40,40,2)] Layer(Constant): 551, Tactic: 0x0000000000000000, -> (Unnamed Layer* 346) [Constant]_output[Float(1,3,20,20,2)] Layer(Constant): 559, Tactic: 0x0000000000000000, -> (Unnamed Layer* 357) [Constant]_output[Float(1,3,20,20,2)] Layer(Slice): Slice_4, Tactic: 0x0000000000000000, Reformatted images[Float(1,3,640,640)] -> 170[Float(1,3,320,640)] Layer(Slice): Slice_14, Tactic: 0x0000000000000000, Reformatted images[Float(1,3,640,640)] -> 180[Float(1,3,320,640)] Layer(Slice): Slice_24, Tactic: 0x0000000000000000, Reformatted images[Float(1,3,640,640)] -> 190[Float(1,3,320,640)] Layer(Slice): Slice_34, Tactic: 0x0000000000000000, Reformatted images[Float(1,3,640,640)] -> 200[Float(1,3,320,640)] Layer(Slice): Slice_9, Tactic: 0x0000000000000000, 170[Float(1,3,320,640)] -> 206[Float(1,3,320,320)] Layer(Slice): Slice_19, Tactic: 0x0000000000000000, 180[Float(1,3,320,640)] -> 206[Float(1,3,320,320)] Layer(Slice): Slice_29, Tactic: 0x0000000000000000, 190[Float(1,3,320,640)] -> 206[Float(1,3,320,320)] Layer(Slice): Slice_39, Tactic: 0x0000000000000000, 200[Float(1,3,320,640)] -> 206[Float(1,3,320,320)] Layer(CaskConvolution): Conv_41, Tactic: 0xa8609adc4e0ceb90, 206[Float(1,12,320,320)] -> 207[Float(1,32,320,320)] Layer(PointWiseV2): PWN(PWN(Sigmoid_42), Mul_43), Tactic: 0x0000000000000005, 207[Float(1,32,320,320)] -> 209[Float(1,32,320,320)] Layer(CaskConvolution): Conv_44, Tactic: 0x5deb29b7a8e275f7, 209[Float(1,32,320,320)] -> 210[Float(1,64,160,160)] Layer(PointWiseV2): PWN(PWN(Sigmoid_45), Mul_46), Tactic: 0x0000000000000005, 210[Float(1,64,160,160)] -> 212[Float(1,64,160,160)] Layer(CaskConvolution): Conv_47 || Conv_58, Tactic: 0xc3cf6e1d1c6aff27, 212[Float(1,64,160,160)] -> Conv_47 || Conv_58[Float(1,64,160,160)] Layer(PointWiseV2): PWN(PWN(Sigmoid_48), Mul_49), Tactic: 0x0000000000000008, Conv_47 || Conv_58[Float(1,32,160,160)] -> 215[Float(1,32,160,160)] Layer(CaskConvolution): Conv_50, Tactic: 0x9808072e706def96, 215[Float(1,32,160,160)] -> 216[Float(1,32,160,160)] Layer(PointWiseV2): PWN(PWN(Sigmoid_51), Mul_52), Tactic: 0x0000000000000002, 216[Float(1,32,160,160)] -> 218[Float(1,32,160,160)] Layer(CudnnConvolution): Conv_53, Tactic: 0x000000000000003e, 218[Float(1,32,160,160)] -> 219[Float(1,32,160,160)] Layer(PointWiseV2): PWN(PWN(PWN(Sigmoid_54), Mul_55), Add_56), Tactic: 0x0000000000000002, 219[Float(1,32,160,160)], 215[Float(1,32,160,160)] -> 222[Float(1,32,160,160)] Layer(CaskConvolution): Conv_57, Tactic: 0x9808072e706def96, 222[Float(1,32,160,160)] -> 225[Float(1,32,160,160)] Layer(Reformat): 224 copy, Tactic: 0x00000000000003e8, Conv_47 || Conv_58[Float(1,32,160,160)] -> 225[Float(1,32,160,160)] Layer(Scale): BatchNormalization_60, Tactic: 0x0000000000000000, 225[Float(1,64,160,160)] -> 226[Float(1,64,160,160)] Layer(PointWiseV2): PWN(PWN(Sigmoid_61), Mul_62), Tactic: 0x0000000000000005, 226[Float(1,64,160,160)] -> 228[Float(1,64,160,160)] Layer(CaskConvolution): Conv_63, Tactic: 0xc3cf6e1d1c6aff27, 228[Float(1,64,160,160)] -> 229[Float(1,64,160,160)] Layer(PointWiseV2): PWN(PWN(Sigmoid_64), Mul_65), Tactic: 0x0000000000000005, 229[Float(1,64,160,160)] -> 231[Float(1,64,160,160)] Layer(CaskConvolution): Conv_66, Tactic: 0x503619c69ae500ff, 231[Float(1,64,160,160)] -> 232[Float(1,128,80,80)] Layer(PointWiseV2): PWN(PWN(Sigmoid_67), Mul_68), Tactic: 0x0000000000000001, 232[Float(1,128,80,80)] -> 234[Float(1,128,80,80)] Layer(CaskConvolution): Conv_69 || Conv_94, Tactic: 0xa8ef60e712f8ad24, 234[Float(1,128,80,80)] -> Conv_69 || Conv_94[Float(1,128,80,80)] Layer(PointWiseV2): PWN(PWN(Sigmoid_70), Mul_71), Tactic: 0x0000000000000005, Conv_69 || Conv_94[Float(1,64,80,80)] -> 237[Float(1,64,80,80)] Layer(CaskConvolution): Conv_72, Tactic: 0xc3cf6e1d1c6aff27, 237[Float(1,64,80,80)] -> 238[Float(1,64,80,80)] Layer(PointWiseV2): PWN(PWN(Sigmoid_73), Mul_74), Tactic: 0x0000000000000005, 238[Float(1,64,80,80)] -> 240[Float(1,64,80,80)] Layer(CudnnConvolution): Conv_75, Tactic: 0x0000000000000076, 240[Float(1,64,80,80)] -> 241[Float(1,64,80,80)] Layer(PointWiseV2): PWN(PWN(PWN(Sigmoid_76), Mul_77), Add_78), Tactic: 0x0000000000000002, 241[Float(1,64,80,80)], 237[Float(1,64,80,80)] -> 244[Float(1,64,80,80)] Layer(CaskConvolution): Conv_79, Tactic: 0xc3cf6e1d1c6aff27, 244[Float(1,64,80,80)] -> 245[Float(1,64,80,80)] Layer(PointWiseV2): PWN(PWN(Sigmoid_80), Mul_81), Tactic: 0x0000000000000005, 245[Float(1,64,80,80)] -> 247[Float(1,64,80,80)] Layer(CudnnConvolution): Conv_82, Tactic: 0x0000000000000076, 247[Float(1,64,80,80)] -> 248[Float(1,64,80,80)] Layer(PointWiseV2): PWN(PWN(PWN(Sigmoid_83), Mul_84), Add_85), Tactic: 0x0000000000000002, 248[Float(1,64,80,80)], 244[Float(1,64,80,80)] -> 251[Float(1,64,80,80)] Layer(CaskConvolution): Conv_86, Tactic: 0xc3cf6e1d1c6aff27, 251[Float(1,64,80,80)] -> 252[Float(1,64,80,80)] Layer(PointWiseV2): PWN(PWN(Sigmoid_87), Mul_88), Tactic: 0x0000000000000005, 252[Float(1,64,80,80)] -> 254[Float(1,64,80,80)] Layer(CudnnConvolution): Conv_89, Tactic: 0x0000000000000076, 254[Float(1,64,80,80)] -> 255[Float(1,64,80,80)] Layer(PointWiseV2): PWN(PWN(PWN(Sigmoid_90), Mul_91), Add_92), Tactic: 0x0000000000000002, 255[Float(1,64,80,80)], 251[Float(1,64,80,80)] -> 258[Float(1,64,80,80)] Layer(CaskConvolution): Conv_93, Tactic: 0xc3cf6e1d1c6aff27, 258[Float(1,64,80,80)] -> 261[Float(1,64,80,80)] Layer(Reformat): 260 copy, Tactic: 0x0000000000000000, Conv_69 || Conv_94[Float(1,64,80,80)] -> 261[Float(1,64,80,80)] Layer(Scale): BatchNormalization_96, Tactic: 0x0000000000000000, 261[Float(1,128,80,80)] -> 262[Float(1,128,80,80)] Layer(PointWiseV2): PWN(PWN(Sigmoid_97), Mul_98), Tactic: 0x0000000000000001, 262[Float(1,128,80,80)] -> 264[Float(1,128,80,80)] Layer(CaskConvolution): Conv_99, Tactic: 0xa8ef60e712f8ad24, 264[Float(1,128,80,80)] -> 265[Float(1,128,80,80)] Layer(PointWiseV2): PWN(PWN(Sigmoid_100), Mul_101), Tactic: 0x0000000000000005, 265[Float(1,128,80,80)] -> 370[Float(1,128,80,80)] Layer(Reformat): Reformatting CopyNode for Input Tensor 0 to Conv_102, Tactic: 0x00000000000003ea, 370[Float(1,128,80,80)] -> Reformatted Input Tensor 0 to Conv_102[Float(1,128,80,80)] Layer(CaskConvolution): Conv_102, Tactic: 0x999e005e3b016ea6, Reformatted Input Tensor 0 to Conv_102[Float(1,128,80,80)] -> 268[Float(1,256,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_103), Mul_104), Tactic: 0x000000000000000a, 268[Float(1,256,40,40)] -> 270[Float(1,256,40,40)] Layer(NoOp): Reformatting CopyNode for Input Tensor 0 to Conv_105 || Conv_130, Tactic: 0x0000000000000000, 270[Float(1,256,40,40)] -> Reformatted Input Tensor 0 to Conv_105 || Conv_130[Float(1,256,40,40)] Layer(CaskConvolution): Conv_105 || Conv_130, Tactic: 0x9dece0dc37e90462, Reformatted Input Tensor 0 to Conv_105 || Conv_130[Float(1,256,40,40)] -> Reformatted Output Tensor 0 to Conv_105 || Conv_130[Float(1,256,40,40)] Layer(NoOp): Reformatting CopyNode for Output Tensor 0 to Conv_105 || Conv_130, Tactic: 0x0000000000000000, Reformatted Output Tensor 0 to Conv_105 || Conv_130[Float(1,256,40,40)] -> Conv_105 || Conv_130[Float(1,256,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_106), Mul_107), Tactic: 0x0000000000000016, Conv_105 || Conv_130[Float(1,128,40,40)] -> Reformatted Output Tensor 0 to PWN(PWN(Sigmoid_106), Mul_107)[Float(1,128,40,40)] Layer(Reformat): Reformatting CopyNode for Output Tensor 0 to PWN(PWN(Sigmoid_106), Mul_107), Tactic: 0x00000000000003ea, Reformatted Output Tensor 0 to PWN(PWN(Sigmoid_106), Mul_107)[Float(1,128,40,40)] -> 273[Float(1,128,40,40)] Layer(CaskConvolution): Conv_108, Tactic: 0x865894c4635db7fd, 273[Float(1,128,40,40)] -> 274[Float(1,128,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_109), Mul_110), Tactic: 0x0000000000000008, 274[Float(1,128,40,40)] -> 276[Float(1,128,40,40)] Layer(CaskConvolution): Conv_111, Tactic: 0x94119b4c514b211a, 276[Float(1,128,40,40)] -> 277[Float(1,128,40,40)] Layer(PointWiseV2): PWN(PWN(PWN(Sigmoid_112), Mul_113), Add_114), Tactic: 0x0000000000000007, 277[Float(1,128,40,40)], 273[Float(1,128,40,40)] -> 280[Float(1,128,40,40)] Layer(CaskConvolution): Conv_115, Tactic: 0x865894c4635db7fd, 280[Float(1,128,40,40)] -> 281[Float(1,128,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_116), Mul_117), Tactic: 0x0000000000000008, 281[Float(1,128,40,40)] -> 283[Float(1,128,40,40)] Layer(CaskConvolution): Conv_118, Tactic: 0x94119b4c514b211a, 283[Float(1,128,40,40)] -> 284[Float(1,128,40,40)] Layer(PointWiseV2): PWN(PWN(PWN(Sigmoid_119), Mul_120), Add_121), Tactic: 0x0000000000000007, 284[Float(1,128,40,40)], 280[Float(1,128,40,40)] -> 287[Float(1,128,40,40)] Layer(CaskConvolution): Conv_122, Tactic: 0x865894c4635db7fd, 287[Float(1,128,40,40)] -> 288[Float(1,128,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_123), Mul_124), Tactic: 0x0000000000000008, 288[Float(1,128,40,40)] -> 290[Float(1,128,40,40)] Layer(CaskConvolution): Conv_125, Tactic: 0x94119b4c514b211a, 290[Float(1,128,40,40)] -> 291[Float(1,128,40,40)] Layer(PointWiseV2): PWN(PWN(PWN(Sigmoid_126), Mul_127), Add_128), Tactic: 0x0000000000000007, 291[Float(1,128,40,40)], 287[Float(1,128,40,40)] -> 294[Float(1,128,40,40)] Layer(CaskConvolution): Conv_129, Tactic: 0xc0b05b61d128e46e, 294[Float(1,128,40,40)] -> 297[Float(1,128,40,40)] Layer(Reformat): 296 copy, Tactic: 0x00000000000003e8, Conv_105 || Conv_130[Float(1,128,40,40)] -> 297[Float(1,128,40,40)] Layer(Scale): BatchNormalization_132, Tactic: 0x0000000000000000, 297[Float(1,256,40,40)] -> 298[Float(1,256,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_133), Mul_134), Tactic: 0x0000000000000009, 298[Float(1,256,40,40)] -> 300[Float(1,256,40,40)] Layer(CaskConvolution): Conv_135, Tactic: 0xa419b3b68f2da07b, 300[Float(1,256,40,40)] -> 301[Float(1,256,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_136), Mul_137), Tactic: 0x0000000000000009, 301[Float(1,256,40,40)] -> 343[Float(1,256,40,40)] Layer(Reformat): Reformatting CopyNode for Input Tensor 0 to Conv_138, Tactic: 0x00000000000003ea, 343[Float(1,256,40,40)] -> Reformatted Input Tensor 0 to Conv_138[Float(1,256,40,40)] Layer(CaskConvolution): Conv_138, Tactic: 0xd15dd11d64344e83, Reformatted Input Tensor 0 to Conv_138[Float(1,256,40,40)] -> 304[Float(1,512,20,20)] Layer(PointWiseV2): PWN(PWN(Sigmoid_139), Mul_140), Tactic: 0x000000000000001c, 304[Float(1,512,20,20)] -> 306[Float(1,512,20,20)] Layer(CaskConvolution): Conv_141, Tactic: 0x90898977fc8ce537, 306[Float(1,512,20,20)] -> 307[Float(1,256,20,20)] Layer(NoOp): Reformatting CopyNode for Input Tensor 0 to PWN(PWN(Sigmoid_142), Mul_143), Tactic: 0x0000000000000000, 307[Float(1,256,20,20)] -> Reformatted Input Tensor 0 to PWN(PWN(Sigmoid_142), Mul_143)[Float(1,256,20,20)] Layer(PointWiseV2): PWN(PWN(Sigmoid_142), Mul_143), Tactic: 0x000000000000000a, Reformatted Input Tensor 0 to PWN(PWN(Sigmoid_142), Mul_143)[Float(1,256,20,20)] -> Reformatted Output Tensor 0 to PWN(PWN(Sigmoid_142), Mul_143)[Float(1,256,20,20)] Layer(Reformat): Reformatting CopyNode for Output Tensor 0 to PWN(PWN(Sigmoid_142), Mul_143), Tactic: 0x00000000000003e8, Reformatted Output Tensor 0 to PWN(PWN(Sigmoid_142), Mul_143)[Float(1,256,20,20)] -> 309[Float(1,256,20,20)] Layer(TiledPooling): MaxPool_144, Tactic: 0x0000000000760a14, 309[Float(1,256,20,20)] -> 313[Float(1,256,20,20)] Layer(CudnnPooling): MaxPool_145, Tactic: 0xffffffffffffffff, 309[Float(1,256,20,20)] -> 313[Float(1,256,20,20)] Layer(CudnnPooling): MaxPool_146, Tactic: 0xffffffffffffffff, 309[Float(1,256,20,20)] -> 313[Float(1,256,20,20)] Layer(Reformat): 309 copy, Tactic: 0x00000000000003e8, 309[Float(1,256,20,20)] -> 313[Float(1,256,20,20)] Layer(CudnnConvolution): Conv_148, Tactic: 0x0000000000000039, 313[Float(1,1024,20,20)] -> 314[Float(1,512,20,20)] Layer(PointWiseV2): PWN(PWN(Sigmoid_149), Mul_150), Tactic: 0x0000000000000000, 314[Float(1,512,20,20)] -> 316[Float(1,512,20,20)] Layer(CudnnConvolution): Conv_151 || Conv_161, Tactic: 0x0000000000000039, 316[Float(1,512,20,20)] -> Conv_151 || Conv_161[Float(1,512,20,20)] Layer(PointWiseV2): PWN(PWN(Sigmoid_152), Mul_153), Tactic: 0x0000000000000008, Conv_151 || Conv_161[Float(1,256,20,20)] -> 319[Float(1,256,20,20)] Layer(CaskConvolution): Conv_154, Tactic: 0x1fc87d7eb370bb7a, 319[Float(1,256,20,20)] -> 320[Float(1,256,20,20)] Layer(PointWiseV2): PWN(PWN(Sigmoid_155), Mul_156), Tactic: 0x0000000000000001, 320[Float(1,256,20,20)] -> 322[Float(1,256,20,20)] Layer(FusedConvActConvolution): Conv_157, Tactic: 0x00000000009fffff, 322[Float(1,256,20,20)] -> 323[Float(1,256,20,20)] Layer(PointWiseV2): PWN(PWN(Sigmoid_158), Mul_159), Tactic: 0x0000000000000001, 323[Float(1,256,20,20)] -> 325[Float(1,256,20,20)] Layer(CudnnConvolution): Conv_160, Tactic: 0x0000000000000070, 325[Float(1,256,20,20)] -> 328[Float(1,256,20,20)] Layer(Reformat): 327 copy, Tactic: 0x00000000000003e8, Conv_151 || Conv_161[Float(1,256,20,20)] -> 328[Float(1,256,20,20)] Layer(Scale): BatchNormalization_163, Tactic: 0x0000000000000000, 328[Float(1,512,20,20)] -> 329[Float(1,512,20,20)] Layer(PointWiseV2): PWN(PWN(Sigmoid_164), Mul_165), Tactic: 0x0000000000000000, 329[Float(1,512,20,20)] -> 331[Float(1,512,20,20)] Layer(CudnnConvolution): Conv_166, Tactic: 0x0000000000000039, 331[Float(1,512,20,20)] -> 332[Float(1,512,20,20)] Layer(PointWiseV2): PWN(PWN(Sigmoid_167), Mul_168), Tactic: 0x0000000000000000, 332[Float(1,512,20,20)] -> 334[Float(1,512,20,20)] Layer(CaskConvolution): Conv_169, Tactic: 0x1fc87d7eb370bb7a, 334[Float(1,512,20,20)] -> 335[Float(1,256,20,20)] Layer(PointWiseV2): PWN(PWN(Sigmoid_170), Mul_171), Tactic: 0x0000000000000001, 335[Float(1,256,20,20)] -> 337[Float(1,256,20,20)] Layer(Resize): Resize_173, Tactic: 0x0000000000000000, 337[Float(1,256,20,20)] -> 342[Float(1,256,40,40)] Layer(Reformat): 342 copy, Tactic: 0x0000000000000000, 342[Float(1,256,40,40)] -> 343[Float(1,256,40,40)] Layer(CaskConvolution): Conv_175 || Conv_185, Tactic: 0xa419b3b68f2da07b, 343[Float(1,512,40,40)] -> Conv_175 || Conv_185[Float(1,256,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_176), Mul_177), Tactic: 0x0000000000000001, Conv_175 || Conv_185[Float(1,128,40,40)] -> 346[Float(1,128,40,40)] Layer(CaskConvolution): Conv_178, Tactic: 0x865894c4635db7fd, 346[Float(1,128,40,40)] -> 347[Float(1,128,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_179), Mul_180), Tactic: 0x0000000000000008, 347[Float(1,128,40,40)] -> 349[Float(1,128,40,40)] Layer(CaskConvolution): Conv_181, Tactic: 0x94119b4c514b211a, 349[Float(1,128,40,40)] -> 350[Float(1,128,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_182), Mul_183), Tactic: 0x0000000000000008, 350[Float(1,128,40,40)] -> 352[Float(1,128,40,40)] Layer(CaskConvolution): Conv_184, Tactic: 0xc0b05b61d128e46e, 352[Float(1,128,40,40)] -> 355[Float(1,128,40,40)] Layer(Reformat): 354 copy, Tactic: 0x00000000000003e8, Conv_175 || Conv_185[Float(1,128,40,40)] -> 355[Float(1,128,40,40)] Layer(Scale): BatchNormalization_187, Tactic: 0x0000000000000000, 355[Float(1,256,40,40)] -> 356[Float(1,256,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_188), Mul_189), Tactic: 0x0000000000000009, 356[Float(1,256,40,40)] -> 358[Float(1,256,40,40)] Layer(CaskConvolution): Conv_190, Tactic: 0xa419b3b68f2da07b, 358[Float(1,256,40,40)] -> 359[Float(1,256,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_191), Mul_192), Tactic: 0x0000000000000009, 359[Float(1,256,40,40)] -> 361[Float(1,256,40,40)] Layer(CaskConvolution): Conv_193, Tactic: 0x9cd5cdc35441c505, 361[Float(1,256,40,40)] -> 362[Float(1,128,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_194), Mul_195), Tactic: 0x0000000000000008, 362[Float(1,128,40,40)] -> 364[Float(1,128,40,40)] Layer(Resize): Resize_197, Tactic: 0x0000000000000000, 364[Float(1,128,40,40)] -> 369[Float(1,128,80,80)] Layer(Reformat): 369 copy, Tactic: 0x0000000000000000, 369[Float(1,128,80,80)] -> 370[Float(1,128,80,80)] Layer(CaskConvolution): Conv_199 || Conv_209, Tactic: 0xa419b3b68f2da07b, 370[Float(1,256,80,80)] -> Conv_199 || Conv_209[Float(1,128,80,80)] Layer(PointWiseV2): PWN(PWN(Sigmoid_200), Mul_201), Tactic: 0x0000000000000005, Conv_199 || Conv_209[Float(1,64,80,80)] -> 373[Float(1,64,80,80)] Layer(CaskConvolution): Conv_202, Tactic: 0xc3cf6e1d1c6aff27, 373[Float(1,64,80,80)] -> 374[Float(1,64,80,80)] Layer(PointWiseV2): PWN(PWN(Sigmoid_203), Mul_204), Tactic: 0x0000000000000005, 374[Float(1,64,80,80)] -> 376[Float(1,64,80,80)] Layer(CudnnConvolution): Conv_205, Tactic: 0x0000000000000076, 376[Float(1,64,80,80)] -> 377[Float(1,64,80,80)] Layer(PointWiseV2): PWN(PWN(Sigmoid_206), Mul_207), Tactic: 0x0000000000000005, 377[Float(1,64,80,80)] -> 379[Float(1,64,80,80)] Layer(CaskConvolution): Conv_208, Tactic: 0xc3cf6e1d1c6aff27, 379[Float(1,64,80,80)] -> 382[Float(1,64,80,80)] Layer(Reformat): 381 copy, Tactic: 0x0000000000000000, Conv_199 || Conv_209[Float(1,64,80,80)] -> 382[Float(1,64,80,80)] Layer(Scale): BatchNormalization_211, Tactic: 0x0000000000000000, 382[Float(1,128,80,80)] -> 383[Float(1,128,80,80)] Layer(PointWiseV2): PWN(PWN(Sigmoid_212), Mul_213), Tactic: 0x0000000000000001, 383[Float(1,128,80,80)] -> 385[Float(1,128,80,80)] Layer(CaskConvolution): Conv_214, Tactic: 0xa8ef60e712f8ad24, 385[Float(1,128,80,80)] -> 386[Float(1,128,80,80)] Layer(PointWiseV2): PWN(PWN(Sigmoid_215), Mul_216), Tactic: 0x0000000000000001, 386[Float(1,128,80,80)] -> 388[Float(1,128,80,80)] Layer(CaskConvolution): Conv_217, Tactic: 0x01cf8ce2da913006, 388[Float(1,128,80,80)] -> 389[Float(1,128,40,40)] Layer(CaskConvolution): Conv_261, Tactic: 0xc3cf6e1d1c6aff27, 388[Float(1,128,80,80)] -> 433[Float(1,18,80,80)] Layer(PointWiseV2): PWN(PWN(Sigmoid_218), Mul_219), Tactic: 0x0000000000000002, 389[Float(1,128,40,40)] -> 392[Float(1,128,40,40)] Layer(Reformat): 364 copy, Tactic: 0x00000000000003e8, 364[Float(1,128,40,40)] -> 392[Float(1,128,40,40)] Layer(CaskConvolution): Conv_221 || Conv_231, Tactic: 0xa419b3b68f2da07b, 392[Float(1,256,40,40)] -> Conv_221 || Conv_231[Float(1,256,40,40)] Layer(Shuffle): Reshape_275 + Transpose_276, Tactic: 0x0000000000000000, 433[Float(1,18,80,80)] -> 452[Float(1,3,80,80,6)] Layer(PointWiseV2): PWN(PWN(Sigmoid_222), Mul_223), Tactic: 0x0000000000000001, Conv_221 || Conv_231[Float(1,128,40,40)] -> 395[Float(1,128,40,40)] Layer(CaskConvolution): Conv_224, Tactic: 0x865894c4635db7fd, 395[Float(1,128,40,40)] -> 396[Float(1,128,40,40)] Layer(PointWiseV2): PWN(Sigmoid_277), Tactic: 0x0000000000000002, 452[Float(1,3,80,80,6)] -> 453[Float(1,3,80,80,6)] Layer(Slice): Split_278, Tactic: 0x0000000000000000, 453[Float(1,3,80,80,6)] -> 454[Float(1,3,80,80,2)] Layer(Slice): Split_278_59, Tactic: 0x0000000000000000, 453[Float(1,3,80,80,6)] -> 455[Float(1,3,80,80,2)] Layer(Slice): Split_278_60, Tactic: 0x0000000000000000, 453[Float(1,3,80,80,6)] -> 469[Float(1,3,80,80,2)] Layer(PointWiseV2): PWN(PWN(Sigmoid_225), Mul_226), Tactic: 0x0000000000000008, 396[Float(1,128,40,40)] -> 398[Float(1,128,40,40)] Layer(CaskConvolution): Conv_227, Tactic: 0x94119b4c514b211a, 398[Float(1,128,40,40)] -> 399[Float(1,128,40,40)] Layer(PointWiseV2): PWN(PWN(457 + (Unnamed Layer* 250) [Shuffle] + Mul_280, Add_282), 461 + (Unnamed Layer* 255) [Shuffle] + Mul_284), Tactic: 0x0000000000000002, 454[Float(1,3,80,80,2)], (Unnamed Layer* 252) [Constant]_output[Float(1,3,80,80,2)] -> 462[Float(1,3,80,80,2)] Layer(PointWiseV2): PWN(PWN(463 + (Unnamed Layer* 258) [Shuffle] + Mul_286, PWN(465 + (Unnamed Layer* 261) [Shuffle], Pow_288)), Mul_290), Tactic: 0x000000000000001c, 455[Float(1,3,80,80,2)], (Unnamed Layer* 263) [Constant]_output[Float(1,3,80,80,2)] -> 468[Float(1,3,80,80,2)] Layer(PointWiseV2): PWN(PWN(Sigmoid_228), Mul_229), Tactic: 0x0000000000000008, 399[Float(1,128,40,40)] -> 401[Float(1,128,40,40)] Layer(Reformat): 462 copy, Tactic: 0x00000000000003e8, 462[Float(1,3,80,80,2)] -> 469[Float(1,3,80,80,2)] Layer(Reformat): 468 copy, Tactic: 0x00000000000003e8, 468[Float(1,3,80,80,2)] -> 469[Float(1,3,80,80,2)] Layer(CaskConvolution): Conv_230, Tactic: 0xc0b05b61d128e46e, 401[Float(1,128,40,40)] -> 404[Float(1,128,40,40)] Layer(NoOp): Reshape_298, Tactic: 0x0000000000000000, 469[Float(1,3,80,80,6)] -> Reshape_298_copy_output[Float(1,19200,6)] Layer(Reformat): Reshape_298_copy_output, Tactic: 0x0000000000000000, Reshape_298_copy_output[Float(1,19200,6)] -> output0[Float(1,19200,6)] Layer(Reformat): 403 copy, Tactic: 0x00000000000003e8, Conv_221 || Conv_231[Float(1,128,40,40)] -> 404[Float(1,128,40,40)] Layer(Scale): BatchNormalization_233, Tactic: 0x0000000000000000, 404[Float(1,256,40,40)] -> 405[Float(1,256,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_234), Mul_235), Tactic: 0x0000000000000009, 405[Float(1,256,40,40)] -> 407[Float(1,256,40,40)] Layer(CaskConvolution): Conv_236, Tactic: 0xa419b3b68f2da07b, 407[Float(1,256,40,40)] -> 408[Float(1,256,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_237), Mul_238), Tactic: 0x0000000000000009, 408[Float(1,256,40,40)] -> 410[Float(1,256,40,40)] Layer(CaskConvolution): Conv_239, Tactic: 0xa31d27de74b895ff, 410[Float(1,256,40,40)] -> 411[Float(1,256,20,20)] Layer(FusedConvActConvolution): Conv_299, Tactic: 0x00000000009dffff, 410[Float(1,256,40,40)] -> 479[Float(1,18,40,40)] Layer(PointWiseV2): PWN(PWN(Sigmoid_240), Mul_241), Tactic: 0x0000000000000004, 411[Float(1,256,20,20)] -> 414[Float(1,256,20,20)] Layer(Reformat): 337 copy, Tactic: 0x00000000000003e8, 337[Float(1,256,20,20)] -> 414[Float(1,256,20,20)] Layer(CudnnConvolution): Conv_243 || Conv_253, Tactic: 0x0000000000000039, 414[Float(1,512,20,20)] -> Conv_243 || Conv_253[Float(1,512,20,20)] Layer(Shuffle): Reshape_313 + Transpose_314, Tactic: 0x0000000000000000, 479[Float(1,18,40,40)] -> 498[Float(1,3,40,40,6)] Layer(PointWiseV2): PWN(PWN(Sigmoid_244), Mul_245), Tactic: 0x0000000000000008, Conv_243 || Conv_253[Float(1,256,20,20)] -> 417[Float(1,256,20,20)] Layer(CaskConvolution): Conv_246, Tactic: 0x1fc87d7eb370bb7a, 417[Float(1,256,20,20)] -> 418[Float(1,256,20,20)] Layer(PointWiseV2): PWN(Sigmoid_315), Tactic: 0x0000000000000005, 498[Float(1,3,40,40,6)] -> 499[Float(1,3,40,40,6)] Layer(Slice): Split_316, Tactic: 0x0000000000000000, 499[Float(1,3,40,40,6)] -> 500[Float(1,3,40,40,2)] Layer(Slice): Split_316_65, Tactic: 0x0000000000000000, 499[Float(1,3,40,40,6)] -> 501[Float(1,3,40,40,2)] Layer(Slice): Split_316_66, Tactic: 0x0000000000000000, 499[Float(1,3,40,40,6)] -> 515[Float(1,3,40,40,2)] Layer(PointWiseV2): PWN(PWN(Sigmoid_247), Mul_248), Tactic: 0x0000000000000001, 418[Float(1,256,20,20)] -> 420[Float(1,256,20,20)] Layer(FusedConvActConvolution): Conv_249, Tactic: 0x00000000009fffff, 420[Float(1,256,20,20)] -> 421[Float(1,256,20,20)] Layer(PointWiseV2): PWN(PWN(503 + (Unnamed Layer* 297) [Shuffle] + Mul_318, Add_320), 507 + (Unnamed Layer* 302) [Shuffle] + Mul_322), Tactic: 0x0000000000000005, 500[Float(1,3,40,40,2)], (Unnamed Layer* 299) [Constant]_output[Float(1,3,40,40,2)] -> 508[Float(1,3,40,40,2)] Layer(PointWiseV2): PWN(PWN(509 + (Unnamed Layer* 305) [Shuffle] + Mul_324, PWN(511 + (Unnamed Layer* 308) [Shuffle], Pow_326)), Mul_328), Tactic: 0x0000000000000000, 501[Float(1,3,40,40,2)], (Unnamed Layer* 310) [Constant]_output[Float(1,3,40,40,2)] -> 514[Float(1,3,40,40,2)] Layer(PointWiseV2): PWN(PWN(Sigmoid_250), Mul_251), Tactic: 0x0000000000000001, 421[Float(1,256,20,20)] -> 423[Float(1,256,20,20)] Layer(Reformat): 508 copy, Tactic: 0x0000000000000000, 508[Float(1,3,40,40,2)] -> 515[Float(1,3,40,40,2)] Layer(Reformat): 514 copy, Tactic: 0x0000000000000000, 514[Float(1,3,40,40,2)] -> 515[Float(1,3,40,40,2)] Layer(CudnnConvolution): Conv_252, Tactic: 0x0000000000000070, 423[Float(1,256,20,20)] -> 426[Float(1,256,20,20)] Layer(NoOp): Reshape_336, Tactic: 0x0000000000000000, 515[Float(1,3,40,40,6)] -> Reshape_336_copy_output[Float(1,4800,6)] Layer(Reformat): Reshape_336_copy_output, Tactic: 0x00000000000003e8, Reshape_336_copy_output[Float(1,4800,6)] -> output0[Float(1,4800,6)] Layer(Reformat): 425 copy, Tactic: 0x00000000000003e8, Conv_243 || Conv_253[Float(1,256,20,20)] -> 426[Float(1,256,20,20)] Layer(Scale): BatchNormalization_255, Tactic: 0x0000000000000000, 426[Float(1,512,20,20)] -> 427[Float(1,512,20,20)] Layer(PointWiseV2): PWN(PWN(Sigmoid_256), Mul_257), Tactic: 0x0000000000000000, 427[Float(1,512,20,20)] -> 429[Float(1,512,20,20)] Layer(CudnnConvolution): Conv_258, Tactic: 0x0000000000000039, 429[Float(1,512,20,20)] -> 430[Float(1,512,20,20)] Layer(PointWiseV2): PWN(PWN(Sigmoid_259), Mul_260), Tactic: 0x0000000000000000, 430[Float(1,512,20,20)] -> 432[Float(1,512,20,20)] Layer(FusedConvActConvolution): Conv_337, Tactic: 0x00000000001fffff, 432[Float(1,512,20,20)] -> 525[Float(1,18,20,20)] Layer(Shuffle): Reshape_351 + Transpose_352, Tactic: 0x0000000000000000, 525[Float(1,18,20,20)] -> 544[Float(1,3,20,20,6)] Layer(PointWiseV2): PWN(Sigmoid_353), Tactic: 0x0000000000000001, 544[Float(1,3,20,20,6)] -> 545[Float(1,3,20,20,6)] Layer(Slice): Split_354, Tactic: 0x0000000000000000, 545[Float(1,3,20,20,6)] -> 546[Float(1,3,20,20,2)] Layer(Slice): Split_354_71, Tactic: 0x0000000000000000, 545[Float(1,3,20,20,6)] -> 547[Float(1,3,20,20,2)] Layer(Slice): Split_354_72, Tactic: 0x0000000000000000, 545[Float(1,3,20,20,6)] -> 561[Float(1,3,20,20,2)] Layer(PointWiseV2): PWN(PWN(549 + (Unnamed Layer* 344) [Shuffle] + Mul_356, Add_358), 553 + (Unnamed Layer* 349) [Shuffle] + Mul_360), Tactic: 0x0000000000000008, 546[Float(1,3,20,20,2)], (Unnamed Layer* 346) [Constant]_output[Float(1,3,20,20,2)] -> 554[Float(1,3,20,20,2)] Layer(PointWiseV2): PWN(PWN(555 + (Unnamed Layer* 352) [Shuffle] + Mul_362, PWN(557 + (Unnamed Layer* 355) [Shuffle], Pow_364)), Mul_366), Tactic: 0x0000000000000000, 547[Float(1,3,20,20,2)], (Unnamed Layer* 357) [Constant]_output[Float(1,3,20,20,2)] -> 560[Float(1,3,20,20,2)] Layer(Reformat): 554 copy, Tactic: 0x00000000000003e8, 554[Float(1,3,20,20,2)] -> 561[Float(1,3,20,20,2)] Layer(Reformat): 560 copy, Tactic: 0x00000000000003e8, 560[Float(1,3,20,20,2)] -> 561[Float(1,3,20,20,2)] Layer(NoOp): Reshape_374, Tactic: 0x0000000000000000, 561[Float(1,3,20,20,6)] -> Reshape_374_copy_output[Float(1,1200,6)] Layer(Reformat): Reshape_374_copy_output, Tactic: 0x00000000000003e8, Reshape_374_copy_output[Float(1,1200,6)] -> output0[Float(1,1200,6)] [06/27/2024-06:28:21] [I] [TRT] [MemUsageChange] TensorRT-managed allocation in building engine: CPU +3, GPU +32, now: CPU 3, GPU 32 (MiB) [06/27/2024-06:28:21] [W] [TRT] The getMaxBatchSize() function should not be used with an engine built from a network created with NetworkDefinitionCreationFlag::kEXPLICIT_BATCH flag. This function will always return 1. [06/27/2024-06:28:21] [W] [TRT] The getMaxBatchSize() function should not be used with an engine built from a network created with NetworkDefinitionCreationFlag::kEXPLICIT_BATCH flag. This function will always return 1. [06/27/2024-06:28:21] [I] Engine built in 137.541 sec. [06/27/2024-06:28:21] [I] [TRT] [MemUsageChange] Init CUDA: CPU +0, GPU +0, now: CPU 9926, GPU 2076 (MiB) [06/27/2024-06:28:21] [I] [TRT] Loaded engine size: 32 MiB [06/27/2024-06:28:21] [V] [TRT] Using cuDNN as a tactic source [06/27/2024-06:28:21] [I] [TRT] [MemUsageChange] Init cuDNN: CPU +0, GPU +10, now: CPU 9935, GPU 2118 (MiB) [06/27/2024-06:28:21] [V] [TRT] Deserialization required 57518 microseconds. [06/27/2024-06:28:21] [I] [TRT] [MemUsageChange] TensorRT-managed allocation in engine deserialization: CPU +0, GPU +31, now: CPU 0, GPU 31 (MiB) [06/27/2024-06:28:21] [I] Engine deserialized in 0.0590293 sec. [06/27/2024-06:28:21] [V] [TRT] Using cuDNN as a tactic source [06/27/2024-06:28:21] [I] [TRT] [MemUsageChange] Init cuDNN: CPU +0, GPU +8, now: CPU 9935, GPU 2118 (MiB) [06/27/2024-06:28:21] [V] [TRT] Total per-runner device persistent memory is 1813504 [06/27/2024-06:28:21] [V] [TRT] Total per-runner host persistent memory is 149632 [06/27/2024-06:28:21] [V] [TRT] Allocated activation device memory of size 35942400 [06/27/2024-06:28:21] [I] [TRT] [MemUsageChange] TensorRT-managed allocation in IExecutionContext creation: CPU +0, GPU +36, now: CPU 0, GPU 67 (MiB) [06/27/2024-06:28:21] [I] Using random values for input images [06/27/2024-06:28:21] [I] Created input binding for images with dimensions 1x3x640x640 [06/27/2024-06:28:21] [I] Using random values for output output0 [06/27/2024-06:28:21] [I] Created output binding for output0 with dimensions 1x25200x6 [06/27/2024-06:28:21] [I] Starting inference [06/27/2024-06:28:24] [I] Warmup completed 40 queries over 200 ms [06/27/2024-06:28:24] [I] Timing trace has 607 queries over 3.01077 s [06/27/2024-06:28:24] [I] [06/27/2024-06:28:24] [I] === Trace details === [06/27/2024-06:28:24] [I] Trace averages of 10 runs: [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 5.12725 ms - Host latency: 5.34526 ms (enqueue 1.50237 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.76289 ms - Host latency: 4.9808 ms (enqueue 1.53262 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.69172 ms - Host latency: 4.911 ms (enqueue 1.50132 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.63809 ms - Host latency: 4.85598 ms (enqueue 1.47682 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.69015 ms - Host latency: 4.90805 ms (enqueue 1.49055 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.66891 ms - Host latency: 4.88725 ms (enqueue 1.51058 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62473 ms - Host latency: 4.8426 ms (enqueue 1.49664 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.65402 ms - Host latency: 4.8719 ms (enqueue 1.50006 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.64511 ms - Host latency: 4.86321 ms (enqueue 1.51636 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.85778 ms - Host latency: 5.07575 ms (enqueue 1.56909 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.8169 ms - Host latency: 5.03513 ms (enqueue 1.56017 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.77653 ms - Host latency: 4.99433 ms (enqueue 1.51624 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.81594 ms - Host latency: 5.03389 ms (enqueue 1.52562 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.79667 ms - Host latency: 5.01467 ms (enqueue 1.55388 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.70493 ms - Host latency: 4.92321 ms (enqueue 1.52064 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.679 ms - Host latency: 4.89688 ms (enqueue 1.49452 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.78045 ms - Host latency: 4.99824 ms (enqueue 1.49407 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.61534 ms - Host latency: 4.8334 ms (enqueue 1.48124 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.71984 ms - Host latency: 4.93771 ms (enqueue 1.49098 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.67263 ms - Host latency: 4.89053 ms (enqueue 1.5611 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 5.15114 ms - Host latency: 5.36935 ms (enqueue 1.51099 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.69977 ms - Host latency: 4.91765 ms (enqueue 1.53557 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.6184 ms - Host latency: 4.83627 ms (enqueue 1.51288 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.53395 ms - Host latency: 4.75217 ms (enqueue 1.51486 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.63195 ms - Host latency: 4.8501 ms (enqueue 1.50763 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62062 ms - Host latency: 4.83854 ms (enqueue 1.49963 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.6422 ms - Host latency: 4.86014 ms (enqueue 1.49907 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62695 ms - Host latency: 4.84504 ms (enqueue 1.48645 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.61379 ms - Host latency: 4.83164 ms (enqueue 1.51901 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.61687 ms - Host latency: 4.83468 ms (enqueue 1.48748 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62833 ms - Host latency: 4.84617 ms (enqueue 1.49177 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62269 ms - Host latency: 4.84053 ms (enqueue 1.49237 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.61995 ms - Host latency: 4.83785 ms (enqueue 1.48519 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.57549 ms - Host latency: 4.79364 ms (enqueue 1.50269 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62235 ms - Host latency: 4.84028 ms (enqueue 1.49054 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62091 ms - Host latency: 4.83881 ms (enqueue 1.48737 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.57817 ms - Host latency: 4.79631 ms (enqueue 1.49587 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.76704 ms - Host latency: 4.98542 ms (enqueue 1.56793 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62859 ms - Host latency: 4.84651 ms (enqueue 1.49761 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.6321 ms - Host latency: 4.85005 ms (enqueue 1.50154 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.57021 ms - Host latency: 4.78811 ms (enqueue 1.49763 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 5.10222 ms - Host latency: 5.32021 ms (enqueue 1.48577 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62546 ms - Host latency: 4.84326 ms (enqueue 1.51318 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62693 ms - Host latency: 4.845 ms (enqueue 1.50857 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.6261 ms - Host latency: 4.84412 ms (enqueue 1.5033 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.56323 ms - Host latency: 4.78118 ms (enqueue 1.49827 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62197 ms - Host latency: 4.83987 ms (enqueue 1.50332 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.63093 ms - Host latency: 4.84878 ms (enqueue 1.49998 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.69111 ms - Host latency: 4.90908 ms (enqueue 1.50032 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.58088 ms - Host latency: 4.79915 ms (enqueue 1.52319 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62075 ms - Host latency: 4.83862 ms (enqueue 1.48149 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.60498 ms - Host latency: 4.8228 ms (enqueue 1.49221 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62383 ms - Host latency: 4.84316 ms (enqueue 1.49114 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.61924 ms - Host latency: 4.83728 ms (enqueue 1.48909 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.63093 ms - Host latency: 4.849 ms (enqueue 1.49177 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.63982 ms - Host latency: 4.85891 ms (enqueue 1.46755 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.61719 ms - Host latency: 4.83518 ms (enqueue 1.48618 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.63518 ms - Host latency: 4.85305 ms (enqueue 1.48828 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.62539 ms - Host latency: 4.84341 ms (enqueue 1.49407 ms) [06/27/2024-06:28:24] [I] Average on 10 runs - GPU latency: 4.69148 ms - Host latency: 4.9106 ms (enqueue 1.52371 ms) [06/27/2024-06:28:24] [I] [06/27/2024-06:28:24] [I] === Performance summary === [06/27/2024-06:28:24] [I] Throughput: 201.61 qps [06/27/2024-06:28:24] [I] Latency: min = 4.6687 ms, max = 9.61401 ms, mean = 4.89614 ms, median = 4.71338 ms, percentile(99%) = 5.67871 ms [06/27/2024-06:28:24] [I] Enqueue Time: min = 1.43579 ms, max = 1.90771 ms, mean = 1.50502 ms, median = 1.50037 ms, percentile(99%) = 1.64319 ms [06/27/2024-06:28:24] [I] H2D Latency: min = 0.190674 ms, max = 0.196594 ms, mean = 0.191028 ms, median = 0.190918 ms, percentile(99%) = 0.193726 ms [06/27/2024-06:28:24] [I] GPU Compute Time: min = 4.45117 ms, max = 9.39624 ms, mean = 4.67807 ms, median = 4.49438 ms, percentile(99%) = 5.461 ms [06/27/2024-06:28:24] [I] D2H Latency: min = 0.0266113 ms, max = 0.0402222 ms, mean = 0.0270339 ms, median = 0.0268555 ms, percentile(99%) = 0.02771 ms [06/27/2024-06:28:24] [I] Total Host Walltime: 3.01077 s [06/27/2024-06:28:24] [I] Total GPU Compute Time: 2.83959 s [06/27/2024-06:28:24] [W] * GPU compute time is unstable, with coefficient of variance = 8.92558%. [06/27/2024-06:28:24] [W] If not already in use, locking GPU clock frequency or adding --useSpinWait may improve the stability. [06/27/2024-06:28:24] [I] Explanations of the performance metrics are printed in the verbose logs. [06/27/2024-06:28:24] [V] [06/27/2024-06:28:24] [V] === Explanations of the performance metrics === [06/27/2024-06:28:24] [V] Total Host Walltime: the host walltime from when the first query (after warmups) is enqueued to when the last query is completed. [06/27/2024-06:28:24] [V] GPU Compute Time: the GPU latency to execute the kernels for a query. [06/27/2024-06:28:24] [V] Total GPU Compute Time: the summation of the GPU Compute Time of all the queries. If this is significantly shorter than Total Host Walltime, the GPU may be under-utilized because of host-side overheads or data transfers. [06/27/2024-06:28:24] [V] Throughput: the observed throughput computed by dividing the number of queries by the Total Host Walltime. If this is significantly lower than the reciprocal of GPU Compute Time, the GPU may be under-utilized because of host-side overheads or data transfers. [06/27/2024-06:28:24] [V] Enqueue Time: the host latency to enqueue a query. If this is longer than GPU Compute Time, the GPU may be under-utilized. [06/27/2024-06:28:24] [V] H2D Latency: the latency for host-to-device data transfers for input tensors of a single query. [06/27/2024-06:28:24] [V] D2H Latency: the latency for device-to-host data transfers for output tensors of a single query. [06/27/2024-06:28:24] [V] Latency: the summation of H2D Latency, GPU Compute Time, and D2H Latency. This is the latency to infer a single query. [06/27/2024-06:28:24] [I] &&&& PASSED TensorRT.trtexec [TensorRT v8401] # trtexec --onnx=best.onnx --saveEngine=best.trt --verbose C:\TensorRT-8.4.1.5\bin>