duplicate some ops to enable more fusion opportunities and reduce memory footprint #433

wyzero · 2022-06-30T02:43:36Z

No description provided.

…duce memory footprint

qiuxiafei · 2022-06-30T04:47:07Z

tao_compiler/mlir/disc/transforms/disc_duplicate_computation_for_fusion.cc

+//  Modern NN networks are usually composed of multiple similar layers. Thus the
+//  above patterns are very common especailly when we enable shape constraint ir
+//  optimization (if enabled, we will do shape prpagation egaerly, and may
+//  further enable cross layer CSE, which in turn increases the change of the


change -> chance?

Done, thanks.

Yancey1989 · 2022-06-30T06:11:56Z

tao_compiler/mlir/disc/transforms/disc_passes.td

+  let options = [
+    Option<"gpu_enabled_", "gpu-enabled", "bool",
+           /*default=*/"true", "whether gpu is available.">,
+    Option<"fusion_strategy_", "fusion-strategy", "std::string",


Do we actually need the fusion-strategy option? If it is always base, we can remove it in this PR, and add this option if it's required.

I prefer to leave the option here even though we do not use it now actually. It's better if we make use of such config. Currently implementation is just a conservative strategy. Furthermore, we only duplicate scalar-bcast pattern in this PR, while eventually we will need a general "duplicate fusion" pass like XLA.

Yancey1989 · 2022-06-30T06:13:55Z

BTW, any benchmark data on a model about this pass?

wyzero · 2022-06-30T06:27:57Z

BTW, any benchmark data on a model about this pass?

In my test case, it reduces around ~1.5ms (e2e is ~6.5ms). I haven't test this feature on other models. Thus I do not enable this feature by default (guarded by shape-constraint-ir flag). I'll evaluate shape-constraint-ir on more models in next month, and then make a decision if it's ready to enable this feature by default.

wyzero added 3 commits June 29, 2022 20:05

duplicate some computation to enable more fusion opportunities and re…

b5592b0

…duce memory footprint

refine

0d59549

refine

636304f

wyzero requested review from tanyokwok, minminsun, qiuxiafei and Yancey1989 June 30, 2022 02:48

qiuxiafei reviewed Jun 30, 2022

View reviewed changes

qiuxiafei approved these changes Jun 30, 2022

View reviewed changes

minor fix

f951a28

Yancey1989 reviewed Jun 30, 2022

View reviewed changes

wyzero merged commit 21c3113 into alibaba:main Jun 30, 2022

wyzero deleted the features/wenyi_shape_constraint branch June 30, 2022 06:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

duplicate some ops to enable more fusion opportunities and reduce memory footprint #433

duplicate some ops to enable more fusion opportunities and reduce memory footprint #433

wyzero commented Jun 30, 2022

qiuxiafei Jun 30, 2022

wyzero Jun 30, 2022

Yancey1989 Jun 30, 2022

wyzero Jun 30, 2022

Yancey1989 commented Jun 30, 2022

wyzero commented Jun 30, 2022

duplicate some ops to enable more fusion opportunities and reduce memory footprint #433

duplicate some ops to enable more fusion opportunities and reduce memory footprint #433

Conversation

wyzero commented Jun 30, 2022

qiuxiafei Jun 30, 2022

Choose a reason for hiding this comment

wyzero Jun 30, 2022

Choose a reason for hiding this comment

Yancey1989 Jun 30, 2022

Choose a reason for hiding this comment

wyzero Jun 30, 2022

Choose a reason for hiding this comment

Yancey1989 commented Jun 30, 2022

wyzero commented Jun 30, 2022