Add delta-delta-p (ddp) tree inference approach #327

jroessler · 2021-05-03T14:21:25Z

Proposed changes

As suggested in #300 I implemented the delta-delta-p (in short: DDP) tree based inference method originally proposed in Hansotia and Rukstales: Incremental value modeling (2002). I simply added another evaluationFunction in causalml/inference/tree/models.py called 'DDP'.

Types of changes

Bugfix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation Update (if none of the other choices apply)

Checklist

I have read the CONTRIBUTING doc
I have signed the CLA
Lint and unit tests pass locally with my changes
I have added tests that prove my fix is effective or that my feature works
I have added necessary documentation (if appropriate)
Any dependent changes have been merged and published in downstream modules

Further comments

There was no need to add an additional test for proving that the new tree based inference method works. You can simply use the test_uplift_trees.py test and provide evaluationFunction='DDP' when creating the UpliftRandomForestClassifier and UpliftTreeClassifier instances. I noticed that my test does not pass if N_SAMPLE = 1000 because the performance of random targeting was better than the DDP approach. However increasing IN_SAMPLE to, for example, 10.000 worked.

If you need additional information, don't hesitate to ask me!

CLAassistant · 2021-05-03T14:21:30Z

All committers have signed the CLA.

paullo0106 · 2021-05-04T23:07:42Z

causalml/inference/tree/models.py

+                        leftScore1 = evaluationFunction(leftNodeSummary, control_name=self.control_name)
+                        rightScore2 = evaluationFunction(rightNodeSummary, control_name=self.control_name)
+                        gain = np.abs(leftScore1 - rightScore2)
+                        gain_for_imp = (len(X_l) * leftScore1 - len(X_r) * rightScore2)


@jroessler I'm not familiar with this method, but just want to confirm if it's the case that gain takes the absolute difference while gain_for_imp doesn't

@paullo0106 Thanks for the pointer! Actually it's a mistake! It should be the absolute value for gain_for_imp as well. I'll fix it and resubmit.

…P is used with more than two treatment options

jroessler · 2021-05-05T05:35:38Z

I added raising an exception if the DDP approach is used with more than two treatment options as mentioned here:
#300 (comment)

Further, regarding the tests, I added a method called generate_classification_data_two_treatments which creates a synthetic data set with only two treatment options. You can use this method within test_UpliftTreeClassifier and test_UpliftRandomForestClassifier to verify that the DDP approach is working as excepted. As a side note: I realized that the results for all evaluation functions (not only for the DDP approach but also for KL, ED, and CHI) changed from test (RandomForestTest) to test. Thus, it seems that something with random_state is not working properly.

jeongyoonlee · 2021-06-06T00:53:37Z

Thanks, @jroessler for the contribution. It looks good to me. @t-tte, could you check this PR?

t-tte

Thanks for the contribution.

LGTM also.

jroessler added 2 commits May 3, 2021 12:40

Added DDP tree inference

374673c

Added documentation

978fe54

paullo0106 requested a review from t-tte May 4, 2021 22:54

paullo0106 reviewed May 4, 2021

View reviewed changes

Bugfix: Take absolute value of gain_for_imp; Raise an exception if DD…

b0d6c0d

…P is used with more than two treatment options

paullo0106 requested a review from jeongyoonlee May 5, 2021 06:26

t-tte approved these changes Jun 8, 2021

View reviewed changes

jeongyoonlee merged commit df3830d into uber:master Jun 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add delta-delta-p (ddp) tree inference approach #327

Add delta-delta-p (ddp) tree inference approach #327

jroessler commented May 3, 2021

CLAassistant commented May 3, 2021 •

edited

Loading

paullo0106 May 4, 2021

jroessler May 5, 2021

jroessler commented May 5, 2021

jeongyoonlee commented Jun 6, 2021

t-tte left a comment

Add delta-delta-p (ddp) tree inference approach #327

Add delta-delta-p (ddp) tree inference approach #327

Conversation

jroessler commented May 3, 2021

Proposed changes

Types of changes

Checklist

Further comments

CLAassistant commented May 3, 2021 • edited Loading

paullo0106 May 4, 2021

Choose a reason for hiding this comment

jroessler May 5, 2021

Choose a reason for hiding this comment

jroessler commented May 5, 2021

jeongyoonlee commented Jun 6, 2021

t-tte left a comment

Choose a reason for hiding this comment

CLAassistant commented May 3, 2021 •

edited

Loading