StochasticTMLE #131

pzivich · 2019-11-18T16:35:58Z

Addition of StochasticTMLE as mentioned in #52

StochasticTMLE is for stochastic treatment plans, similar to StochasticIPTW and TimeFixedGFormula.fit_stochastic().

pzivich · 2020-01-12T13:19:11Z

@CamDavidsonPilon This is the branch I would like you to review. Specifically, I added the StochasticTMLE estimator in TMLE.py and made some changes to utils.py.

There are some tests that fail (I am aware of these and am fixing them in v0.8.2 #122 ). The only tests that are important for this new estimator are TestStochasticTMLE.

As for the simulations, I assessed the double-robustness property. Four combinations of model specification (Q-model & g-model correct, one of them wrong, and both wrong) are plotted for bias and coverage. In expectation, the correct Q-model & g-model should be unbiased and have nominal 95% CL coverage. Scenarios where one model is wrong should be unbiased but confidence interval coverage has no guarantees. There are no guarantees when both models are wrong (expected to be biased).

Tutorial location: LINK

Simulation location: LINK

CamDavidsonPilon · 2020-02-13T16:18:00Z

tests/test_doublyrobust.py

+        return expected
+
+    def test_error_continuous_exp(self, df):
+        with pytest.raises(ValueError):


tests like this have bitten me before: the call being tested could fail for many reasons and throw a ValueError. You probably are hoping it fails for a specific ValueError - I suggest using the match parameter in pytest.raises to narrow it down.

CamDavidsonPilon · 2020-02-13T16:19:04Z

tests/test_doublyrobust.py

+        df = pd.DataFrame()
+        df['A'] = [1, 1, 0, 0, np.nan]
+        df['Y'] = [np.nan, 0, 1, 0, 1]
+        with pytest.warns(UserWarning):


ditto above comment. This could start "passing" from some warning thrown in a dependent library.

CamDavidsonPilon · 2020-02-13T16:28:37Z

tests/test_doublyrobust.py

+        npt.assert_allclose(sas_preds, est_preds, atol=1e-6)
+
+    def test_qmodel_params2(self, simple_df):
+        # Comparing to SAS linear model


if possible, I suggest putting a reproducible SAS command here. It'll save you time later when/if you have to reproduce it.

CamDavidsonPilon · 2020-02-13T16:31:22Z

zepid/causal/utils.py

+def stochastic_check_conditional(df, conditional):
+    """Check that conditionals are exclusive for the stochastic fit process. Generates a warning if not true
+    """
+    a = np.array([0] * df.shape[0])


np.zeros(df.shape[0])

CamDavidsonPilon · 2020-02-13T16:32:27Z

zepid/causal/utils.py

+    """
+    a = np.array([0] * df.shape[0])
+    for c in conditional:
+        a = np.add(a, np.where(eval(c), 1, 0))


why not a = a + np.where(eval(c), 1, 0)

CamDavidsonPilon · 2020-02-13T16:33:04Z

zepid/causal/utils.py

+    for c in conditional:
+        a = np.add(a, np.where(eval(c), 1, 0))
+
+    if np.sum(np.where(a > 1, 1, 0)):


is this the same as np.any(a > 1)?

CamDavidsonPilon · 2020-02-13T16:35:24Z

zepid/causal/doublyrobust/TMLE.py

@@ -800,15 +801,537 @@ def plot_love(self, color_unweighted='r', color_weighted='b', shape_unweighted='
                       shape_unweighted=shape_unweighted, shape_weighted=shape_weighted)
        return ax

+
+class StochasticTMLE:
+    r"""Implementation of target maximum likelihood estimator for stochastic treatment plans. This implementation


nice docs here 👍

CamDavidsonPilon · 2020-02-13T16:38:15Z

zepid/causal/doublyrobust/TMLE.py

+
+
+# Functions that all TMLEs can call are below
+def _tmle_unit_bounds_(y, mini, maxi, bound):


this makes me think you may want a _TMLE super class with (atleast) these two methods, and any other common overlap between TMLE and StochasticTMLE. It maybe premature, and premature abstractions can be bad.

CamDavidsonPilon · 2020-02-13T16:41:17Z

zepid/causal/doublyrobust/TMLE.py

+            raise ValueError("StochasticTMLE only supports binary exposures currently")
+
+        # Manage outcomes
+        if df[outcome].dropna().value_counts().index.isin([0, 1]).all():


how often do you do this "trick"? I feel like probably this is repeated many times in zepid? Maybe time to wrap into a function?

often enough that it would be put into a function

CamDavidsonPilon · 2020-02-13T16:41:55Z

docs/Reference/generated/zepid.causal.doublyrobust.TMLE.StochasticTMLE.rst

+.. autoclass:: StochasticTMLE
+   :members:
+
+   .. rubric:: Methods


CamDavidsonPilon · 2020-02-13T16:42:31Z

I mostly reviewed the Python code, and not the correctness of the algorithm. Overall, lgtm!

Stochastic TMLE -- general structure

148b4c3

pzivich self-assigned this Nov 18, 2019

pzivich added 18 commits November 18, 2019 13:17

Moving check_conditional to utils.py

6b6be91

Fixing conditional IPTW for TMLE

8d3ca81

improved testing for OOB p in StochasticIPTW

feca583

starting tests for StochasticTMLE

ff3f0bb

more tests for StochasticTMLE

a146880

updated StochasticTMLE docs

c7346a4

variance estimation tweaking

f4d7cd4

adding variance divisor

a046ecc

both variances for testing versus R

f51927c

adding diagnostics option

c9c6a1b

CONFIRMED: psi must be transformed then averaged

22e8549

updating run_diagnostics

5b4ad08

adding reference pages for StochasticTMLE

edfcad1

adding support for machine-learning estimators in StochasticTMLE

08c1dcd

fixing error in patsy data parsing for ML

9b6508d

seed and checking documentation

1987dcc

updating ML model specification

83b769e

updating StochasticTMLE tests

0310679

pzivich requested a review from CamDavidsonPilon January 12, 2020 13:19

CamDavidsonPilon reviewed Feb 13, 2020

View reviewed changes

pzivich mentioned this pull request Jul 4, 2020

Better test_error in the tests/ #139

Open

pzivich merged commit 0310679 into v0.8.2 Jul 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

StochasticTMLE #131

StochasticTMLE #131

pzivich commented Nov 18, 2019

pzivich commented Jan 12, 2020

CamDavidsonPilon Feb 13, 2020

CamDavidsonPilon Feb 13, 2020

CamDavidsonPilon Feb 13, 2020

CamDavidsonPilon Feb 13, 2020

CamDavidsonPilon Feb 13, 2020

CamDavidsonPilon Feb 13, 2020

CamDavidsonPilon Feb 13, 2020

CamDavidsonPilon Feb 13, 2020

CamDavidsonPilon Feb 13, 2020

pzivich Feb 13, 2020

CamDavidsonPilon Feb 13, 2020

CamDavidsonPilon commented Feb 13, 2020



		# Functions that all TMLEs can call are below
		def _tmle_unit_bounds_(y, mini, maxi, bound):

StochasticTMLE #131

StochasticTMLE #131

Conversation

pzivich commented Nov 18, 2019

pzivich commented Jan 12, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

CamDavidsonPilon commented Feb 13, 2020