Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme. A dynamic treatment regime consists of a set of decision rules that dictate how to individualize treatment to patients based on available treatment and covariate history. A common method for estimating an optimal dynamic treatment regime from data is Q-learning which involves nonsmooth operations of the data. This nonsmoothness causes standard asymptotic approaches for inference like the bootstrap or Taylor series arguments to breakdown if applied without correction. Here, we consider the m-out-of-n bootstrap for constructing confidence intervals for the parameters indexing the optimal dynamic regime. We propose an adaptive choice of m and show that it produces asymptotically correct confidence sets under fixed alternatives. Furthermore, the proposed method has the advantage of being conceptually and computationally much simple than competing methods possessing this same theoretical property. We provide an extensive simulation study to compare the proposed method with currently available inference procedures. The results suggest that the proposed method delivers nominal coverage while being less conservative than alternatives. The proposed methods are implemented in the qLearn R-package and have been made available on the Comprehensive R-Archive Network (http://cran.r-project.org/). Analysis of the Sequenced Treatment Alternatives to Relieve Depression (STAR * D) study is used as an illustrative example.
Keywords for this software
References in zbMATH (referenced in 5 articles )
Showing results 1 to 5 of 5.
- Laber, Eric B.; Lizotte, Daniel J.; Qian, Min; Pelham, William E.; Murphy, Susan A.: Dynamic treatment regimes: technical challenges and applications (2014)
- Chakraborty, Bibhas; Laber, Eric B.; Zhao, Yingqi: Inference for optimal dynamic treatment regimes using an adaptive $m$-out-of-$n$ bootstrap scheme (2013)
- Chakraborty, Bibhas; Moodie, Erica E.M.: Statistical methods for dynamic treatment regimes. Reinforcement learning, causal inference, and personalized medicine (2013)
- Zhang, Baqun; Tsiatis, Anastasios A.; Laber, Eric B.; Davidian, Marie: Robust estimation of optimal dynamic treatment regimes for sequential treatment decisions (2013)
- Wang, Weiwei; Scharfstein, Daniel; Wang, Chenguang; Daniels, Michael; Needham, Dale; Brower, Roy: Estimating the causal effect of low tidal volume ventilation on survival in patients with acute lung injury (2011)