« search calendars« TRIPODS (Transdisciplinary Research in Principles of Data Science) Seminar Series

« Statistical Inference for High-dimensional Block-wise Missing Data

Statistical Inference for High-dimensional Block-wise Missing Data

March 02, 2022, 11:45 AM - 12:45 PM

Location:

Online Event

Fei Xue, Purdue University

For multi-source data, blocks of variable information from certain sources are likely missing. Most existing methods for handling missing data do not take structures of block-wise missing data into consideration. In this talk, I will describe a Multiple Block-wise Imputation (MBI) approach, which incorporates imputations based on both complete and incomplete observations. Specifically, for a given missing pattern group, the imputations in MBI incorporate more samples from groups with fewer observed variables in addition to the group with complete observations. We propose to construct estimating equations based on all available information, and integrate all estimating functions to achieve efficient estimators. In addition, we propose a nearly unbiased estimator for each individual regression coefficient, which is asymptotically normally distributed under mild conditions. Based on these debiased estimators, asymptotically valid confidence intervals and statistical tests about each regression coefficient are constructed. Numerical studies and ADNI data application confirm that the proposed method outperforms existing methods under various missing mechanisms.
.
Bio: Fei is an Assistant Professor of Statistics at Purdue University. She got her PhD from UIUC and was a postdoc at University of Pennsylvania. Her research interests are data integration, missing data, mediation analysis, machine learning, and statistical genetics.