Papers to Appear

Cover image: Diseases can be interconnected. The study by Hao Mei, Ruofan Jia, Guanzhong Qiao, Zhenqiu Lin, and Shuangge Ma, in the article “Human disease clinical treatment network for the elderly: Analysis of the Medicare inpatient length of stay and readmission data,” uniquely examines disease interconnections from A clinical treatment perspective. Medicare data is analyzed. In this plot, a node represents a Clinical Classifications Software (CCS) defined disease. Two nodes are connected with an edge if their length of stay and/or readmission are correlated. The size of a node is proportional to its importance in this disease network. Different colors correspond to different disease groups, which are composed of diseases tightly interconnected. Overall, this figure and study can advance our understanding of disease interconnections.

Report of the Editors - 2022

Biometric Methodology

Discussion Paper

Adaptive enrichment designs with a continuous biomarker

Nigel Stallard


Rachael V. Phillips and Mark J. van der Laan


James M. S. Wason


Christopher Jennison


Nancy Flournoy and Sergey Tarima


Nigel Stallard

Covariate adjustment in continuous biomarker assessment

Ziyi Li, Yijian Huang, Dattatraya Patil, and Martin G. Sanda

Elastic priors to dynamically borrow information from historical data in clinical trials

Liyun Jiang, Lei Nie, and Ying Yuan

On restricted mean time in favor of treatment

Lu Mao

Dynamic logistic state-space prediction model for clinical decision making

Jiakun Jiang, Wei Yang, Erin M. Schnellinger, Stephen E. Kimmel, and Wensheng Guo

A novel statistical test for treatment differences in clinical trials using a response adaptive forward looking Gittins Index Rule

Helen Yvette Barnett, Sofia S. Villar, Helena Geys, and Thomas Jaki

Sample size considerations for stepped wedge designs with subclusters

Kendra Davis-Plourde, Monica Taljaard, and Fan Li

Functional additive models for optimizing individualized treatment rules

Hyung Park, Eva Petkova, Thaddeus Tarpey, and R. Todd Ogden

Estimation of separable direct and indirect effects in continuous time

Torben Martinussen and Mats Julius Stensrud

Non-iterative adjustment to regression estimators with population-based auxiliary information for semiparametric models

Fei Gao and K. C. G. Chan

Bayesian non-parametric quantile process regression and estimation of marginal quantile effects

Steven G. Xu and Brian J. Reich

Accelerated failure time modeling via nonparametric mixtures

Byungtae Seo and Sangwook Kang 

Jackknife model averaging for high-dimensional quantile regression

Miaomiao Wang, Xinyu Zhang, Alan T. K. Wan, Kang You, and Guohua Zou

Risk prediction with imperfect survival outcome information from electronic health records

Jue Hou, Stephanie F. Chan, Xuan Wang, and Tianxi Cai

CASANOVA: Permutation inference in factorial survival designs

Marc Ditzhaus, Jon Genuneit, Arnold Janssen, and Markus Pauly

A matching procedure for sequential experiments that iteratively learns which covariates improve power

Adam Kapelner and Abba Krieger

SMIM: a unified framework of survival sensitivity analysis using multiple imputation and Martingale

Shu Yang, Yilong Zhang, Guanghan Frank Liu, and Qian Guan

Logistic regression analysis of two-phase studies using generalized method of moments

Prosenjit Kundu and Nilanjan Chatterjee

Instrumental variable estimation of complier causal treatment effect with interval-censored data

Shuwei Li and Limin Peng

Integrating sample similarities into latent class analysis: A tree-structured shrinkage approach

Mengbing Li, Daniel E. Park, Maliha Aziz, Cindy M. Liu, Lance B. Price, and Zhenke Wu

Latent group detection in functional partially linear regression models

Wu Wang, Ying Sun, and Huixia Judy Wang

Variable selection in nonlinear function-on-scalar regression

Rahul Ghosal and Arnab Maity

Spectra in Low-Rank Localized Layers (SpeLLL) for interpretable time-frequency analysis

Marie Tuft, Martica H. Hall, and Robert T. Krafty

Modelling publication bias and p-hacking

Jonas Moss and Riccardo De Bin

Generalized case-control sampling under generalized linear models

Jacob M. Maronge, Ran Tao, Jonathan S. Schildcrout, and Paul J. Rathouz

De-biased Lasso for generalized linear models with a diverging number of covariates

Lu Xia, Bin Nan, and Yi Li

Biometric Practice

Accounting for post-randomization variables in meta-analysis: A joint meta-regression approach

Qinshu Lian, Jing Zhang, James S. Hodges, Yong Chen, and Haitao Chu

Interim monitoring in sequential multiple assignment randomized trials

Liwen Wu, Junyao Wang, and Abdus S. Wahed

Inference in response-adaptive clinical trials when the enrolled population varies over time

Massimiliano Russo, Steffen Ventz, Victoria Xin Wang, and Lorenzo Trippa

Improving efficiency of inference in clinical trials with external control data

Xinyu Li, Wang Miao, Fang Lu, and Xiao-Hua Zhou

Human disease clinical treatment network for the elderly: Analysis of the Medicare inpatient length of stay and readmission data

Hao Mei, Ruofan Jia, Guanzhong Qiao, Zhenqiu Lin, and Shuangge Ma

A linear mixed model to estimate COVID-19-induced excess mortality

Johan Verbeeck, Christel Faes, Thomas Neyens, Niel Hens, Geert Verbeke, Patrick Deboosere, and Geert Molenberghs

An individual level infectious disease model in the presence of uncertainty from multiple, imperfect diagnostic tests

Caitlin Ward, Grant D. Brown, and Jacob J. Oleson

A smoothed corrected score approach for proportional hazards model with misclassified discretized covariates induced by error-contaminated continuous time-dependent exposure

Xiao Song, Edward C. Chao, and Ching-Yun Wang

Estimating perinatal critical windows of susceptibility to environmental mixtures via structured Bayesian regression tree pairs

Daniel Mork and Ander Wilson

Bayesian multiple index models for environmental mixtures

Glen McGee, Ander Wilson, Thomas F. Webster, and Brent A. Coull

Sensitivity analyses informed by tests for bias in observational studies

Paul R. Rosenbaum

Evaluating the association between latent classes and competing risks outcomes with multi-phenotype data

Teng Fei, John Hanfelt, and Limin Peng

Causal inference with outcomes truncated by death in multiarm studies

Shanshan Luo, Wei Li, and Yangbo He

Reader Reaction

Reader Reaction to "Outcome-adaptive lasso: variable selection for causal inference" by Shortreed and Ertefaie (2017)

Ismaila Balde, Yi Yang, and Genevieve Lefebvre


Jeremiah Jones, Ashkan Ertefaie, and Susan M. Shortreed

Book Reviews

Gene Expression Data Analysis: A Statistical and Machine Learning Perspective (Dhruba Kumar Bhattacharyya, Jugal Kumar Kalita, and Pankaj Barah)

Reviewed by Amrita Chattopadhyay

Confidence Intervals for Discrete Data in Clinical Research (Vivek Pradhan, Ashis Gangopadhyay, Sandeep M. Menon, Cynthia Basu, and Tathagata Banerjee)

Reviewed by Naitee Ting 

The Effect: An Introduction to Research Design and Causality (Nick Huntington-Klein)

Reviewed by Hung-Ching Chang and Michael T. Gorczyca

Papers to appear in future issues of Biometrics

A novel Bayesian functional spatial partitioning method with application to prostate cancer lesion detection using MRI

Maria Masotti, Lin Zhang, Ethan Leng, Greg J. Metzger, and Joseph S. Koopmeiners

Semiparametric additive time-varying coefficients model for longitudinal data with censored time origin

Yanqing Sun, Qiong Shou, Peter B. Gilbert, Fei Heng, and Xiyuan Qian

Testing for heterogeneity in the utility of a surrogate marker

Layla Parast, Tianxi Cai, and Lu Tian

Selective prediction-set models with coverage rate guarantees

Jean Feng, Arjun Sondhi, Jessica Perry, and Noah Simon

Supervised two-dimensional functional principal component analysis with time-to-event outcomes and mammogram imaging data

Shu Jiang, Jiguo Cao, Bernard Rosner, and Graham A. Colditz

Estimation of the odds ratio in a proportional odds model with censored time-lagged outcome in a randomized clinical trial

Anastasios A. Tsiatis, Marie Davidian, and Shannon T. Holloway

Variable selection in regression-based estimation of dynamic treatment regimes

Zeyu Bian, Erica E.M. Moodie, Susan M. Shortreed, and Sahir Bhatnagar

Assessing intervention effects in a randomized trial within a social network

Shaina J. Alexandria, Michael G. Hudgens, and Allison E. Aiello

Improving trial generalizability using observational studies

Dasom Lee, Shu Yang, Lin Dong, Xiaofei Wang, Donglin Zeng, and Jianwen Cai

A formal causal interpretation of the case-crossover design

Zach Shahn, Miguel A. Hernan, and James M. Robins

Discussion on "A formal causal interpretation of the case-crossover design"

Per Kragh Andersen and Torben Martinussen

Discussion on "A formal causal interpretation of the case-crossover design"

Ruth M. Pfeiffer and Mitchell H. Gail

Discussion on "A formal causal interpretation of the case-crossover design"

Thomas Lumley

Rejoinder to Discussions on "A formal causal interpretation of the case-crossover design"

Zach Shahn, Miguel A. Hernan, and James M. Robins

Estimating cell type composition using isoform expression one gene at a time
Hillary M. Heiling, Douglas R. Wilson, Naim U. Rashid, Wei Sun, and Joseph G. Ibrahim

Design and analysis of two-phase studies with multivariate longitudinal data
Chiara Di Gravio, Ran Tao, and Jonathan S. Schildcrout

Ultra-high dimensional variable selection for doubly robust causal inference

Dingke Tang, Dehan Kong, Wenliang Pan, and Linbo Wang

Bayesian inference for stationary points in Gaussian process regression models for event-related potentials analysis

Cheng-Han Yu, Meng Li, Colin Noe, Simon Fischer-Baum, and Marina Vannucci

Inference for nonparanormal partial correlation via regularized rank-based nodewise regression

Haoyan Hu and Yumou Qiu


Neural networks for clustered and longitudinal data using mixed effects models

Francesca Mandel, Riddhi Pratim Ghosh, and Ian Barnett

Semiparametric count data regression for self-reported mental health

Daniel R. Kowal and Bohan Wu

A repeated measures approach to pooled and calibrated biomarker data

Abigail Sloan, Chao Cheng, Bernard Rosner, Regina G. Ziegler, Stephanie A. Smith-Warner, and Molin Wang

Bayesian nonparametric analysis of restricted mean survival time

Chenyang Zhang and Guosheng Yin

Generalized network structured models with mixed responses subject to measurement error and misclassification

Qihuang Zhang and Yun Grace Yi

Leveraging a surrogate outcome to improve inference on a partially missing target outcome

Zachary R. McCaw, Sheila M. Gaynor, Ryan Sun, and Xihong Lin

Screening methods for linear errors-in-variables models in high dimensions

Linh H. Nghiem, Francis K.C. Hui, Samuel Muller, and A.H. Welsh

Bayesian nonparametric analysis for the detection of spikes in noisy calcium imaging data

Laura D'Angelo, Antonio Canale, Zhaoxia Yu, and Michele Guindani

A joint fairness model with applications to risk predictions for under-represented populations

Hyungrok Do, Shinjini Nandi, Preston Putzel, Padhraic Smyth, and Judy Zhong

Bayesian spatiotemporal modeling on complex-valued fMRI signals via kernel convolutions

Cheng-Han Yu, Raquel Prado, Hernando Ombao, and Daniel Rowe

Multi-source single-cell data integration by MAW Barycenter for Gaussian mixture models

Lin Lin, Wei Shi, Jianbo Ye, and Jia Li

Estimated quadratic inference function for correlated failure time data

Feifei Yan, Yanyan Liu, Jianwen Cai, and Haibo Zhou

The Generalized Fisher’s Combination and accurate p-value calculation under dependence

Hong Zhang and Zheyang Wu

Joint gene network construction by single-cell RNA sequencing data

Meichen Dong, Yiping He, Yuchao Jiang, and Fei Zou

Inference for set-based effects in genetic association studies with interval-censored outcomes

Ryan Sun, Liang Zhu, Yimei Li, Yutaka Yasui, and Leslie Robison

Bayes optimal informer sets for early-stage drug discovery

Peng Yu, Spencer Ericksen, Anthony Gitter, and Michael A. Newton

Instrumental variable estimation of causal hazard ratio

Linbo Wang, Eric Tchetgen Tchetgen, Torben Martinussen, and Stijn Vansteelandt

Discussion on "Instrumental variable estimation of causal hazard ratio" by Linbo Wang, Eric Tchetgen Tchetgen, Torben Martinussen, and Stijn Vansteelandt

Brigham R. Frandsen

Discussion on "Instrumental variable estimation of causal hazard ratio" by Linbo Wang, Eric Tchetgen Tchetgen, Torben Martinussen, and Stijn Vansteelandt

Benjamin R. Baer, Robert L. Strawderman, and Ashkan Ertefaie

Discussion on "Instrumental variable estimation of causal hazard ratio" by Linbo Wang, Eric Tchetgen Tchetgen, Torben Martinussen, and Stijn Vansteelandt

A. James O’Malley, Pablo Martinez-Camblor, and Todd A. MacKenzie

Rejoinder to discussions on “Instrumental variable estimation of the causal hazard ratio”

Linbo Wang, Eric Tchetgen Tchetgen, Torben Martinussen, and Stijn Vansteelandt

Functional data analysis for longitudinal data with informative observation times

Caleb Weaver, Luo Xiao, and Wenbin Lu

A hierarchical model for analyzing multi-site individual-level disease surveillance data from multiple systems

Yuzi Zhang, Howard H. Chang, Qu Cheng, Philip A. Collender, Ting Li, Jinge He, and Justin V. Remais

Zero-inflated Poisson models with measurement error in response

Qihuang Zhang and Grace Y. Yi

Evaluating treatment effects in group sequential multivariate longitudinal studies with covariate adjustment

Neal O. Jeffries, James F. Troendle, and Nancy L. Geller

A time-heterogeneous D-vine copula model for unbalanced and unequally spaced longitudinal data

Md. Erfanul Hoque, Elif F. Acar, and Mahmoud Torabi

Bayesian interaction selection model for multi-modal neuroimaging data analysis

Yize Zhao,  Ben Wu, and Jian Kang

Bayesian sample size determination using commensurate priors to leverage pre-experimental data

Haiyan Zheng, Thomas Jaki, and James M.S. Wason

Feature screening with latent responses

Congran Yu, Wenwen Guo, Xinyuan Song, and Hengjian Cui

Cross-trait prediction accuracy of summary statistics in genome-wide association studies

Bingxin Zhao, Fei Zou, and Hongtu Zhu

Increasing efficiency and reducing bias when assessing HPV vaccination efficacy by using non-targeted HPV strains

Lola Etievant, Joshua N. Sampson, and Mitchell H. Gail

Instrumented difference-in-differences

Ting Ye, Ashkan Ertefaie, James Flory, Sean Hennessy, and Dylan S. Small

Discussion on "Instrumented difference-in-differences" by Ting Ye, Ashkan Ertefaie, James Flory, Sean Hennessy, and Dylan S. Small

Jad Beyhum, Jean-Pierre Florens, and Ingrid Van Keilegom

Discussion on "Instrumented difference-in-differences" by Ting Ye, Ashkan Ertefaie, James Flory, Sean Hennessy, and Dylan S. Small

Zhiqiang Tan

Discussion on "Instrumented difference-in-differences" by Ting Ye, Ashkan Ertefaie, James Flory, Sean Hennessy, and Dylan S. Small

Hyunseung Kang

Discussion on "Instrumented difference-in-differences" by Ting Ye, Ashkan Ertefaie, James Flory, Sean Hennessy, and Dylan S. Small

Karla Diaz-Ordaz

Rejoinder to Discussions on "Instrumented difference-in-differences"
Ting Ye, Ashkan Ertefaie, James Flory, Sean Hennessy, and Dylan S. Small

Decomposition of variation of mixed variables by a latent mixed Gaussian copula model

Yutong Liu, Toni Darville, Xiaojing Zheng, and Quefeng Li

Continuous time-interaction processes for population size estimation, with an application to drug dealing in Italy

Linda Altieri, Alessio Farcomeni, and Danilo Alunni Fegatelli

Clustering high-dimensional data via feature selection

Tianqi Liu, Yu Lu, Biqing Zhu, and Hongyu Zhao

A general framework of nonparametric feature selection in high-dimensional data

Hang Yu, Yuanjia Wang, and Donglin Zeng

A note on familywise error rate for a primary and secondary endpoint

Michael A. Proschan and Dean A. Follmann

Nonparametric and semiparametric estimation with sequentially truncated survival data

Rebecca A. Betenksy, Jing Qian, and Jingyao Hou

Multi-kink quantile regression for longitudinal data with application to progesterone data analysis

Chuang Wan, Wei Zhong, Wenyang Zhang, and Changliang Zou

Nonparametric estimation of the causal effect of a stochastic threshold-based intervention

Lars van der Laan, Wenbo Zhang, and Peter B. Gilbert

Score test for missing at random or not under logistic missingness models

Hairu Wang, Zhiping Lu, and Yukun Liu

Domain selection and family-wise error rate for functional data: a unified framework

Konrad Abramowicz, Alessia Pini, Lina Schelin, Sara Sjostedt de Luna, Aymeric Stamm, and Simone Vantini

Robust approach to combining multiple markers to improve surrogacy

Xuan Wang, Layla Parast, Larry Han, Lu Tian, and Tianxi Cai

Model-based clustering of high-dimensional longitudinal data via regularization

Luoying Yang and Tong Tong Wu

Robust Bayesian variable selection for gene-environment interactions

Jie Ren, Fei Zhou, Xiaoxi Li, Shuangge Ma, Yu Jiang, and Cen Wu

An alternative metric for evaluating the potential Patient benefit of response-adaptive randomization procedures

Jennifer Proper and Thomas A. Murray

Random projection ensemble classification with high-dimensional time series

Fuli Zhang and Kung-Sik Chan

Coherent modeling of longitudinal causal effects on binary outcomes

Linbo Wang, Xiang Meng, Thomas Richardson, and James Robins

A Bayesian model with application for adaptive platform trials having temporal changes

Chenguang Wang, Min Lin, Gary L. Rosner, and Guoxing Soon

A compound decision approach to covariance matrix estimation

Huiqin Xin and Sihai Dave Zhao

A cross-validation statistical framework for asymmetric data integration

Lam Tran, Kevin He, Di Wang, and Hui Jiang

A Bayesian platform trial design to simultaneously evaluate multiple drugs in multiple indications with mixed endpoints

Yujie Zhao, Rui (Sammi) Tang, Yeting Du, and Ying Yuan

Functional group bridge for simultaneous regression and support estimation

Zhengjia Wang, John Magnotti, Michael S. Beauchamp, and Meng Li

Exact-corrected confidence interval for risk difference in noninferiority binomial trials

Nour Hawila and Arthur Berg

An Eigenvalue ratio approach to inferring population structure from whole genome sequencing data

Yuyang Xu, Zhonghua Liu, and Jianfeng Yao

Power analysis for cluster randomized trials with continuous co-primary endpoints

Siyun Yang, Mirjam Moerbeek, Monica Taljaard, and Fan Li

Simplifying the estimation of diagnostic testing accuracy over time for high specificity tests in the absence of a gold standard

Clara Drew, Moses Badio, Dehkontee Dennis, Lisa Hensley, Elizabeth Higgs, Michael Sneller, Mosoka Fallah, and Cavan Reilly

A Bayesian functional data model for surveys collected under informative sampling with application to mortality estimation using NHANES

Paul A. Parker and Scott H. Holan

Translocation detection from Hi-C data via scan statistics

Anthony Cheng, Disheng Mao, Yuping Zhang, Joseph Glaz, and Zhengqing Ouyang

Closed testing with Globaltest, with application in metabolomics

Ningning Xu, Aldo Solari, and Jelle J. Goeman

Robust functional principal component analysis via a functional pairwise spatial sign operator

Guangxing Wang, Sisheng Liu, Fang Han, and Chong-Zhi Di

Efficient and robust methods for causally interpretable meta-analysis: transporting inferences from multiple randomized trials to a target population

Issa J. Dahabreh, Sarah E. Robertson, Lucia C. Petito, Miguel A. Hernan, and Jon A. Steingrimsson

Nonparametric inverse probability weighted estimators based on the highly adaptive lasso

Ashkan Ertefaie, Nima S. Hejazi, and Mark J. van der Laan

It's all relative: regression analysis with compositional predictors

Gen Li, Yan Li, and Kun Chen

Post-treatment confounding in causal mediation studies: A cutting-edge problem and a novel solution via sensitivity analysis

Guanglei Hong, Fan Yang, and Xu Qin

Optimal test procedures for multiple hypotheses controlling the familywise expected loss

Willi Maurer, Frank Bretz, and Xiaolei Xun

Discussion on "Optimal test procedures for multiple hypotheses controlling the familywise expected loss" by Willi Maurer, Frank Bretz, and Xiaolei Xun

Yoav Benjamini, Ruth Heller, Abba Krieger, Saharon Rosset

Discussion on "Optimal test procedures for multiple hypotheses controlling the familywise expected loss" by Willi Maurer, Frank Bretz, and Xiaolei Xun

Sudipto Banerjee

Discussion on "Optimal test procedures for multiple hypotheses controlling the familywise expected loss" by Willi Maurer, Frank Bretz, and Xiaolei Xun

Lisa M. LaVange, Ethan M. Alt, and Joseph G. Ibrahim

Discussion on "Optimal test procedures for multiple hypotheses controlling the familywise expected loss" by Willi Maurer, Frank Bretz, and Xiaolei Xun

Werner Brannath

Flexible copula model for integrating correlated multi-omics data from single-cell experiments

Zichen Ma, Shannon W. Davis, and Yen-Yi Ho

Semiparametric distributed lag quantile regression for modeling time-dependent exposure mixtures

Yuyan Wang, Akhgar Ghassabian, Bo Gu, Yelena Afnansyeva, Yiwei Li, Leonardo Trasande, and Mengling Liu

Structural cumulative survival models for estimation of treatment effects accounting for treatment switching in randomized experiments

Andrew Ying and Eric J. Tchetgen Tchetgen

Elastic analysis of irregularly or sparsely sampled curves

Lisa Steyer, Almond Stocker, and Sonja Greven

Automated analysis of low-field brain MRI in cerebral malaria

Danni Tu, Manu S. Goyal, Jordan D. Dworkin, Samuel Kampondeni, Lorenna Vidal, Eric Biondo-Savin, Sandeep Juvvadi, Prashant Raghavan, Jennifer Nicholas, Karen Chetcuti, Kelly Clark, Timothy Robert-Fitzgerald, Theodore D. Satterthwaite, Paul Yushkevich, Christos Davatzikos, Guray Erus, Nicholas J. Tustison, Douglas G. Postels, Terrie E. Taylor, Dylan S. Small, and Russell T. Shinohara

Subset selection for linear mixed models

Daniel R. Kowal

General independent censoring in event-driven trials with staggered entry

Jasmin Ruhl, Jan Beyersmann, and Sarah Friedrich

A general framework for subgroup detection via one-step value difference estimation

Dana Johnson, Wenbin Lu, and Marie Davidian

The central role of the identifying assumption in population size estimation

Serge Aleshin-Guendel, Mauricio Sadinle, and Jon Wakefield

Discussion on "The central role of the identifying assumption in population size estimation" by Serge Aleshin-Guendel, Mauricio Sadinle, and Jon Wakefield

John Whitehead

Discussion on "The central role of the identifying assumption in population size estimation" by Serge Aleshin-Guendel, Mauricio Sadinle, and Jon Wakefield

Li-Chun Zhang

Discussion on "The central role of the identifying assumption in population size estimation" by Serge Aleshin-Guendel, Mauricio Sadinle, and Jon Wakefield

Ruth King, Rachel McCrea, and Antony Overstall

Discussion on "The central role of the identifying assumption in population size estimation" by Serge Aleshin-Guendel, Mauricio Sadinle, and Jon Wakefield

Daniel Manrique-Vallier

Multi-wave validation sampling for error-prone electronic health records

Bryan E. Shepherd, Kyunghee Han, Tong Chen, Aihua Bian, Shannon Pugh, Stephany N. Duda, Thomas Lumley, William J. Heerman, and Pamela A. Shaw

Nonparametric inference of general while-alive estimands for recurrent events

Lu Mao

Dimension reduction for integrative survival analysis

Aaron J. Molstad and Rohit K. Patra

Pair-switching rerandomization

Ke Zhu and Hanzhong Liu

Infinite hidden Markov models for multiple multivariate time series with missing data

Lauren Hoskovec, Matthew D. Koslovsky, Kirsten Koehler, Nicholas Good, Jennifer L. Peel, John Volckens, and Ander Wilson

Concordance indices with left-truncated and right-censored data

Nicholas Hartman, Sehee Kim, Kevin He, and John D. Kalbfleisch

Solutions for surrogacy validation with longitudinal outcomes for a gene therapy

Emily K. Roberts, Michael R. Elliott, and Jeremy M. G. Taylor

Prioritizing candidate peptides for cancer vaccines through predicting peptide presentation by HLA-I proteins

Laura Y. Zhou, Fei Zou, and Wei Sun

Grouped generalized estimating equations for longitudinal data analysis

Tsubasa Ito and Shonosuke Sugasawa

A high-dimensional mediation model for a neuroimaging mediator: integrating clinical, neuroimaging, and neurocognitive data to mitigate late effects in pediatric cancer

Xiaoqing Jade Wang, Yimei Li, Wilburn E. Reddick, Heather M. Conklin, John O. Glass, Arzu Onar-Thomas, Amar Gajjar, Cheng Cheng, and Zhao-Hua Lu

Double reduction estimation and equilibrium tests in natural autopolyploid populations

David Gerard 

A robust approach for electronic health record-based case-control studies with contaminated case pools

Guorong Dai, Yanyuan Ma, Jill Schnall, Jinbo Chen, and Raymond J. Carroll

Centre-augmented $\ell_2$-type regularization for subgroup learning

Ling Zhou, Ye He, Yingcun Xia, and Huazhen Lin

Quantile regression for nonignorable missing data with its application of analyzing electronic medical records

Aiai Yu, Yujie Zhong, Xingdong Feng, and Ying Wei

A Bayesian multivariate mixture model for high throughput spatial transcriptomics

Carter Allen, Yuzhou Chang, Brian Neelon, Won Chang, Hang J. Kim, Zihai Li, Qin Ma, and Dongjun Chung

Joint semiparametric models for case-cohort designs

Weibin Zhong and Guoqing Diao

Tractable Bayes of skew-elliptical link models for correlated binary data

Zhongwei Zhang, Reinaldo B. Arellano-Valle, Marc G. Genton, and Raphael Huser

Optimal multiple testing and design in clinical trials

Ruth Heller, Abba Krieger, and Saharon Rosset

A general modelling framework for open wildlife populations based on the Polya Tree prior

Alex Diana, Eleni Matechou, Jim Griffin, Todd Arnold, Simone Tenan, and Stefano Volponi

A novel penalized inverse-variance weighted estimator for Mendelian randomization with applications to COVID-19 outcomes

Siqi Xu, Peng Wang, Wing Kam Fung, and Zhonghua Liu

Testing weak nulls in matched observational studies

Colin B. Fogarty

Neural network on interval censored data with application to the prediction of Alzheimer’s Disease 

Tao Sun and Ying Ding

A latent state space model for estimating brain dynamics from electroencephalogram (EEG) data

Qinxia Wang, Ji Loh, Xiaofu He, and Yuanjia Wang

Mendelian randomization mixed-scale treatment effect robust identification and estimation for causal inference

Zhonghua Liu, Ting Ye, Baoluo Sun, Mary Schooling, and Eric Tchetgen Tchetgen

Integrative Bayesian models using post-selective inference: a case study in radiogenomics

Snigdha Panigrahi, Shariq Mohammed, Arvind Rao, and Veerabhadran Baladandayuthapani

Bayesian treatment screening and selection using subgroup-specific utilities of response and toxicity

Juhee Lee, Peter F. Thall, and Pavlos Msaouel

Delivering spatially comparable inference on the risks of multiple severities of respiratory disease from spatially misaligned disease count data

Duncan Lee and Craig Anderson

Generalized propensity score approach to causal inference with spatial interference

A. Giffin, B. J. Reich, S. Yang, and A. G. Rappold

Bayesian regression analysis of skewed tensor responses

Inkoo Lee, Debajyoti Sinha, Qing Mai, Xin Zhang, and Dipankar Bandyopadhyay

Functional data analysis with covariate-dependent mean and covariance structures

Chenlin Zhang, Huazhen Lin, Li Liu, Jin Liu, and Yi Li

Multidimensional adaptive P-splines with application to Neurons’ activity studies

Maria Xose Rodriguez-Alvarez, Maria Durban, Paul H. C. Eilers, Dae-Jin Lee, and Francisco Gonzalez

Simultaneous cluster structure learning and estimation of heterogeneous graphs for matrix-variate fMRI data

Dong Liu, Changwei Zhao, Yong He, Lei Liu, Ying Guo, and Xinsheng Zhang

Estimating tree-based dynamic treatment regimes using observational data with restricted treatment sequences

Nina Zhou, Lu Wang, and Daniel Almirall

Joint inference for competing risks data using multiple endpoints

Jiyang Wen, Chen Hu, and Mei-Cheng Wang

Bayesian hierarchical quantile regression with application to characterizing the immune architecture of lung cancer

Priyam Das, Christine B. Peterson, Yang Ni, Alexandre Reuben, Jiexin Zhang, Jianjun Zhang, Kim-Anh Do, and Veerabhadran Baladandayuthapani

Segmented correspondence curve regression for quantifying covariate effects on the reproducibility of high-throughput experiments

Feipeng Zhang and Qunhua Li

Maximum likelihood estimation in the additive hazards model

Chengyuan Lu, Jelle Goeman, and Hein Putter

Frequentist model averaging for undirected Gaussian graphical models

Huihang Liu and Xinyu Zhang

Misdiagnosis-related harm quantification through mixture models and harm measures

Yuxin Zhu, Zheyu Wang, and David Newman-Toker

Adaptive Bayesian sum of trees model for covariate dependent spectral analysis

Yakun Wang, Zeda Li, and Scott A. Bruce

Penalized estimation of frailty-based illness-death models for semi-competing risks

Harrison T. Reeder, Junwei Lu, and Sebastien J. Haneuse

Change-plane analysis for subgroup detection with a continuous treatment

Peng Jin, Wenbin Lu, Yu Chen, and Mengling Liu

Concave likelihood-based regression with finite-support response variables

K. O. Ekvall and M. Bottai

Boosting distributional copula regression

Nicolai Hans, Nadja Klein, Florian Faschingbauer, Michael Schneider, and Andreas Mayr 

Associating somatic mutation with clinical outcomes through kernel regression and optimal transport

Paul Little, Li Hsu, and Wei Sun

Asynchronous functional linear regression models for longitudinal data in reproducing kernel Hilbert space

Ting Li, Huichen Zhu, Tengfei Li, and Hongtu Zhu

Comparing COVID-19 incidences longitudinally per economic sector against the background of preventive measures and vaccination

Florian Stijven,  Johan Verbeeck, and Geert Molenberghs

On generalized latent factor modeling and inference for high-dimensional binomial data

Ting Fung Ma, Fangfang Wang, and Jun Zhu

Microbiome subcommunity learning with logistic-tree normal latent Dirichlet allocation

Patrick LeBlanc and Li Ma

Spatial dependence modeling of latent susceptibility and time to joint damage in psoriatic arthritis

Fangya Mao and Richard J. Cook

Pattern-based clustering of daily weigh-in trajectories using dynamic time warping

Samantha Bothwell, Alex Kaizer, Ryan Peterson, Danielle Ostendorf, Victoria Catenacci, and Julia Wrobel

Combining parametric and nonparametric models to estimate treatment effects in observational studies

Daniel Daly-Grafstein and Paul Gustafson

Identifying brain hierarchical structures associated with Alzheimer’s disease

Yi Zhao, Bingkai Wang, Chin-Fu Liu, Andreia V. Faria, Michael I. Miller, Brian S. Caffo, and Xi Luo 

How well can Fine Balance work for covariate balancing

Ruoqi Yu

Semiparametric estimation of the transformation model by leveraging external aggregate data in the presence of population heterogeneity

Yu-Jen Cheng, Yen-Chun Liu, Chang-Yu Tsai, and Chiung-Yu Huang

Contrasting principal stratum and hypothetical strategy estimands in multi-period crossover trials with incomplete data

John N.S. Matthews, Sofia Bazakou, Robin Henderson, and Linda D. Sharples

CEDAR: Communication Efficient Distributed Analysis for Regressions

C. Chang, Z. Bu,  and Q. Long

Statistical inference and power analysis for direct and spillover effects in two-stage randomized experiments

Zhichao Jiang, Kosuke Imai, and Anup Malani

Estimating the area under the ROC curve when transporting a prediction model to a target population

Bing Li, Constantine Gatsonis, Issa J. Dahabreh, and Jon A. Steingrimsson

Latent multinomial models for extended batch mark data

Wei Zhang, Simon Bonner, and Rachel McCrea

A sensitivity analysis approach for the causal hazard ratio in randomized and observational studies

Rachel Axelrod and Daniel Nevo

Marginal proportional hazards models for clustered interval-censored data with time-dependent covariates

Kaitlyn Cook, Wenbin Lu, and Rui Wang

Efficient targeted learning of heterogeneous treatment effects for multiple subgroups

Waverly Wei, Maya Petersen, Mark J. van der Laan, Zeyu Zheng, Chong Wu, and Jingshen Wang

Improved semiparametric estimation of the proportional rate model with recurrent event data

Ming-Yueh Huang and Chiung-Yu Huang

Nonparametric scanning tests of homogeneity for hierarchical models with continuous covariates

David Todem, Wei-Wen Hsu, and KyungMann Kim

Identifying alert concentrations using a model-based bootstrap approach

Kathrin Möllenhoff, Kirsten Schorning, and Franziska Kappenberg

Hospital profiling using Bayesian decision theory

Johannes Hengelbrock, Johannes Rauh, Jona Cederbaum, Maximilian Kähler, and Michael Höhle

A semiparametric joint model for cluster size and subunit-specific interval-censored outcomes

Chun Yin Lee, Kin Yau Wong, K. F. Lam, and Dipankar Bandyopadhyay

Non-parametric estimation of the age-at-onset distribution from a cross-sectional sample

Soutrik Mandal, Jing Qin, and Ruth M. Pfeiffer

An information ratio based goodness-of-fit test for copula models on censored data

Tao Sun, Yu Cheng, and Ying Ding

Assessing exposure-time treatment effect heterogeneity in stepped wedge cluster randomized trials

Lara Maleyeff, Fan Li, Sebastien Haneuse, and Rui Wang

Design considerations for two stage enrichment clinical trials

Rosamarie Frieri, William F. Rosenberger, Nancy Flournoy, and Zhantao Lin

Bayesian sample size calculations for comparing two strategies in SMART studies

Armando Turchetta, Erica E. M. Moodie, David A. Stephens, and Sylvie D. Lambert

Efficient and robust approaches for analysis of SMARTs: illustration using the ADAPT-R trial

Lina Montoya, Michael Kosorok, Elvin Geng, Joshua Schwab, Thomas Odeny, and Maya Petersen

Inference for the dimension of a regression relationship using pseudo-covariates

Shih-Hao Huang, Kerby Shedden, and Hsin-wen Chang

Model uncertainty quantification in Cox regression

Gonzalo Garcia-Donato, Stefano Cabras, and Maria Eugenia Castellanos

Age-related model for estimating the symptomatic and asymptomatic transmissibility of COVID-19 patients

Jianbin Tan, Ye Shen, Yang Ge, Leonardo Martinez, and Hui Huang 

Tensor response quantile regression with neuroimaging data

Bo Wei, Limin Peng, Ying Guo, Amita Manatunga, and Jennifer Stevens

Correcting delayed reporting of COVID-19 using the Generalized-Dirichlet-Multinomial method

Oliver Stoner, Alba Halliday, and Theo Economou

Two-level Bayesian interaction analysis for survival data incorporating pathway information

Xing Qin, Shuangge Ma, and Mengyun Wu

Fast Bayesian inference for large occupancy datasets

Alex Diana, Emily B. Dennis, Eleni Matechou, and Byron J.T. Morgan

Adjusting for publication bias in meta-analysis via inverse probability weighting using clinical trial registries

Ao Huang, Kosuke Morikawa, Tim Friede, and Satoshi Hattori

Consistent estimation of the number of communities via regularized network embedding

Mingyang Ren, Sanguo Zhang, and Junhui Wang

Spatially adaptive calibrations of AirBox PM2.5 data

Hsin-Cheng Huang

Additive subdistribution hazards regression for competing risks data in case-cohort studies

Adane F. Wogu, Haolin Li, Shanshan Zhao, Hazel B. Nichols, and Jianwen Cai

Stabilized direct learning for efficient estimation of individualized treatment rules

Kushal S. Shah, Haoda Fu, and Michael R. Kosorok

Bayesian design of multi-regional clinical trials with time-to-event endpoints

Nathan W. Bean, Joseph G. Ibrahim, and Matthew A. Psioda

Relative contrast estimation and inference for treatment recommendation

Muxuan Liang and Menggang Yu

Competition-based control of the false discovery proportion

Dong Luo, Arya Ebadi, Kristen Emery, Yilun He, William Stafford Noble, and Uri Keich

Optimal sampling for positive only electronic health record data

Seong-ho Lee, Yanyuan Ma, Ying Wei, and Jinbo Chen

Combining observational and experimental datasets using shrinkage estimators

Evan T. R. Rosenman, Guillaume Basse, Art B. Owen, and Mike Baiocchi

Bayesian inference for a principal stratum estimand on recurrent events truncated by death

Tianmeng Lyu, Bjorn Bornkamp, Guenther Mueller-Velten, and Heinz Schmidli

Entropy balancing for causal generalization with target sample summary information

Rui Chen, Guanhua Chen, and Menggang Yu

A seasonality-adjusted sequential test for vaccine safety surveillance

Rex Shen, Keran Moll, Ying Lu, and Lu Tian

Estimating population size: the importance of model and estimator choice

Matthew R. Schofield, Richard J. Barker, William A. Link, and Heloise Pavanato

Latent trajectory models for spatio-temporal dynamics in Alaskan ecosystems

Xinyi Lu, Mevin B. Hooten, Ann M. Raiho, David K. Swanson, Carl A. Roland, and Sarah E. Stehn

Bayesian nonparametric adjustment of confounding

Chanmin Kim, Maucicio Tec, and Corwin Zigler

Nonlinear function-on-scalar regression via functional universal approximation

Ruiyan Luo and Xin Qi

Homogeneity tests of covariance for high-dimensional functional data with applications to event segmentation

Ping-Shou Zhong

DROID: Dose-ranging approach to optimizing dose in oncology drug development

Beibei Guo and Ying Yuan

Analyzing data in complicated 3D domains: smoothing, semiparametric regression and functional principal component analysis

Eleonora Arnone, Luca Negri, Ferruccio Panzica, and Laura M. Sangalli

Improved inference for doubly robust estimators of heterogeneous treatment effects

Heejun Shin and Joseph Antonelli

Spatial modeling of M. tuberculosis transmission with dyadic genetic relatedness data

Joshua L. Warren, Melanie H. Chitwood, Benjamin Sobkowiak, Caroline Colijn, and Ted Cohen

A nonparametric test of group distributional differences for hierarchically-clustered functional data

Alexander S. Long, Brian J. Reich, Ana-Maria Staicu, and John Meitzen

Individualized causal discovery with latent trajectory embedded Bayesian networks

Fangting Zhou, Kejun He, and Yang Ni

Finding influential subjects in a network using a causal framework

Youjin Lee, Ashley Buchanan, Elizabeth Ogburn, Samuel R. Friedman, M. Elizabeth Halloran, Natallia V. Katenka, Jing Wu, and Georgios Nikolopoulos

Modelling Covid-19 contact-tracing using the ratio regression capture-recapture approach

D. Böhning,  R. Lerdsuwansri, and P. Sangnawakij

Latent deformation models for multivariate functional data and time warping separability

Cody Carroll and Hans-Georg Mueller

Covariate-adjusted response-adaptive designs based on semiparametric approaches

Hai Zhu and Hongjian Zhu

Efficient and flexible estimation of natural direct and indirect effects under intermediate confounding and monotonicity constraints

Kara E. Rudolph, Nicholas Williams, and Ivan Diaz

Causal mediation analysis using image mediator bounded in irregular domain with an application to breast cancer

Shu Jiang and Graham A. Colditz

FDR controlled multiple testing for union null hypotheses: A knockoff-based approach

Ran Dai and Cheng Zheng

Nonparametric failure time: time-to-event machine learning with heteroskedastic Bayesian additive regression trees and low information omnibus Dirichlet process mixtures

R.A. Sparapani, B.R. Logan, M.J. Maiers, P.W. Laud, and R.E. McCulloch

Estimation of time-specific intervention effects on continuously distributed time-to-event outcomes by targeted maximum likelihood estimation

Helene C. W. Rytgaard, Frank Eriksson, and Mark J. van der Laan

A Bayesian zero-inflated Dirichlet-multinomial regression model for multivariate compositional count data

Matthew D. Koslovsky

A synthetic data integration framework to leverage external summary-level information from heterogeneous populations

Tian Gu, Jeremy M.G. Taylor, and Bhramar Mukherjee

Interim monitoring of sequential multiple assignment randomized trials using partial information

Cole Manschot, Eric Laber, and Marie Davidian

Longitudinal incremental propensity score interventions for limited resource settings

Aaron Sarvet, Kerollos N. Wanis, Jessica Young, Roberto Hernandez-Alejandro, and Mats J. Stensrud

Information criteria for detecting change-points in the Cox proportional hazards model

Ryoto Ozaki, and Yoshiyuki Ninomiya 

An efficient data integration scheme for synthesizing information from multiple secondary datasets for the parameter inference of the main analysis

Chixiang Chen, Ming Wang, and Shuo Chen

Supervised convex clustering

Minjie Wang, Tianyi Yao, and Genevera I. Allen

Detecting the spatial clustering of exposure-response relationships with estimation error: a novel spatial scan statistic

Wei Wang, Sheng Li, Tao Zhang, Fei Yin, and Yue Ma

Asynchronous and error-prone longitudinal data analysis via functional calibration

Xinyue Chang, Yehua Li, and Yi Li

Combining mixed effects hidden Markov models with latent alternating recurrent event processes to model diurnal active-rest cycles

Benny Ren and Ian Barnett

Identifying and estimating effects of sustained interventions under parallel trends assumptions

Audrey Renson, Michael G. Hudgens, Alexander P. Keil, Paul N. Zivich, and Allison E. Aiello

Conditional cross-design synthesis estimators for generalizability in Medicaid

Irina Degtiar, Tim Layton, Jacob Wallace, and Sherri Rose

Estimating optimal individualized treatment rules with multistate processes

Giorgos Bakoyannis

Sparse Bayesian modeling of hierarchical independent component analysis: reliable estimation of individual differences in brain networks

Joshua Lukemire, Giuseppe Pagnoni, and Ying Guo

Sparse estimation in semi-parametric finite mixture of varying coefficient regression models

Abbas Khalili, Farhad Shokoohi, Masoud Asgharian, and Shili Lin

Imputation-based Q-learning for optimizing dynamic treatment regimes with right-censored survival outcome

Lingyun Lyu, Yu Cheng, and Abdus S. Wahed

Bi-level structured functional analysis for genome-wide association studies

Mengyun Wu, Fan Wang, Yeheng Ge, Shuangge Ma, and Yang Li

On interquantile smoothness of censored quantile regression with induced smoothing (CQRIS)

Zexi Cai and Tony Sit

An accelerated failure time regression model for illness-death data: A frailty approach

Lea Kats and Malka Gorfine

A case study of glucose levels during sleep using multilevel fast function on scalar regression inference

Renat Sergazinov, Andrew Leroux, Erjia Cui, Ciprian Crainiceanu, R. Nisha Aurora, Naresh M. Punjabi, and Irina Gaynanova

Pathological imaging-assisted cancer gene-environment interaction analysis

Kuangnan Fang, Jingmao Li, Qingzhao Zhang, Yaqing Xu, and Shuangge Ma

Correcting for bias due to mismeasured exposure history in longitudinal studies with continuous outcomes

Jiachen Cai, Ning Zhang, Xin Zhou, Donna Spiegelman, and Molin Wang

Group variable selection for Cox model with interval-censored failure time data

Yuxiang Wu, Hui Zhao, and Jianguo Sun

Instability of inverse probability weighting methods and a remedy for non-ignorable missing data

Pengfei Li, Jing Qin, and Yukun Liu 

Bayesian functional data analysis over dependent regions and its application for identification of differentially methylated regions

Suvo Chatterjee, Shrabanti Chowdhury, Duchwan Ryu, and Sanjib Basu

Hierarchical nuclear norm penalization for multi-view data integration

Sangyoon Yi, Raymond K. W. Wong, and Irina Gaynanova

Flexible joint modeling of mean and dispersion for the directional tuning of neuronal spike counts

Maria Alonso-Pena, Irene Gijbels, and Rosa M. Crujeiras

A double robust test for high-dimensional gene co-expression networks conditioning on clinical information

Maomao Ding, Ruosha Li, Jin Qin, and Jing Ning

Prior and posterior checking of implicit causal assumptions

Antonio R. Linero

Constructing time-invariant dynamic surveillance rules for optimal monitoring schedules

Xinyuan Dong, Yingye Zheng, Daniel W. Lin, Lisa Newcomb, and Ying-Qi Zhao

Dirichlet process mixture models for the analysis of repeated attempt designs

M.J. Daniels, M. Lee, and W. Feng

Conditional inference in cis-Mendelian randomization using weak genetic factors

Ashish Patel, Dipender Gill, Paul Newcombe, and Stephen Burgess

A stochastic block Ising model for multi-layer networks with inter-layer dependence

Jingnan Zhang, Chengye Li, and Junhui Wang 

Dynamic enrichment of Bayesian small sample, sequential, multiple assignment randomized trial (snSMART) design using natural history data: A case study from Duchenne muscular dystrophy

Sidi Wang,  Kelley M. Kidwell, and Satrajit Roychoudhury

Ensuring valid inference for Cox hazard ratios after variable selection

Kelly Van Lancker, Oliver Dukes, and Stijn Vansteelandt