Abstract
Multiple reference panels of a given tissue or multiple tissues often exist, and multiple regression methods could be used for training gene expression imputation models for TWAS. To leverage expression imputation models (i.e., base models) trained with multiple reference panels, regression methods, and tissues, we develop a Stacked Regression based TWAS (SR-TWAS) tool which can obtain optimal linear combinations of base models for a given validation transcriptomic dataset. Both simulation and real studies showed that SR-TWAS improved power, due to increased effective training sample sizes and borrowed strength across multiple regression methods and tissues. Leveraging base models across multiple reference panels, tissues, and regression methods, our real application studies identified 6 independent significant risk genes for Alzheimer’s disease (AD) dementia for supplementary motor area tissue and 9 independent significant risk genes for Parkinson’s disease (PD) for substantia nigra tissue. Relevant biological interpretations were found for these significant risk genes.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
RP and JY are supported by National Institutes of Health (NIH/NIGMS) grant award R35GM138313. MPE was supported by NIH/NIGMS grant award R01GM117946 and NIH/NIA grant award RF1AG071170. ROS/MAP study data were provided by the Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, IL. Data collection was supported through funding by NIA grants P30AG10161, R01AG15819, R01AG17917, R01AG30146, R01AG36836, R01AG56352, U01AG32984, U01AG46152, U01AG61356, the Illinois Department of Public Health, and the Translational Genomics Research Institute.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
ROS/MAP data available through the Rush Alzheimer's Disease Center Research Resource Sharing Hub: https://www.radc.rush.edu GTEx V8 data are available from dbGaP with accession phs000424.v8.p2 TIGAR_GTEx base models trained from GTEx V8 are available at Synapse: https://www.synapse.org/TIGAR_V2_Resource_GTExV8 PrediXcan_GTEx base models trained from GTEx V8 are available from: https://predictdb.org/ GWAS summary data of AD are available from: https://ctg.cncr.nl/software/summary_statistics GWAS summary data of PD are available from: https://bit.ly/2ofzGrk
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
Updating to include figures that were not uploaded in the previous revision.
Data availability
All ROS/MAP data analyzed in this study are de-identified and available to any qualified investigator with application through the Rush Alzheimer’s Disease Center Research Resource Sharing Hub, https://www.radc.rush.edu, which has descriptions of the studies and available data. GTEx V8 data are available from dbGaP with accession phs000424.v8.p2, and GTEx Portal https://www.gtexportal.org/home/. TIGAR DPR base models trained from GTEx V8 are available at Synapse https://www.synapse.org/TIGAR_V2_Resource_GTExV8. PrediXcan Elastic-Net base models trained from GTEx V8 are available from https://predictdb.org/. GWAS summary data of AD are available from https://ctg.cncr.nl/software/summary_statistics, and GWAS summary data of PD are available from https://bit.ly/2ofzGrk. TIGAR DPR and PrediXcan Elastic-Net base models of ROS/MAP tissues (DLPFC, SMA), SR-TWAS and Avg-valid+SR models trained from ROS/MAP SMA tissue and GTEx brain substantia nigra tissue in this study, and all TWAS summary statistics generated in this study are freely available from SYNAPSE https://doi.org/10.7303/syn53437281.