Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples

11Citations
Citations of this article
26Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Participant overlap can induce overfitting bias into Mendelian randomization (MR) and polygenic risk score (PRS) studies. Here, we evaluated a block jackknife resampling framework for genome-wide association studies (GWAS) and PRS construction to mitigate overfitting bias in MR analyses and implemented this study design in a causal inference setting using data from the UK Biobank. We simulated PRS and MR under three scenarios: (1) using weighted SNP estimates from an external GWAS, (2) using weighted SNP estimates from an overlapping GWAS sample and (3) using a block jackknife resampling framework. Based on a P-value threshold to derive genetic instruments for MR studies (P < 5 × 10-8) and a 10% variance in the exposure explained by all SNPs, block-jackknifing PRS did not suffer from overfitting bias (mean R2 = 0.034) compared with the externally weighted PRS (mean R2 = 0.040). In contrast, genetic instruments derived from overlapping samples explained a higher variance (mean R2 = 0.048) compared with the externally derived score. Overfitting became considerably more severe when using a more liberal P-value threshold to construct PRS (e.g. P < 0.05, overlapping sample PRS mean R2 = 0.103, externally weighted PRS mean R2 = 0.086), whereas estimates using jackknife score remained robust to overfitting (mean R2 = 0.084). Using block jackknife resampling MR in an applied analysis, we examined the effects of body mass index on circulating biomarkers which provided comparable estimates to an externally weighted instrument, whereas the overfitted scores typically provided narrower confidence intervals. Furthermore, we extended this framework into sex-stratified, multivariate and bidirectional settings to investigate the effect of childhood body size on adult testosterone levels.

References Powered by Scopus

Second-generation PLINK: Rising to the challenge of larger and richer datasets

7134Citations
N/AReaders
Get full text

UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age

6874Citations
N/AReaders
Get full text

The UK Biobank resource with deep phenotyping and genomic data

4592Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Constructing an atlas of associations between polygenic scores from across the human phenome and circulating metabolic biomarkers

13Citations
N/AReaders
Get full text

Low levels of small HDL particles predict but do not influence risk of sepsis

7Citations
N/AReaders
Get full text

Exploring pleiotropy in Mendelian randomisation analyses: What are genetic variants associated with ‘cigarette smoking initiation’ really capturing?

1Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Fang, S., Hemani, G., Richardson, T. G., Gaunt, T. R., & Davey Smith, G. (2023). Evaluating and implementing block jackknife resampling Mendelian randomization to mitigate bias induced by overlapping samples. Human Molecular Genetics, 32(2), 192–203. https://doi.org/10.1093/hmg/ddac186

Readers' Seniority

Tooltip

Researcher 7

47%

PhD / Post grad / Masters / Doc 6

40%

Professor / Associate Prof. 2

13%

Readers' Discipline

Tooltip

Biochemistry, Genetics and Molecular Bi... 6

50%

Medicine and Dentistry 3

25%

Agricultural and Biological Sciences 2

17%

Psychology 1

8%

Save time finding and organizing research with Mendeley

Sign up for free