Inclusion bias affects common variant discovery and replication in a health-system linked biobank
We quantify inclusion bias in a health-system-linked biobank using classification models to distinguish enrolled individuals from the background population. To evaluate its impact on genetic findings observed in biobanks, we reweight analyses by enrollment probability and find increased replication rates of known variant-trait associations and altered polygenic associations.
