Hartman Institute for Therapeutic Organ Regeneration

Modeling islet enhancers using deep learning identifies candidate causal variants at loci associated with T2D and glycemic traits.

TitleModeling islet enhancers using deep learning identifies candidate causal variants at loci associated with T2D and glycemic traits.
Publication TypeJournal Article
Year of Publication2023
AuthorsHudaiberdiev S, D Taylor L, Song W, Narisu N, Bhuiyan RM, Taylor HJ, Tang X, Yan T, Swift AJ, Bonnycastle LL, Consortium D, Chen S, Stitzel ML, Erdos MR, Ovcharenko I, Collins FS
JournalProc Natl Acad Sci U S A
Volume120
Issue35
Paginatione2206612120
Date Published2023 Aug 29
ISSN1091-6490
KeywordsComputer Simulation, Deep Learning, Diabetes Mellitus, Type 2, Enhancer Elements, Genetic, Genetic Variation, Humans, Islets of Langerhans
Abstract

<p>Genetic association studies have identified hundreds of independent signals associated with type 2 diabetes (T2D) and related traits. Despite these successes, the identification of specific causal variants underlying a genetic association signal remains challenging. In this study, we describe a deep learning (DL) method to analyze the impact of sequence variants on enhancers. Focusing on pancreatic islets, a T2D relevant tissue, we show that our model learns islet-specific transcription factor (TF) regulatory patterns and can be used to prioritize candidate causal variants. At 101 genetic signals associated with T2D and related glycemic traits where multiple variants occur in linkage disequilibrium, our method nominates a single causal variant for each association signal, including three variants previously shown to alter reporter activity in islet-relevant cell types. For another signal associated with blood glucose levels, we biochemically test all candidate causal variants from statistical fine-mapping using a pancreatic islet beta cell line and show biochemical evidence of allelic effects on TF binding for the model-prioritized variant. To aid in future research, we publicly distribute our model and islet enhancer perturbation scores across ~67 million genetic variants. We anticipate that DL methods like the one presented in this study will enhance the prioritization of candidate causal variants for functional studies.</p>

DOI10.1073/pnas.2206612120
Alternate JournalProc Natl Acad Sci U S A
PubMed ID37603758
PubMed Central IDPMC10469333
Grant ListZIA HG000024 / ImNIH / Intramural NIH HHS / United States
ZIA LM200881 / ImNIH / Intramural NIH HHS / United States
R01 DK118011 / DK / NIDDK NIH HHS / United States

Weill Cornell Medicine
Hartman Institute for Therapeutic Organ Regeneration
1300 York Ave, Box 136 New York, NY 10065