Authors

Theo A Knijnenburg, Institute for Systems Biology, Seattle, US.Follow
Joseph G Vockley
Nyasha Chambwe, Institute for Systems Biology, Seattle, WA 98109.Follow
David L Gibbs, Institute for Systems Biology, Seattle, Washington.Follow
Crystal Humphries, Institute for Systems Biology, Seattle, WA 98109.
Kathi C Huddleston
Elisabeth Klein
Prachi Kothiyal
Ryan Tasseff, Institute for Systems Biology, Seattle, WA 98109.
Varsha Dhankani, Institute for Systems Biology, Seattle, WA 98109.Follow
Dale L Bodian
Wendy S W Wong
Gustavo Glusman, Institute for Systems Biology, Seattle, Washington, United States of America.Follow
Denise E Mauldin, Institute for Systems Biology, Seattle, WA 98109.
Michael Miller, Institute for Systems Biology, Seattle, WA 98109.
Joseph Slagel, Institute for Systems Biology, Seattle, WA 98109.Follow
Summer Elasady, Institute for Systems Biology, Seattle, WA 98109.
Jared C Roach, Institute for Systems Biology, Seattle, WA 98109.Follow
Roger Kramer, Institute for Systems Biology, Seattle, WA 98109.
Kalle Leinonen, Institute for Systems Biology, Seattle, WA 98109.
Jasper Linthorst, Institute for Systems Biology, Seattle, WA 98109.
Rajiv Baveja
Robin Baker
Benjamin D Solomon
Greg Eley
Ramaswamy K Iyer
George L Maxwell
Brady Bernard, Institute for Systems Biology, Seattle, WA 98109.Follow
Ilya Shmulevich, Institute for Systems Biology, Seattle, WA 98109.Follow
Leroy Hood, Institute for Systems Biology, Seattle, Washington, United States of America.Follow
John E Niederhuber

Document Type

Article

Publication Date

3-19-2019

Publication Title

Proceedings of the National Academy of Sciences of the United States of America

Keywords

family trios; genomic variants; integrative computational analysis; preterm birth; whole genome sequencing

Abstract

Preterm birth (PTB) complications are the leading cause of long-term morbidity and mortality in children. By using whole blood samples, we integrated whole-genome sequencing (WGS), RNA sequencing (RNA-seq), and DNA methylation data for 270 PTB and 521 control families. We analyzed this combined dataset to identify genomic variants associated with PTB and secondary analyses to identify variants associated with very early PTB (VEPTB) as well as other subcategories of disease that may contribute to PTB. We identified differentially expressed genes (DEGs) and methylated genomic loci and performed expression and methylation quantitative trait loci analyses to link genomic variants to these expression and methylation changes. We performed enrichment tests to identify overlaps between new and known PTB candidate gene systems. We identified 160 significant genomic variants associated with PTB-related phenotypes. The most significant variants, DEGs, and differentially methylated loci were associated with VEPTB. Integration of all data types identified a set of 72 candidate biomarker genes for VEPTB, encompassing genes and those previously associated with PTB. Notably, PTB-associated genes RAB31 and RBPJ were identified by all three data types (WGS, RNA-seq, and methylation). Pathways associated with VEPTB include EGFR and prolactin signaling pathways, inflammation- and immunity-related pathways, chemokine signaling, IFN-γ signaling, and Notch1 signaling. Progress in identifying molecular components of a complex disease is aided by integrated analyses of multiple molecular data types and clinical data. With these data, and by stratifying PTB by subphenotype, we have identified associations between VEPTB and the underlying biology.

Clinical Institute

Women & Children

Department

Institute for Systems Biology

Department

Perinatology/Neonatology

Share

COinS