Skip to main navigation Skip to search Skip to main content

Multiomics and deep learning dissect regulatory syntax in human development

  • Betty B. Liu
  • , Selin Jessa
  • , Samuel H. Kim
  • , Yan Ting Ng
  • , Soon Il Higashino
  • , Georgi K. Marinov
  • , Derek C. Chen
  • , Benjamin E. Parks
  • , Li Li
  • , Tri C. Nguyen
  • , Austin T. Wang
  • , Sean K. Wang
  • , Meng How Tan
  • , Serena Y. Tan
  • , Michael Kosicki
  • , Len A. Pennacchio
  • , Eyal Ben-David
  • , Anca M. Pasca
  • , Anshul Kundaje*
  • , Kyle K.H. Farh*
  • William J. Greenleaf*
*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Transcription factors establish cell identity during development by binding regulatory DNA in a sequence-specific manner, often promoting local chromatin accessibility and regulating gene expression1. Mapping accessible chromatin offers critical insights into transcriptional control, but available datasets for human development are restricted to bulk tissue, single organs or single modalities2. Here we present the Human Development Multiomic Atlas, a single-cell atlas of chromatin accessibility and gene expression from 817,740 fetal cells across 12 organs, spanning 203 cell types and more than 1 million candidate cis-regulatory elements, many of which exhibit organ-specific in vivo enhancer activity. Deep learning models trained to predict accessibility from local DNA sequence unravel a comprehensive lexicon of motifs that influence accessibility, including composite motifs exhibiting distinct syntactic constraints that are predicted to mediate transcription factor cooperativity. We identify ‘hard’ syntactic rules requiring precise motif spacing and orientation, ‘soft’ rules allowing flexible motif arrangements, and ubiquitous motifs inhibiting accessibility. Model-based interpretation of genetic variants reveals that disruption of motifs with positive and negative effects is associated with concordant effects on gene expression. Our work delineates how motif syntax governs cell-type-specific chromatin accessibility and provides a foundational resource for decoding cis-regulatory logic and interpreting genetic variation during human development.

Original languageEnglish
JournalNature
DOIs
StateAccepted/In press - 2026
Externally publishedYes

Bibliographical note

Publisher Copyright:
© The Author(s) 2026.

Fingerprint

Dive into the research topics of 'Multiomics and deep learning dissect regulatory syntax in human development'. Together they form a unique fingerprint.

Cite this