← Back to search

Expanded encyclopaedias of DNA elements in the human and mouse genomes.

Nature · 2020 · Vol. 583 (7818) · pp. 699-710

Abstract

The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development. All data are available through the ENCODE data portal (https://www.encodeproject.org), including phase II ENCODE<sup>1</sup> and Roadmap Epigenomics<sup>2</sup> data. We have developed a registry of 926,535 human and 339,815 mouse candidate cis-regulatory elements, covering 7.9 and 3.4% of their respective genomes, by integrating selected datatypes associated with gene regulation, and constructed a web-based server (SCREEN; http://screen.encodeproject.org) to provide flexible, user-defined access to this resource. Collectively, the ENCODE data and registry provide an expansive resource for the scientific community to build a better understanding of the organization and function of the human and mouse genomes.

Publication Types

["Journal Article", "Research Support, N.I.H., Extramural"]

Keywords

[]

MeSH Terms

["Animals", "Chromatin", "DNA", "DNA Footprinting", "DNA Methylation", "DNA Replication Timing", "Databases, Genetic", "Deoxyribonuclease I", "Genome", "Genome, Human", "Genomics", "Histones", "Humans", "Mice", "Mice, Transgenic", "Molecular Sequence Annotation", "RNA-Binding Proteins", "Registries", "Regulatory Sequences, Nucleic Acid", "Transcription, Genetic", "Transposases"]

Funding

P30 CA014195 NCI NIH HHS (United States)
P30 CA008748 NCI NIH HHS (United States)
K99 HG009530 NHGRI NIH HHS (United States)
U41 HG007000 NHGRI NIH HHS (United States)
UM1 HG009411 NHGRI NIH HHS (United States)
U24 HG009446 NHGRI NIH HHS (United States)
T32 GM087237 NIGMS NIH HHS (United States)
U54 HG007005 NHGRI NIH HHS (United States)
U54 HG006991 NHGRI NIH HHS (United States)
F32 HG006993 NHGRI NIH HHS (United States)
U01 HG007036 NHGRI NIH HHS (United States)
UM1 HG009390 NHGRI NIH HHS (United States)
UM1 HG009442 NHGRI NIH HHS (United States)
U54 HG007004 NHGRI NIH HHS (United States)
U01 HG009380 NHGRI NIH HHS (United States)
U01 HG007033 NHGRI NIH HHS (United States)
U54 HG006996 NHGRI NIH HHS (United States)
U01 HG009431 NHGRI NIH HHS (United States)
R01 HG003143 NHGRI NIH HHS (United States)
U01 HG007037 NHGRI NIH HHS (United States)
R01 HG012367 NHGRI NIH HHS (United States)
U54 HG007002 NHGRI NIH HHS (United States)
U41 HG006992 NHGRI NIH HHS (United States)
T32 HG000044 NHGRI NIH HHS (United States)
P30 CA045508 NCI NIH HHS (United States)
R01 DK068634 NIDDK NIH HHS (United States)
U01 HG007019 NHGRI NIH HHS (United States)
U54 HG007010 NHGRI NIH HHS (United States)
R24 DK106766 NIDDK NIH HHS (United States)
U54 HG006994 NHGRI NIH HHS (United States)
U54 HG006997 NHGRI NIH HHS (United States)
R01 GM083337 NIGMS NIH HHS (United States)
R37 DK050107 NIDDK NIH HHS (United States)
U54 HG006998 NHGRI NIH HHS (United States)