← Back to search

Comprehensive RNA-binding protein analyses and deep learning uncover genetic constraints and disease associations in protein-RNA interfaces.

Cell systems · 2026 · pp. 101588

Abstract

RNA-binding proteins (RBPs) orchestrate post-transcriptional processes, including splicing, cleavage and polyadenylation, and translation. Our updated RBP resource integrates data from 92 additional RBPs (286 in total) profiled by enhanced CLIP (eCLIP), enabling comprehensive characterization of RNA elements within human K562 and HepG2 cells. To interrogate RBP-binding syntax, we trained deep-learning models on eCLIP profiles, allowing us to score genetic variants and quantify constraints on RBP-binding sites. We observed opposing selective-constraint profiles at splicing enhancers versus silencers, including an unexpected enrichment of strengthening mutations in ELAVL1- and HNRNPC-binding sites. Finally, our model prioritizes disease variants, exposing unexpected RBP-related mechanisms of pathogenesis, exemplified by the enrichment of weakening mutations in spliceosomal protein-binding sites among retinal disease variants. The complete eCLIP resource offers an integrated platform for exploring RBP-RNA interactomes.

Funding

U24 HG009889 NHGRI NIH HHS (United States)
R01 HG004659 NHGRI NIH HHS (United States)
RF1 MH126719 NIMH NIH HHS (United States)
R01 HG011864 NHGRI NIH HHS (United States)

Linked Datasets (3)

GSE315347 GSE via ncbi_elink
GEO

Comprehensive RNA-binding protein analyses using enhanced CLIP (ENCORE) [dataset1]

GSE315406 GSE via ncbi_elink
GEO

Comprehensive RNA-binding protein analyses using enhanced CLIP (ENCORE) [dataset2]

GSE315407 GSE via ncbi_elink
GEO

Comprehensive RNA-binding protein analyses using enhanced CLIP (ENCORE)

Analysis Pipelines (3)

eCLIP geo_data_processing GSE315347
eCLIP geo_data_processing GSE315406
eCLIP geo_data_processing GSE315407