Mudskipper detects combinatorial RNA binding protein interactions in multiplexed CLIP data.
Abstract
The uncovering of protein-RNA interactions enables a deeper understanding of RNA processing. Recent multiplexed crosslinking and immunoprecipitation (CLIP) technologies such as antibody-barcoded eCLIP (ABC) dramatically increase the throughput of mapping RNA binding protein (RBP) binding sites. However, multiplex CLIP datasets are multivariate, and each RBP suffers non-uniform signal-to-noise ratio. To address this, we developed Mudskipper, a versatile computational suite comprising two components: a Dirichlet multinomial mixture model to account for the multivariate nature of ABC datasets and a softmasking approach that identifies and removes non-specific protein-RNA interactions in RBPs with low signal-to-noise ratio. Mudskipper demonstrates superior precision and recall over existing tools on multiplex datasets and supports analysis of repetitive elements and small non-coding RNAs. Our findings unravel splicing outcomes and variant-associated disruptions, enabling higher-throughput investigations into diseases and regulation mediated by RBPs.
Publication Types
MeSH Terms
Funding
Potentially Related Datasets (7)
These accessions were text-mined from the PMC full text. They may be referenced for comparison, cited from other studies, or otherwise mentioned without being primary data for this paper.
Multiplexed transcriptome discovery of RNA binding protein binding sites by antibody-barcode eCLIP
eCLIP control experiment on K562 against SLBP
shRNA knockdown against PRPF8 in K562 cells followed by RNA-seq. (PRPF8_BGKLV19)
shRNA knockdown against SF3B4 in K562 cells followed by RNA-seq. (SF3B4-LV08)