GSE230717
GSE GEOLarge-scale map of RNA binding protein interactomes across the mRNA life-cycle
Relations
Summary
RNA-binding proteins (RBPs) target RNA in a context-dependent manner to regulate gene expression. Protein-protein interaction (PPI) maps of RBP complexes and networks are critical for defining RBP function and RNA targeting. Yet, PPI networks under-represent RBP baits and need more information about RNA-driven interactions. Therefore, we generated an RNA-aware, RBP-interactome map combining two strategies that use systematic proteomic methods to identify 1) protein interactions using co-immunoprecipitation in the presence or absence of RNase treatment of a ~100 RBPs across the RNA life-cycle and 2) RNA-dependent complexes using Size Exclusion Chromatography. Together, this dataset provides proteome-wide, cell-type specific, and quantitative identification of RNA-protein interactions across multiple RNA processing events. In the resulting PPI network, several hundred database-supported interactions establish many complexes operating at each of these mRNA life-cycle stages. Nearly a thousand novel interactions imply new functions for RBPs across multiple steps of the RNA life cycle. Overlapping our network with eCLIP data, we find RNA targets between interactors and uncover complex-driven binding signatures. Betweenness-centrality scores identify multi-functional RBPs that participate across multiple mRNA life-cycle steps. We characterize the novel interactions and functions of different classes of multi-functional proteins. We find the scaffolding protein, ERH, interacts with numerous nuclear speckle proteins and facilitates splicing and mRNA export. Finally, we show that the splicing factor, SNRNP200, interacts with nuclear export, localization, and translation proteins and is an essential factor of RNA granule formation during stress. Our large-scale RBP interaction network provides new insights and a valuable resource for exploring new RBP complexes operating to control gene expression.
Overall Design
eCLIP-seq of human SNRNP200, CAPRIN, and G3BP1 eCLIP of RBP of interest. Each sample has an input and IP sample
Analysis (9 steps)
View Data Processing- Data was processed using the eCLIP pipeline and available at https://github.com/YeoLab/skipper
- Unique Molecular Identifiers (UMIs) were extracted using fastp 0.11.5 (https://github.com/OpenGene/fastp)
- Post-umi-extracted reads were trimmed for adapter sequences and barcode sequences (eCLIP samples)
- Mapping was then performed against the full human genome (hg38) including a database of splice junctions with STAR (v 2.7.6) allowing up to 100 multimapped regions.
- Reads were then PCR deduplicated using UMIcollaps (https://github.com/Daniel-Liu-c0deb0t/UMICollapse
- Enriched âwindowsâ (IP versus SM-Input) was called on deduplicated reads using a GC-bias aware beta-binomial model.
- Each window (~100 b.p.) were partitioned from Gencode v38 and associated with a specific type of genomic region (CDS, UTR, proximal introns near splice site⦠etc).
- Windows are filtered using FDR < 0.2 and only reproducible windows between two replicates were used.
Supplementary Files (6)
Dataset Citations (1)
SRA Experiments (24) and Runs (24)
Total: 15788 MBSample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321529 | 22797958 | 1709846850 | 575.66 | HEK-Input-SNRNP200_S15_L001_R1_001.fastq.gz, SRR24321529, SRR24321529… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321528 | 30112202 | 3041332402 | 930.21 | HEK-baseline-SNRNP200-Input_S22_L002_R1_001.fastq.gz, SRR24321528, SR… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321527 | 19855426 | 2005398026 | 673.74 | SNRNP200_cytoplasmic_control_in1_S152_L001_R1_001.fastq.gz, SRR243215… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321526 | 19556555 | 1975212055 | 662.58 | SNRNP200_cytoplasmic_control_in2_S154_L001_R1_001.fastq.gz, SRR243215… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321525 | 12796123 | 1292408423 | 445.5 | CAPRIN_US_IN1_S164_L001_R1_001.fastq.gz, SRR24321525, SRR24321525.lite | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321524 | 15998602 | 1615858802 | 552.8 | CAPRIN_US_IN2_S165_L001_R1_001.fastq.gz, SRR24321524, SRR24321524.lite | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321523 | 12137799 | 910334925 | 354.16 | KR-A-IN_S50_L002_R1_001.fastq.gz, SRR24321523, SRR24321523.lite | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321522 | 10457974 | 784348050 | 306.46 | KR-B-IN_S52_L002_R1_001.fastq.gz, SRR24321522, SRR24321522.lite | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321521 | 20750336 | 1556275200 | 523.65 | HEK-Stress-Input-SNRNP200_S3_L001_R1_001.fastq.gz, SRR24321521, SRR24… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321520 | 30036610 | 3033697610 | 923.03 | HEK-stress-SNRNP200-Input_S26_L002_R1_001.fastq.gz, SRR24321520, SRR2… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321519 | 26211402 | 2647351602 | 879.62 | SNRNP200_cytoplasmic_arsenite_in1_S156_L001_R1_001.fastq.gz, SRR24321… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321518 | 27399709 | 2767370609 | 927.35 | SNRNP200_cytoplasmic_arsenite_in2_S158_L001_R1_001.fastq.gz, SRR24321… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321517 | 21091403 | 1581855225 | 534.97 | HEK-IP-SNRNP200_S16_L001_R1_001.fastq.gz, SRR24321517, SRR24321517.li… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321516 | 34778360 | 3512614360 | 1056.61 | HEK-baseline-SNRNP200-IP_S23_L002_R1_001.fastq.gz, SRR24321516, SRR24… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321515 | 21709357 | 2192645057 | 733.46 | SNRNP200_cytoplasmic_control_ip1_S153_L001_R1_001.fastq.gz, SRR243215… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321514 | 18900571 | 1908957671 | 636.92 | SNRNP200_cytoplasmic_control_ip2_S155_L001_R1_001.fastq.gz, SRR243215… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321513 | 16064937 | 1622558637 | 558.73 | CAPRIN_US_IP1_S172_L001_R1_001.fastq.gz, SRR24321513, SRR24321513.lite | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321512 | 16773273 | 1694100573 | 581.88 | CAPRIN_US_IP2_S173_L001_R1_001.fastq.gz, SRR24321512, SRR24321512.lite | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321511 | 4789369 | 359202675 | 143.54 | KR-A-IP_S49_L002_R1_001.fastq.gz, SRR24321511, SRR24321511.lite | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321510 | 9784369 | 733827675 | 293.08 | KR-B-IP_S51_L002_R1_001.fastq.gz, SRR24321510, SRR24321510.lite | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321509 | 24971886 | 1872891450 | 642.89 | HEK-Stress-IP-SNRNP200_S4_L001_R1_001.fastq.gz, SRR24321509, SRR24321… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321508 | 26947857 | 2721733557 | 831.58 | HEK-stress-SNRNP200-IP_S27_L002_R1_001.fastq.gz, SRR24321508, SRR2432… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321507 | 28571341 | 2885705441 | 964.99 | SNRNP200_cytoplasmic_arsenite_ip1_S157_L001_R1_001.fastq.gz, SRR24321… | SRA |
Sample attributes
Original files (1)
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR24321506 | 31017987 | 3132816687 | 1054.32 | SNRNP200_cytoplasmic_arsenite_ip2_S159_L001_R1_001.fastq.gz, SRR24321… | SRA |
Linked Publications (2)
Data Files (26)
| Accession | File Name | Stored Type | Output Type | Mapping Assembly | Size | Download | |
|---|---|---|---|---|---|---|---|
| — | CAPRIN_US_IN1_S164_L001_R1_001.fastq.gz | RIP-Seq | 445.5 MB | link | |||
| — | CAPRIN_US_IN2_S165_L001_R1_001.fastq.gz | RIP-Seq | 552.8 MB | link | |||
| — | CAPRIN_US_IP1_S172_L001_R1_001.fastq.gz | RIP-Seq | 558.7 MB | link | |||
| — | CAPRIN_US_IP2_S173_L001_R1_001.fastq.gz | RIP-Seq | 581.9 MB | link | |||
| — | HEK-baseline-SNRNP200-Input_S22_L002_R1_001.fastq.gz | RIP-Seq | 930.2 MB | link | |||
| — | HEK-baseline-SNRNP200-IP_S23_L002_R1_001.fastq.gz | RIP-Seq | 1.0 GB | link | |||
| — | HEK-Input-SNRNP200_S15_L001_R1_001.fastq.gz | RIP-Seq | 575.7 MB | link | |||
| — | HEK-IP-SNRNP200_S16_L001_R1_001.fastq.gz | RIP-Seq | 535.0 MB | link | |||
| — | HEK-Stress-Input-SNRNP200_S3_L001_R1_001.fastq.gz | RIP-Seq | 523.6 MB | link | |||
| — | HEK-Stress-IP-SNRNP200_S4_L001_R1_001.fastq.gz | RIP-Seq | 642.9 MB | link | |||
| — | HEK-stress-SNRNP200-Input_S26_L002_R1_001.fastq.gz | RIP-Seq | 923.0 MB | link | |||
| — | HEK-stress-SNRNP200-IP_S27_L002_R1_001.fastq.gz | RIP-Seq | 831.6 MB | link | |||
| — | KR-A-IN_S50_L002_R1_001.fastq.gz | RIP-Seq | 354.2 MB | link | |||
| — | KR-A-IP_S49_L002_R1_001.fastq.gz | RIP-Seq | 143.5 MB | link | |||
| — | KR-B-IN_S52_L002_R1_001.fastq.gz | RIP-Seq | 306.5 MB | link | |||
| — | KR-B-IP_S51_L002_R1_001.fastq.gz | RIP-Seq | 293.1 MB | link | |||
| — | SNRNP200_cytoplasmic_arsenite_in1_S156_L001_R1_001.fas… | RIP-Seq | 879.6 MB | link | |||
| — | SNRNP200_cytoplasmic_arsenite_in2_S158_L001_R1_001.fas… | RIP-Seq | 927.3 MB | link | |||
| — | SNRNP200_cytoplasmic_arsenite_ip1_S157_L001_R1_001.fas… | RIP-Seq | 965.0 MB | link | |||
| — | SNRNP200_cytoplasmic_arsenite_ip2_S159_L001_R1_001.fas… | RIP-Seq | 1.0 GB | link | |||
| — | SNRNP200_cytoplasmic_control_in1_S152_L001_R1_001.fast… | RIP-Seq | 673.7 MB | link | |||
| — | SNRNP200_cytoplasmic_control_in2_S154_L001_R1_001.fast… | RIP-Seq | 662.6 MB | link | |||
| — | SNRNP200_cytoplasmic_control_ip1_S153_L001_R1_001.fast… | RIP-Seq | 733.5 MB | link | |||
| — | SNRNP200_cytoplasmic_control_ip2_S155_L001_R1_001.fast… | RIP-Seq | 636.9 MB | link | |||
| — | SRR24321506.lite | RIP-Seq | 1.0 GB | link | |||
| — | SRR24321507.lite | RIP-Seq | 965.0 MB | link |