← All datasets

GSE262542

GSE GEO
View on GEO Export SRA CSV

Evaluation of novel computational methods that identify RNA-binding protein footprints from structural data

Organism: Homo sapiens
Platform: GPL24676
Samples: 20
Experiment Types:
Expression profiling by high throughput sequencing
Submitted: Mar 26 2024
Last Updated: May 23 2025
Status: Public on May 23 2025
Contact: Brian,,Yee (UCSD)

Relations

BioProject: https://www.ncbi.nlm.nih.gov/bioproject/PRJNA1092254

Summary

RNA binding proteins (RBP) play diverse roles in mRNA processing and function. However, from over 1,000 RBPs encoded in the human genome, a detailed molecular understanding of their interactions with RNA is available only for a small fraction. In most cases, our knowledge of the combination of RNA sequence and structure required for specific binding is insufficient for enabling exhaustive prediction of binding sites transcriptome-wide. In that context, the rapidly expanding collection of transcriptomic datasets that map distinct, yet intertwined post-transcriptional marks, such as RNA structure and RBP binding, presents an opportunity to integratively analyze them in order to better characterize binding. A grand challenge faced by our community is that relatively little information on the structural context of RNA-protein interactions has been gleaned from integrating such datasets, partially due to lack of suitable methods. To engage scientists from diverse backgrounds in addressing this gap, the RNA Society organized the RBP Footprint Grand Challenge⸺an international community effort to develop new methods or leverage existing ones for predicting RBP binding sites through analysis of a growing volume of sequence, structure, and binding data and to experimentally validate select predictions. Here, we report the initiative, analyses and methods developed by the participants, validation results, and several new in vivo binding datasets generated for validation. We hope this work will inspire additional innovation in computational methods, further utilization of available data resources, and future endeavors to engage the community in collaborating towards closing other critical data analysis gaps.

Overall Design

eCLIP for four RBPs in five cell types was conducted in replicates for each condition.

Analysis (4 steps)

View Data Processing
Processing steps for GSE262542
  1. data processing was done using the Skipper pipeline, freelly available at https://github.com/yeolab/skipper.
  2. Adapters trimming was done with Skewer
  3. proccessed reads were mapped with STAR (2.7.10a_alpha_220314)
  4. PCR bias was removed using UMIcollapse

Supplementary Files (5)

GSE262542_PRPF17_HeLa.tsv.gz Download
GSE262542_PRPF17_Hek293T.tsv.gz Download
GSE262542_SND1_K562.tsv.gz Download
GSE262542_hnRNPA2B1_HepG2.tsv.gz Download
GSE262542_hnRNPC_K562.tsv.gz Download
GEO Samples (20)

Dataset Citations (1)

Evaluation of novel computational methods to identify RNA-binding protein footprints from structural data.
PMID 40399037 · 2025 · RNA (New York, N.Y.)
Orel Mizrahi, Meredith Corley, Ori Feldman, Thorben Fröhlking, Lei Sun, Alison Ziesel, Maciej Antczak, Mattia Bernetti, Shaimae I Elhajjajy, Wenze Huang, Grady G Nguyen, Samuel S Park, Raul I Perez Martell, Luke Trinity, Kui Xu, Tomasz Zok, Giovanni Bussi, Hosna Jabbari, Yaron Orenstein, Sharon Aviran, Michelle M Meyer, Gene W Yeo

SRA Experiments (20) and Runs (20)

Total: 24023 MB
SRX24068016 SRP497980 RNA-Seq SINGLE
GSM8171514: hnRNPA2B1_HepG2_Input_rep1; Homo sapiens; RNA-Seq
Sample: SRS20858732
BioProject: PRJNA1092254
BioSample: SAMN40622134
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: HepG2
cell line: HepG2
cell type: Epithelial-like cells
geo_loc_name: missing
collection_date: missing
Original files (1)
HepG2
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464941 23149673 3495600623 1161.83 hnRNPA2B1_HepG2_IN1.fastq.gz, SRR28464941, SRR28464941.lite SRA
SRX24068017 SRP497980 RNA-Seq SINGLE
GSM8171515: hnRNPA2B1_HepG2_Input_rep2; Homo sapiens; RNA-Seq
Sample: SRS20858733
BioProject: PRJNA1092254
BioSample: SAMN40622133
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: HepG2
cell line: HepG2
cell type: Epithelial-like cells
geo_loc_name: missing
collection_date: missing
Original files (1)
HepG2
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464940 27062499 4086437349 1367.17 hnRNPA2B1_HepG2_IN2.fastq.gz, SRR28464940, SRR28464940.lite SRA
SRX24068018 SRP497980 RNA-Seq SINGLE
GSM8171516: hnRNPA2B1_HepG2_IP_rep1; Homo sapiens; RNA-Seq
Sample: SRS20858734
BioProject: PRJNA1092254
BioSample: SAMN40622132
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: HepG2
cell line: HepG2
cell type: Epithelial-like cells
geo_loc_name: missing
collection_date: missing
Original files (1)
HepG2
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464939 28583604 4316124204 1429.9 hnRNPA2B1_HepG2_IP1.fastq.gz, SRR28464939, SRR28464939.lite SRA
SRX24068019 SRP497980 RNA-Seq SINGLE
GSM8171517: hnRNPA2B1_HepG2_IP_rep2; Homo sapiens; RNA-Seq
Sample: SRS20858735
BioProject: PRJNA1092254
BioSample: SAMN40622131
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: HepG2
cell line: HepG2
cell type: Epithelial-like cells
geo_loc_name: missing
collection_date: missing
Original files (1)
HepG2
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464938 29492329 4453341679 1485.6 hnRNPA2B1_HepG2_IP2.fastq.gz, SRR28464938, SRR28464938.lite SRA
SRX24068020 SRP497980 RNA-Seq SINGLE
GSM8171518: hnRNPC_K562_Input_rep1; Homo sapiens; RNA-Seq
Sample: SRS20858736
BioProject: PRJNA1092254
BioSample: SAMN40622130
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: K562
cell line: K562
cell type: lymphoblast
geo_loc_name: missing
collection_date: missing
Original files (1)
K562
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR32612463 28674616 2896136216 880.87 HNRNPC_K562_In1.fastq.gz, SRR32612463, SRR32612463.lite SRA
SRX24068021 SRP497980 RNA-Seq SINGLE
GSM8171519: hnRNPC_K562_Input_rep2; Homo sapiens; RNA-Seq
Sample: SRS20858737
BioProject: PRJNA1092254
BioSample: SAMN40622129
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: K562
cell line: K562
cell type: lymphoblast
geo_loc_name: missing
collection_date: missing
Original files (1)
K562
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR32612464 29270718 2956342518 890.34 HNRNPC_K562_In2.fastq.gz, SRR32612464, SRR32612464.lite SRA
SRX24068022 SRP497980 RNA-Seq SINGLE
GSM8171520: hnRNPC_K562_IP_rep1; Homo sapiens; RNA-Seq
Sample: SRS20858738
BioProject: PRJNA1092254
BioSample: SAMN40622128
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: K562
cell line: K562
cell type: lymphoblast
geo_loc_name: missing
collection_date: missing
Original files (1)
K562
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR32612465 28017485 2829765985 829.7 HNRNPC_K562_IP1.fastq.gz, SRR32612465, SRR32612465.lite SRA
SRX24068023 SRP497980 RNA-Seq SINGLE
GSM8171521: hnRNPC_K562_IP_rep2; Homo sapiens; RNA-Seq
Sample: SRS20858739
BioProject: PRJNA1092254
BioSample: SAMN40622127
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: K562
cell line: K562
cell type: lymphoblast
geo_loc_name: missing
collection_date: missing
Original files (1)
K562
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR32612466 26791104 2705901504 795.74 HNRNPC_K562_IP2.fastq.gz, SRR32612466, SRR32612466.lite SRA
SRX24068024 SRP497980 RNA-Seq SINGLE
GSM8171522: PRPF17_Hek293T_Input_rep1; Homo sapiens; RNA-Seq
Sample: SRS20858740
BioProject: PRJNA1092254
BioSample: SAMN40622126
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: HEK293T
cell line: HEK293T
cell type: human embryonic kidney
geo_loc_name: missing
collection_date: missing
Original files (1)
HEK293T
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464933 160080994 16168180394 5255.94 PRPF17_Hek293T_In1.fastq.gz, SRR28464933, SRR28464933.lite SRA
SRX24068025 SRP497980 RNA-Seq SINGLE
GSM8171523: PRPF17_Hek293T_Input_rep2; Homo sapiens; RNA-Seq
Sample: SRS20858741
BioProject: PRJNA1092254
BioSample: SAMN40622125
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: HEK293T
cell line: HEK293T
cell type: human embryonic kidney
geo_loc_name: missing
collection_date: missing
Original files (1)
HEK293T
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464932 18038918 1821930718 602.19 PRPF17_Hek293T_In2.fastq.gz, SRR28464932, SRR28464932.lite SRA
SRX24068026 SRP497980 RNA-Seq SINGLE
GSM8171524: PRPF17_Hek293T_IP_rep1; Homo sapiens; RNA-Seq
Sample: SRS20858742
BioProject: PRJNA1092254
BioSample: SAMN40622124
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: HEK293T
cell line: HEK293T
cell type: human embryonic kidney
geo_loc_name: missing
collection_date: missing
Original files (1)
HEK293T
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464931 18818781 1900696881 682.72 PRPF17_Hek293T_IP1.fastq.gz, SRR28464931, SRR28464931.lite SRA
SRX24068027 SRP497980 RNA-Seq SINGLE
GSM8171525: PRPF17_Hek293T_IP_rep2; Homo sapiens; RNA-Seq
Sample: SRS20858743
BioProject: PRJNA1092254
BioSample: SAMN40622123
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: HEK293T
cell line: HEK293T
cell type: human embryonic kidney
geo_loc_name: missing
collection_date: missing
Original files (1)
HEK293T
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464930 16522751 1668797851 567.93 PRPF17_Hek293T_IP2.fastq.gz, SRR28464930, SRR28464930.lite SRA
SRX24068028 SRP497980 RNA-Seq SINGLE
GSM8171526: PRPF17_HeLa_Input_rep1; Homo sapiens; RNA-Seq
Sample: SRS20858744
BioProject: PRJNA1092254
BioSample: SAMN40622122
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: HeLa
cell line: HeLa
cell type: epithelial cell
geo_loc_name: missing
collection_date: missing
Original files (1)
HeLa
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464929 13676148 1381290948 466.03 PRPF17_HeLa_In1.fastq.gz, SRR28464929, SRR28464929.lite SRA
SRX24068029 SRP497980 RNA-Seq SINGLE
GSM8171527: PRPF17_HeLa_Input_rep2; Homo sapiens; RNA-Seq
Sample: SRS20858745
BioProject: PRJNA1092254
BioSample: SAMN40622121
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: HeLa
cell line: HeLa
cell type: epithelial cell
geo_loc_name: missing
collection_date: missing
Original files (1)
HeLa
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464928 15392620 1554654620 523.98 PRPF17_HeLa_In2.fastq.gz, SRR28464928, SRR28464928.lite SRA
SRX24068030 SRP497980 RNA-Seq SINGLE
GSM8171528: PRPF17_HeLa_IP_rep1; Homo sapiens; RNA-Seq
Sample: SRS20858746
BioProject: PRJNA1092254
BioSample: SAMN40622120
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: HeLa
cell line: HeLa
cell type: epithelial cell
geo_loc_name: missing
collection_date: missing
Original files (1)
HeLa
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464927 27850197 2812869897 991.19 PRPF17_HeLa_IP1.fastq.gz, SRR28464927, SRR28464927.lite SRA
SRX24068031 SRP497980 RNA-Seq SINGLE
GSM8171529: PRPF17_HeLa_IP_rep2; Homo sapiens; RNA-Seq
Sample: SRS20858747
BioProject: PRJNA1092254
BioSample: SAMN40622119
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: HeLa
cell line: HeLa
cell type: epithelial cell
geo_loc_name: missing
collection_date: missing
Original files (1)
HeLa
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464926 28924429 2921367329 1032.04 PRPF17_HeLa_IP2.fastq.gz, SRR28464926, SRR28464926.lite SRA
SRX24068032 SRP497980 RNA-Seq SINGLE
GSM8171530: SND1_k562_Input_rep1; Homo sapiens; RNA-Seq
Sample: SRS20858748
BioProject: PRJNA1092254
BioSample: SAMN40622118
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: K562
cell line: K562
cell type: lymphoblast
geo_loc_name: missing
collection_date: missing
Original files (1)
K562
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464925 19024043 2872630493 952.92 SND1_k562_IN1.fastq.gz, SRR28464925, SRR28464925.lite SRA
SRX24068033 SRP497980 RNA-Seq SINGLE
GSM8171531: SND1_k562_Input_rep2; Homo sapiens; RNA-Seq
Sample: SRS20858749
BioProject: PRJNA1092254
BioSample: SAMN40622117
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: K562
cell line: K562
cell type: lymphoblast
geo_loc_name: missing
collection_date: missing
Original files (1)
K562
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464924 29137586 4399775486 1473.82 SND1_k562_IN2.fastq.gz, SRR28464924, SRR28464924.lite SRA
SRX24068034 SRP497980 RNA-Seq SINGLE
GSM8171532: SND1_k562_IP_rep1; Homo sapiens; RNA-Seq
Sample: SRS20858750
BioProject: PRJNA1092254
BioSample: SAMN40622116
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: K562
cell line: K562
cell type: lymphoblast
geo_loc_name: missing
collection_date: missing
Original files (1)
K562
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464923 27524428 4156188628 1373.66 SND1_k562_IP1.fastq.gz, SRR28464923, SRR28464923.lite SRA
SRX24068035 SRP497980 RNA-Seq SINGLE
GSM8171533: SND1_k562_IP_rep2; Homo sapiens; RNA-Seq
Sample: SRS20858751
BioProject: PRJNA1092254
BioSample: SAMN40622115
Platform: ILLUMINA
Instrument: Illumina NovaSeq 6000
Organism: Homo sapiens
Sample attributes
source_name: K562
cell line: K562
cell type: lymphoblast
geo_loc_name: missing
collection_date: missing
Original files (1)
K562
Runs (1)
Run Spots Bases Size (MB) Files Link
SRR28464922 25471892 3846255692 1259.1 SND1_k562_IP2.fastq.gz, SRR28464922, SRR28464922.lite SRA

Linked Publications (1)

Data Files (40)

Accession File Name Stored Type Output Type Mapping Assembly Size Download
hnRNPA2B1_HepG2_IN1.fastq.gz RNA-Seq 1.1 GB link
hnRNPA2B1_HepG2_IN1.fastq.gz RNA-Seq 1.1 GB link
hnRNPA2B1_HepG2_IN2.fastq.gz RNA-Seq 1.3 GB link
hnRNPA2B1_HepG2_IN2.fastq.gz RNA-Seq 1.3 GB link
hnRNPA2B1_HepG2_IP1.fastq.gz RNA-Seq 1.4 GB link
hnRNPA2B1_HepG2_IP1.fastq.gz RNA-Seq 1.4 GB link
hnRNPA2B1_HepG2_IP2.fastq.gz RNA-Seq 1.5 GB link
hnRNPA2B1_HepG2_IP2.fastq.gz RNA-Seq 1.5 GB link
HNRNPC_K562_In1.fastq.gz RNA-Seq 880.9 MB link
HNRNPC_K562_In1.fastq.gz RNA-Seq 880.9 MB link
HNRNPC_K562_In2.fastq.gz RNA-Seq 890.3 MB link
HNRNPC_K562_In2.fastq.gz RNA-Seq 890.3 MB link
HNRNPC_K562_IP1.fastq.gz RNA-Seq 829.7 MB link
HNRNPC_K562_IP1.fastq.gz RNA-Seq 829.7 MB link
HNRNPC_K562_IP2.fastq.gz RNA-Seq 795.7 MB link
HNRNPC_K562_IP2.fastq.gz RNA-Seq 795.7 MB link
PRPF17_Hek293T_In1.fastq.gz RNA-Seq 5.1 GB link
PRPF17_Hek293T_In1.fastq.gz RNA-Seq 5.1 GB link
PRPF17_Hek293T_In2.fastq.gz RNA-Seq 602.2 MB link
PRPF17_Hek293T_In2.fastq.gz RNA-Seq 602.2 MB link
PRPF17_Hek293T_IP1.fastq.gz RNA-Seq 682.7 MB link
PRPF17_Hek293T_IP1.fastq.gz RNA-Seq 682.7 MB link
PRPF17_Hek293T_IP2.fastq.gz RNA-Seq 567.9 MB link
PRPF17_Hek293T_IP2.fastq.gz RNA-Seq 567.9 MB link
PRPF17_HeLa_In1.fastq.gz RNA-Seq 466.0 MB link
PRPF17_HeLa_In1.fastq.gz RNA-Seq 466.0 MB link
PRPF17_HeLa_In2.fastq.gz RNA-Seq 524.0 MB link
PRPF17_HeLa_In2.fastq.gz RNA-Seq 524.0 MB link
PRPF17_HeLa_IP1.fastq.gz RNA-Seq 991.2 MB link
PRPF17_HeLa_IP1.fastq.gz RNA-Seq 991.2 MB link
PRPF17_HeLa_IP2.fastq.gz RNA-Seq 1.0 GB link
PRPF17_HeLa_IP2.fastq.gz RNA-Seq 1.0 GB link
SND1_k562_IN1.fastq.gz RNA-Seq 952.9 MB link
SND1_k562_IN1.fastq.gz RNA-Seq 952.9 MB link
SND1_k562_IN2.fastq.gz RNA-Seq 1.4 GB link
SND1_k562_IN2.fastq.gz RNA-Seq 1.4 GB link
SND1_k562_IP1.fastq.gz RNA-Seq 1.3 GB link
SND1_k562_IP1.fastq.gz RNA-Seq 1.3 GB link
SND1_k562_IP2.fastq.gz RNA-Seq 1.2 GB link
SND1_k562_IP2.fastq.gz RNA-Seq 1.2 GB link