GSE90963
GSE GEOTranscriptome-wide Profiling of Multiple RNA Modifications Simultaneously at Single-base Resolution
Relations
Summary
We report RBS-Seq, a new RNA bisulfite sequencing method enabling the sensitive and simultaneous detection of m5C, pseudouridine, and m1A at single-base resolution transcriptome-wide. For each, we detect âsignatureâ base mismatches (for m5C and m1A), or 1-2 base deletions (for pseudouridine) structurally explained by ribose ring-opened pseudouridine-mono-bisulfite adducts.  Our profiles for pseudouridine reveal clear signatures at known sites in tRNAs and rRNAs, and provide hundreds of new targets in non-coding RNAs and mRNAs. However, our results diverge greatly from prior work, suggesting ~10-fold fewer m5C sites, and the absence of substantial m1A in mRNAs.
Overall Design
Application of a modified RNA bisulfite sequencing approach on RNA samples from HeLa cells for simultaneous detection of m5C, pseudouridine, and m1A at single-base resolution transcriptome-wide.
Analysis (35 steps)
View Data Processing- Reference genome: Novoindex (2.8) (http://www.novocraft.com) was used to create a standard and bisulfite reference index on a combination of hg19 chromosome, scaffold and splice junction sequences.
- Splice junction sequences were generated with USeq (v8.8.8) 1 MakeTranscriptome using Ensembl transcript annotations (build 74).
- Alignment parameters: BS and NBS samples were aligned to the transcriptome indexes described above using Novoalign (v2.08.01) (http://www.novocraft.com) allowing up to 50 alignments and trimming adaptor sequences (TGGAATTCTCGGGTGCCAAGGAACTCCAGTCAC GATCGTCGGACTGTAGAACTCTGAAC).
- Alignment processing : The alignment files were processed with USeq SamTranscriptomeParser, which reports the best alignment location for each read in bam format and converts the coordinates of splice junction alignments to genomic space.
- One alignment location is selected at random if a read aligned to multiple locations in the genome with equal confidence.
- Alignments were reordered using Picard (v1.87) (http://picard.sourceforge.net) ReorderBam and all alignments were set as primary using the set_sam_primary.py (https://github.com/HuntsmanCancerInstitute/RBSSeqTools).
- The alignments were then processed with GATK (v3.4-0) 2 SplitNCigarReads, which splits spliced alignments into multiple alignment segments without N cigar elements.
- Reads containing deletions four base pairs or longer were removed using remove_deletions.py (https://github.com/HuntsmanCancerInstitute/RBSSeqTools).
Supplementary Files (4)
Dataset Citations (1)
SRA Experiments (14) and Runs (14)
Total: 159334 MBSample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058077 | 59380911 | 11994944022 | 7734.23 | 10669X4_140417_D00294_0090_AC49H3ACXX_8_1.txt.gz, 10669X4_140417_D002… | SRA |
Sample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058078 | 85475350 | 17266020700 | 10847.31 | 10669X3_140417_D00294_0090_AC49H3ACXX_7_1.txt.gz, 10669X3_140417_D002… | SRA |
Sample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058079 | 91797553 | 18543105706 | 11711.77 | 10669X2_140417_D00294_0090_AC49H3ACXX_7_1.txt.gz, 10669X2_140417_D002… | SRA |
Sample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058080 | 131844384 | 32961096000 | 15015.72 | 11219X1_141212_D00294_0147_AC6231ANXX_7_1.txt.gz, 11219X1_141212_D002… | SRA |
Sample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058081 | 92727751 | 23181937750 | 10813.85 | 11219X2_141212_D00294_0147_AC6231ANXX_7_1.txt.gz, 11219X2_141212_D002… | SRA |
Sample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058082 | 88322680 | 17841181360 | 11460.98 | 10669X5_140417_D00294_0090_AC49H3ACXX_8_1.txt.gz, 10669X5_140417_D002… | SRA |
Sample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058083 | 116072930 | 29018232500 | 15085.72 | 11838X2_150925_D00550_0293_BC7FW5ANXX_6_1.txt.gz, 11838X2_150925_D005… | SRA |
Sample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058084 | 76266288 | 15405790176 | 9946.69 | 10669X6_140417_D00294_0090_AC49H3ACXX_8_1.txt.gz, 10669X6_140417_D002… | SRA |
Sample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058085 | 7164626 | 2076953045 | 1184.01 | 11737x2_S2_L001_R1_001.fastq.gz, 11737x2_S2_L001_R2_001.fastq.gz, SRR… | SRA |
Sample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058086 | 200678038 | 40536963676 | 26556.36 | 10669X1_140417_D00294_0090_AC49H3ACXX_6_1.txt.gz, 10669X1_140417_D002… | SRA |
Sample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058087 | 9026194 | 2604974297 | 1527.17 | 11737x1_S1_L001_R1_001.fastq.gz, 11737x1_S1_L001_R2_001.fastq.gz, SRR… | SRA |
Sample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058088 | 92398709 | 23099677250 | 10465.68 | 11219X4_141212_D00294_0147_AC6231ANXX_8_1.txt.gz, 11219X4_141212_D002… | SRA |
Sample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058089 | 85424616 | 21356154000 | 9253.89 | 11219X3_141212_D00294_0147_AC6231ANXX_8_1.txt.gz, 11219X3_141212_D002… | SRA |
Sample attributes
Runs (1)
| Run | Spots | Bases | Size (MB) | Files | Link |
|---|---|---|---|---|---|
| SRR5058090 | 137121483 | 34280370750 | 17731.01 | 11838X1_150925_D00550_0293_BC7FW5ANXX_6_1.txt.gz, 11838X1_150925_D005… | SRA |
Linked Publications (1)
Data Files (14)
| Accession | File Name | Stored Type | Output Type | Mapping Assembly | Size | Download | |
|---|---|---|---|---|---|---|---|
| — | 11737x1_S1_L001_R1_001.fastq.gz | AMPLICON | 1.5 GB | link | |||
| — | 11737x2_S2_L001_R1_001.fastq.gz | AMPLICON | 1.2 GB | link | |||
| — | 10669X2_140417_D00294_0090_AC49H3ACXX_7_1.txt.gz | miRNA-Seq | 11.4 GB | link | |||
| — | 10669X5_140417_D00294_0090_AC49H3ACXX_8_1.txt.gz | miRNA-Seq | 11.2 GB | link | |||
| — | 10669X1_140417_D00294_0090_AC49H3ACXX_6_1.txt.gz | RNA-Seq | 25.9 GB | link | |||
| — | 10669X3_140417_D00294_0090_AC49H3ACXX_7_1.txt.gz | RNA-Seq | 10.6 GB | link | |||
| — | 10669X4_140417_D00294_0090_AC49H3ACXX_8_1.txt.gz | RNA-Seq | 7.6 GB | link | |||
| — | 10669X6_140417_D00294_0090_AC49H3ACXX_8_1.txt.gz | RNA-Seq | 9.7 GB | link | |||
| — | 11219X1_141212_D00294_0147_AC6231ANXX_7_1.txt.gz | RNA-Seq | 14.7 GB | link | |||
| — | 11219X2_141212_D00294_0147_AC6231ANXX_7_1.txt.gz | RNA-Seq | 10.6 GB | link | |||
| — | 11219X3_141212_D00294_0147_AC6231ANXX_8_1.txt.gz | RNA-Seq | 9.0 GB | link | |||
| — | 11219X4_141212_D00294_0147_AC6231ANXX_8_1.txt.gz | RNA-Seq | 10.2 GB | link | |||
| — | 11838X1_150925_D00550_0293_BC7FW5ANXX_6_1.txt.gz | RNA-Seq | 17.3 GB | link | |||
| — | 11838X2_150925_D00550_0293_BC7FW5ANXX_6_1.txt.gz | RNA-Seq | 14.7 GB | link |