GSE132971 Processing Pipeline
RNA-Seq
code_examples
4 steps
Publication
Direct RNA sequencing enables m<sup>6</sup>A detection in endogenous transcript isoforms at base-specific resolution.RNA (New York, N.Y.) (2020) — PMID 31624092
Dataset
GSE132971Direct RNA sequencing enables single-nucleotide m6A detection in endogenous transcript isoforms
Warning: Pipeline descriptions and code snippets may be inferred or AI-generated. Use them only as a starting point to guide analysis, and validate before use.
Processing Steps
Generate Jupyter Notebook-
1
Oxford Nanopore's Albacore and Guppy base-callers
Nanopore sequencing v6.5.7$ Bash example
# Install Guppy (example using conda, ensure you have the appropriate GPU drivers if using GPU acceleration) # conda create -n guppy_env python=3.8 # conda activate guppy_env # conda install -c bioconda ont-guppy # Example command for Guppy base-calling # This command processes raw .fast5 files from an input directory and outputs .fastq files. # The configuration file specifies the base-calling model, which depends on the flow cell and chemistry used. # Replace /path/to/raw_fast5_data with the actual directory containing your .fast5 files. # Replace /path/to/output_fastq_data with your desired output directory for .fastq files. # Replace dna_r9.4.1_450bps_hac.cfg with the appropriate configuration file for your specific Nanopore chemistry and flow cell. # The --device auto flag attempts to automatically detect and use available GPUs or fall back to CPU. guppy_basecaller \ -i /path/to/raw_fast5_data \ -s /path/to/output_fastq_data \ -c dna_r9.4.1_450bps_hac.cfg \ --recursive \ --fast5_out \ --compress_fastq \ --device auto
-
2
Reads were aligned and resquiggled with Tombo v1.4
$ Bash example
# Install Tombo (if not already installed) # conda create -n tombo_env python=3.7 # conda activate tombo_env # pip install tombo # Define variables (replace with actual paths) FAST5_DIR="path/to/your/fast5_files" GENOME_FASTA="path/to/your/reference/genome/hg38.fa" # Placeholder: latest assembly (e.g., hg38) OUTPUT_DIR="path/to/output/resquiggle_files" NUM_PROCESSES=8 # Example number of processes # Create output directory if it doesn't exist mkdir -p "${OUTPUT_DIR}" # Align reads and resquiggle with Tombo tombo resquiggle \ --fast5-basedirs "${FAST5_DIR}" \ --genome-fasta "${GENOME_FASTA}" \ --output-path "${OUTPUT_DIR}/resquiggle_results.hdf5" \ --processes "${NUM_PROCESSES}" \ --overwrite -
3
Coverage and fraction modified files were obtained using Tombo's text_output commands.
$ Bash example
# Install Tombo (if not already installed) # conda install -c bioconda tombo # pip install tombo # Placeholder for input Tombo HDF5 file, typically generated by 'tombo detect_modifications' TOMBO_HDF5_FILE="sample_modifications.tombo.hdf5" # Obtain coverage file tombo text_output coverage \ --file-type tombo_stats \ --output-filepath sample_coverage.txt \ "${TOMBO_HDF5_FILE}" # Obtain fraction modified file tombo text_output fraction_modified \ --file-type tombo_stats \ --output-filepath sample_fraction_modified.txt \ "${TOMBO_HDF5_FILE}" -
4
Bedobs was used to convert the wig files to bed files before being interescted with the coverage file using bedtools.
$ Bash example
# Install bedtools (example) # conda install -c bioconda bedtools # Bedobs is mentioned as a tool to convert wig files to bed files. # Assuming 'Bedobs' is an executable or script available in the environment. # If 'Bedobs' is not a standard tool, this step might use a custom script or another tool like UCSC wig2bed. # Example for converting a single wig file: Bedobs input.wig > converted.bed # Intersect the converted bed file with the coverage file using bedtools # Assuming coverage.bed is the coverage file bedtools intersect -a converted.bed -b coverage.bed > intersected_output.bed
Tools Used
Raw Source Text
Oxford Nanopore's Albacore and Guppy base-callers Reads were aligned and resquiggled with Tombo v1.4 Coverage and fraction modified files were obtained using Tombo's text_output commands. Bedobs was used to convert the wig files to bed files before being interescted with the coverage file using bedtools. Genome_build: hg19 and hg38(cDNA) Supplementary_files_format_and_content: bedgraph and wig