GSE125125 Processing Pipeline
RNA-Seq
code_examples
3 steps
Publication
Overriding FUS autoregulation in mice triggers gain-of-toxic dysfunctions in RNA metabolism and autophagy-lysosome axis.eLife (2019) — PMID 30747709
Dataset
GSE125125Overriding FUS autoregulation triggers gain-of-toxic dysfunctions in autophagy-lysosome axis and RNA metabolism
Warning: Pipeline descriptions and code snippets may be inferred or AI-generated. Use them only as a starting point to guide analysis, and validate before use.
Processing Steps
Generate Jupyter Notebook-
1
Sequenced reads filtered for low quality reads with skewer 0.2.2 (-Q 5)
$ Bash example
# Install skewer (if not already installed) # conda install -c bioconda skewer=0.2.2 # Example command for quality filtering with skewer # Replace 'input_reads.fastq.gz' with your actual input file(s) # Replace 'output_prefix' with your desired output file prefix skewer -Q 5 -o output_prefix input_reads.fastq.gz
-
2
RNA-seq read quantification was performed using Kallisto 0.44.0 (-b 50 âsingle -l 200 -s 20)
$ Bash example
# Install kallisto (if not already installed) # conda install -c bioconda kallisto # Placeholder for kallisto index. Replace 'transcriptome.idx' with your actual index file # and 'transcripts.fasta' with your reference transcriptome FASTA file (e.g., from Ensembl, GENCODE). # kallisto index -i transcriptome.idx transcripts.fasta # Perform RNA-seq read quantification using Kallisto 0.44.0 # Replace 'reads.fastq' with your input single-end FASTQ file. # Replace 'output_dir' with your desired output directory. kallisto quant -i transcriptome.idx \ -o output_dir \ --single \ -l 200 \ -s 20 \ -b 50 \ reads.fastq -
3
DEG calling was performed using Sleuth 0.28.1 (Control contrast)
Sleuth v0.28.1$ Bash example
# Install R if not already installed (example for Ubuntu) # sudo apt-get update && sudo apt-get install -y r-base # Install Bioconductor and Sleuth (if not already installed) # R -e 'if (!requireNamespace("BiocManager", quietly = TRUE)) install.packages("BiocManager")' # R -e 'BiocManager::install("sleuth")' # R -e 'install.packages("dplyr")' # Often used with sleuth # R -e 'install.packages("readr")' # Often used with sleuth # --- Placeholder for Kallisto output directories --- # Sleuth requires kallisto (or similar) quantification results. # These directories would typically contain 'abundance.h5' and 'run_info.json'. # For demonstration, we create empty directories. # In a real scenario, these would be generated by running Kallisto against a transcriptome index (e.g., GRCh38). mkdir -p kallisto_output/sample1_control mkdir -p kallisto_output/sample2_control mkdir -p kallisto_output/sample3_treated mkdir -p kallisto_output/sample4_treated # Create a dummy sample-to-condition file (s2c.tsv) # This file links sample IDs to their experimental conditions and kallisto output paths. # The 'control contrast' implies comparing a treatment group against a control group. echo -e "sample\tcondition\tpath" > s2c.tsv echo -e "sample1_control\tcontrol\tkallisto_output/sample1_control" >> s2c.tsv echo -e "sample2_control\tcontrol\tkallisto_output/sample2_control" >> s2c.tsv echo -e "sample3_treated\ttreated\tkallisto_output/sample3_treated" >> s2c.tsv echo -e "sample4_treated\ttreated\tkallisto_output/sample4_treated" >> s2c.tsv # Create an R script to perform DEG calling with Sleuth cat << 'EOF' > run_sleuth.R library(sleuth) library(dplyr) library(readr) # Load sample-to-condition information s2c <- read_tsv("s2c.tsv") # Ensure 'condition' is a factor and set 'control' as the reference level s2c$condition <- factor(s2c$condition, levels = c("control", "treated")) # Initialize sleuth object # 'extra_bootstrap_data = TRUE' and 'read_bootstrap_tpm = TRUE' are recommended for full Sleuth functionality so <- sleuth_prep(s2c, extra_bootstrap_data = TRUE, read_bootstrap_tpm = TRUE) # Define the full model (e.g., ~condition) to test for differences between conditions so <- sleuth_fit(so, ~condition, 'full') # Define the null model (e.g., ~1 for no effect) so <- sleuth_fit(so, ~1, 'null') # Perform Wald test for the 'conditiontreated' coefficient. # This directly tests the difference between 'treated' and 'control' (the 'control contrast'). # The coefficient 'conditiontreated' represents the log2 fold change of 'treated' vs 'control'. so <- sleuth_wt(so, 'conditiontreated', 'full') # Get results for the specified contrast results_table <- sleuth_results(so, 'conditiontreated', test_type = 'wald', show_all = FALSE) # Filter results by q-value (FDR) and arrange by p-value results_table_filtered <- dplyr::filter(results_table, qval <= 0.05) %>% dplyr::arrange(pval) # Write all results and filtered results to TSV files write_tsv(results_table, "sleuth_deg_results_all.tsv") write_tsv(results_table_filtered, "sleuth_deg_results_q0.05.tsv") # Optional: Save the sleuth object for further interactive analysis # save(so, file = "sleuth_object.RData") EOF # Execute the R script Rscript run_sleuth.R
Raw Source Text
Sequenced reads filtered for low quality reads with skewer 0.2.2 (-Q 5) RNA-seq read quantification was performed using Kallisto 0.44.0 (-b 50 âsingle -l 200 -s 20) DEG calling was performed using Sleuth 0.28.1 (Control contrast) Genome_build: GRCh38Â Supplementary_files_format_and_content: expression.tsv: tsv (Gene quantifications) Supplementary_files_format_and_content: base-nontg_FUS-wt-homo_gene.tsv: tsv (DE calls for homozygous vs nontransgenic) Supplementary_files_format_and_content: base-nontg_FUS-wt-het_gene.tsv: tsv (DE calls for heterozygous vs nontransgenic) Supplementary_files_format_and_content: bs-control_fus-ko_wt_gene.tsv: tsv (DE calls for Fus KO vs control)