GSE253825 Processing Pipeline
RNA-Seq
code_examples
1 step
Publication
Epistatic interactions between NMD and TRP53 control progenitor cell maintenance and brain size.Neuron (2024) — PMID 38697111
Dataset
GSE253825Nonsense-mediated mRNA decay counteracts TRP53 to maintain the progenitor cell pool for brain development (snRNA-seq_e15invivo)
Warning: Pipeline descriptions and code snippets may be inferred or AI-generated. Use them only as a starting point to guide analysis, and validate before use.
Processing Steps
Generate Jupyter Notebook-
1
STAR (version 2.7.11a, reference genome: mm10-2020-A<based on Gencode vM23, and filtering out pseudogenes, TEC, miRNA etc. a total of 32285 genes) was used to perform alignment, filtering, barcode counting, and UMI counting for single cell RNA-Seq.
STAR v2.7.11a$ Bash example
# Install STAR (if not already installed) # conda install -c bioconda star # Define variables # Path to the custom STAR index built with mm10, Gencode vM23, and filtered genes GENOME_DIR="path/to/STAR_index_mm10_Gencode_vM23_filtered" # Input FASTQ files (e.g., R1 containing UMI/barcode, R2 containing cDNA) READ1_FASTQ="path/to/sample_R1.fastq.gz" READ2_FASTQ="path/to/sample_R2.fastq.gz" # Output directory prefix OUTPUT_PREFIX="star_alignment_output/" # Path to the cell barcode whitelist (specific to scRNA-seq chemistry, e.g., 10x Genomics) CB_WHITELIST="path/to/cell_barcode_whitelist.txt" # Create output directory mkdir -p "${OUTPUT_PREFIX}" # Run STAR for alignment, filtering, barcode counting, and UMI counting # Note: --solo parameters (soloType, soloCBwhitelist, soloCBstart, soloCBlen, soloUMIstart, soloUMIlen) # must be adjusted based on the specific single-cell RNA-Seq library preparation chemistry. # The example below assumes a common 10x Genomics v3 setup (16bp CB, 12bp UMI in R1). STAR --genomeDir "${GENOME_DIR}" \ --readFilesIn "${READ1_FASTQ}" "${READ2_FASTQ}" \ --readFilesCommand zcat \ --outFileNamePrefix "${OUTPUT_PREFIX}" \ --outSAMtype BAM SortedByCoordinate \ --outSAMunmapped Within \ --outFilterMultimapNmax 1 \ --outFilterScoreMinOverLread 0.3 \ --outFilterMatchNminOverLread 0.3 \ --outFilterMismatchNmax 3 \ --limitBAMsortRAM 60000000000 \ --soloType CB_UMI_Simple \ --soloCBwhitelist "${CB_WHITELIST}" \ --soloCBstart 1 --soloCBlen 16 \ --soloUMIstart 17 --soloUMIlen 12 \ --soloFeatures Gene Velocyto \ --soloCellFilter CellRanks 30 0.99 10000 \ --soloOutFileNames Features.mtx Features.barcodes.txt Features.genes.txt
Tools Used
Raw Source Text
STAR (version 2.7.11a, reference genome: mm10-2020-A<based on Gencode vM23, and filtering out pseudogenes, TEC, miRNA etc. a total of 32285 genes) was used to perform alignment, filtering, barcode counting, and UMI counting for single cell RNA-Seq. Assembly: mm10 Supplementary files format and content: Tab-separated values files and matrix files