GSE95237 Processing Pipeline

ChIP-Seq code_examples 3 steps

Publication

Early transcriptional and epigenetic regulation of CD8<sup>+</sup> T cell differentiation revealed by single-cell RNA sequencing.

Nature immunology (2017) — PMID 28218746

Dataset

GSE95237

Genome-wide maps of chromatin state and chromatin accessibility in CD8 T cell subsets

Warning: Pipeline descriptions and code snippets may be inferred or AI-generated. Use them only as a starting point to guide analysis, and validate before use.
  1. 1

    Reads were aligned to mm10 by BWA

    BWA v0.7.17 GitHub
    $ Bash example
    # Install BWA and Samtools if not already installed
    # conda install -c bioconda bwa samtools
    
    # --- Reference Genome Preparation ---
    # Download the mm10 reference genome (example from UCSC)
    # mkdir -p ref/mm10
    # wget -P ref/mm10 https://hgdownload.soe.ucsc.edu/goldenPath/mm10/bigZips/mm10.fa.gz
    # gunzip ref/mm10/mm10.fa.gz
    
    # Index the reference genome for BWA
    # bwa index ref/mm10/mm10.fa
    
    # --- Alignment Command ---
    # Assuming input reads are paired-end: reads_R1.fastq.gz and reads_R2.fastq.gz
    # And the reference genome is indexed at ref/mm10/mm10.fa
    bwa mem -t 8 ref/mm10/mm10.fa reads_R1.fastq.gz reads_R2.fastq.gz | samtools view -bS -o aligned_reads.bam -
  2. 2

    Reads with low quality (MAPQ < 30) were filtered out.

    samtools (Inferred with models/gemini-2.5-flash) v1.10 (Inferred with models/gemini-2.5-flash) GitHub
    $ Bash example
    # Install samtools (if not already installed)
    # conda install -c bioconda samtools
    
    # Filter reads with mapping quality (MAPQ) less than 30
    samtools view -b -q 30 -o filtered_reads.bam input_reads.bam
  3. 3

    If multiple reads were mapped to the exact same location, only one read was kept.

    samtools markdup (Inferred with models/gemini-2.5-flash) v1.15 GitHub
    $ Bash example
    # Install samtools if not already installed
    # conda install -c bioconda samtools=1.15
    
    # Input: A coordinate-sorted BAM file. This step assumes the input BAM is already sorted.
    # Example input from a previous alignment step:
    INPUT_BAM="aligned_reads.sorted.bam"
    OUTPUT_DEDUP_BAM="aligned_reads.dedup.bam"
    
    # Remove PCR duplicates based on mapping coordinates.
    # If multiple reads map to the exact same genomic location, only one is kept.
    # -r: Remove duplicate reads (rather than just marking them).
    # -s: Output statistics to stderr (optional, but useful for logging).
    samtools markdup -r -s "${INPUT_BAM}" "${OUTPUT_DEDUP_BAM}"
Raw Source Text
Reads were aligned to mm10 by BWA
Reads with low quality (MAPQ < 30) were filtered out. If multiple reads were mapped to the exact same location, only one read was kept.
Genome_build: mm10
Supplementary_files_format_and_content: bigwig
← Back to Analysis