GSE203090 Processing Pipeline
ChIP-Seq
code_examples
3 steps
Publication
The long noncoding RNA Malat1 regulates CD8+ T cell differentiation by mediating epigenetic repression.The Journal of experimental medicine (2022) — PMID 35593887
Dataset
GSE203090The long noncoding RNA Malat1 regulates CD8+ T cell differentiation by mediating epigenetic repression (ChIP-Seq)
Warning: Pipeline descriptions and code snippets may be inferred or AI-generated. Use them only as a starting point to guide analysis, and validate before use.
Processing Steps
Generate Jupyter Notebook-
1
Libraries were filtered and mapped to the mm10 genome using ENCODE Transcription Factor and Histone ChIP-Seq processing pipeline with default parameters for histone marks (github.com/ENCODE-DCC/chip-seq-pipeline2).
$ Bash example
# Install Cromwell (requires Java). Replace X.Y.Z with the actual version. # java -jar cromwell-X.Y.Z.jar install # Clone the ENCODE ChIP-seq pipeline repository if not already present # git clone https://github.com/ENCODE-DCC/chip-seq-pipeline2.git # cd chip-seq-pipeline2 # Create an inputs JSON file specifying the parameters based on the description. # This includes the genome assembly (mm10) and pipeline type (histone). # Replace 'path/to/your/fastq_R1.fastq.gz' and 'path/to/your/fastq_R2.fastq.gz' with actual input FASTQ files. # Replace 'path/to/mm10_genome.tsv' with the path to your genome TSV file for mm10. # The genome TSV file typically contains paths to the genome FASTA, BWA index, blacklist regions, etc., # and can be found in the pipeline's 'genome_data' directory or generated. cat << EOF > inputs.json { "chip.pipeline_type": "histone", "chip.genome_tsv": "path/to/mm10_genome.tsv", "chip.fastqs_rep1_R1": ["path/to/your/fastq_R1.fastq.gz"], "chip.fastqs_rep1_R2": ["path/to/your/fastq_R2.fastq.gz"], "chip.paired_end": true, "chip.auto_detect_adapter": true, "chip.trim_adapter": true, "chip.multimapping": 0, "chip.nthr": 8, "chip.mem_gb": 32 // Other default parameters for histone marks are handled by the pipeline's internal logic } EOF # Execute the ENCODE ChIP-seq pipeline using Cromwell. # Ensure you are in the directory containing 'chip.wdl' and 'inputs.json'. # Replace 'cromwell-X.Y.Z.jar' with the actual Cromwell JAR file name. java -jar cromwell-X.Y.Z.jar run chip.wdl -i inputs.json -
2
Final pooled bigwig files were used for visualization.
$ Bash example
# Install UCSC tools (if not already installed) # conda install -c bioconda ucsc-bedgraphtobigwig # Define input and output files # INPUT_BEDGRAPH: This would be the pooled and normalized bedGraph file generated in a previous step. INPUT_BEDGRAPH="final_pooled.bedGraph" # CHROM_SIZES: Chromosome sizes file for the reference genome (e.g., hg38). CHROM_SIZES="hg38.chrom.sizes" OUTPUT_BIGWIG="final_pooled.bigWig" # Download chrom.sizes for hg38 (if not available) # wget -O ${CHROM_SIZES} http://hgdownload.cse.ucsc.edu/goldenPath/hg38/bigZips/hg38.chrom.sizes # Convert bedGraph to bigWig format for efficient visualization on the UCSC Genome Browser bedGraphToBigWig "${INPUT_BEDGRAPH}" "${CHROM_SIZES}" "${OUTPUT_BIGWIG}" -
3
none provided by the submitter
N/A (No step description provided) (Inferred with models/gemini-2.5-flash) vN/A (No step description provided)$ Bash example
# No step description was provided, so specific tools, parameters, # and reference datasets cannot be inferred. # Please provide a step description to generate a relevant code block.
Tools Used
Raw Source Text
Libraries were filtered and mapped to the mm10 genome using ENCODE Transcription Factor and Histone ChIP-Seq processing pipeline with default parameters for histone marks (github.com/ENCODE-DCC/chip-seq-pipeline2). Final pooled bigwig files were used for visualization. none provided by the submitter Assembly: mm10 Supplementary files format and content: bigwig files from pooled replicates