GSE13483 Processing Pipeline
GSE
code_examples
1 step
Publication
Divergent transcription from active promoters.Science (New York, N.Y.) (2008) — PMID 19056940
Warning: Pipeline descriptions and code snippets may be inferred or AI-generated. Use them only as a starting point to guide analysis, and validate before use.
Processing Steps
Generate Jupyter Notebook-
1
Image analysis and base calling was done using Gerald (Illumina pipeline), and reads aligned to human (hg17) using Megablast.
$ Bash example
# The initial step of image analysis and base calling using Gerald is typically performed on the Illumina sequencing instrument and generates BCL files, which are then converted to FASTQ files. This process does not involve a user-executable bash command. # For alignment using Megablast, the following steps are conceptualized: # 1. Convert FASTQ reads (output from Gerald/bcl2fastq) to FASTA format, as legacy Megablast often prefers FASTA input. # Example for single-end reads: # zcat input_reads.fastq.gz | awk 'BEGIN{RS="@";ORS=""}NR>1{print ">"$1"\n"$2}' > input_reads.fasta # 2. Download the hg17 reference genome and create a BLAST database. # mkdir -p reference/hg17 # wget -P reference/hg17 http://hgdownload.cse.ucsc.edu/goldenPath/hg17/bigZips/chromFa.tar.gz # tar -xzf reference/hg17/chromFa.tar.gz -C reference/hg17/ # cat reference/hg17/*.fa > reference/hg17/hg17.fa # makeblastdb is part of the BLAST+ suite, which is the modern successor to legacy BLAST. # If using legacy megablast, ensure the database format is compatible or use legacy formatdb. # makeblastdb -in reference/hg17/hg17.fa -dbtype nucl -out reference/hg17/hg17_blastdb # 3. Execute Megablast for alignment. # Note: This command assumes the legacy 'megablast' executable is available in the PATH. # For modern BLAST+ installations, 'blastn -task megablast' would be used. # The '-m 8' option outputs a tabular format, which is common for alignments. megablast -i input_reads.fasta -d reference/hg17/hg17_blastdb -o aligned_reads.txt -m 8
Raw Source Text
Image analysis and base calling was done using Gerald (Illumina pipeline), and reads aligned to human (hg17) using Megablast.