GSE33113 Processing Pipeline

GSE code_examples 1 step

Publication

DDX5 promotes oncogene C3 and FABP1 expressions and drives intestinal inflammation and tumorigenesis.

Life science alliance (2020) — PMID 32817263

Dataset

GSE33113

AMC colon cancer AJCCII

Warning: Pipeline descriptions and code snippets may be inferred or AI-generated. Use them only as a starting point to guide analysis, and validate before use.
  1. 1

    Expression data (cel files) were normalized with the MAS5.0 algorithm (target signal = 100) using GCOS software (Affymetrix)

    $ Bash example
    # The original normalization was performed using proprietary Affymetrix GCOS software.
    # This code block demonstrates the equivalent MAS5.0 normalization using the open-source Bioconductor 'affy' package in R.
    
    # Install R and Bioconductor if not already present
    # sudo apt-get update && sudo apt-get install -y r-base
    # R -e 'if (!requireNamespace("BiocManager", quietly = TRUE)) install.packages("BiocManager"); BiocManager::install("affy")'
    
    # Define input and output paths
    INPUT_CEL_DIR="path/to/your/cel_files"
    OUTPUT_DIR="path/to/output_normalized_data"
    mkdir -p "${OUTPUT_DIR}"
    
    # R script for MAS5.0 normalization
    Rscript -e '
      library(affy)
      
      # Set input and output directories
      input_cel_dir <- Sys.getenv("INPUT_CEL_DIR")
      output_dir <- Sys.getenv("OUTPUT_DIR")
      
      # Read CEL files
      cel_files <- list.files(input_cel_dir, pattern = "\\.cel$", full.names = TRUE, ignore.case = TRUE)
      if (length(cel_files) == 0) {
        stop("No CEL files found in the specified directory.")
      }
      
      # Create an AffyBatch object
      raw_data <- ReadAffy(filenames = cel_files)
      
      # Perform MAS5.0 normalization with target signal = 100
      # The "target" parameter in mas5() corresponds to the target signal value.
      normalized_data <- mas5(raw_data, target = 100)
      
      # Extract expression matrix
      expr_matrix <- exprs(normalized_data)
      
      # Save the normalized expression matrix to a CSV file
      output_file <- file.path(output_dir, "normalized_mas5_expression.csv")
      write.csv(expr_matrix, file = output_file, row.names = TRUE)
      
      message(paste("MAS5.0 normalized expression data saved to:", output_file))
    '
    

Tools Used

Raw Source Text
Expression data (cel files) were normalized with the MAS5.0 algorithm (target signal = 100) using GCOS software (Affymetrix)
← Back to Analysis