Skip to content

Pipelines: panoply_main

wcorinne edited this page Feb 6, 2024 · 2 revisions

panoply_main

Description

Performs proteogenomic analysis for a given ome (proteome, phosphoproteome, ubiquitylome, or acetylome). Use panoply_unified_workflow to run this pipeline on multiple ome types in parallel.

This pipeline executes the following modules:

Input

Required inputs:

  • job_identifier: (String) An identifier name given to the job
  • ome_type: (String) Type of omics data being run (ex: "proteome", "phosphoproteome" etc.)
  • input_pome: (.gct file) Input ome data matrix
  • input_rna: (.gct file, default = this.rna_ss) Input rna data matrix (optional)
  • input_cna: (.gct file, default = this.cna_ss) Input cna data matrix (optional)
  • sample_annotation: (File, default = this.annotation_ss) The annotation file for the given ome data
  • groups_file: (File, default = this.groups_ss) The annotation file for the given ome data, subsetted to annotations of interest and used for analyses. Can be manually overriden for relevant modules (below).
  • yaml: (.yaml file) parameters in yaml format

Optional inputs:

  • run_ptmsea: (String, default = false) true if phosphoproteome data is being run and ptmsea analysis is desired or "false" to skip (this module takes additional time and higher cost to run)

  • run_cmap: (String, default = false) true if proteome data is being run and cmap analysis is desired or false to skip (this module takes additional time and higher cost to run)

  • cna_groups: please see groupsFile input parameter in panoply_cna_setup

  • groups_file_association: please see groupsFile input parameter in panoply_association. Overrides default groups_file.

  • groups_file_nmf: please see groupsFile input parameter in panoply_nmf. Overrides default groups_file.

  • normalizeProteomics: please see normalizeProteomics input parameter in panoply_normalize_ms_data

  • cmap_n_permutations: please see n_permutations input parameter in panoply_cmap_analysis

  • groups_file_cmap_enrichment: please see cmap_enrichment_groups input parameter in panoply_cmap_analysis. Overrides default groups_file.

  • subset_list_file: please see subset_list_file input parameter in panoply_cmap_analysis

  • cmap_level5_data: please see cmap_level5_data input parameter in panoply_cmap_analysis

  • annotation_pathway_db: please see annotation_pathway_db input parameter in panoply_cmap_analysis

  • subset_bucket: please see subset_bucket input parameter in panoply_cmap_analysis

  • ndigits: please see ndigits input parameter in panoply_normalize_ms_data

  • na_max: please see naMax input parameter in panoply_normalize_ms_data

  • sample_na_max: please see sample_na_max input parameter in panoply_association

  • min_numratio_fraction: please see minNumratioFraction input parameter in panoply_normalize_ms_data

  • nmiss_factor: please see nmiss_factor input parameter in panoply_association

  • sd_filter_threshold: please see sdFilterThreshold input parameter in panoply_normalize_ms_data

  • duplicate_gene_policy: please see duplicate_gene_policy input parameter in panoply_harmonize

  • gene_id_col: please see gene_id_col input parameter in panoply_harmonize

  • organism:

  • standalone: (String, default = false) set to true to run as a self-contained module; when running panoply_main pipeline use false

  • geneset_db: (String, default = this.gseaDB) please see gene_set_database input parameter in panoply_ssgsea

  • ptm_db: (String, default = this.ptmseaDB) this is the gene_set_database for running ptmsea

Output

panoply_main produces the follow outputs:

Clone this wiki locally