Skip to content
This repository has been archived by the owner on Jul 21, 2020. It is now read-only.

GATK 4.1.4.0 Dev #27

Merged
merged 42 commits into from
Oct 15, 2019
Merged
Changes from 1 commit
Commits
Show all changes
42 commits
Select commit Hold shift + click to select a range
d1c5cbc
README.md
Nov 7, 2018
4b08f6a
Update to gatk4.1
Feb 1, 2019
ab1fccc
Updated forum links in Readme
Feb 1, 2019
f2ec9cc
changed ref file VM path to google bucket path
Feb 1, 2019
ad695aa
Merge branch 'master' into dev
Feb 1, 2019
257166f
updated import link to git tag url
Feb 1, 2019
9b0c86f
Merge branch 'master' of github.com:gatk-workflows/gatk4-somatic-snvs…
Apr 2, 2019
b814bc3
updated to GATK4.1.1.0
Apr 2, 2019
579e36a
minor edits
Apr 2, 2019
f722ae4
changed import to dev for pon
Apr 2, 2019
1594883
updated filter bias variable
Apr 4, 2019
6f58280
updated filter bias variable
Apr 4, 2019
92c1abf
updated output variable names, removed artifact_mode from normal-normal
Apr 4, 2019
0f38435
changed index to idx
Apr 4, 2019
5da3b20
removed artifact_modes
Apr 4, 2019
0ecee17
updated import to next release: 2.4.0
Apr 5, 2019
1719562
specified gatk version in readme
Apr 5, 2019
566fec1
Merge branch 'master' of github.com:gatk-workflows/gatk4-somatic-snvs…
Apr 25, 2019
f055dfc
Added Funcotator note to readme
Apr 25, 2019
0d31a9f
Update to gatk4.1.2.0
Apr 25, 2019
de2d5b3
updated input variable for mutect2
Apr 25, 2019
4054e9b
correctin to input vairalbe
Apr 25, 2019
d7438e1
correction to output variable
Apr 25, 2019
7e22a33
corrected variable name and simplified json
Aug 19, 2019
1494e0b
Merge branch 'master' into dev
Aug 19, 2019
1026dbe
Merge branch 'master' of github.com:gatk-workflows/gatk4-somatic-snvs…
Sep 23, 2019
7dabf3c
renamed input json, removed optional variables from json, updated reg…
Sep 23, 2019
8b690cb
removed nio wdl
Sep 23, 2019
6c8374d
minor update
Sep 23, 2019
9bc5597
Merge branch 'dev' of github.com:gatk-workflows/gatk4-somatic-snvs-in…
Sep 23, 2019
17af7dd
removed comments in json because later version of cromwell throws err…
Sep 23, 2019
0a5b6bd
corrections to import url for pon wdl
Sep 23, 2019
356a43d
mutect2 WDL and pon WDL had inconsistent input names
Sep 26, 2019
98316f3
mutect2 WDL and pon WDL had inconsistent input names
Sep 26, 2019
694372a
added standard_runtime to pon
Sep 26, 2019
c816321
add runtime structs
Sep 26, 2019
085ea22
add runtime structs for mergevcf
Sep 26, 2019
7cfd447
Update mutect2_pon.wdl
Sep 27, 2019
af9cacc
cleaned up input variables
Sep 30, 2019
dc46cd8
Update README.md
Sep 30, 2019
00328d1
updated docker to 4.1.4.0, updated import url to latest release
Oct 11, 2019
bb1c6b2
Merge branch 'dev' of github.com:gatk-workflows/gatk4-somatic-snvs-in…
Oct 11, 2019
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Next Next commit
README.md
  • Loading branch information
bshifaw committed Nov 7, 2018
commit d1c5cbc3ce8bf82cb5f1447c33917c381045a280
69 changes: 39 additions & 30 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -45,38 +45,47 @@ Cromwell version support

### Parameter descriptions :
#### mutect2 (single pair/sample)
- ``Mutect2.gatk4_jar`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.intervals`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.ref_fasta`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.ref_fasta_index`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.ref_dict`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.gatk4_jar`` -- Location *within the docker file* of the GATK4 jar file. If you wish you to use a different jar file, such as one on your local filesystem or a google bucket, specify that location with ``Mutect2_Multi.gatk4_jar_override``. This parameter is ignored if ``Mutect2_Multi.gatk4_jar_override`` is specified.
- ``Mutect2.intervals`` -- A file listing genomic intervals to search for somatic mutations. This should be in the standard GATK4 format.
- ``Mutect2.ref_fasta`` -- reference fasta. For Broad internal VM: ``/seq/references/Homo_sapiens_assembly19/v1/Homo_sapiens_assembly19.fasta``
- ``Mutect2.ref_fasta_index`` -- For Broad internal VM: ``/seq/references/Homo_sapiens_assembly19/v1/Homo_sapiens_assembly19.fasta.fai``
- ``Mutect2.ref_dict`` -- For Broad internal VM: ``/seq/references/Homo_sapiens_assembly19/v1/Homo_sapiens_assembly19.dict``
- ``Mutect2.tumor_bam`` -- File path or storage location (depending on backend) of the tumor bam file.
- ``Mutect2.tumor_bam_index`` -- File path or storage location (depending on backend) of the tumor bam file index.
- ``Mutect2.tumor_bam_index`` -- File path or storage location (depending on backend) of the tumor bam file index.
- ``Mutect2.normal_bam`` -- (optional) File path or storage location (depending on backend) of the normal bam file.
- ``Mutect2.normal_bam_index`` -- (optional, but required if ``Mutect2.normal_bam`` is specified) File path or storage location (depending on backend) of the normal bam file index.
- ``Mutect2.pon`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.pon_index`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.scatter_count`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.gnomad`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.gnomad_index`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.variants_for_contamination`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.variants_for_contamination_index`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.is_run_orientation_bias_filter`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.is_run_oncotator`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.gatk_docker`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.oncotator_docker`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.gatk4_jar_override`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.preemptible_attempts`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.onco_ds_tar_gz`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.onco_ds_local_db_dir`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.artifact_modes`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.picard_jar`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.m2_extra_args`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.m2_extra_filtering_args`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.sequencing_center`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.sequence_source`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.default_config_file`` -- Please see parameter description above in the mutect2_multi_sample.
- ``Mutect2.filter_oncotator_maf`` -- (optional, default true) Whether Oncotator should remove filtered variants when rendering the MAF. Ignored if `run_oncotator` is false.
- ``Mutect2.normal_bam_index`` -- (optional, but required if ``Mutect2.normal_bam`` is specified)  File path or storage location (depending on backend) of the normal bam file index.
- ``Mutect2.pon`` -- (optional) Panel of normals VCF to use for false positive reduction.
- ``Mutect2.pon_index`` -- (optional, but required if ``Mutect2_Multi.pon`` is specified) VCF index for the panel of normals. Please see GATK4 tool ``IndexFeatureFile`` for creation of an index.
- ``Mutect2.scatter_count`` -- Number of executions to split the Mutect2 task into. The more you put here, the faster Mutect2 will return results, but at a higher cost of resources.
- ``Mutect2.gnomad`` -- (optional) gnomAD vcf containing population allele frequencies (AF) of common and rare alleles. Download an exome or genome sites vcf [here](http://gnomad.broadinstitute.org/downloads). Essential for determining possible germline variants in tumor
- ``Mutect2.gnomad_index`` -- (optional, but required if ``Mutect2_Multi.gnomad`` is specified) VCF index for gnomAD. Please see GATK4 tool ``IndexFeatureFile`` for creation of an index.
- ``Mutect2.variants_for_contamination`` -- (optional) vcf containing population allele frequencies (AF) of common SNPs. If omitted, cross-sample contamination will not be calculated and contamination filtering will not be applied. This can be generated from a gnomAD vcf using the GATK4 tool ``SelectVariants`` with the argument ``--select "AF > 0.05"``. For speed, one can get very good results using only SNPs on chromosome 1. For example, ``java -jar $gatk SelectVariants -V gnomad.vcf -L 1 --select "AF > 0.05" -O variants_for_contamination.vcf``.
- ``Mutect2.variants_for_contamination_index`` -- (optional, but required if ``Mutect2_Multi.variants_for_contamination`` is specified) VCF index for contamination variants. Please see GATK4 tool ``IndexFeatureFile`` for creation of an index.
- ``Mutect2.is_run_orientation_bias_filter`` -- ``true``/``false`` whether the orientation bias filter should be run.
- ``Mutect2.is_run_oncotator`` -- ``true``/``false`` whether the command-line version of oncotator should be run. If ``false``, ``Mutect2_Multi.oncotator_docker`` parameter is ignored.
- ``Mutect2.gatk_docker`` -- Docker image to use for Mutect2 tasks. This is only used for backends configured to use docker.
- ``Mutect2.oncotator_docker`` -- (optional) A GATK4 jar file to be used instead of the jar file in the docker image. (See ``Mutect2_Multi.gatk4_jar``) This can be very useful for developers. Please note that you need to be careful that the docker image you use is compatible with the GATK4 jar file given here -- no automated checks are made.
- ``Mutect2.gatk4_jar_override`` -- (optional) A GATK4 jar file to be used instead of the jar file in the docker image. (See ``Mutect2_Multi.gatk4_jar``) This can be very useful for developers. Please note that you need to be careful that the docker image you use is compatible with the GATK4 jar file given here
- ``Mutect2.preemptible_attempts`` -- Number of times to attempt running a task on a preemptible VM. This is only used for cloud backends in cromwell and is ignored for local and SGE backends.
- ``Mutect2.onco_ds_tar_gz`` -- (optional) A tar.gz file of the oncotator datasources -- often quite large (>15GB). This will be uncompressed as part of the oncotator task. Depending on backend used, this can be specified as a path on the local filesystem of a cloud storage container (e.g. gs://...). Typically the Oncotator default datasource can be downloaded at ``ftp://gsapubftp-anonymous@ftp.broadinstitute.org/bundle/oncotator/``. Do not put the FTP URL into the json file.
- ``Mutect2.onco_ds_local_db_dir`` -- "(optional) A direct path to the Oncotator datasource directory (uncompressed). While this is the fastest approach, it cannot be used with docker unless your docker image already has the datasources in it. For cromwell backends without docker, this can be a local filesystem path. *This cannot be a cloud storage location*

Note: If neither ``Mutect2_Multi.onco_ds_tar_gz``, nor ``Mutect2_Multi.onco_ds_local_db_dir``, is specified, the Oncotator task will download and uncompress for each execution.

The following three parameters are useful for rendering TCGA MAFs using oncotator. These parameters are ignored if ``is_run_oncotator`` is ``false``."
- ``Mutect2.artifact_modes`` -- List of artifact modes to search for in the orientation bias filter. For example to filter the OxoG artifact, you would specify ``["G/T"]``. For both the FFPE artifact and the OxoG artifact, specify ``["G/T", "C/T"]``. If you do not wish to search for any artifacts, please set ``Mutect2_Multi.is_run_orientation_bias_filter`` to ``false``.
- ``Mutect2.picard_jar`` -- A direct path to a picard jar for using ``CollectSequencingArtifactMetrics``. This parameter requirement will be eliminated in the future.
- ``Mutect2.m2_extra_args`` -- (optional) a string of additional command line arguments of the form "-argument1 value1 -argument2 value2" for Mutect 2. Most users will not need this.
- ``Mutect2.m2_extra_filtering_args`` -- (optional) a string of additional command line arguments of the form "-argument1 value1 -argument2 value2" for Mutect 2. Most users will not need this.
- ``Mutect2.sequencing_center`` -- (optional) center reporting this variant. Please see ``https://wiki.nci.nih.gov/display/TCGA/Mutation+Annotation+Format+%28MAF%29+Specification+-+v2.4`` for more details.
- ``Mutect2.sequence_source`` -- (optional) ``WGS`` or ``WXS`` for whole genome or whole exome sequencing, respectively. Please note that the controlled vocabulary of the TCGA MAF spec is *not* enforced. Please see ``https://wiki.nci.nih.gov/display/TCGA/Mutation+Annotation+Format+%28MAF%29+Specification+-+v2.4`` for more details.
- ``Mutect2.default_config_file`` -- "(optional) A configuration file that can direct oncotator to use default values for unspecified annotations in the TCGA MAF. This help prevents having MAF files with a lot of ""__UNKNOWN__"" values. An usable example is given below. Here is an example that should work for most users:

```
[manual_annotations]
override:NCBI_Build=37,Strand=+,status=Somatic,phase=Phase_I,sequencer=Illumina,Tumor_Validation_Allele1=,Tumor_Validation_Allele2=,Match_Norm_Validation_Allele1=,Match_Norm_Validation_Allele2=,Verification_Status=,Validation_Status=,Validation_Method=,Score=,BAM_file=,Match_Norm_Seq_Allele1=,Match_Norm_Seq_Allele2=
```
- ``Mutect2.filter_oncotator_maf`` -- (optional, default true) Whether Oncotator should remove filtered variants when rendering the MAF.  Ignored if `run_oncotator` is false.

### Important Note :
- Runtime parameters are optimized for Broad's Google Cloud Platform implementation.
Expand Down