-
Notifications
You must be signed in to change notification settings - Fork 129
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #1429: filter: Add --max-length option
- Loading branch information
Showing
10 changed files
with
106 additions
and
61 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -4,6 +4,7 @@ | |
|
||
SEQUENCE_ONLY_FILTERS = ( | ||
"min_length", | ||
"max_length", | ||
"non_nucleotide", | ||
) | ||
|
||
|
13 changes: 0 additions & 13 deletions
13
tests/functional/filter/cram/filter-min-length-no-sequence-index-error.t
This file was deleted.
Oops, something went wrong.
17 changes: 0 additions & 17 deletions
17
tests/functional/filter/cram/filter-min-length-output-metadata.t
This file was deleted.
Oops, something went wrong.
17 changes: 0 additions & 17 deletions
17
tests/functional/filter/cram/filter-min-length-output-strains.t
This file was deleted.
Oops, something went wrong.
28 changes: 28 additions & 0 deletions
28
tests/functional/filter/cram/filter-no-sequence-index-error.t
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,28 @@ | ||
Setup | ||
|
||
$ source "$TESTDIR"/_setup.sh | ||
|
||
Try to filter using only metadata without a sequence index. | ||
|
||
These should fail because the requested filters rely on sequence information. | ||
|
||
$ ${AUGUR} filter \ | ||
> --metadata "$TESTDIR/../data/metadata.tsv" \ | ||
> --min-length 10000 \ | ||
> --output-strains filtered_strains.txt > /dev/null | ||
ERROR: You need to provide a sequence index or sequences to filter on sequence-specific information. | ||
[2] | ||
|
||
$ ${AUGUR} filter \ | ||
> --metadata "$TESTDIR/../data/metadata.tsv" \ | ||
> --max-length 10000 \ | ||
> --output-strains filtered_strains.txt > /dev/null | ||
ERROR: You need to provide a sequence index or sequences to filter on sequence-specific information. | ||
[2] | ||
|
||
$ ${AUGUR} filter \ | ||
> --metadata "$TESTDIR/../data/metadata.tsv" \ | ||
> --non-nucleotide \ | ||
> --output-strains filtered_strains.txt > /dev/null | ||
ERROR: You need to provide a sequence index or sequences to filter on sequence-specific information. | ||
[2] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
Setup | ||
|
||
$ source "$TESTDIR"/_setup.sh | ||
|
||
Filter using --min-length and --max-length. | ||
|
||
$ ${AUGUR} filter \ | ||
> --sequence-index "$TESTDIR/../data/sequence_index.tsv" \ | ||
> --metadata "$TESTDIR/../data/metadata.tsv" \ | ||
> --min-length 10500 \ | ||
> --max-length 10700 \ | ||
> --output-strains filtered_strains.txt | ||
7 strains were dropped during filtering | ||
1 had no metadata | ||
1 had no sequence data | ||
2 were dropped because they were shorter than the minimum length of 10500bp when only counting standard nucleotide characters A, C, G, or T (case-insensitive) | ||
3 were dropped because they were longer than the maximum length of 10700bp when only counting standard nucleotide characters A, C, G, or T (case-insensitive) | ||
6 strains passed all filters |