Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: Use metadata-only filtering in subsample jobs #571

Closed
wants to merge 3 commits into from

Commits on Apr 14, 2021

  1. Merge exclusion files on the fly in augur filter

    Augur filter's new metadata-only interface will allow users to pass
    multiple inputs to the `--exclude` argument, internally deduplicating
    these strain lists. This new interface eliminates the need for a
    separate Snakemake rule to cat the exclusion files.
    huddlej committed Apr 14, 2021
    Configuration menu
    Copy the full SHA
    9bd5089 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    70707be View commit details
    Browse the repository at this point in the history
  3. Use metadata-only filtering for subsampling

    Replaces FASTA outputs with strain list outputs for the subsample rule
    such that sequence data are not inspected during most subsampling steps.
    The exception to the rule are subsampling jobs that require a priority
    score calculation that depends on the FASTA sequence of another
    subsampled group. To handle this exception, we add a new rule to extract
    just those subsampled sequences.
    
    Finally, we collect subsampled sequences into a single deduplicated
    FASTA output using augur filter's new interface with the `--exclude-all`
    flag and multiple input support for `--include`.
    
    Note that this commit also updates the conda environment to use a GitHub
    branch instead of an official augur release.
    huddlej committed Apr 14, 2021
    Configuration menu
    Copy the full SHA
    102a0e2 View commit details
    Browse the repository at this point in the history