Skip to content

Commit

Permalink
fixup: drop the E_indicator column
Browse files Browse the repository at this point in the history
#36 (comment)

Since we are not using the E_indicator column, drop it.
We have separate steps to calculate the E_coverage column.
  • Loading branch information
j23414 committed Mar 27, 2024
1 parent 8e0157a commit 5d05098
Showing 1 changed file with 2 additions and 3 deletions.
5 changes: 2 additions & 3 deletions ingest/rules/nextclade.smk
Original file line number Diff line number Diff line change
Expand Up @@ -65,13 +65,12 @@ rule concat_nextclade_subtype_results:
nextclade_field=config["nextclade"]["nextclade_field"],
shell:
"""
echo "{params.id_field},{params.nextclade_field},alignmentStart,alignmentEnd,genome_coverage,failedCdses,E_indicator" \
echo "{params.id_field},{params.nextclade_field},alignmentStart,alignmentEnd,genome_coverage,failedCdses" \
| tr ',' '\t' \
> {output.nextclade_subtypes}
tsv-select -H -f "seqName,clade,alignmentStart,alignmentEnd,coverage,failedCdses" {input.nextclade_results_files} \
| awk 'NR>1 {{print}}' \
| awk -F'\t' '$2 && !($6 ~ /E/) {{print $0"\t1"; next}} {{print $0"\t"}}' \
>> {output.nextclade_subtypes}
"""

Expand All @@ -92,7 +91,7 @@ rule append_nextclade_columns:
tsv-join -H \
--filter-file {input.nextclade_subtypes} \
--key-fields {params.id_field} \
--append-fields {params.nextclade_field},alignmentStart,alignmentEnd,genome_coverage,failedCdses,E_indicator \
--append-fields {params.nextclade_field},alignmentStart,alignmentEnd,genome_coverage,failedCdses \
--write-all ? \
{input.metadata} \
> {output.metadata_all}
Expand Down

0 comments on commit 5d05098

Please sign in to comment.