Base level alignment #26

ksahlin · 2022-06-01T10:43:08Z

Note to developer:

The extension step (nucleotide level alignment) is the bottleneck in strobealign. There are different three ways to reduce this:

Direction 1 (change the alignment module):
1. Change to base level alignment with WFA (WFA publ) as is done in Accelalign
Direction 2 Speedup the current module used (SSW):
1. By using 8bit slots in alignment matrix?
2. By not computing alignment twice - is ssw does this?
Direction 3 (partitioned SSW)
1. Finish implementing partitioned SW (split alignment into several small hamming or SW alignments) if seeds are in middle.

ksahlin · 2022-09-09T09:58:02Z

I think this is worth exploring WFA as we are currently relying on SSW which has its issues being a local aligner as mentioned in issue #54. The method seems to have great performance, see table2 in WFA paper for timings to e.g. SeqAn and ksw2 in Table 2. Furthermore, the maturity of the implementation is here with WFA2 in terms of providing different penalty models (including dual gap cost penalties!), traceback cigar etc,

Also in strobealign the extension step is over 50% of the runtime for many biological datasets, see 'aln' field for BIO150 and BIO250 in attached figure for extension with SSW.

ekg · 2023-02-08T15:17:19Z

This will also make it easy to scale to longer sequences, provided your seed chaining can do it. BiWFA uses order divergence space and is consequently very cache coherent and actually fast even for ~500bp sequences.

ksahlin · 2023-02-08T21:06:35Z

That's a good point. As mentioned in #24, extending to alignments of long reads is one objective. Strobealign's seeds should be very suitable for long reads, so BiWFA may be the way to go already at start. We are currently exploring WFA (#229), but we have yet to find a way to use the library efficiently compared to SSW.

ksahlin added the optimization Further information is requested label Jun 1, 2022

ksahlin mentioned this issue Jan 29, 2023

Enabling AVX2 #214

Closed

marcelm mentioned this issue Feb 6, 2023

Various refactorings #228

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Base level alignment #26

Base level alignment #26

ksahlin commented Jun 1, 2022

ksahlin commented Sep 9, 2022

ekg commented Feb 8, 2023

ksahlin commented Feb 8, 2023 •

edited

Loading

Base level alignment #26

Base level alignment #26

Comments

ksahlin commented Jun 1, 2022

ksahlin commented Sep 9, 2022

ekg commented Feb 8, 2023

ksahlin commented Feb 8, 2023 • edited Loading

ksahlin commented Feb 8, 2023 •

edited

Loading