Skip to content

Latest commit

 

History

History
58 lines (29 loc) · 2.23 KB

main.md

File metadata and controls

58 lines (29 loc) · 2.23 KB

Report, created 2023-04-12 08:18

Note: This pipeline is under development, and the results may be wrong or inaccurate.

Accuracy vs read length

Accuracy is measured as the F1 score and shown for different read lengths and single/paired end reads. Further divided into reads containing and not containing genomic variants. A read contains a variant is it has been simulated so that it contains a variant allele not in the reference genome.

Link to plot data

Precision/recall for various MAPQ thresholds

Recall and 1-Precision is shown for different subsets of the reads, depending on the MAPQ-score (shown as dots in the plot). For every MAPQ-score, only reads with that score or higher are included.

Link to plot data

Accuracy for different error profiles

Showing read mapping accuracy for reads simulated with different error profiles (here described as low, medium and high error rate).

Link to plot data

Variant calling accuracy

Link to plot data

Peak calling accuracy

Peaks are simulated using Chips. The accuracy is the number of peaks found after calling peaks by MACS and selecting the top N peaks where N is the same number of peaks as in the simulated data.

Link to plot data

Runtime as a function of number of threads used

Shown for single-end and paired-end reads and different read lengths.

Link to plot data

Memory usage

Memory usage when mapping 2 million reads using 4 threads to hg38.

Link to plot data