Skip to content
/ DNArSim Public

DNA Archive Simulator (DNArSim) is a memory channel model that simulate the entire DNA Data Storage process: from synthesis to basecalling. This simulator is part of the DnarXiv Project.

Notifications You must be signed in to change notification settings

BHam-1/DNArSim

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

DNA_Data_Storage_Channel_Model

Simulator to simulate the entire DNA Data Storage process (from synthesis to basecalling)

This is the first version of the Channel Model with Memory for DNA Data Storage with Nanopore Sequencing based on Julia (https://julialang.org/downloads/). Julia was used with 1.5.3 version and recent ones should also be compatible (tested until v1.6.2).

After setting Julia correctly, simulations can be launched using next command:

./DNA_data_storage_channel.sh -i ./example/ref.txt -n 100 -o ./example/sim.fastq -k 6

Parameters:

  • -i: Path to the input sequence (should be on fasta format) to simulate. An example of such sequence is available on "example" folder with the name 'ref.txt'. [required]
  • -n: Number of reads to simulate. [required]
  • -o: Simulated sequences output path. Will be presented in a fastq format (without included scores). [required]
  • -k: Channel memory length. fixed to k=6 by default [recommended]

Error Profile:

Error profiles were computed using DNA data storage data which went through multiple steps including:

  • Synthesis: chemical technique, with oligonucleotide assembling called "GeneArt Strings DNA Fragments" and done by Thermofisher.
  • Sequencing: MinIon sequencer with "R9.4_180mv_450bps" nanopore model
  • Basecalling: Guppy v5.0.7 using "super-accurate" mode.

More documentation will be added as soon as possible. Meanwhile feel free to reach me by e-mail (belaid.hamoum@gmail.com) for more details.

About

DNA Archive Simulator (DNArSim) is a memory channel model that simulate the entire DNA Data Storage process: from synthesis to basecalling. This simulator is part of the DnarXiv Project.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published