Skip to content

Scripts and files relevant to reproduce Lubock et al. 2017

License

Notifications You must be signed in to change notification settings

Gaodelike/errorCorrect

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A systematic comparison of error correction enzymes by next-generation sequencing

This repository contains everything needed (sans data) to generate our paper, which is openly availible on NAR. The requisite NGS data is availible at the Sequencing Read Archive

Running make all in analysis will process the raw data and generate the manuscript. The Makefile is also compatible with parallel builds.

Please note that our pipeline will use a significant amount of RAM, and will generate ~8Gb of intermediate files. Much of this comes from running the pipeline in parallel (especially the filtering steps).

Subdirectories

analysis: contains polished scripts, markdown documents, and figures

rawData: location of sequencing reads and scripts to pull them from the relevant repos

explorations: early exploratory data analysis and outdated/dead end scripts

Necessary Tools

  • R and (tidyverse, broom, stringr, magrittr, data.table, dtplyr, scale, grid, gridExtra)
  • Python 2.7
  • uta-align or equivalently
  • GNU Make
  • latexmk and a reasonable LaTeX installation (ex TexLive)
  • BBTools
  • Bowtie2 (although not critical)

About

Scripts and files relevant to reproduce Lubock et al. 2017

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • TeX 56.7%
  • Python 34.3%
  • Makefile 6.6%
  • Shell 2.4%