GitHub - gifford-lab/deepsea-predict-docker: Docker image for running DeepSEA for predicting

Docker image for running DeepSEA (v0.94, June 1, 2016) for predicting, without retraining. As the default in the original code, it runs on the CPU.

If you are looking for a Docker image that trains a new DeepSEA model on customized data, check out our other repo.

Dependencies

Docker

Predict sequence-derived features

Here we show how to predict whether a list of 1001 bp sequences (in FASTA format) reflect the sequence preferences learned from 919 histone mark, DNase-seq, and TF ChIP-seq experiments.

To run on the example data the DeepSEA authors provided:

 docker pull haoyangz/deepsea-predict-docker
 docker run -v $(pwd)/output:/output --rm haoyangz/deepsea-predict-docker python rundeepsea.py examples/deepsea/example.fasta /output

The output will be saved under a folder output in the current directory.

To predict on your own FASTA-formatted sequences:
```
 docker pull haoyangz/deepsea-predict-docker
 docker run -v $FULL_PATH_TO_FA_FILE$:/infile.fasta  -v $FULL_PATH_TO_OUTPUTDIR$:/output --rm haoyangz/deepsea-predict-docker python rundeepsea.py /infile.fasta /output
```
- FULL_PATH_TO_FA_FILE: the full path to input FASTA file
- FULL_PATH_TO_OUTPUTDIR: the full path to output directory
This mounts FULL_PATH_TO_FA_FILE as /infile.fasta and FULL_PATH_TO_OUTPUTDIR as /output to the docker container, then executes rundeepsea.py with /infile.fasta and /output as arguments. We include a example.fasta in the directory for you to try.

Other usage

For more usage, please refer to the original README in DeepSEA-0.94. Simply change the part of the code above after haoyangz/deepsea-predict-docker to match with the functionality.

Note that the DeepSEA code will determine what to do based on the suffix of the input file. If the DeepSEA functionality you use takes file format other than FASTA, for instance VCF file for scoring sequence variants, you will need to change both infile.fasta in the previous example to match the suffix. For instance if you are scoring a VCF file, you might want to run:

docker run -v $FULL_PATH_TO_FA_FILE$:/infile.vcf  -v $FULL_PATH_TO_OUTPUTDIR$:/output --rm haoyangz/deepsea-predict-docker python rundeepsea.py /infile.vcf /output

Issues

The Torch library is compiled for modern systems may not compatible with all target machines. If you want to run this image on a machine that (for instance) does not support AVX instructions, you must rebuild the image on your machine using these steps:

cd rebuild_torch
docker build -t deepsea-predict-docker .

You can then use the local image deepsea-predict-docker as detailed in the earlier examples. You can also accomplish this in one step with a wrapper script we have provided, but the (slow) rebuilding process will not be reused from one run to another. Here is an example of this usage:

docker run --rm haoyangz/deepsea-predict-docker /root/torch/rebuild_torch.sh python rundeepsea.py examples/deepsea/example.fasta /output

License

The original code is from the Troyanskaya lab, see here.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
rebuild_torch		rebuild_torch
Dockerfile		Dockerfile
README.md		README.md
README_ori		README_ori
example.fasta		example.fasta

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dependencies

Predict sequence-derived features

Other usage

Issues

License

About

Releases

Packages

Languages

gifford-lab/deepsea-predict-docker

Folders and files

Latest commit

History

Repository files navigation

Dependencies

Predict sequence-derived features

Other usage

Issues

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages