Name		Name	Last commit message	Last commit date
parent directory ..
demo_coding_vs_intergenomic_seqs		demo_coding_vs_intergenomic_seqs
demo_human_or_worm		demo_human_or_worm
drosophila_enhancers_stark		drosophila_enhancers_stark
dummy_mouse_enhancers_ensembl		dummy_mouse_enhancers_ensembl
human_enhancers_cohn		human_enhancers_cohn
human_enhancers_ensembl		human_enhancers_ensembl
human_ensembl_regulatory		human_ensembl_regulatory
human_nontata_promoters		human_nontata_promoters
human_ocr_ensembl		human_ocr_ensembl
README.md		README.md

README.md

Datasets documentation

This folder contains scripts and notebooks that have been used for creation of benchmarks in dataset folder. The format and the process of contribution a new benchmark is specified there.

To make the process of benchmarks creation reproducible, please, try to stick to the following principles:

Fix the random seeds so your script produce the same dataset when calling repeatedly
Clean up temporary files, especially if they were created inside the package structure (so they will not be accidentally pushed to GitHub)
It is ok to import packages that are not contained in requirements.txt but avoid adding unnecessary dependencies
For Jupyter Notebook, it might be a good idea to rerun everything at the end using Kernel -> Restart & Run All
Make sure that the created benchmark can be read, i.e. run the following code

    from genomic_benchmarks.loc2seq import download_dataset
    download_dataset("YOUR_BENCHMARK_NAME")

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs

docs

README.md

Datasets documentation

Files

docs

Directory actions

More options

Directory actions

More options

Latest commit

History

docs

Folders and files

parent directory

README.md

Datasets documentation