DKRZ-CERA

This package provides an interface to the CERA database of the DKRZ (Deutsches Klimarechenzentrum). This allows the user to scrape the database for CMIP data, for example, and prepare files for the remote download via Jblob, a program written in Java and provided by the DKRZ.

Installation

Clone this repo via

git clone https://github.com/markusritschel/dkrz-cera

Then, in the new directory (cd dkrz-cera/) install the package via:

pip install .

or via

pip install -e .

if you plan on making changes to the code.

Alternatively, install directly from GitHub via

pip install 'git+https://github.com/markusritschel/dkrz-cera.git'

Usage

The database can be scraped by creating an instance of the Cera class and using it's search method:

from dkrz_cera import Cera
cera = Cera()
cera.search(variable_s='tas', model_s='ACCESS1-0', qc_experiment_s='historical')

This yields a CeraQuery object, which itself provides a tabular view of the request results by the .df attribute, as well as a method to_jblob(), which creates a bash file executable by Jblob for eventually downloading the datasets from CERA. When running this bash file, a directory structure gets created according to the CMIP standards, i.e.

<activity>/
    <product>/
        <institute>/
            <model>/
                <experiment>/
                    <frequency>/
                        <modeling realm>/
                            <MIP table>/
                                <ensemble member>/
                                    <version number>/
                                        <variable name>/

This structure is created by hands of the entry_name_s provided in each dataset. (However, in some exceptions it can happen that this entry_name_s value is not complying the CMIP standards.)

The data sets get automatically downloaded into the respective directory. While some of them are already downloaded as netCDF files, others exist primarily as zip files and still need to be extracted. This can be done by using the function unzip_files() which takes the root path (parent of the <activity> directory) as a mandatory argument.

Testing

Run make tests in the source directory to test the code. This will execute both the unit tests and docstring examples (using pytest).

Run make lint to check code style consistency.

ToDos

Routine for scraping the CERA database based on multiple keywords
sort files depending on configuration file => creates directory structure automatically during jblob download
create intake-esm catalog files => this will be implemented in another package
Try to validate CERA credentials if present
Retrieve CERA credentials either from .env file or from environment variable via os.getenv('CERA_USER')
implement click for command line tooling

Maintainer

markusritschel

Contact & Issues

For any questions or issues, please contact me via git@markusritschel.de or open an issue.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github		.github
docs		docs
notebooks		notebooks
src/dkrz_cera		src/dkrz_cera
tests		tests
.cruft.json		.cruft.json
.gitignore		.gitignore
AUTHORS.md		AUTHORS.md
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test_environment.py		test_environment.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DKRZ-CERA

Installation

Usage

Testing

ToDos

Maintainer

Contact & Issues

About

Releases

Packages

Languages

License

markusritschel/dkrz-cera

Folders and files

Latest commit

History

Repository files navigation

DKRZ-CERA

Installation

Usage

Testing

ToDos

Maintainer

Contact & Issues

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages