Skip to content

Latest commit

 

History

History
 
 

seedDB

cBioPortal Seed Database

These files are MySQL database dump files for seeding a new instance of the cBioPortal database. They contain all the necessary background data for a properly functioning cBioPortal website, including cancer types, genes, uniprot-mappings, drug and network data.

The database schema and cBioPortal release follows different numbering cycles since cBioPortal 1.5.0 and database schema 2.1.0. This means that the version numbers won't be identical. cBioPortal 1.9.0 with database schema 2.4.0 removed PDB annotations from the database.

Latest seed database

Seed database schema 2.8.2

This schema is required for cBioPortal release versions:

  • 2.2.0

When using a cBioPortal release version > 2.2.0, a migration step to a new database schema might be required. The migration process is described here.

Schema 2.8.2: SQL file with create table statements
Seed database: seed-cbioportal_hg19_v2.8.2.sql.gz
md5sum f070b735324560e45af91f2737e99546

Contents of seed database:

  • Entrez Gene IDs, HGNC symbols and gene aliases updated in December 2018 from NCBI (miRNA removed)
  • Gene lengths retrieved from Gencode Release 29 (mapped to GRCh37)
  • Pfam graphics fetched in August 2017
  • Gene Sets from MSigDB 6.1
  • Cancer Types from OncoTree (fetched December 2018 from http://oncotree.mskcc.org)

Previous seed databases

Seed database schema 2.7.3

This schema is required for cBioPortal release versions:

  • 2.0.0

When using a release version > 2.0.0, a migration step to a new database schema might be required. The migration process is described here.

Schema 2.7.3: SQL file with create table statements
Seed database: seed-cbioportal_hg19_v2.7.3.sql.gz
md5sum 85444ce645104dbc00610fc1f15e8c7a

Contents of seed database:

Seed database schema 2.7.2

This schema is required for cBioPortal release versions:

  • 1.18.0

When using a release version > 1.18.0, a migration step to a new database schema might be required. The migration process is described here.

Schema 2.7.2: SQL file with create table statements
Seed database: seed-cbioportal_hg19_v2.7.2.sql.gz
md5sum b0a4e11b94d00a7291129c30ee4e0f70

Contents of seed database:

Seed database schema 2.6.0

This schema is required for cBioPortal release versions:

  • 1.12.x
  • 1.13.x
  • 1.14.0

When using a release version > 1.14.0, a migration step to a new database schema might be required. The migration process is described here.

Schema 2.6.0: SQL file with create table statements
Seed database: seed-cbioportal_hg19_v2.6.0.sql.gz
md5sum aafc9da7b72a29f3978ddca31004b8f5

Contents of seed database:

Seed database schema 2.4.0

This schema is required for cBioPortal release versions:

  • 1.9.0

When using a release version > 1.9.0, a migration step to a new database schema might be required. The migration process is described here.

Schema 2.4.0: SQL file with create table statements
Seed database : seed-cbioportal_hg19_v2.4.0.sql.gz
md5sum 1014ed1f9d72103f2b46e5615aacbc2f

Contents of seed database:

  • Entrez Gene IDs, HGNC symbols and aliases updated in August 2017 from NCBI
  • Gene lengths retrieved from Gencode Release 26 (mapped to GRCh37)
  • Pfam graphics fetched in August 2017

Seed database schema 2.3.1

This schema is required for cBioPortal release versions:

  • 1.7.1
  • 1.7.2
  • 1.7.3
  • 1.8.0

When using a release version > 1.8.0, a migration step to a new database schema might be required. The migration process is described here.

Schema 2.3.1: SQL file with create table statements
Seed database part1 (no PDB tables): seed-cbioportal_hg19_v2.3.1.sql.gz
md5sum 324be3d975d22019ee0c82ce0542bcc3
Seed database part2 (optional, only PDB tables): seed-cbioportal_hg19_v2.3.1_only-pdb.sql.gz
md5sum 5774a7947cdf5ef78fd737f1bea688cc

Contents of seed database:

  • Entrez Gene Ids, Hugo symbols and aliases updated in August 2017 from NCBI
  • Gene lengths retrieved from Gencode Release 26 (mapped to GRCh37)
  • Pfam graphics fetched in August 2017

Seed database schema 2.1.0

This schema is required for older cBioPortal release versions:

  • 1.5.0
  • 1.5.1
  • 1.5.2

When using this older seed database with a release version > 1.5.2, a migration step to a new database schema is required. The migration process is described here.

Schema 2.1.0: SQL file with create table statements
Seed database part1 (no PDB tables): seed-cbioportal_hg19_v2.1.0.sql.gz
md5sum fe4e8502034f72f182733a72b50dbbc8
Seed database part2 (optional, only PDB tables): seed-cbioportal_hg19_v2.1.0_only-pdb.sql.gz
md5sum 5774a7947cdf5ef78fd737f1bea688cc

Contents of seed database:

  • Entrez Gene Ids, Hugo symbols and aliases updated in September 2016 from NCBI
  • Gene lengths retrieved from Gencode Release 25 (mapped to GRCh37)
  • Pfam graphics fetched in September 2016

For developers

Updating the seed database for Datahub is described here.