Feature request: more options for filtering database during creation #25

joelnitta · 2022-05-19T06:22:33Z

Currently, the only options for limiting the size of the database are:

preselection argument of db_download(), which allows for selection of GenBank "division" (i.e., plant, bacterial, invertebrate, etc)
min_length argument of db_create()
max_length argument of db_create()

This does not work well if someone is only interested in a clade (e.g., ferns) within one of the larger divisions (e.g., plants; ca. 800gb), as the local database is much larger than needed and therefore slow.

So what I propose is a way to limit the database during creation by ID (i.e., GenBank accession number). This is very similar to the example given for extracting data from the database, but instead of returning data to R, it would reduce the size of the external database created. That way, future queries would run on a smaller database and go faster.

Another idea would be to limit the database by taxonomic level, but I am not sure if that is possible with the information available during parsing of the files downloaded from GenBank.

The text was updated successfully, but these errors were encountered:

Fixes #25

joelnitta changed the title ~~Feature request: allow manipulation of database~~ Feature request: more options for filtering database during creation May 19, 2022

joelnitta self-assigned this May 27, 2022

joelnitta added a commit that referenced this issue May 30, 2022

Add acc_filter and invert args to db_create

ba253a4

Fixes #25

joelnitta mentioned this issue May 30, 2022

Joelnitta/issue25 #27

Closed

joelnitta added a commit that referenced this issue Jun 2, 2022

Add acc_filter and invert args to db_create

0e88e9e

Fixes #25

joelnitta mentioned this issue Jun 2, 2022

Joelnitta/issue25 #29

Merged

joelnitta closed this as completed in #29 Jun 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: more options for filtering database during creation #25

Feature request: more options for filtering database during creation #25

joelnitta commented May 19, 2022

Feature request: more options for filtering database during creation #25

Feature request: more options for filtering database during creation #25

Comments

joelnitta commented May 19, 2022