Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: more options for filtering database during creation #25

Closed
joelnitta opened this issue May 19, 2022 · 0 comments · Fixed by #29
Closed

Feature request: more options for filtering database during creation #25

joelnitta opened this issue May 19, 2022 · 0 comments · Fixed by #29
Assignees

Comments

@joelnitta
Copy link
Contributor

Currently, the only options for limiting the size of the database are:

  1. preselection argument of db_download(), which allows for selection of GenBank "division" (i.e., plant, bacterial, invertebrate, etc)
  2. min_length argument of db_create()
  3. max_length argument of db_create()

This does not work well if someone is only interested in a clade (e.g., ferns) within one of the larger divisions (e.g., plants; ca. 800gb), as the local database is much larger than needed and therefore slow.

So what I propose is a way to limit the database during creation by ID (i.e., GenBank accession number). This is very similar to the example given for extracting data from the database, but instead of returning data to R, it would reduce the size of the external database created. That way, future queries would run on a smaller database and go faster.

Another idea would be to limit the database by taxonomic level, but I am not sure if that is possible with the information available during parsing of the files downloaded from GenBank.

@joelnitta joelnitta changed the title Feature request: allow manipulation of database Feature request: more options for filtering database during creation May 19, 2022
@joelnitta joelnitta self-assigned this May 27, 2022
joelnitta added a commit that referenced this issue May 30, 2022
joelnitta added a commit that referenced this issue Jun 2, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant