Name		Name	Last commit message	Last commit date
Latest commit History 3,034 Commits
.github		.github
api		api
ingestion_server		ingestion_server
nginx		nginx
postgres		postgres
readme_assets		readme_assets
sample_data		sample_data
.flake8		.flake8
.git-blame-ignore-revs		.git-blame-ignore-revs
.gitattributes		.gitattributes
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTORS.md		CONTRIBUTORS.md
DOCUMENTATION_GUIDELINES.md		DOCUMENTATION_GUIDELINES.md
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
justfile		justfile
load_sample_data.sh		load_sample_data.sh
prettier.config.js		prettier.config.js

Repository files navigation

Openverse API

Purpose

This repository is primarily concerned with back end infrastructure like datastores, servers, and APIs. The pipeline that feeds data into this system can be found in the Openverse Catalog repository. A front end web application that interfaces with the API can be found at the Openverse frontend repository.

Getting started

Our quickstart guide and other documentation can be found in our developer docs (or within the repo at ./api/docs/guides/quickstart.md). Our API documentation can also be found at https://api.openverse.engineering.

System architecture

Basic flow of data

Search data is ingested from upstream sources provided by the data pipeline. As of the time of writing, this includes data from Common Crawl and multiple 3rd party APIs. Once the data has been scraped and cleaned, it is transferred to the upstream database, indicating that it is ready for production use.

Every week, the latest version of the data is automatically bulk copied ("ingested") from the upstream database to the production database by the Ingestion Server. Once the data has been downloaded and indexed inside of the database, the data is indexed in Elasticsearch, at which point the new data can be served up from the Openverse API servers.

Description of subprojects

api/: a Django Rest Framework API server For a full description of its capabilities, please see the browsable documentation.
ingestion_server/: a service for downloading and indexing search data once it has been prepared by the Openverse Catalog

Contributing

Pull requests are welcome! Feel free to join us on Slack and discuss the project with the engineers and community members on #openverse.

You are welcome to take any open issue in the tracker labeled help wanted or good first issue; there's no need to ask for permission in advance. Other issues are open for contribution as well, but may be less accessible or well-defined in comparison to those that are explicitly labeled.

See the CONTRIBUTING file for details.

Acknowledgments

Openverse, previously known as CC Search, was conceived and built at Creative Commons. We thank them for their commitment to open source and openly licensed content, with particular thanks to previous team members @ryanmerkley, @janetpkr, @lizadaly, @sebworks, @pa-w, @kgodey, @annatuma, @mathemancer, @aldenstpage, @brenoferreira, and @sclachar, along with their community of volunteers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Openverse API

Purpose

Getting started

System architecture

Basic flow of data

Description of subprojects

Contributing

Acknowledgments

About

Releases 46

Packages

Contributors 53

Languages

License

WordPress/openverse-api

Folders and files

Latest commit

History

Repository files navigation

Openverse API

Purpose

Getting started

System architecture

Basic flow of data

Description of subprojects

Contributing

Acknowledgments

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 46

Packages 0

Contributors 53

Languages

Packages