Aggregator

	This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 857201. The European Commission assumes no responsibility for any content of this repository.

Aggregator

Aggregates content metadata and stream URLs from various sources.

What is this?

The Aggregator service is part of the platform for media caching on trains. Based on given criteria, the Aggregator retrieves content metadata and resource locations of media items in video on demand (VoD) catalogues. Criteria are determined by a human user, e.g. editiorial staff of the VoD service, through a user interface for configuration. For adaptive bitrate streams the Aggregator will retrieve locations of stream segements of all qualities. From all retrieved content information, the Aggregator compiles a list of media items, which it passes to the State API. The State API will initialise a new cache state, which is used to capture the caching status of the individual media items. The State API will than notify subscribers (Cache Monitor and Prefetcher) that a new cache state has been instantiated.

How does it work?

Below architecture diagram shows the software modules that implement the essential functionalities of the Aggregator. These include clients for HTTP communication with the State API and communication with the Message Streamer. A core logic module, which is implemented in aggregator.service.ts, manages different crawler modules, which search different sources for content according to the rules given in the configuration. Currently, two crawlers are implemented:

ard-core-crawler.ts: searches the ARD core database for the latest publications. Note: the ARD-Core service is under constand development and not stable. Its API is subject to changes, the service does not always respond as documented and is sometimes not reachable at all.
ard-mediathek-crawler.ts: searches the homepage of the ARD-Mediathek. It follows links on the home page, as well as links on the corresponding sub-pages, until the number of videos specified by the configuration rules is found. For this purpose the script parses the JSON files loaded by the ARD-Mediathek application. JSON files contain information about the playback media including descriptive meta data, as well as resource locations.

The basic program flow of the core logic in aggregator.service.ts is as follows:

Set listener for 'new-aggregator-config' messages
On reception of a 'new-aggregator-config' message:
- Cancel running crawl tasks
- Load new configuration
- Find information on media items that match configured rules (task of crawlers)
- Remove duplicate media items
- Remove stream URLs for unwanted video formats (currently only URLs to HLS streams are kept)
- For each media item, parse the manifest file of adaptive bitrate streams (currently only HLS) and segement URLs to the list of stream URLs of the media item
- Send list of media items to State API in order for it to instantiate a new cache state

Technologie used

Nest.js

Adapt to other VoD-Services

At the moment the Aggregator is specialised to find content of the ARD-Mediathek. In order to adapt the Aggregator for other VoD services, appropriate crawlers must be implemented, which than need to be integrated in aggregator.service.ts in order to serve a given configuration.

Install, build, run

Note: Typically you would use the up.sh script from the Platform project to install, build and run this service as part of a composite of docker services. Read on if you intend to run the service directly on your host system.

Prerequestits: Following software needs to be installed on your host machine in order to execute the subsequent steps.

Node.js
NPM

First, git clone this project and change into its root directory. Than run the following command to install its dependencies:

$ npm install

You can than run the service in three different modes.

# development
$ npm run start

# watch mode
$ npm run start:dev

# production mode
$ npm run start:prod

With following command you can build a docker image for this service. But again, typically you use the startup script up.sh of the Platform project to do the job.

$ docker build -t 5gv-aggregator .

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
doc/images		doc/images
src		src
test		test
.gitignore		.gitignore
.prettierrc		.prettierrc
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
nest-cli.json		nest-cli.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json
tslint.json		tslint.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aggregator

What is this?

How does it work?

Technologie used

Adapt to other VoD-Services

Install, build, run

About

Releases

Packages

Languages

IRT-Open-Source/5gv-aggregator

Folders and files

Latest commit

History

Repository files navigation

Aggregator

What is this?

How does it work?

Technologie used

Adapt to other VoD-Services

Install, build, run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages