- git clone https://github.com/ian-jooble/jooble-ir-2018-rock.git
- pip install requrements.txt
- Run services mentioned above python "server_name".py
- python client.py
Port: port 13565
Input paramets:
- query - query from user
- max_docs - maximum number of documents returned by the search engine
Actions:
- import analyzer -> create prepared query (list of terms).
- request to service ReverseIndex -> receive json with key "processed_data" - dictionary with docID's and texts.
- request to service Snippets -> receive json with key "processed_data" - dictionary with docID's and snippets.
Return:
- documents - dictionary with docID's and snippets.
Port: 13538
Input:
- data - list of terms
- max_docs - maximum number of documents returned by the search engine.
Actions:
- You need to create inverted_index from documents, idf and normilized_tf_idf before starting to work with a search engine.
- request to service Ranking -> receive json with key "ranked" - list with ranked docIDs.
Return: processed_data - dictionary with docID's and texts (number of docID == max_docs). {'id': docID, 'text': text}
Port: 13541
Before using this service for ranking you must create idf and normalized tf_idf from inverted index by using ReverseIndex Service.
Input:
- documents - list with relevant docID
- words - list with tokens from the query.
Return: ranked - list with ranked docIDs
Port: 13542
Input:
- data - dictionary with docID's and texts (number of docID == max_docs).
Return: processed_data - dictionary with docID's and snippets. {'id': docID, 'snippet': snippet}
Port: 13560
You can change 'max_docs' here. Actions:
- import SpellChecker -> checks query for mistakes and returns correct query if any were found.
- request to service Search -> recieve json with key "documents" - dictionary with docID's and snippets.