Name		Name	Last commit message	Last commit date
parent directory ..
exact_match		exact_match
CMakeLists.txt		CMakeLists.txt
README.md		README.md
binding.cpp		binding.cpp
eval.cpp		eval.cpp
eval.h		eval.h
eval.py		eval.py
eval_models.py		eval_models.py
requirements.txt		requirements.txt
run.sh		run.sh
run_model.sh		run_model.sh

README.md

Accuracy Evalution

xFasterTransformer supports a different model format than huggingface. This module are used to evaluate accuracy on the datasets for xFasterTransformer inference engine on CPU.

Step 1: Compile xFT with enabling evaluation module.

cmake -DXFT_BUILD_EVALUATION=ON ..

Step 2: Download the datasets to local (in case huggingface hub can not be connected)

wget https://openaipublic.blob.core.windows.net/gpt-2/data/lambada_test.jsonl

Step 3: Run testing script corresponding to the model.

After that, modify the config parameters in the scripts and run. You will see the accuracy report and dump files in the output directory.

Params	Use
TOKEN_NAME	config files and tokenizer models from huggingface
MODEL_NAME	xFT format model weights
TRUST_REMOTE_CODE	True for chatglm,baichuan family models
TASKS	dataset names like lambada_openai or boolq
DATA_FILES	Local path of json-type data files corresponding to TASKS
LIMIT	ONLY FOR TESTING. REAL METRICS SHOULD Be set 0
BATCH_SIZE	batch size

    sh run.sh 1 48 run_model.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

evaluation

evaluation

README.md

Accuracy Evalution

Step 1: Compile xFT with enabling evaluation module.

Step 2: Download the datasets to local (in case huggingface hub can not be connected)

Step 3: Run testing script corresponding to the model.

Files

evaluation

Directory actions

More options

Directory actions

More options

Latest commit

History

evaluation

Folders and files

parent directory

README.md

Accuracy Evalution

Step 1: Compile xFT with enabling evaluation module.

Step 2: Download the datasets to local (in case huggingface hub can not be connected)

Step 3: Run testing script corresponding to the model.