word_representations

This repository contains code to evaluate machine-generated word representations against human behavior, specifically the Nelson assocaition norms. This repository does not contain code to generate such models (see instead https://github.com/smeylan/batch_w2v), or to choose among such models the one that is the best fit to human data.

We intend for this code to be run under Python3, with or without a virtual environment. To install the requirements, pip install -r requirements.txt

Rather than using command line arguments, the main analysis script eval_parallel.py takes a .json control file as input. This control file specifies the paths to the models that are to be evaluated, the appropriate directory to place the results, and whether intermediate files (either for the models or for the sets of similarity judgments being compared) are to be cached. Caching will use a relatively large amount of disk space (e.g., 11 gb for 10 models).

Name		Name	Last commit message	Last commit date
Latest commit History 143 Commits
data/nelson_norms		data/nelson_norms
.gitattributes		.gitattributes
README.md		README.md
compare_cached_pickles.py		compare_cached_pickles.py
eval_parallel.py		eval_parallel.py
evaluate.py		evaluate.py
example.ctrl		example.ctrl
kevin-ji-dec2017-2.ctrl		kevin-ji-dec2017-2.ctrl
kevin-ji-dec2017.ctrl		kevin-ji-dec2017.ctrl
latex_table.py		latex_table.py
merged.ctrl		merged.ctrl
plot_te.py		plot_te.py
process.py		process.py
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

word_representations

About

Releases

Packages

Languages

jcpeterson/eval-word-rep

Folders and files

Latest commit

History

Repository files navigation

word_representations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages