GitHub - layer6ai-labs/CMLMC: Code for the ICLR'22 paper "Improving Non-Autoregressive Translation Models Without Distillation"

ICLR'22 Improving Non-Autoregressive Translation Models Without Distillation

Authors: Xiao Shi Huang, Felipe Perez, Maksims Volkovs

Introduction

This repository contains a full implementation of the CMLMC implemented with the fairseq library, and includes both training and evaluation routines on the IWSLT'14 De-En dataset.

Environment

The python code is developed and tested on the following environment:

Python 3.7.9
Pytorch 1.10.0

Experiments on IWSLT'14 De-En and En-De datasets (included in this repo) were run on NVIDIA V100 GPU with 32GB GPU memory; all other experiments were run on an IBM server with 160 POWER9 CPUs, 600GB RAM and 4 Tesla V100 GPUs

Dataset

The IWSLT'14 De-En and En-De dataset were included in this repo; for the WMT'14 En-De and WMT'16 En-Ro datasets refer to the fairseq's instructions here

Running The Code

./trainNAT.sh will train and evaluate both the CMLM benchmark and the CMLMC model on IWSLT'14 De-En raw dataset.
(Optionally) launch tensorboard to monitor progress by tensorboard --logdir=<log_path>

This script runs the 512-1024-4 Transformer NAR model (see paper for details). By default all avialable GPUs are used, but parameters such as batchsize are set for for 1 GPU. If multiple GPUs are avaialbe, either point the script to only one GPU or adjust model parameters accordingly.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
config		config
docs		docs
examples		examples
fairseq		fairseq
fairseq_cli		fairseq_cli
pip-wheel-metadata/fairseq.dist-info		pip-wheel-metadata/fairseq.dist-info
scripts		scripts
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
InferenceIWSLT.py		InferenceIWSLT.py
InferenceIWSLT_valid.py		InferenceIWSLT_valid.py
LICENSE		LICENSE
README.md		README.md
compound_split_bleu.sh		compound_split_bleu.sh
generate.py		generate.py
generate_cmlm.py		generate_cmlm.py
hubconf.py		hubconf.py
preprocess.py		preprocess.py
pyproject.toml		pyproject.toml
removecheckpoints.py		removecheckpoints.py
score.py		score.py
setup.py		setup.py
train.py		train.py
trainNAT.sh		trainNAT.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ICLR'22 Improving Non-Autoregressive Translation Models Without Distillation

Introduction

Environment

Dataset

Running The Code

About

Releases

Packages

Contributors 2

Languages

License

layer6ai-labs/CMLMC

Folders and files

Latest commit

History

Repository files navigation

ICLR'22 Improving Non-Autoregressive Translation Models Without Distillation

Introduction

Environment

Dataset

Running The Code

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages