Incremental Reasoning

This repo contains tools to reproduce the work Can We Guide a Multi-Hop Reasoning Language Model to Incrementally Learn at each Hop?.

Artificial Datasets

Download the data for the multi-hop reasoning experiments and downstream tasks in the following tables

Generated datasets

Dataset	Hops Number	Description
SHINET	1
SHINET	2	Distractors included
RuleTakers	0 - 5	Filtered by single hops. innoculation files
RACE	-	Innoculation files

External Resources

Dataset	Source
LeapOfThought	Leap-Of-Thought - Talmor et al.
Semantic Fragments	What does my QA model knows? - Richardson et al.

Trained models (HuggingFace Transformers version)

Model	Description	Link
RoBERTa	Trained on SHINet 2-hop (g-inf)	link
XLNet	Trained on SHINet 2-hop (g-inf)	link
BERT	Trained on SHINet 2-hop (g-inf)	link

To load the models:

from transformers import AutoModel
model = AutoModel.from_pretrained(path)

Setup

We used the AllenNLP framework to perform the experiments.

1. Requirements

Python 3.8
AllenNLP 2.8.0

2. Create and activate the environment

git clone 
cd inc_reasoning
conda env create -f env.yml
conda activate inchop

3. Training and Evaluation

With the config and inc_reason folders we obtain similar results to Table 2 and 3 from our paper.

For training the models:

For BERT

python -m allennlp train config/bert.jsonnet -s output/test_bert --include-package inc_reason

For RoBERTa

python -m allennlp train config/roberta.jsonnet -s output/test_roberta --include-package inc_reason

For XLNet

python -m allennlp train config/xlnet.jsonnet -s output/test_xlnet --include-package inc_reason

Change in the jsonnet files the input files for training and dev in the train_data_path and validation_data_path fields.

- Example to train RoBERTa with 1-hop SHINET

Modify the config file (config/roberta.jsonnet file)

{
...
  //"datasets_for_vocab_creation": [],
  "train_data_path": "/data/shinet-1hop/shinet_h1_sinf.train.jsonl",
  "validation_data_path": "/data/shinet-1hop/shinet_h1_sinf.dev.jsonl",
...
}

For Evaluation:

Evaluate model on SHINET-H1 (SH1)

python -m allennlp evaluate YOUR_OUTPUT_FOLDER/model.tar.gz /data/shinet-1hop/shinet_h1_sinf.test.jsonl --include-package inc_reason

Evaluate model on SHINET-H2 (SH2)

python -m allennlp evaluate YOUR_OUTPUT_FOLDER/model.tar.gz /data/shinet-2hop/shinet_h2_sinf.test.jsonl --include-package inc_reason

4. Downstream Tasks

The trained models are tested using the code from Semantic Fragments - Allen AI

AFLITE Algorithm for Algorithmic Data Bias Reduction

To apply AFLITE to your data:

python main.py --config config/test1.yaml

Input data, output directory and parameters for the algorithm can be modified in config/test1.yaml file or directly in the config/default.yaml file.

Citation

@inproceedings{lovon-2022-guide,
    title = "Can We Guide a Multi-Hop Reasoning Language Model to Incrementally Learn at Each Single-Hop?",
    author = "Lovon-Melgarejo, Jesus  and
      Moreno, Jose G.  and
      Besan{\c{c}}on, Romaric  and
      Ferret, Olivier  and
      Tamine, Lynda",
    booktitle = "Proceedings of the 29th International Conference on Computational Linguistics",
    year = "2022",
    address = "Gyeongju, Republic of Korea",
    publisher = "International Committee on Computational Linguistics",
    url = "https://aclanthology.org/2022.coling-1.125",
    pages = "1455--1466",
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
config		config
inc_reason		inc_reason
logger		logger
README.md		README.md
Winogrande.py		Winogrande.py
env.yml		env.yml
main.py		main.py
reader.py		reader.py
wrapper.py		wrapper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Incremental Reasoning

Artificial Datasets

Generated datasets

External Resources

Trained models (HuggingFace Transformers version)

Setup

1. Requirements

2. Create and activate the environment

3. Training and Evaluation

For training the models:

- Example to train RoBERTa with 1-hop SHINET

For Evaluation:

4. Downstream Tasks

AFLITE Algorithm for Algorithmic Data Bias Reduction

To apply AFLITE to your data:

Citation

About

Releases

Packages

Languages

jeslev/incremental_reasoning

Folders and files

Latest commit

History

Repository files navigation

Incremental Reasoning

Artificial Datasets

Generated datasets

External Resources

Trained models (HuggingFace Transformers version)

Setup

1. Requirements

2. Create and activate the environment

3. Training and Evaluation

For training the models:

- Example to train RoBERTa with 1-hop SHINET

For Evaluation:

4. Downstream Tasks

AFLITE Algorithm for Algorithmic Data Bias Reduction

To apply AFLITE to your data:

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages