Selective Copying Task with Mamba Model

This repository contains a simple implementation for reproducing the selective copying task with the Mamba model.

Files

config.py: Contains the configuration for training, dataset, and the Mamba model.
data_generator.py: Contains the torch_copying_data function for generating a dataset for a selective copying task and the generate_dataset function for generating a dataset based on the provided configuration.
selective_copying_mamba.py: Contains the main script for training and validating the Mamba model.

Usage

Configure your training, dataset, and model parameters in config.py.
Run selective_copying_mamba.py to train and validate the model.

Running the Scripts

To run the main script, use the following command:

python selective_copying_mamba.py

Results

After training, you can view the results of the selective copying task in the terminal. Sample results might look like this:

2024-06-03 16:03:06,983 - Step [399995/400000], Loss: 0.0000, Accuracy: 100.00%
2024-06-03 16:03:06,988 - Step [399996/400000], Loss: 0.0000, Accuracy: 100.00%
2024-06-03 16:03:06,993 - Step [399997/400000], Loss: 0.0000, Accuracy: 100.00%
2024-06-03 16:03:06,999 - Step [399998/400000], Loss: 0.0000, Accuracy: 100.00%
2024-06-03 16:03:07,005 - Step [399999/400000], Loss: 0.0000, Accuracy: 100.00%
2024-06-03 16:03:07,010 - Step [400000/400000], Loss: 0.0000, Accuracy: 100.00%
2024-06-03 16:03:07,010 - Training completed in: 34.91 minutes
2024-06-03 16:03:07,013 - Validation Accuracy: 100.00%

The above results are obtained with sequences of length 100 for demonstration purposes. Similar results can be achieved with sequences of length 4096, but more training time will be required.

Acknowledgments

We would like to thank the authors Dao and Gu for their work, as referenced in this paper, and for the model used in their implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
config.py		config.py
data_generator.py		data_generator.py
selective_copying_mamba.py		selective_copying_mamba.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Selective Copying Task with Mamba Model

Files

Usage

Running the Scripts

Results

Acknowledgments

About

Releases

Packages

Languages

MinhZou/selective-copying-mamba

Folders and files

Latest commit

History

Repository files navigation

Selective Copying Task with Mamba Model

Files

Usage

Running the Scripts

Results

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages