Added example with Workflow interface for finetuning Llama2 LLM #905

manuelhsantana · 2023-12-30T01:36:25Z

This PR introduces a new example of fine-tuning the Llama2 Language Model (LLM), using the workflow interface.

The main objective of this PR is to provide users and developers with a practical guide on how to fine-tune the Llama2 LLM for their specific use cases.

The added example includes:

Steps on how to load the Llama2 LLM and prepare it for fine-tuning.
Instructions on how to set up the training and validation datasets.
Use the OpenFL workflow interface for fine-tune the model with a specific dataset.

A few things to keep in mind:

For this new example it is a prerequisite to register and request access to the Meta model.
The lvwerra and some Huggingface libraries are used.

This tutorial serves as a basic example, and users are encouraged to adapt and expand upon it to suit their specific needs and requirements.

Users are animated to explore this example and provide feedback, which will be invaluable in refining and expanding our set of examples.

Please review the changes and provide your valuable feedback.

psfoley

Thanks @manuelhsantana. This PR is very close. Three small changes to address, then this is ready to merge.

openfl-tutorials/experimental/Workflow_Interface_501_FineTuning_LLAMA2.ipynb

Fix typos and unused comments

openfl-tutorials/experimental/Workflow_Interface_501_FineTuning_LLAMA2.ipynb

kta-intel · 2024-03-06T00:03:34Z

I am trying to run it on CPU, but am running error:
NameError: name 'str2optimizer32bit' is not defined

@manuelhsantana did you happen to run into this? I can try to triage a bit more, but from a quick investigate, it seems like it is expecting CUDA drivers, which I think is a req for bitesandbytes currently

for easier testing, I reduced dataset size by replacing the data cell with this:

dataset=load_dataset(dataset_name,split='train[10:20]')

dataset = DatasetDict({
    'train': dataset,
    'test': dataset,
    'valid': dataset})

openfl-tutorials/experimental/Workflow_Interface_501_FineTuning_LLAMA2.ipynb

manuelhsantana · 2024-03-14T00:47:26Z

I am trying to run it on CPU, but am running error: NameError: name 'str2optimizer32bit' is not defined

@manuelhsantana did you happen to run into this? I can try to triage a bit more, but from a quick investigate, it seems like it is expecting CUDA drivers, which I think is a req for bitesandbytes currently

for easier testing, I reduced dataset size by replacing the data cell with this:
dataset=load_dataset(dataset_name,split='train[10:20]')

dataset = DatasetDict({
    'train': dataset,
    'test': dataset,
    'valid': dataset})

I changed the optimizer to avoid the warning on CPU

kta-intel · 2024-03-14T17:15:15Z

I am trying to run it on CPU, but am running error: NameError: name 'str2optimizer32bit' is not defined
@manuelhsantana did you happen to run into this? I can try to triage a bit more, but from a quick investigate, it seems like it is expecting CUDA drivers, which I think is a req for bitesandbytes currently
for easier testing, I reduced dataset size by replacing the data cell with this:
dataset=load_dataset(dataset_name,split='train[10:20]')

dataset = DatasetDict({
    'train': dataset,
    'test': dataset,
    'valid': dataset})
I changed the optimizer to avoid the warning on CPU

Thanks - this resolved the issue on my end. Everything looks good to me. Great contribution!

Added example for finetuning Llama2 LLM

a3adbc8

manuelhsantana requested review from psfoley, porteratzo and mansishr December 30, 2023 01:36

manuelhsantana and others added 2 commits February 27, 2024 01:28

Update notebook to use gpu

ff8f024

Updated ipynb file to remove minor typos.

91e959c

manuelhsantana requested a review from kta-intel February 27, 2024 17:00

psfoley reviewed Mar 1, 2024

View reviewed changes

manuelhsantana added 2 commits March 1, 2024 14:02

Update Workflow_Interface_501_FineTuning_LLAMA2.ipynb

7cbfb8c

Fix typos and unused comments

Fix typos Workflow_Interface_501_FineTuning_LLAMA2.ipynb

6f6ee79

kta-intel reviewed Mar 5, 2024

View reviewed changes

openfl-tutorials/experimental/Workflow_Interface_501_FineTuning_LLAMA2.ipynb Show resolved Hide resolved

Added hash validation to dataset example

a13e471

psfoley reviewed Mar 13, 2024

View reviewed changes

openfl-tutorials/experimental/Workflow_Interface_501_FineTuning_LLAMA2.ipynb Outdated Show resolved Hide resolved

Fixed str2optimizer32bit warning on CPU run

2d8733c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added example with Workflow interface for finetuning Llama2 LLM #905

Added example with Workflow interface for finetuning Llama2 LLM #905

manuelhsantana commented Dec 30, 2023 •

edited

Loading

psfoley left a comment

kta-intel commented Mar 6, 2024

manuelhsantana commented Mar 14, 2024

kta-intel commented Mar 14, 2024

Added example with Workflow interface for finetuning Llama2 LLM #905

Are you sure you want to change the base?

Added example with Workflow interface for finetuning Llama2 LLM #905

Conversation

manuelhsantana commented Dec 30, 2023 • edited Loading

psfoley left a comment

Choose a reason for hiding this comment

kta-intel commented Mar 6, 2024

manuelhsantana commented Mar 14, 2024

kta-intel commented Mar 14, 2024

manuelhsantana commented Dec 30, 2023 •

edited

Loading