Implement SD3 loss weighting #8528

Slickytail · 2024-06-13T10:18:49Z

The loss-weighting schemes for SD3 training were not implemented correctly, causing all of them to be non-functional. I went ahead and implemented the lognorm and cosmap schemes, just by using the density at those timesteps. Potentially, a better approach would be to sample the timestep according to that density in the first place.

The Mode scheme is much harder to implement -- there's a reason that they didn't include an explicit form for the density in the paper (I couldn't find one...), so I put in an error message if you try to use it for now.

@sayakpaul @kashif

kashif · 2024-06-13T10:24:34Z

thanks @Slickytail checking

HuggingFaceDocBuilderDev · 2024-06-13T11:32:07Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

bghira · 2024-06-13T15:54:14Z

previous code

new code

kashif · 2024-06-13T15:55:40Z

@Slickytail can you kindly make style in the root dir of diffusers?

@Slickytail

…tribution thanks to @Slickytail

examples/dreambooth/train_dreambooth_lora_sd3.py

examples/dreambooth/train_dreambooth_sd3.py

examples/dreambooth/train_dreambooth_lora_sd3.py

examples/dreambooth/train_dreambooth_sd3.py

chenbaiyujason · 2024-06-13T17:56:19Z

I get the following error：
timesteps = noise_scheduler_copy.timesteps[indices].to(device=model_input.device)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^
RuntimeError: indices should be either on cpu or on the same device as the indexed tensor (cpu)

sayakpaul · 2024-06-13T17:58:33Z

@kashif I will review the changes a bit later but could you test the scripts and see if they are running without errors?

chenbaiyujason · 2024-06-13T18:01:22Z

This avoids errors
1476 indices = (u * noise_scheduler_copy.config.num_train_timesteps).long()
fix:
indices = (u * noise_scheduler_copy.config.num_train_timesteps).long().to(device="cpu")

examples/dreambooth/train_dreambooth_lora_sd3.py

examples/dreambooth/train_dreambooth_sd3.py

xiaohu2015 · 2024-06-14T06:17:05Z

a question about the loss: why here we firstly convert the model prediction to x_0 to compute loss?

Slickytail · 2024-06-14T09:45:51Z

Hi @kashif, it looks like you commited the necessary formatting/style changes, so I'm assuming make style isn't needed from me anymore. Is there anything else that needs a fix in here?

kashif · 2024-06-14T09:53:08Z

@Slickytail yes let's keep all the u tensors on the CPU as the indices are on the CPU side so there is no need to set the device to gpu

sayakpaul

Thanks a lot for these!

@asomoza has confirmed that these are working nicely.

In a follow up PR, I will add your PR in the comments to honor your contributions :)

Will also make it a little utility function and move it to training_utils.py.

Appreciate your help here!

sayakpaul · 2024-06-16T19:15:43Z

Okay I can confirm that the failing example tests are only with the latest version of datasets. With the 2.19.1 version, they don't appear. So, will merge the PR. Requesting @lhoestq @albertvillanova to help.

albertvillanova · 2024-06-17T06:06:33Z

Thanks for the ping, @sayakpaul.

In the latest datasets release, we had to introduce a breaking change for security reasons: huggingface/transformers#31406

Datasets with a Python loading script now require passing trust_remote_code=True to be used

lhoestq · 2024-06-17T11:22:34Z

I merged https://huggingface.co/datasets/hf-internal-testing/fill10/discussions/1 to fix the CI

Add lognorm and cosmap weighting

45b64ec

Implement mode sampling

5c3f755

bghira pushed a commit to bghira/SimpleTuner that referenced this pull request Jun 13, 2024

implement huggingface/diffusers#8528 changes to timestep sampling dis…

9bece0d

…tribution thanks to @Slickytail

sayakpaul mentioned this pull request Jun 13, 2024

[SD3] the training script of SD3 dreambooth has wrong logit-normal weighting #8534

Closed

kashif reviewed Jun 13, 2024

View reviewed changes

examples/dreambooth/train_dreambooth_lora_sd3.py Outdated Show resolved Hide resolved

kashif reviewed Jun 13, 2024

View reviewed changes

examples/dreambooth/train_dreambooth_lora_sd3.py Outdated Show resolved Hide resolved

kashif reviewed Jun 13, 2024

View reviewed changes

examples/dreambooth/train_dreambooth_sd3.py Outdated Show resolved Hide resolved

kashif reviewed Jun 13, 2024

View reviewed changes

examples/dreambooth/train_dreambooth_sd3.py Outdated Show resolved Hide resolved

kashif reviewed Jun 13, 2024

View reviewed changes

examples/dreambooth/train_dreambooth_sd3.py Outdated Show resolved Hide resolved

kashif added 5 commits June 13, 2024 18:30

Update examples/dreambooth/train_dreambooth_lora_sd3.py

77305e3

Update examples/dreambooth/train_dreambooth_lora_sd3.py

41803fd

Update examples/dreambooth/train_dreambooth_sd3.py

3a428be

Update examples/dreambooth/train_dreambooth_sd3.py

9473589

Update examples/dreambooth/train_dreambooth_sd3.py

6e23139

kashif reviewed Jun 13, 2024

View reviewed changes

examples/dreambooth/train_dreambooth_lora_sd3.py Outdated Show resolved Hide resolved

Update examples/dreambooth/train_dreambooth_lora_sd3.py

994da3d

kashif reviewed Jun 13, 2024

View reviewed changes

examples/dreambooth/train_dreambooth_sd3.py Outdated Show resolved Hide resolved

kashif added 2 commits June 13, 2024 18:48

Update examples/dreambooth/train_dreambooth_sd3.py

f360c2d

Merge branch 'main' into main

5bf8163

kashif approved these changes Jun 13, 2024

View reviewed changes

Merge branch 'main' into main

54093d9

kashif reviewed Jun 13, 2024

View reviewed changes

examples/dreambooth/train_dreambooth_lora_sd3.py Show resolved Hide resolved

kashif reviewed Jun 13, 2024

View reviewed changes

examples/dreambooth/train_dreambooth_sd3.py Show resolved Hide resolved

kashif added 2 commits June 13, 2024 21:16

Update examples/dreambooth/train_dreambooth_sd3.py

0c09c91

Update examples/dreambooth/train_dreambooth_lora_sd3.py

9feda1c

keep timestamp sampling fully on cpu

7318729

sayakpaul approved these changes Jun 15, 2024

View reviewed changes

sayakpaul added 2 commits June 15, 2024 06:49

Merge branch 'main' into main

8c1eca7

Merge branch 'main' into main

feff605

sayakpaul merged commit 6946fac into huggingface:main Jun 16, 2024
6 of 8 checks passed

sayakpaul mentioned this pull request Jun 16, 2024

[SD3 training] refactor the density and weighting utilities. #8591

Merged

albertvillanova mentioned this pull request Jun 17, 2024

Set datasets temporary upper version 2.20.0 #8597

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement SD3 loss weighting #8528

Implement SD3 loss weighting #8528

Slickytail commented Jun 13, 2024

kashif commented Jun 13, 2024

HuggingFaceDocBuilderDev commented Jun 13, 2024

bghira commented Jun 13, 2024

kashif commented Jun 13, 2024

chenbaiyujason commented Jun 13, 2024

sayakpaul commented Jun 13, 2024

chenbaiyujason commented Jun 13, 2024

xiaohu2015 commented Jun 14, 2024

Slickytail commented Jun 14, 2024

kashif commented Jun 14, 2024

sayakpaul left a comment

sayakpaul commented Jun 16, 2024

albertvillanova commented Jun 17, 2024

lhoestq commented Jun 17, 2024

Implement SD3 loss weighting #8528

Implement SD3 loss weighting #8528

Conversation

Slickytail commented Jun 13, 2024

kashif commented Jun 13, 2024

HuggingFaceDocBuilderDev commented Jun 13, 2024

bghira commented Jun 13, 2024

kashif commented Jun 13, 2024

chenbaiyujason commented Jun 13, 2024

sayakpaul commented Jun 13, 2024

chenbaiyujason commented Jun 13, 2024

xiaohu2015 commented Jun 14, 2024

Slickytail commented Jun 14, 2024

kashif commented Jun 14, 2024

sayakpaul left a comment

Choose a reason for hiding this comment

sayakpaul commented Jun 16, 2024

albertvillanova commented Jun 17, 2024

lhoestq commented Jun 17, 2024