update modules #59

lifeiteng · 2023-03-20T12:19:13Z

Add continual inference
Support train AR Decoder and NAR Decoder separately
Copy transformer modules from pytorch for Implementing Faster Inference

This maybe the last breaking modification!

zjwang21 · 2023-03-21T04:01:42Z

valle/models/valle.py

-            self.predict_layers[j].weight = self.nar_embeddings[j + 1].weight
+        for j in range(0, 6):
+            self.nar_predict_layers[j].weight = self.nar_audio_embeddings[
+                j + 2


这里的embedding share为什么只有6层，而且是j+2，我看论文里好像是说，predict_layer只有7层，embedding_layer是8层，所以predict的第1层和embedding的第2层共享，是这样嘛

zjwang21

Good Work

lifeiteng · 2023-04-06T07:18:37Z

demo page https://lifeiteng.github.io/valle/index.html

one GPU with 24GB memory

exp_dir=exp_0331/valle_Prefix1_Dim1024H16L12
python3 bin/trainer.py --max-duration 80 --filter-min-duration 0.5 --filter-max-duration 14 --train-stage 1 \
      --num-buckets 6 --dtype "bf16" --save-every-n 10000 \
      --model-name valle --share-embedding true --norm-first true --add-prenet false \
      --decoder-dim 1024 --nhead 16 --num-decoder-layers 12 --prefix-mode 1 \
      --base-lr 0.05 --warmup-steps 200 --average-period 0 \
      --num-epochs 20 --start-epoch 1 --start-batch 0 --accumulate-grad-steps 4 \
      --exp-dir ${exp_dir}

cp ${exp_dir}/best-valid-loss.pt ${exp_dir}/epoch-2.pt

python3 bin/trainer.py --max-duration 40 --filter-min-duration 0.5 --filter-max-duration 14 --train-stage 2 \
      --num-buckets 6 --dtype "float32" --save-every-n 10000 \
      --model-name valle --share-embedding true --norm-first true --add-prenet false \
      --decoder-dim 1024 --nhead 16 --num-decoder-layers 12 --prefix-mode 1 \
      --base-lr 0.05 --warmup-steps 200 --average-period 0 \
      --num-epochs 20 --start-epoch 3 --start-batch 0 --accumulate-grad-steps 4 \
      --exp-dir ${exp_dir}

lifeiteng added 5 commits March 19, 2023 16:25

VALLE add continual inference

c8d6f86

separate text embedding & position of AR and NAR Decoders

91ecd50

Separate Modules of AR and NAR Decoders

e34c101

Support train AR Decoder and NAR Decoder separately

486898d

Copy transformer modules from pytorch

1297357

lifeiteng requested a review from zjwang21 March 20, 2023 12:19

lifeiteng added 4 commits March 20, 2023 21:26

update trainer.py

b6a824c

Implement InputStrategy PromptedPrecomputedFeatures

aced965

VALL-E Add prefix_mode=4

7afedd5

Fix InputStrategy PromptedPrecomputedFeatures

fbb3fbc

zjwang21 reviewed Mar 21, 2023

View reviewed changes

zjwang21 approved these changes Mar 21, 2023

View reviewed changes

lifeiteng added 12 commits March 21, 2023 22:39

Fix InputStrategy PromptedPrecomputedFeatures

4c05d68

LibriTTS update README

cfe4965

use load_manifest_lazy

5c4f85f

Fix index of PromptedPrecomputedFeatures

0f0c7fd

Trainer - Add config --filter-min-duration

db5997c

Unify Prefix Mode 2 and 4

e7162e5

update trainer

751c226

Add Hparam --share-embedding

637c476

Merge branch 'prefix4' into stage

a50b5b4

Fix Hparam --share-embedding

f6f3017

Fix MultiGPU load_checkpoint

140a0b9

Tune prefix_mode 1

7657ef6

lifeiteng mentioned this pull request Mar 31, 2023

After 100 epochs training, the model can synthesize natural speech on LibriTTS #58

Open

lifeiteng added 4 commits March 31, 2023 12:25

valid every epoch

a952f95

update --train-stage logic

51a6955

set NUM_TEXT_TOKENS=512 for multi-language models

e55582f

VALLF support --train-stage

d34b025

lifeiteng added 6 commits March 31, 2023 21:13

VALLF support --prefix-mode

8a8facf

Fix VALl-F test

7e3bb2f

Fix DDP --train-stage

7d6b721

Add model hparam --scale-factor

9acece1

VALL-E & F update embedding sharing and inference sampling

5154048

egs rename run.sh to prepare.sh and simplify README

cf9f26c

lifeiteng merged commit 1ad7e13 into main Apr 6, 2023

lifeiteng deleted the stage branch October 16, 2023 14:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update modules #59

update modules #59

lifeiteng commented Mar 20, 2023 •

edited

Loading

zjwang21 Mar 21, 2023

zjwang21 left a comment

lifeiteng commented Apr 6, 2023 •

edited

Loading

update modules #59

update modules #59

Conversation

lifeiteng commented Mar 20, 2023 • edited Loading

zjwang21 Mar 21, 2023

Choose a reason for hiding this comment

zjwang21 left a comment

Choose a reason for hiding this comment

lifeiteng commented Apr 6, 2023 • edited Loading

one GPU with 24GB memory

lifeiteng commented Mar 20, 2023 •

edited

Loading

lifeiteng commented Apr 6, 2023 •

edited

Loading