Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] MATERIAL scripts #2165

Merged
merged 113 commits into from
Jun 6, 2019
Merged
Changes from 1 commit
Commits
Show all changes
113 commits
Select commit Hold shift + click to select a range
3b4428b
basic directory structure
jtrmal Oct 30, 2017
467cdeb
basic data setup ready
jtrmal Oct 31, 2017
ae370ad
adding scoring script
jtrmal Oct 31, 2017
92bd9eb
resolve utf-8 encoding and some other details
jtrmal Nov 6, 2017
33d5f36
do fix_data_dir after parametrixation
jtrmal Nov 6, 2017
0a0436a
make 1A language default again
jtrmal Nov 6, 2017
e826675
material:add text filter for scoring
jtrmal Nov 21, 2017
34ff3c1
material: fix path.sh
jtrmal Nov 21, 2017
8e3481a
script changes up to triphone training
freewym Nov 6, 2017
04dc97e
tuning of triphone systems
freewym Nov 22, 2017
7a2114c
added recipe for tagalog
freewym Dec 13, 2017
51768ed
added more
freewym Dec 13, 2017
4755373
RNNLM for material
hainan-xv Jan 6, 2018
389dcde
change chain model path
hainan-xv Jan 6, 2018
258f7ba
minor change
hainan-xv Jan 13, 2018
0061f07
minor change
hainan-xv Jan 13, 2018
daf8e56
reoriganized the scripts structures to allow to specifying language n…
freewym Jan 16, 2018
8b3e34c
create one single rnnlm script for all material languages
freewym Jan 17, 2018
c05b977
fix stage numbers in the tdnn-lstm recipe
freewym Jan 19, 2018
78c821b
remove $language subdir in /exp and /data
freewym Jan 20, 2018
2abbfa2
fix issues related to num of params checks for some scripts
freewym Feb 7, 2018
b5af256
added decoding scripts for ANALYSIS1
Jan 25, 2018
1a8d80e
added scripts to compute WER for decoding ANALYSIS1
Jan 31, 2018
60c6eed
removed exit 0
Feb 4, 2018
b0b098f
remove in exp/ and data/
Feb 7, 2018
0145421
minor fixes
Feb 9, 2018
64165fe
added audio_path to conf
Feb 13, 2018
10360e6
added scripts that produce the results for the site visit and cleanup
freewym Feb 9, 2018
c0ae6b1
added support to decode test dev/eval1
freewym Mar 1, 2018
2016d8c
added sentence segmentation
Apr 24, 2018
2e5d018
bug fixes for the path to tagalog DEV data
freewym Apr 6, 2018
1b17c1f
adds eval2 decoding; adds tdnn1b recipes
freewym Apr 21, 2018
442e6bb
adds analysis2 decoding
Jun 14, 2018
eb225b1
material scripts
hainan-xv Jun 14, 2018
711198e
clean up src a bit
hainan-xv Jun 14, 2018
0ca089f
clean up src a bit2
hainan-xv Jun 14, 2018
3e1c880
material: Temporarily fixed scoring
vimalmanohar Jul 10, 2018
9cd33e6
material: Cleanup scoring scripts
vimalmanohar Jul 10, 2018
399496a
material scripts
hainan-xv Jul 17, 2018
d705fe3
merge with latest base
hainan-xv Jul 17, 2018
a42b09f
t push origin material_basicMerge branch 'hainan-xv-material_basic' i…
Jul 18, 2018
b98b248
remove some of the _2 affixes
hainan-xv Jul 20, 2018
1f9fa82
add decoding for eval3
hainan-xv Jul 21, 2018
9ffb000
Update convert_lexicon.pl
jtrmal Aug 27, 2018
4cd2ea5
added semisupervised training scripts (might need changes according t…
freewym Sep 5, 2018
187b18c
removed files with _2 suffix
Sep 11, 2018
212a8f7
Merge branch 'master' of https://github.com/kaldi-asr/kaldi into mate…
hainan-xv Sep 15, 2018
cc3cd1d
add mono data
hainan-xv Sep 16, 2018
250ba57
fix a bug for decoding; officially working
hainan-xv Sep 17, 2018
28dc919
basic directory structure
jtrmal Oct 30, 2017
bf66894
basic data setup ready
jtrmal Oct 31, 2017
e3db74e
adding scoring script
jtrmal Oct 31, 2017
b06655e
resolve utf-8 encoding and some other details
jtrmal Nov 6, 2017
0599652
do fix_data_dir after parametrixation
jtrmal Nov 6, 2017
fa75313
make 1A language default again
jtrmal Nov 6, 2017
de1b53d
material:add text filter for scoring
jtrmal Nov 21, 2017
93ebc80
material: fix path.sh
jtrmal Nov 21, 2017
bdc70ec
script changes up to triphone training
freewym Nov 6, 2017
2c46288
tuning of triphone systems
freewym Nov 22, 2017
d281ede
added recipe for tagalog
freewym Dec 13, 2017
dbc0723
added more
freewym Dec 13, 2017
eab09c7
RNNLM for material
hainan-xv Jan 6, 2018
a7f73a3
change chain model path
hainan-xv Jan 6, 2018
77ce4dc
minor change
hainan-xv Jan 13, 2018
0b0e47d
minor change
hainan-xv Jan 13, 2018
c9b3e75
reoriganized the scripts structures to allow to specifying language n…
freewym Jan 16, 2018
a8d77fc
create one single rnnlm script for all material languages
freewym Jan 17, 2018
ffba239
fix stage numbers in the tdnn-lstm recipe
freewym Jan 19, 2018
9ee662d
remove $language subdir in /exp and /data
freewym Jan 20, 2018
a80382b
fix issues related to num of params checks for some scripts
freewym Feb 7, 2018
6cd4049
added decoding scripts for ANALYSIS1
Jan 25, 2018
809e3f3
added scripts to compute WER for decoding ANALYSIS1
Jan 31, 2018
5e06fe4
removed exit 0
Feb 4, 2018
d7cf604
remove in exp/ and data/
Feb 7, 2018
63c2b93
minor fixes
Feb 9, 2018
bb97a3f
added audio_path to conf
Feb 13, 2018
38d3445
added scripts that produce the results for the site visit and cleanup
freewym Feb 9, 2018
1601312
added support to decode test dev/eval1
freewym Mar 1, 2018
cc32463
added sentence segmentation
Apr 24, 2018
c85ddf3
bug fixes for the path to tagalog DEV data
freewym Apr 6, 2018
22109d7
adds eval2 decoding; adds tdnn1b recipes
freewym Apr 21, 2018
6452a5f
adds analysis2 decoding
Jun 14, 2018
ec7ec63
material scripts
hainan-xv Jun 14, 2018
b37c754
clean up src a bit
hainan-xv Jun 14, 2018
f1e44da
clean up src a bit2
hainan-xv Jun 14, 2018
170da49
material: Temporarily fixed scoring
vimalmanohar Jul 10, 2018
9c61ab2
material: Cleanup scoring scripts
vimalmanohar Jul 10, 2018
63022e6
material scripts
hainan-xv Jul 17, 2018
46f34a4
remove some of the _2 affixes
hainan-xv Jul 20, 2018
e6e21bb
add decoding for eval3
hainan-xv Jul 21, 2018
bb06a69
Update convert_lexicon.pl
jtrmal Aug 27, 2018
d380c46
added semisupervised training scripts (might need changes according t…
freewym Sep 5, 2018
c5b58f8
removed files with _2 suffix
Sep 11, 2018
e80d7b1
adding monodata to material
hainan-xv Sep 25, 2018
d80f022
Merge branch 'material_basic' into material_fix2
mahsa7823 Sep 25, 2018
902fd89
Merge branch 'hainan-xv-material_fix4' into material_basic
Sep 25, 2018
865b507
updated run.sh with the instrictions
Oct 1, 2018
b1d9ef0
changing how LM preparation was done for material
hainan-xv Oct 8, 2018
8014b51
changing how LM preparation was done for material, merge with latest …
hainan-xv Oct 8, 2018
aef4bf3
added Somali config
Oct 22, 2018
3d07b4e
Merge branch 'hainan-xv-material_new_lm2' into material_basic
Oct 22, 2018
df46dc9
updated somali.cong with mono and number_mapping paths
Nov 19, 2018
22d03b5
added local/normalize_numbers.py
Nov 23, 2018
7a83d7e
support for mono2, create output_nbest directory
mahsa7823 Dec 12, 2018
e4b77fe
updated WER results in local/chain/tuning/run_tdnn_1b.sh and local/rn…
mahsa7823 Feb 20, 2019
54840c8
clean-up semisupervised training scripts for material
freewym May 26, 2019
042c974
clean up
mahsa7823 May 27, 2019
682d483
clean up semisup
mahsa7823 May 27, 2019
0f428ba
change configs
mahsa7823 May 27, 2019
7f07182
anonymize paths
mahsa7823 May 27, 2019
f92372b
Merge branch 'master' into material_basic
mahsa7823 May 27, 2019
2de2e42
added README and RESULTS. De-anonymization.
mahsa7823 May 29, 2019
aac8c9a
Merge branch 'material_basic' of https://github.com/mahsa7823/kaldi i…
mahsa7823 May 29, 2019
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
minor change
  • Loading branch information
hainan-xv authored and Mahsa Yarmohammadi committed Apr 30, 2018
commit 258f7ba9a467ee58f8d67edc3e49c57022b9bd36
2 changes: 1 addition & 1 deletion scripts/rnnlm/prepare_rnnlm_dir.sh
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ if [ $stage -le 1 ]; then
done
fi

cat $dir/config/words.txt | grep '<' | rnnlm/get_special_symbol_opts.py > $dir/special_symbol_opts.txt
cat $dir/config/words.txt | rnnlm/get_special_symbol_opts.py > $dir/special_symbol_opts.txt
fi

if [ $stage -le 2 ]; then
Expand Down