question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

bug during training with new voice data

See original GitHub issue

Hi, I replaced the slt_demo data with a new voice data, following the instruction given here. All the scripts for data prepreration went smoothly. However, during the Step2 : training duration model, I got the following error info. Any idea or advice? Thanks a lot!

2017-01-02 00:34:49,253 DEBUG acoustic_comp: wrote 45 frames of features 2017-01-02 00:34:49,869 INFO main : normalising acoustic (output) features using method MVN /home/merlin/src/frontend/mean_variance_norm.py:69: FutureWarning: comparison to None will result in an elementwise object comparison in the future. if self.mean_vector == None: /home/merlin/src/frontend/mean_variance_norm.py:71: FutureWarning: comparison to None will result in an elementwise object comparison in the future. if self.std_vector == None: 2017-01-02 00:34:49,940 INFO main : saved MVN vectors to /home/merlin/egs/slt_arctic/s1/experiments/slt_arctic_demo/duration_model/data/norm_info_dur_5_MVN.dat 2017-01-02 00:34:49,941 INFO main : saved dur variance vector to /home//merlin/egs/slt_arctic/s1/experiments/slt_arctic_demo/duration_model/data/var/dur_5 2017-01-02 00:34:49,958 DEBUG labels : HTS-derived input feature dimension is 416 + 0 = 416 2017-01-02 00:34:49,958 INFO main : label dimension is 416 2017-01-02 00:34:49,959 INFO main : training DNN 2017-01-02 00:34:49,959 DEBUG main.train_DNN: Starting train_DNN 2017-01-02 00:34:49,959 DEBUG main.train_DNN: Creating training data provider 2017-01-02 00:34:49,960 DEBUG main.train_DNN: Creating validation data provider 2017-01-02 00:34:49,960 CRITICALListDataProvider: first list is empty 2017-01-02 00:34:49,960 CRITICAL main : train_DNN threw an exception Traceback (most recent call last): File “/home//merlin/src/run_merlin.py”, line 1175, in <module> main_function(cfg) File “/home/merlin/src/run_merlin.py”, line 838, in main_function cmp_mean_vector = cmp_mean_vector, cmp_std_vector = cmp_std_vector) File “/home//merlin/src/run_merlin.py”, line 209, in train_DNN n_ins = n_ins, n_outs = n_outs, buffer_size = buffer_size, sequential = sequential_training, shuffle = False) File “/home/merlin/src/utils/providers.py”, line 88, in init assert len(x_file_list) > 0 AssertionError

Issue Analytics

  • State:closed
  • Created 7 years ago
  • Comments:7 (2 by maintainers)

github_iconTop GitHub Comments

2reactions
zhizhengwucommented, Jan 4, 2017

(train_file_number + valid_file_number + test_file_number) should be smaller than the total number of files.

On 4 January 2017 at 14:01, Jun ZHANG notifications@github.com wrote:

yes. I actually followed the demo example, in the duration_slt_arctic_demo.conf, I specified the following " [Data] train_file_number: 417 valid_file_number: 5 test_file_number: 5"

note the total number of files in the new voice dataset is 417. Very confused how the validation data became “empty”? Any further advice for the bug ? I highly appreciate the effort 😃

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/CSTR-Edinburgh/merlin/issues/69#issuecomment-270499685, or mute the thread https://github.com/notifications/unsubscribe-auth/AFRxxpl3kkpGSE4C5e7ZrQU6xLxGliGQks5rPBbKgaJpZM4LZXF0 .

1reaction
ikhfacommented, Jan 4, 2017

change the train_file_number to 407

Read more comments on GitHub >

github_iconTop Results From Across the Web

Too Few Observation Sequences while training new voice for ...
I'm trying to build a new voice for MaryTTS in German for a while now, but didn't succeed so far.
Read more >
Training from scratch · Issue #126 · CorentinJ/Real ... - GitHub
I'm working on adding TEDLIUM_release-3 which would add 1,925 new speakers and potentially SLR68 which would add 1,017 Chinese speakers but ...
Read more >
CSCvr89224 - Authenticated VOICE device traffic is ... - Cisco Bug
Symptom: VOICE device is sending untagged traffic onto the DATA VLAN and getting a DHCP address from the DATA domain Conditions: Voice device...
Read more >
How to prepare data for Custom Voice - Speech service
Select Custom Voice > Your project name > Prepare training data > Upload data. In the Upload data wizard, choose a data type...
Read more >
Common Voice training data - Coqui STT 1.4.0 documentation
This document gives some information about using Common Voice data with STT. If you're in need of training data, the Common Voice corpus...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found