Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

`unknown error` in TTS Model Loading

See original GitHub issue

Is your feature request related to a problem? Please describe. I am trying out text-to-speech pipeline and I pushed files to hub based similar to this one except I am not using hifigan as vocoder, but using the default griffin-lim.

When I try to load in interface, I am getting following error: Screenshot from 2022-01-24 23-52-01

Here is how my fs look like:

.
├── config.yaml
├── gcmvn_stats.npz
├── model.pt
├── README.md
├── speakers.txt
├── spm_char.model
├── spm_char.txt
└── spm_char.vocab

Describe alternatives you’ve considered I tried to replicate how @kahne created the config and added a hub key to config.yml and it did not help:

...
hub:
  lang: mn
  speaker: 0

I wonder if you guys have any idea why it would throw index out of range in self?

Issue Analytics

State:
Created 2 years ago
Reactions:1
Comments:8 (8 by maintainers)

Top GitHub Comments

4reactions

kahnecommented, Jan 26, 2022

Hi all, thanks very much for the discussions here! src_dict.encode_line() adds OOV tokens to the dictionary by default instead of replacing them with the UNK token (needs to set add_if_not_exist=False). I will update fairseq with this fallback to UNK.

2reactions

bayartsogt-yacommented, Jan 25, 2022

Thank you so much @osanseviero @Narsil @patrickvonplaten

I just found that when I use only characters that are present in spm_char.txt, then it is working fine. In my case, I just needed to lowercase 😅

I also could have specified the uncased or cased scenario in sentencepiece training.

I guess the root cause was sentencepiece encoding (src_dict.encode_line(tokenized)) which caused embedding index out of range issue. I automatically assumed that sentencepiece model would do the out of vocab handling (falling back to <unk> token), but obviously it does not.

Anyways, thank you again for figuring this out!

Top Results From Across the Web

WWW IMAGE ERROR - Tabletop Simulator - Steam Community

WWW IMAGE ERROR: UNKNOWN ERROR. HI there im having this error pop up when the game tries to load up certain images (that...

WWW Image Error: Cannot connect to destination host - Reddit

First, load up the mod in Tabletop Simulator, don't change or move anything, and as soon as it's done loading, create a new...

Troubleshooting - Tabletop Simulator Knowledge Base

This can be set in Steam properties for the game. Change your monitor's resolution and run TTS. Try changing it back to normal...

Problems training a model with Common Voice

Hello, I want to use Mozilla TTS with Mozilla Common Voice (German). When I start the training I get the following error: Pastebin:...

Error messages | Cloud Speech-to-Text Documentation

Learn how to resolve some errors raised by Speech-to-Text. This topic discusses errors whose resolutions require more steps than can be easily described...