`unknown error` in TTS Model Loading
See original GitHub issueIs your feature request related to a problem? Please describe. I am trying out text-to-speech pipeline and I pushed files to hub based similar to this one except I am not using hifigan as vocoder, but using the default griffin-lim.
When I try to load in interface, I am getting following error:
Here is how my fs look like:
.
├── config.yaml
├── gcmvn_stats.npz
├── model.pt
├── README.md
├── speakers.txt
├── spm_char.model
├── spm_char.txt
└── spm_char.vocab
Describe alternatives you’ve considered
I tried to replicate how @kahne created the config and added a hub
key to config.yml and it did not help:
...
hub:
lang: mn
speaker: 0
I wonder if you guys have any idea why it would throw index out of range in self
?
Issue Analytics
- State:
- Created 2 years ago
- Reactions:1
- Comments:8 (8 by maintainers)
Top Results From Across the Web
WWW IMAGE ERROR - Tabletop Simulator - Steam Community
WWW IMAGE ERROR: UNKNOWN ERROR. HI there im having this error pop up when the game tries to load up certain images (that...
Read more >WWW Image Error: Cannot connect to destination host - Reddit
First, load up the mod in Tabletop Simulator, don't change or move anything, and as soon as it's done loading, create a new...
Read more >Troubleshooting - Tabletop Simulator Knowledge Base
This can be set in Steam properties for the game. Change your monitor's resolution and run TTS. Try changing it back to normal...
Read more >Problems training a model with Common Voice
Hello, I want to use Mozilla TTS with Mozilla Common Voice (German). When I start the training I get the following error: Pastebin:...
Read more >Error messages | Cloud Speech-to-Text Documentation
Learn how to resolve some errors raised by Speech-to-Text. This topic discusses errors whose resolutions require more steps than can be easily described...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Hi all, thanks very much for the discussions here!
src_dict.encode_line()
adds OOV tokens to the dictionary by default instead of replacing them with the UNK token (needs to setadd_if_not_exist=False
). I will update fairseq with this fallback to UNK.Thank you so much @osanseviero @Narsil @patrickvonplaten
I just found that when I use only characters that are present in
spm_char.txt
, then it is working fine. In my case, I just needed to lowercase 😅I also could have specified the uncased or cased scenario in sentencepiece training.
I guess the root cause was sentencepiece encoding (
src_dict.encode_line(tokenized)
) which caused embedding index out of range issue. I automatically assumed that sentencepiece model would do the out of vocab handling (falling back to<unk>
token), but obviously it does not.Anyways, thank you again for figuring this out!