question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

`unknown error` in TTS Model Loading

See original GitHub issue

Is your feature request related to a problem? Please describe. I am trying out text-to-speech pipeline and I pushed files to hub based similar to this one except I am not using hifigan as vocoder, but using the default griffin-lim.

When I try to load in interface, I am getting following error: Screenshot from 2022-01-24 23-52-01

Here is how my fs look like:

.
├── config.yaml
├── gcmvn_stats.npz
├── model.pt
├── README.md
├── speakers.txt
├── spm_char.model
├── spm_char.txt
└── spm_char.vocab

Describe alternatives you’ve considered I tried to replicate how @kahne created the config and added a hub key to config.yml and it did not help:

...
hub:
  lang: mn
  speaker: 0

I wonder if you guys have any idea why it would throw index out of range in self?

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Reactions:1
  • Comments:8 (8 by maintainers)

github_iconTop GitHub Comments

4reactions
kahnecommented, Jan 26, 2022

Hi all, thanks very much for the discussions here! src_dict.encode_line() adds OOV tokens to the dictionary by default instead of replacing them with the UNK token (needs to set add_if_not_exist=False). I will update fairseq with this fallback to UNK.

2reactions
bayartsogt-yacommented, Jan 25, 2022

Thank you so much @osanseviero @Narsil @patrickvonplaten

I just found that when I use only characters that are present in spm_char.txt, then it is working fine. In my case, I just needed to lowercase 😅

I also could have specified the uncased or cased scenario in sentencepiece training.

I guess the root cause was sentencepiece encoding (src_dict.encode_line(tokenized)) which caused embedding index out of range issue. I automatically assumed that sentencepiece model would do the out of vocab handling (falling back to <unk> token), but obviously it does not.

Anyways, thank you again for figuring this out!

Read more comments on GitHub >

github_iconTop Results From Across the Web

WWW IMAGE ERROR - Tabletop Simulator - Steam Community
WWW IMAGE ERROR: UNKNOWN ERROR. HI there im having this error pop up when the game tries to load up certain images (that...
Read more >
WWW Image Error: Cannot connect to destination host - Reddit
First, load up the mod in Tabletop Simulator, don't change or move anything, and as soon as it's done loading, create a new...
Read more >
Troubleshooting - Tabletop Simulator Knowledge Base
This can be set in Steam properties for the game. Change your monitor's resolution and run TTS. Try changing it back to normal...
Read more >
Problems training a model with Common Voice
Hello, I want to use Mozilla TTS with Mozilla Common Voice (German). When I start the training I get the following error: Pastebin:...
Read more >
Error messages | Cloud Speech-to-Text Documentation
Learn how to resolve some errors raised by Speech-to-Text. This topic discusses errors whose resolutions require more steps than can be easily described...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found