Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

tts_models/en/ljspeech/tacotron2-DDC model does not stop decoding if there is no punctuation at the end of the sentence.

See original GitHub issue

Describe the bug In some cases the spoken text is messed up.

To Reproduce Steps to reproduce the behavior:

Run the following command tts-server --model_name tts_models/en/ljspeech/tacotron2-DDC --vocoder_name vocoder_models/en/ljspeech/hifigan_v2
open webinterface and generate audio http://127.0.0.1:5002/
text and audio examples:

banana

https://soundcloud.com/davidak-de/ai-with-tourette-syndrome-struggles-to-say-banana

popular tourette youtuber tells the banana tic story. it’s pretty close 😄 https://youtu.be/Q5MrVcpq-a8?t=74

dada

https://soundcloud.com/davidak-de/dada-ai-singing

what have you done??? i’m literally screaming. ha

https://soundcloud.com/davidak-de/disturbed-ai-is-literally-screaming

shit, fuck! ha

https://soundcloud.com/davidak-de/ai-with-tourette-syndrome-swearing

what’s going on

https://soundcloud.com/davidak-de/ai-total-breakdown

cyber space mastodon coqui tts cocktail noise linus G H A

well, every single letter produces 12 seconds noise

the problem seem to be that it does not know where the end is when there is no ., ! or ? at the end. it just continues speaking what comes to it’s mind…

you can try to generate the audio multiple times to get different results. sometimes it’s completely confused, but most times only the last word loops

when this issue occur, you see this in terminal:

 > Model input: what's going on
 > Text splitted to sentences.
["what's going on"]
   | > Decoder stopped with 'max_decoder_steps
 > Processing time: 7.120903015136719
 > Real-time factor: 0.584658592060488
[INFO] ::ffff:127.0.0.1 - - [13/May/2021 20:45:27] "GET /api/tts?text=what%27s%20going%20on HTTP/1.1" 200 -

Related to:

Expected behavior read the text like a human would

Environment (please complete the following information):

OS Platform and Distribution (e.g., Linux Ubuntu 16.04): NixOS GNU/Linux 21.05pre288911.65d6153aec8 (Okapi)
PyTorch or TensorFlow version (use command below): which command???
Python version: 3.8.9
CUDA/cuDNN version:
GPU model and memory:
Exact command to reproduce:
TTS version: 0.0.12

Additional context I hope it’s OK to make this joke that the AI has a mental disorder and it’s not too offensive.

Issue Analytics

State:
Created 2 years ago
Comments:7 (4 by maintainers)

Top GitHub Comments

1reaction

erogolcommented, May 14, 2021

just updated the title to be more informative. Hope it is fine with you.

0reactions

stale[bot]commented, Jun 13, 2021

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

Top Results From Across the Web

8. Punctuation

The general principles governing the use of punctuation are: If it does not clarify ... The possessive case of a singular or plural...

Please Don't Do That!

"quote within a quote," and if both come at the end of a sentence, the sentence ends with three marks outside the period...

End Punctuation [pdf]

End punctuation tells the reader how to interpret each sentence. There are three types of end punctuation: periods, question marks, and exclamation points....

End Stop Punctuation, Fragments, and Commas

Use a period after a sentence that makes a statement or a request. Make sure that your statements have an independent clause (see...

Why Prosody Matters: The Importance of Reading Aloud ...

While punctuation provides some cues to prosody, young readers can be misled by it. For instance, they may pause at every comma, even...