question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

tts_models/en/ljspeech/tacotron2-DDC model does not stop decoding if there is no punctuation at the end of the sentence.

See original GitHub issue

Describe the bug In some cases the spoken text is messed up.

To Reproduce Steps to reproduce the behavior:

  1. Run the following command tts-server --model_name tts_models/en/ljspeech/tacotron2-DDC --vocoder_name vocoder_models/en/ljspeech/hifigan_v2
  2. open webinterface and generate audio http://127.0.0.1:5002/
  3. text and audio examples:

banana

https://soundcloud.com/davidak-de/ai-with-tourette-syndrome-struggles-to-say-banana

popular tourette youtuber tells the banana tic story. it’s pretty close 😄 https://youtu.be/Q5MrVcpq-a8?t=74

dada

https://soundcloud.com/davidak-de/dada-ai-singing

what have you done??? i’m literally screaming. ha

https://soundcloud.com/davidak-de/disturbed-ai-is-literally-screaming

shit, fuck! ha

https://soundcloud.com/davidak-de/ai-with-tourette-syndrome-swearing

what’s going on

https://soundcloud.com/davidak-de/ai-total-breakdown

cyber space mastodon coqui tts cocktail noise linus G H A

well, every single letter produces 12 seconds noise

the problem seem to be that it does not know where the end is when there is no ., ! or ? at the end. it just continues speaking what comes to it’s mind…

you can try to generate the audio multiple times to get different results. sometimes it’s completely confused, but most times only the last word loops

when this issue occur, you see this in terminal:

 > Model input: what's going on
 > Text splitted to sentences.
["what's going on"]
   | > Decoder stopped with 'max_decoder_steps
 > Processing time: 7.120903015136719
 > Real-time factor: 0.584658592060488
[INFO] ::ffff:127.0.0.1 - - [13/May/2021 20:45:27] "GET /api/tts?text=what%27s%20going%20on HTTP/1.1" 200 -

Related to:

Expected behavior read the text like a human would

Environment (please complete the following information):

  • OS Platform and Distribution (e.g., Linux Ubuntu 16.04): NixOS GNU/Linux 21.05pre288911.65d6153aec8 (Okapi)
  • PyTorch or TensorFlow version (use command below): which command???
  • Python version: 3.8.9
  • CUDA/cuDNN version:
  • GPU model and memory:
  • Exact command to reproduce:
  • TTS version: 0.0.12

Additional context I hope it’s OK to make this joke that the AI has a mental disorder and it’s not too offensive.

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:7 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
erogolcommented, May 14, 2021

just updated the title to be more informative. Hope it is fine with you.

0reactions
stale[bot]commented, Jun 13, 2021

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

Read more comments on GitHub >

github_iconTop Results From Across the Web

8. Punctuation
The general principles governing the use of punctuation are: If it does not clarify ... The possessive case of a singular or plural...
Read more >
Please Don't Do That!
"quote within a quote," and if both come at the end of a sentence, the sentence ends with three marks outside the period...
Read more >
End Punctuation [pdf]
End punctuation tells the reader how to interpret each sentence. There are three types of end punctuation: periods, question marks, and exclamation points....
Read more >
End Stop Punctuation, Fragments, and Commas
Use a period after a sentence that makes a statement or a request. Make sure that your statements have an independent clause (see...
Read more >
Why Prosody Matters: The Importance of Reading Aloud ...
While punctuation provides some cues to prosody, young readers can be misled by it. For instance, they may pause at every comma, even...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found