Is there a Rest API example for getting TTS Word Boundary?
See original GitHub issueI am using Python requests to get audio through the cognitoService speech REST API. Is there an example somewhere that I refer to so that I can save time figuring it out?
https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/rest-text-to-speech
requests.post("https://westus2.tts.speech.microsoft.com/cognitiveservices/v1", data=data ,headers=headers)
I don’t need most of the functions from the sdk so I prefer not to install the sdk if possible to keep the cloud function light.
Does it cost more to get word boundary?
Thank you
Issue Analytics
- State:
- Created 2 years ago
- Comments:12 (6 by maintainers)
Top Results From Across the Web
Text to Speech WordBoundary with REST? - Microsoft Q&A
Text to Speech WordBoundary with REST? ... Is it possible to get word information with the REST call? It would be the WordBoundary...
Read more >Does Azure Text to Speech API support <mark> tags?
3 Answers. The <mark> tag is not currently supported. However, we do support word boundary events now; an example can be found here....
Read more >Text to speech REST API - Azure
Learn how to use the REST API to convert text into synthesized speech.
Read more >Enable read-aloud for your application with Azure neural TTS
Receives the audio stream and the TTS events (e.g., word boundary events) from Azure TTS, then makes postprocessing if necessary, and sends them ......
Read more >How to increase TTS request limits?
By default, The REST API limits requests to 25 per 5 seconds. Over the limit, it will be throttled. To increase the limit,...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
see here
yes
We don’t, but you can detect if it’s the last word by
textOffset
Word boundary events are always sent before the corresponding audio
You don’t need to care about the websocket connection. You can just use
SpeechSynthesizer
and SDK will handle the connection.