Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Is there a Rest API example for getting TTS Word Boundary?

See original GitHub issue

I am using Python requests to get audio through the cognitoService speech REST API. Is there an example somewhere that I refer to so that I can save time figuring it out?

https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/rest-text-to-speech

requests.post("https://westus2.tts.speech.microsoft.com/cognitiveservices/v1", data=data ,headers=headers)

I don’t need most of the functions from the sdk so I prefer not to install the sdk if possible to keep the cloud function light.

Does it cost more to get word boundary?

Thank you

Issue Analytics

State:
Created 2 years ago
Comments:12 (6 by maintainers)

Top GitHub Comments

1reaction

yulin-licommented, Mar 8, 2022

see here

1reaction

yulin-licommented, Mar 1, 2022

Are they guaranteed to come in the correct sequence?

yes

It doesn’t seem to have ‘last word’ event, does it?

We don’t, but you can detect if it’s the last word by textOffset

Are word boundary events always sent before audio data is sent?

Word boundary events are always sent before the corresponding audio

Should my server reset the websocket connection for every request? Does it to disconnect and reconnect the websocket connection every now and then?

You don’t need to care about the websocket connection. You can just use SpeechSynthesizer and SDK will handle the connection.

Top Results From Across the Web

Text to Speech WordBoundary with REST? - Microsoft Q&A

Text to Speech WordBoundary with REST? ... Is it possible to get word information with the REST call? It would be the WordBoundary...

Does Azure Text to Speech API support <mark> tags?

3 Answers. The <mark> tag is not currently supported. However, we do support word boundary events now; an example can be found here....

Text to speech REST API - Azure

Learn how to use the REST API to convert text into synthesized speech.

Enable read-aloud for your application with Azure neural TTS

Receives the audio stream and the TTS events (e.g., word boundary events) from Azure TTS, then makes postprocessing if necessary, and sends them ......

How to increase TTS request limits?

By default, The REST API limits requests to 25 per 5 seconds. Over the limit, it will be throttled. To increase the limit,...