Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

First Bookmark's audio_offset Incorrectly Includes Following Break Time Duration

See original GitHub issue

Describe the bug When a bookmark is placed at the very beginning of the SSML string and is followed by a <break> tag, the evt.audio_offset value of the bookmark incorrectly includes the break time. This does not occur when the bookmark is placed elsewhere in the SSML string.

To Reproduce Steps to reproduce the behavior:

Generate SSML that starts with a bookmark followed by a break, e.g., <bookmark mark="1"/> <break time="2s"/>.
Process the SSML with the Azure Text to Speech service.
In the bookmark_reached callback, log the evt.audio_offset value.

Example SSML:

<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xml:lang="es-ES"><voice name="es-ES-AlvaroNeural"><prosody rate="1"><bookmark mark="1"/> <break time="2s"></break> ¡Hola! ¿Qué tal? <bookmark mark="2"/> <break time="2s"></break></prosody></voice></speak>

Expected behavior The evt.audio_offset value of the bookmark should not include the subsequent break time. As the bookmark is placed at the very start of the SSML, the evt.audio_offset should be 0.

Version of the Cognitive Services Speech SDK azure-cognitiveservices-speech 1.29.0

Platform, Operating System, and Programming Language

OS: Any
Programming language: Python

Issue Analytics

State:
Created 3 months ago
Comments:11 (2 by maintainers)

Top GitHub Comments

1reaction

Kerry-LinZhangcommented, Jul 27, 2023

Tracked as long-term fix, we have put it into backlog for future plan. Will continue follow up with it.

0reactions

Kerry-LinZhangcommented, Aug 4, 2023

Hi @anton-vidext currently we have 2 workaround for you to try,

 According to your workaround, you can also try this one without 'rate'

Using ‘<voice name="es-ES-AlvaroNeural"><bookmark mark="1"/> <mstts:audioduration value=“2s”><prosody volume="0">hola hola hola hola hola</prosody></mstts:audioduration></voice>’ to replace break

Top Results From Across the Web

Audio out of sync AFTER importing - Adobe Support Community

It looks like there is a problem when conforming the audio. But even the time duration is different from the original, there is...

Solved: [0S 12.0] Audio out of sync on Xfinity channel

Solved: Hi, I mainly use the Xfinity app on my Roku Express device. I tried your suggestions of uninstalling and reinstalling the app...

[vlc-devel] [PATCH] Subsync: added an easier way to sync ...

Added the following shortcuts (and made them configurable): - Shift-H: sets a ... _("Sub sync: bookmarked audio timestamp")); + break; ...

Solving Audio Sync Drift

Imported audio-only files in a sequence with PAL video may drift out of sync in some cases. Products Affected Final Cut Pro Resolution...

Synchronization problems and solutions in Logic Pro for Mac

This table describes common synchronization problems and solutions. If Logic Pro is synchronized to external word clock (Audio Sync Mode: External or Free), ......