question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

First Bookmark's audio_offset Incorrectly Includes Following Break Time Duration

See original GitHub issue

Describe the bug When a bookmark is placed at the very beginning of the SSML string and is followed by a <break> tag, the evt.audio_offset value of the bookmark incorrectly includes the break time. This does not occur when the bookmark is placed elsewhere in the SSML string.

To Reproduce Steps to reproduce the behavior:

  1. Generate SSML that starts with a bookmark followed by a break, e.g., <bookmark mark="1"/> <break time="2s"/>.
  2. Process the SSML with the Azure Text to Speech service.
  3. In the bookmark_reached callback, log the evt.audio_offset value.

Example SSML:

<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xml:lang="es-ES"><voice name="es-ES-AlvaroNeural"><prosody rate="1"><bookmark mark="1"/> <break time="2s"></break> ¡Hola! ¿Qué tal? <bookmark mark="2"/> <break time="2s"></break></prosody></voice></speak>

Expected behavior The evt.audio_offset value of the bookmark should not include the subsequent break time. As the bookmark is placed at the very start of the SSML, the evt.audio_offset should be 0.

Version of the Cognitive Services Speech SDK azure-cognitiveservices-speech 1.29.0

Platform, Operating System, and Programming Language

  • OS: Any
  • Programming language: Python

Issue Analytics

  • State:open
  • Created 3 months ago
  • Comments:11 (2 by maintainers)

github_iconTop GitHub Comments

1reaction
Kerry-LinZhangcommented, Jul 27, 2023

Tracked as long-term fix, we have put it into backlog for future plan. Will continue follow up with it.

0reactions
Kerry-LinZhangcommented, Aug 4, 2023

Hi @anton-vidext currently we have 2 workaround for you to try,

image

 According to your workaround, you can also try this one without 'rate'
  1. Using ‘<voice name="es-ES-AlvaroNeural"><bookmark mark="1"/> <mstts:audioduration value=“2s”><prosody volume="0">hola hola hola hola hola</prosody></mstts:audioduration></voice>’ to replace break

image

Read more comments on GitHub >

github_iconTop Results From Across the Web

Audio out of sync AFTER importing - Adobe Support Community
It looks like there is a problem when conforming the audio. But even the time duration is different from the original, there is...
Read more >
Solved: [0S 12.0] Audio out of sync on Xfinity channel
Solved: Hi, I mainly use the Xfinity app on my Roku Express device. I tried your suggestions of uninstalling and reinstalling the app...
Read more >
[vlc-devel] [PATCH] Subsync: added an easier way to sync ...
Added the following shortcuts (and made them configurable): - Shift-H: sets a ... _("Sub sync: bookmarked audio timestamp")); + break; ...
Read more >
Solving Audio Sync Drift
Imported audio-only files in a sequence with PAL video may drift out of sync in some cases. Products Affected Final Cut Pro Resolution...
Read more >
Synchronization problems and solutions in Logic Pro for Mac
This table describes common synchronization problems and solutions. If Logic Pro is synchronized to external word clock (Audio Sync Mode: External or Free), ......
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found