TranscribeStreamingClient - bad transcription for StartStreamTranscriptionCommand
See original GitHub issueDescribe the bug
The transcription of a short sample wav file is wrong and different from the results obtained by the AWS Transcribe Console.
Your environment
SDK version number
{
"@aws-sdk/client-transcribe": "^3.6.1",
"@aws-sdk/client-transcribe-streaming": "^3.6.1",
"aws-sdk": "^2.850.0"
}
Is the issue in the browser/Node.js/ReactNative?
Node.js
Details of the browser/Node.js/ReactNative version
$ node --version
v12.16.1
Steps to reproduce
Run the code example in the package aws-transcribe-streaming-node-example:
git clone https://github.com/loretoparisi/aws-transcribe-streaming-node-example
cd aws-transcribe-streaming-node-example
npm install
node streamingclient.js sample.wav
Observed behavior
Transcription results:
7.81: Closure.
13.9: Uh huh. For the man. You there? Yeah, sure. Sure.
Expected behavior
The transcription should at least match the results from the AWS Transcribe Console tool:
{
"transcript": "Hi. My name is John McCollum. I'm a writer at large for The New York Times Magazine. And in 2016, I wrote a piece for the magazine about clouds. Yeah, it's a story about clouds. It doesn't sound very important, and I"
}
Full output is available here.
Screenshots
Additional context
Issue Analytics
- State:
- Created 3 years ago
- Comments:5 (1 by maintainers)
Top Results From Across the Web
Calling .destroy() doesn't destroy the transcribe client #3922
Describe the bug. TranscribeStreamingClient from @aws-sdk/client-transcribe-streaming has a destroy method that, as per the source code, ...
Read more >aws/aws-sdk-js-v3 - Gitter
i'm trying to use the TranscribeStreamingClient but i'm having some issues const { TranscribeStreamingClient, StartStreamTranscriptionCommand ...
Read more >Announcing Amazon Transcribe streaming transcription ...
Amazon Transcribe streaming transcription enables you to send an audio stream, and with a single API call, receive a stream of text in...
Read more >Transcribing with the AWS SDKs
const { TranscribeStreamingClient, StartStreamTranscriptionCommand, } = require("@aws-sdk/client-transcribe-streaming"); const { createReadStream } ...
Read more >Amazon Transcribe Streaming API without SDK - Stack Overflow
The new documentation for streaming transcription is also bad. When using its content-type the HTTP request return 404 Not Found with ...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Hey @loretoparisi thanks for opening this, unfortunately I am not able to reproduce this I used the code and example found here: https://github.com/ajredniwja/transcribe-example.
Is it possible for you to share code and the file?
In the original code they mention they use a RAW PCM file, and you used a WAV, so I think the problem is likely there.