question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

suggestion: adding a more detailed data pipeline instruction on fine-tuning colab tutorial

See original GitHub issue

It’s been great to see more tutorials being added to learn speechbrain. For this old tutorial Pretrained Models and Fine-Tuning with huggface, I notice there is slight inconsistency between the loading data procedure to use pre-trained model and to fine-tune the model.

In using pre-trained model to transcribe the file, the audio is loaded using the EncoderDecoderASR interface, which will 1) load the audio using torchaudio 2) normalize the audio using an audio normalizer.

In the fine-tuning section, the audio is loaded using dataset.add_dynamic_item(sb.dataio.dataio.read_audio, takes="file_path", provides="signal"), and no normalizer is used within the function.

Since the decoding relies on normalized audio, wouldn’t it be more natural to fine-tune the model based on normalized audio rather than raw audio?

It might also be useful to add a section about how to use the fine-tuned model for transcribing file.

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:6

github_iconTop GitHub Comments

1reaction
popcornellcommented, May 5, 2021

I am going to fix this today, thank you @ziz19 for pointing this out

0reactions
ziz19commented, May 7, 2021

@popcornell Thank you for much for the explanation. Github username is perfectly fine! Speechbrain is so much easier to use compared to other frameworks. Thank you all for the hard work! I’m keep using Speechbrain now and in the future.

Read more comments on GitHub >

github_iconTop Results From Across the Web

The guide to fine-tuning Stable Diffusion with your own images
Easy guide to generate your own images with Dreambooth. Learn how to fine-tune Stable Diffusion.
Read more >
A Full Guide to Finetuning T5 for Text2Text and Building a ...
Fine-tune a pre-trained model for title generation on Colab, monitoring the chosen metric on the validation set using TensorBoard, ...
Read more >
Tutorial: Fine-Tuning a Model on Your Own Data - Haystack
Once you have collected training data, you can fine-tune your base models. We initialize a reader as a base model and fine-tune it...
Read more >
Using Tutorial Data from Google Drive in Colab - PyTorch
To do this, you'll first need to be logged into Google Drive. (For a full description of how to access data in Colab,...
Read more >
Classify structured data with feature columns | TensorFlow Core
Classify structured data with feature columns · Load a CSV file using Pandas. · Build an input pipeline to batch and shuffle the...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found