question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

how to pretrain wav2vec2 on my own audio using an existing model as a base?

See original GitHub issue

i’m calling pretraining using fairseq-hydra-train --run --config-name myconfig --config-dir examples/wav2vec/config/pretraining

which ends up getting stuck on the same loss of 6.658. It never improves on that no matter what I’ve tried.

Someone in another issue suggested using one of the base models for pretraining. I haven’t come across where in the docs it tells you how to do that? Any help appreciated.

Issue Analytics

  • State:open
  • Created 3 years ago
  • Comments:8

github_iconTop GitHub Comments

2reactions
olafthielecommented, Mar 10, 2021

Try @mailong25 repo for finetuning. Pretty good instructions to get you going. As for the parameters, if you don’t share information on them or what you are doing, it is hard to give any advice.

1reaction
jubick1337commented, Mar 9, 2021

Hi @tensorfoo! You can change it in your config.yaml by setting restore_file in checkpoint atribute. If checkpoint stored outside of ./checkpoints (which is default dir for checkpoints) you should also set save_dir to proper dir.

Read more comments on GitHub >

github_iconTop Results From Across the Web

how to pretrain wav2vec2 on my own audio using an existing ...
i'm calling pretraining using fairseq-hydra-train --run ... how to pretrain wav2vec2 on my own audio using an existing model as a base?
Read more >
Self-training and pre-training, understanding the wav2vec ...
first, pre-train a wav2vec 2.0 model on the unlabeled data (using self-supervised learning approach),; fine-tune it on the available labeled ...
Read more >
Wav2Vec2 - Hugging Face
Wav2Vec2 is a speech model that accepts a float array corresponding to the raw waveform of the speech signal. Wav2Vec2 model was trained...
Read more >
Fine-tune and deploy a Wav2Vec2 model for speech ...
The base model we use in this post is Wav2Vec2-Base-960h, fine-tuned on 960 hours of Librispeech on 16 kHz sampled speech audio.
Read more >
Wav2Vec2 - A Lazy Data Science Guide - Mohit Mayank
To pre-train the model, Wav2Vec2 masks certain portions of time steps in the feature ... load the audio data (use your own wav...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found