question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Pre-trained models take 2 - tracker

See original GitHub issue

On each of the datasets provided, we must train a Deepspeech model and document exactly how to reproduce these results! The overall architecture is encompassed in this command:

python train.py  --rnn_type gru --hidden_size 800 --hidden_layers 5 --checkpoint --visdom --train_manifest /path/to/train_manifest.csv --val_manifest /path/to/val_manifest.csv --epochs 100 --num_workers $(nproc) --cuda

In the above command you must replace the manifests paths with the correct paths to the dataset. A few notes:

  • Train till convergence (should get a nice smooth training curve hopefully!)
  • For smaller datasets, you may need to reduce the learning rate annealing by adding the flag --learning anneal and setting it to a smaller value, like 1.01. For larger datasets, the default is fine (up to around 4.5k hours from internal testing on the deepspeech.torch version)

If you use augmentation or noise injection give as much details as possible for me to wrap up it all nicely.

A release will be cut from the DeepSpeech package that will have the models, and a reference to the latest release added to the README to find latest models!

Progress tracker for datasets:

  • AN4
  • TEDLium
  • LibriSpeech

Let me know if you plan on working on running any of these, and I’ll update the ticket with details!

Issue Analytics

  • State:closed
  • Created 6 years ago
  • Comments:11 (7 by maintainers)

github_iconTop GitHub Comments

4reactions
oplatekcommented, Feb 21, 2018

Hi guys, as above is there any unidirectional model? If yes is there a streaming interface for it?

Hints appreciated in what needs to be done to have both a) the model b) the streaming decoding interface

3reactions
yyang29commented, Jan 17, 2018

Hi guys, just to confirm, is there any unidirectional model released yet?

Read more comments on GitHub >

github_iconTop Results From Across the Web

Introduction to the model—ArcGIS pretrained models
This model automates the object tracking process significantly, which speeds up motion imagery analysis workflows. It can be used in the Full Motion...
Read more >
Fast-Track Your AI with NVIDIA Pretrained Models - YouTube
The pretrained models span a wide range of computer vision use cases, such as detection, segmentation, and classification.
Read more >
tensorlayer/pretrained-models - GitHub
Model Code Parameter Top‑1 Accuracy Top‑5 Accuracy VGG 16 code model 71.5 89.8 VGG 19 code model (from machrisaa/tensorflow‑vgg) 71.1 89.8 ResNet V1 50 75.2 92.2...
Read more >
Fine-tune a pretrained model - Hugging Face
When you use a pretrained model, you train it on a dataset specific to your task. This is known as fine-tuning, an incredibly...
Read more >
Hugging Face Pre-trained Models: Find the Best One for Your ...
These models take up a lot of space and when you run the above code for the first ... 2. Import the model....
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found