Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Pre-trained models take 2 - tracker

See original GitHub issue

On each of the datasets provided, we must train a Deepspeech model and document exactly how to reproduce these results! The overall architecture is encompassed in this command:

python train.py  --rnn_type gru --hidden_size 800 --hidden_layers 5 --checkpoint --visdom --train_manifest /path/to/train_manifest.csv --val_manifest /path/to/val_manifest.csv --epochs 100 --num_workers $(nproc) --cuda

In the above command you must replace the manifests paths with the correct paths to the dataset. A few notes:

Train till convergence (should get a nice smooth training curve hopefully!)
For smaller datasets, you may need to reduce the learning rate annealing by adding the flag --learning anneal and setting it to a smaller value, like 1.01. For larger datasets, the default is fine (up to around 4.5k hours from internal testing on the deepspeech.torch version)

If you use augmentation or noise injection give as much details as possible for me to wrap up it all nicely.

A release will be cut from the DeepSpeech package that will have the models, and a reference to the latest release added to the README to find latest models!

Progress tracker for datasets:

AN4
TEDLium
LibriSpeech

Let me know if you plan on working on running any of these, and I’ll update the ticket with details!

Issue Analytics

State:
Created 6 years ago
Comments:11 (7 by maintainers)

Top GitHub Comments

4reactions

oplatekcommented, Feb 21, 2018

Hi guys, as above is there any unidirectional model? If yes is there a streaming interface for it?

Hints appreciated in what needs to be done to have both a) the model b) the streaming decoding interface

3reactions

yyang29commented, Jan 17, 2018

Hi guys, just to confirm, is there any unidirectional model released yet?

Top Results From Across the Web

Introduction to the model—ArcGIS pretrained models

This model automates the object tracking process significantly, which speeds up motion imagery analysis workflows. It can be used in the Full Motion...

Fast-Track Your AI with NVIDIA Pretrained Models - YouTube

The pretrained models span a wide range of computer vision use cases, such as detection, segmentation, and classification.

tensorlayer/pretrained-models - GitHub

Model Code Parameter Top‑1 Accuracy Top‑5 Accuracy VGG 16 code model 71.5 89.8 VGG 19 code model (from machrisaa/tensorflow‑vgg) 71.1 89.8 ResNet V1 50 75.2 92.2...

Fine-tune a pretrained model - Hugging Face

When you use a pretrained model, you train it on a dataset specific to your task. This is known as fine-tuning, an incredibly...

Hugging Face Pre-trained Models: Find the Best One for Your ...

These models take up a lot of space and when you run the above code for the first ... 2. Import the model....