Pre-trained models take 2 - tracker
See original GitHub issueOn each of the datasets provided, we must train a Deepspeech model and document exactly how to reproduce these results! The overall architecture is encompassed in this command:
python train.py --rnn_type gru --hidden_size 800 --hidden_layers 5 --checkpoint --visdom --train_manifest /path/to/train_manifest.csv --val_manifest /path/to/val_manifest.csv --epochs 100 --num_workers $(nproc) --cuda
In the above command you must replace the manifests paths with the correct paths to the dataset. A few notes:
- Train till convergence (should get a nice smooth training curve hopefully!)
- For smaller datasets, you may need to reduce the learning rate annealing by adding the flag
--learning anneal
and setting it to a smaller value, like1.01
. For larger datasets, the default is fine (up to around 4.5k hours from internal testing on the deepspeech.torch version)
If you use augmentation or noise injection give as much details as possible for me to wrap up it all nicely.
A release will be cut from the DeepSpeech package that will have the models, and a reference to the latest release added to the README to find latest models!
Progress tracker for datasets:
- AN4
- TEDLium
- LibriSpeech
Let me know if you plan on working on running any of these, and I’ll update the ticket with details!
Issue Analytics
- State:
- Created 6 years ago
- Comments:11 (7 by maintainers)
Top Results From Across the Web
Introduction to the model—ArcGIS pretrained models
This model automates the object tracking process significantly, which speeds up motion imagery analysis workflows. It can be used in the Full Motion...
Read more >Fast-Track Your AI with NVIDIA Pretrained Models - YouTube
The pretrained models span a wide range of computer vision use cases, such as detection, segmentation, and classification.
Read more >tensorlayer/pretrained-models - GitHub
Model Code Parameter Top‑1 Accuracy Top‑5 Accuracy
VGG 16 code model 71.5 89.8
VGG 19 code model (from machrisaa/tensorflow‑vgg) 71.1 89.8
ResNet V1 50 75.2 92.2...
Read more >Fine-tune a pretrained model - Hugging Face
When you use a pretrained model, you train it on a dataset specific to your task. This is known as fine-tuning, an incredibly...
Read more >Hugging Face Pre-trained Models: Find the Best One for Your ...
These models take up a lot of space and when you run the above code for the first ... 2. Import the model....
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Hi guys, as above is there any unidirectional model? If yes is there a streaming interface for it?
Hints appreciated in what needs to be done to have both a) the model b) the streaming decoding interface
Hi guys, just to confirm, is there any unidirectional model released yet?