WER increases with increasing beam size, and with including LM
See original GitHub issueHello,
I am running experiments with the WSJ corpus using PyTorch. I only use the seq2seq part of the code with --mtlalpha set to 0. During decoding, even without a language model, the CER, WER with greedy decoding is best and increasing the beam size only leads to increased errors.
Upon LM training on the WSJ corpus, the best model perplexity reached is:
2018-04-04 21:44:27,006 (lm_pytorch:248) INFO: iteration: 79400
2018-04-04 21:44:27,006 (lm_pytorch:249) INFO: training perplexity: 2.81606144442938
2018-04-04 21:45:16,637 (lm_pytorch:255) INFO: epoch: 24
2018-04-04 21:45:16,638 (lm_pytorch:256) INFO: validation perplexity: 4.354437210237004
And decoding with including such a LM, worsens CER, WER results further.
I believe this might be due to some issue in the decoder or the beam search? Please let me know if you need any further details.
Issue Analytics
- State:
- Created 5 years ago
- Comments:21 (11 by maintainers)
Top Results From Across the Web
Electron beam sizes and lifetimes at MAX II and MAX III - lucris
and bunch length at higher currents as well as the lower than design lifetime in MAX III. The increase in horizontal beam size...
Read more >Towards Ultimate High-Power Scaling: Coherent Beam ...
With some considerations such as expanding the initial beam size and pointing of each beam (controlling the beams overlap with steering mirror, adaptive...
Read more >Electron modulated arc therapy (EMAT) using photon MLC for ...
The arc beam has a higher bremsstrahlung dose than the static beam at the isocenter due to crossfire, but choosing a field size...
Read more >Chapter 8 ELECTRON BEAMS: PHYSICAL AND CLINICAL ...
depends on field size and electron beam energy. 8.1.2. Electron interactions with ... The range of electrons increases with increasing electron energy. The....
Read more >High-resolution dispersion-based measurement of the ...
the time-resolved beam size of the electrons in a dispersive location for a single electron beam ... The initial energy spread is increased....
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Hello, @sw005320 @ShigekiKarita @m-wiesner,
Here are the results for varying beam size with WSJ where error increases by increasing the beam size. The vertical axis shows character or word error rate and the horizontal axis shows beam size. My experiments parameters are:
backend = pytorch mtl alpha = 0 etype = blstmp bs = 48 ls = unigram 0.05 penalty = 0.0 max min ratio = 0.0, 0.0 ctc weight = 0 rnn_lm weights = 0.0
Ok, Thanks for your immediate response.