[enhancement] Polyak Averaging could be done faster
See original GitHub issueThis is rather minor, but polyak averaging in DQN/SAC/TD3 could be done faster with far fewer intermediate tensors using torch.addcmul_
https://pytorch.org/docs/stable/torch.html#torch.addcmul.
Issue Analytics
- State:
- Created 3 years ago
- Comments:5 (5 by maintainers)
Top Results From Across the Web
Lecture 6 Optimization for Deep Neural Networks - CMSC 35246
Polyak Averaging. • On Slides but for self study: Newton and Quasi Newton. Methods (BFGS, L-BFGS, Conjugate Gradient).
Read more >Ensemble Neural Network Model Weights in Keras (Polyak ...
This is called Polyak-Ruppert averaging and can be further improved by using a linearly or exponentially decreasing weighted average of the ...
Read more >Optimal non-asymptotic analysis of the Ruppert–Polyak ...
Abstract. This paper is devoted to the non-asymptotic analysis of the Ruppert–Polyak averaging method introduced in Polyak and Juditsky (1992) ...
Read more >arXiv:2106.02613v3 [stat.ML] 1 Feb 2022
may converge faster than its symmetric alternative, residual ... stance, while Polyak's averaging (Lillicrap et al., 2015), a.
Read more >Polyak Averaging Explained | Papers With Code
Polyak Averaging is an optimization technique that sets final parameters to ... Fast Neural Architecture Search of Compact Semantic Segmentation Models via ...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
I tested these on sac, there is a good 1.5-1.8 speedup here. More on the GPU than the cpu because of data transfers.
This is actually quite large, at 1Million polyak updates, this shaves off 28 minutes for cpu and 2 hours 11 minutes on GPU.
@PartiallyTyped Could you quickly try on cpu but with
num_threads=1
?That’s the only case where I did not see an improvement yet.