question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

What's the baseline resnet50 model used in spare_learning experiments?

See original GitHub issue

image Table3 in paper shows the accuracy of resnet50 and does comparsion with other approches. The proposed method Sparse momentum achevied 74.9% top1 accuracy. Does the baseline model have 79.3% top1? Ifso, the sparse model has 4.4% drop compared to baseline dense model.

Issue Analytics

  • State:closed
  • Created 4 years ago
  • Comments:5 (3 by maintainers)

github_iconTop GitHub Comments

2reactions
TimDettmerscommented, Aug 13, 2019

Thank you for pointing this out, this is a common question that I get!

I misreported multi-crop top1/top5 for the baseline and this will be corrected in the new version of the paper (you can expect it on Wednesday). The scores in this table come from a ResNet-50 baseline with cosine learning rate schedule with warmup and label smoothing but fully sparse layers which yield 77.0% accuracy for 100% weights. You can find the code for this in the fully sparse subfolder. I also replicated on the dynamic sparse codebase with dense first convolution and downsample convolutional layers (as done by dynamic sparse) which can be found in the partially dense subfolder. This ResNet-50 has a baseline performance of 74.9% accuracy with 100% weights. In the latter case, sparse momentum still retains state-of-the-art performance with 72.4 and 74.2 Top-1 for 10% and 20% weights. Does this answer your question?

Let me know if something is still unclear or if you have more questions.

0reactions
TimDettmerscommented, Aug 23, 2019

I just submitted the paper to arXiv. It will be released on Monday.

Read more comments on GitHub >

github_iconTop Results From Across the Web

ResNet-50: The Basics and a Quick Tutorial - Datagen
This tutorial implements a Restnet-50 model to use transfer learning and solve TensorFlow's demo image classification problem. It will build a classifier to ......
Read more >
Beginners' Guide to Image Classification: VGG-19, Resnet 50 ...
VGG-19 is an improvement of the model VGG-16. It is a convolution neural network model with 19 layers. It is built by stacking...
Read more >
Efficient deep learning training - using the practical example of ...
The ResNet-50 model used for this experiment consists of 48 convolutional layers, as well as a MaxPool and an Average Pool layer (48+1+1=50...
Read more >
Microsoft Vision Model ResNet-50 combines web-scale data ...
We evaluate Microsoft Vision Model ResNet-50 against the state-of-the-art pretrained ResNet-50 models and the baseline PyTorch implementation of ...
Read more >
Comparison of Baseline models ResNet-50/101 - ResearchGate
In this study, a review of the dataset within the fashion world and the DL methods applied on, it is presented to help...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found