Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

What's the baseline resnet50 model used in spare_learning experiments?

See original GitHub issue

Table3 in paper shows the accuracy of resnet50 and does comparsion with other approches. The proposed method Sparse momentum achevied 74.9% top1 accuracy. Does the baseline model have 79.3% top1? Ifso, the sparse model has 4.4% drop compared to baseline dense model.

Issue Analytics

State:
Created 4 years ago
Comments:5 (3 by maintainers)

Top GitHub Comments

2reactions

TimDettmerscommented, Aug 13, 2019

Thank you for pointing this out, this is a common question that I get!

I misreported multi-crop top1/top5 for the baseline and this will be corrected in the new version of the paper (you can expect it on Wednesday). The scores in this table come from a ResNet-50 baseline with cosine learning rate schedule with warmup and label smoothing but fully sparse layers which yield 77.0% accuracy for 100% weights. You can find the code for this in the fully sparse subfolder. I also replicated on the dynamic sparse codebase with dense first convolution and downsample convolutional layers (as done by dynamic sparse) which can be found in the partially dense subfolder. This ResNet-50 has a baseline performance of 74.9% accuracy with 100% weights. In the latter case, sparse momentum still retains state-of-the-art performance with 72.4 and 74.2 Top-1 for 10% and 20% weights. Does this answer your question?

Let me know if something is still unclear or if you have more questions.

0reactions

TimDettmerscommented, Aug 23, 2019

I just submitted the paper to arXiv. It will be released on Monday.

Top Results From Across the Web

ResNet-50: The Basics and a Quick Tutorial - Datagen

This tutorial implements a Restnet-50 model to use transfer learning and solve TensorFlow's demo image classification problem. It will build a classifier to ......

Beginners' Guide to Image Classification: VGG-19, Resnet 50 ...

VGG-19 is an improvement of the model VGG-16. It is a convolution neural network model with 19 layers. It is built by stacking...

Efficient deep learning training - using the practical example of ...

The ResNet-50 model used for this experiment consists of 48 convolutional layers, as well as a MaxPool and an Average Pool layer (48+1+1=50...

Microsoft Vision Model ResNet-50 combines web-scale data ...

We evaluate Microsoft Vision Model ResNet-50 against the state-of-the-art pretrained ResNet-50 models and the baseline PyTorch implementation of ...

Comparison of Baseline models ResNet-50/101 - ResearchGate

In this study, a review of the dataset within the fashion world and the DL methods applied on, it is presented to help...