"pre_encoded" directory in Pascal dataset
See original GitHub issueCould you please give me a hint what the pre_encoded
folder/option in pascal_voc_loader.py
refers to?
I do not have that folder or any .mat
file that the loader wants to read from it.
Thanks.
Issue Analytics
- State:
- Created 6 years ago
- Comments:6 (2 by maintainers)
Top Results From Across the Web
"pre_encoded" directory in Pascal dataset · Issue #26 - GitHub
The idea behind having a pre_encoded directory is to avoid this mapping from RGB to classwise (single channel ground truth) images during ...
Read more >The PASCAL Visual Object Classes Homepage
The PASCAL VOC project: Provides standardised image data sets for object class recognition; Provides a common set of tools for accessing the data...
Read more >US20160078874A1 - Data carriage in encoded and pre-encoded ...
A method for a machine or group of machines to carry watermark data in an encoded audio data frame of an audio signal...
Read more >PASCAL-Context Dataset - Stanford Computer Science
Class Name # of Images Average Area (% of image)
empty 0
accordion 2 5.26%
aeroplane 597 17.41%
Read more >ChangeMan ZMF Customization Guide - Micro Focus
down menu to select the folder containing the ZMF documentation suite. ... Provide values for component names, data set names, parameters, subparameters,.
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
The pascal voc dataset provide
RGB
labelpre_encoded
folder will be created for savinglabelid
labelThe splits across the pascal VOC 2012 and SBD datasets are:
It can be a bit confusing because both
voc_train
andvoc_val
have some overlap withsbd_train
. It is made more confusing by the fact that different research papers use different combinations of the data. For example, CRF-as-RNN uses 11,685 training images (the images fromvoc_train
+sbd_train
+sbd_val
), and only uses the images fromvoc_val
that do not occur in eithersbd_train
orsbd_val
(leaving 346 images in total) for validation.The original FCN paper used the 2011 VOC training and validation and data (without SBD). However, the updated FCN PAMI version included experiments that used
voc_train
+sbd_train
for training and used the images fromvoc_val
that were not in SBD train for validation (736 in total since they were usingVOC 2011
- ifVOC 2012
is used, as it is in this repo, this split has 904 images). See the footnote on page 7 of the paper for more details.As a result of these differences, there are several ways to select the data splits (see e.g. this implementation for some commonly used setups).
How much difference does the extra data make? There is a helpful ablation study in the FCN PAMI paper that shows that moving from
voc_train
(2011) tovoc_train + SBD_train
with FCN-32s improved the validation score from57.7 mIoU
to63.6 mIoU
.The number of masks (9733 in total) in the
pre_encoded
directory is the number of unique images across both thetrain_aug
andval
sets. I.e. inside thepascalVOCLoader
class you should find that:This repo follows the data splits described in the FCN PAMI paper. However, in the current implementation, (as far as I understand it), some of the training images are repeated in the
train_aug
split. I’ll submit a PR for that.