Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

"pre_encoded" directory in Pascal dataset

See original GitHub issue

Could you please give me a hint what the pre_encoded folder/option in pascal_voc_loader.py refers to? I do not have that folder or any .mat file that the loader wants to read from it. Thanks.

Issue Analytics

State:
Created 6 years ago
Comments:6 (2 by maintainers)

Top GitHub Comments

1reaction

jetxacommented, Sep 15, 2017

The pascal voc dataset provide RGB label pre_encoded folder will be created for saving labelid label

0reactions

albaniecommented, Jan 3, 2018

The splits across the pascal VOC 2012 and SBD datasets are:

    voc_train: 1464 images
    voc_val: 1449 images
    sbd_train: 8498 images
    sbd_val: 2857 images

It can be a bit confusing because both voc_train and voc_val have some overlap with sbd_train. It is made more confusing by the fact that different research papers use different combinations of the data. For example, CRF-as-RNN uses 11,685 training images (the images from voc_train + sbd_train + sbd_val), and only uses the images from voc_val that do not occur in either sbd_train or sbd_val (leaving 346 images in total) for validation.

The original FCN paper used the 2011 VOC training and validation and data (without SBD). However, the updated FCN PAMI version included experiments that used voc_train + sbd_train for training and used the images from voc_val that were not in SBD train for validation (736 in total since they were using VOC 2011 - if VOC 2012 is used, as it is in this repo, this split has 904 images). See the footnote on page 7 of the paper for more details.

As a result of these differences, there are several ways to select the data splits (see e.g. this implementation for some commonly used setups).

How much difference does the extra data make? There is a helpful ablation study in the FCN PAMI paper that shows that moving from voc_train (2011) to voc_train + SBD_train with FCN-32s improved the validation score from 57.7 mIoU to 63.6 mIoU.

The number of masks (9733 in total) in the pre_encoded directory is the number of unique images across both the train_aug and val sets. I.e. inside the pascalVOCLoader class you should find that:

len(np.unique(self.files['train_aug'] + self.files['val'])) # gives 9733

This repo follows the data splits described in the FCN PAMI paper. However, in the current implementation, (as far as I understand it), some of the training images are repeated in the train_aug split. I’ll submit a PR for that.

Top Results From Across the Web

"pre_encoded" directory in Pascal dataset · Issue #26 - GitHub

The idea behind having a pre_encoded directory is to avoid this mapping from RGB to classwise (single channel ground truth) images during ...

The PASCAL Visual Object Classes Homepage

The PASCAL VOC project: Provides standardised image data sets for object class recognition; Provides a common set of tools for accessing the data...

US20160078874A1 - Data carriage in encoded and pre-encoded ...

A method for a machine or group of machines to carry watermark data in an encoded audio data frame of an audio signal...

PASCAL-Context Dataset - Stanford Computer Science

Class Name # of Images Average Area (% of image) empty 0 accordion 2 5.26% aeroplane 597 17.41%

ChangeMan ZMF Customization Guide - Micro Focus

down menu to select the folder containing the ZMF documentation suite. ... Provide values for component names, data set names, parameters, subparameters,.