Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

How to change VGG16.py to wire conv4_3 instead of conv5_3?

See original GitHub issue

I have trained faster rcnn with a custom dataset based on Pascal VOC format. Now I wanted to use 3rd or 4th convolutional layers of vgg16 to deal with the object of specific size. But I don’t know how to change the vgg16 net exactly. I tried to remove the final lines of this section:

  def _image_to_head(self, is_training, reuse=False):
    with tf.variable_scope(self._scope, self._scope, reuse=reuse):
      net = slim.repeat(self._image, 2, slim.conv2d, 64, [3, 3],
                          trainable=False, scope='conv1')
      net = slim.max_pool2d(net, [2, 2], padding='SAME', scope='pool1')
      net = slim.repeat(net, 2, slim.conv2d, 128, [3, 3],
                        trainable=False, scope='conv2')
      net = slim.max_pool2d(net, [2, 2], padding='SAME', scope='pool2')
      net = slim.repeat(net, 3, slim.conv2d, 256, [3, 3],
                        trainable=is_training, scope='conv3')
      net = slim.max_pool2d(net, [2, 2], padding='SAME', scope='pool3')
      net = slim.repeat(net, 3, slim.conv2d, 512, [3, 3],
                        trainable=is_training, scope='conv4')
      net = slim.max_pool2d(net, [2, 2], padding='SAME', scope='pool4')
      net = slim.repeat(net, 3, slim.conv2d, 512, [3, 3],
trainable=is_training, scope='conv5')

but it didn’t work. Where else should be modified to make it connect earlier layers of vgg16 to the RPN?

I would appreciate some help from experts on this. thanks

Issue Analytics

State:
Created 6 years ago
Comments:12

Top GitHub Comments

1reaction

yaoqi-zdcommented, Jan 13, 2018

@zqdeepbluesky you can check the feature fusion methods used in HyperNet or FPN(feature pyramid network)

1reaction

yaoqi-zdcommented, Aug 13, 2017

@hadi-ghnd em…I thought although the lower feature map(like conv4_3) has higher resolution, they have weaker semantic info, which may harm the localization performance. Have you tried to add a deconv layer to the conv5_3 to make conv5_3 the same size of conv4_3(let’s say this feature map as deconv5_3) and then combine conv4_3 and deconv5_3 to get a featrue map that has both higher resolution and strong semantics?

Top Results From Across the Web

Phase identification for dynamic CT enhancements ... - NCBI

Herein, we explore automatic means for classifying contrast enhancements for application in diagnose pathologies. Deep learning approaches have ...

Understanding Feature Extraction In Mobilenet Through ...

First I o U 0.04 as detection cutoff 4 in conv43 91 in conv53 see project website. I am trying to create a...

RDDB - OSCHINA - 中文开源技术交流社区

RDDB 是一个Ruby 开发的面向文档的数据库系统，其灵感来自CouchDB。示例代码： # First create an database object database = Rddb::Database.new # Put some documents ...

Du capteur à la sémantique - LIRMM

connect indirectly and without contradiction perceptual data to knowledge in natural langage. ... 4.4 Structure du réseau de neurones convolutifs VGG16 .