question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Conv implementation does not support grouped convolutions for now. GPU / CPU

See original GitHub issue

Hi, I am trying to train locally on my Mac without a GPU. Thank you Experiencor, I can train successfully on the raccoon dataset.

This is my config.json file:

`{ “model” : { “min_input_size”: 1000, “max_input_size”: 1920, “anchors”: [21,68, 27,83, 63,64, 77,77, 96,87, 111,101, 161,98, 178,131, 209,110], “labels”: [“1”, “2”, “3”, “4”] },

"train": {
    "train_image_folder":   "/Users/moritz.b/Desktop/keras-yolo3-master/yolo/",
    "train_annot_folder":   "/Users/moritz.b/Desktop/keras-yolo3-master/yolo/outputs/",
    "cache_name":           "localizer.pkl",
    "train_times":          6,
    "batch_size":           2,
    "learning_rate":        1e-4,
    "nb_epochs":            10,
    "warmup_epochs":        2,
    "ignore_thresh":        0.5,
    "gpus":                 "0",

    "grid_scales":          [1,1,1],
    "obj_scale":            5,
    "noobj_scale":          1,
    "xywh_scale":           1,
    "class_scale":          1,

    "tensorboard_dir":      "logs",
    "saved_weights_name":   "localisations.h5",
    "debug":                true
},

"valid": {
    "valid_image_folder":   "",
    "valid_annot_folder":   "",
    "cache_name":           "",
    "valid_times":          1
}

}`

When I start training, the error message says:

Traceback (most recent call last): File "train.py", line 290, in <module> _main_(args) File "train.py", line 267, in _main_ max_queue_size = 8 File "/Users/moritz.b/opt/anaconda3/envs/yolo3/lib/python3.5/site-packages/keras/legacy/interfaces.py", line 91, in wrapper return func(*args, **kwargs) File "/Users/moritz.b/opt/anaconda3/envs/yolo3/lib/python3.5/site-packages/keras/engine/training.py", line 1732, in fit_generator initial_epoch=initial_epoch) File "/Users/moritz.b/opt/anaconda3/envs/yolo3/lib/python3.5/site-packages/keras/engine/training_generator.py", line 220, in fit_generator reset_metrics=False) File "/Users/moritz.b/opt/anaconda3/envs/yolo3/lib/python3.5/site-packages/keras/engine/training.py", line 1514, in train_on_batch outputs = self.train_function(ins) File "/Users/moritz.b/opt/anaconda3/envs/yolo3/lib/python3.5/site-packages/tensorflow_core/python/keras/backend.py", line 3476, in __call__ run_metadata=self.run_metadata) File "/Users/moritz.b/opt/anaconda3/envs/yolo3/lib/python3.5/site-packages/tensorflow_core/python/client/session.py", line 1472, in __call__ run_metadata_ptr) tensorflow.python.framework.errors_impl.UnimplementedError: Fused conv implementation does not support grouped convolutions for now. [[{{node conv_81/BiasAdd}}]]

Does anybody have an idea?

Issue Analytics

  • State:open
  • Created 4 years ago
  • Reactions:1
  • Comments:10

github_iconTop GitHub Comments

1reaction
pushkarjaincommented, May 22, 2020

@YoonSungLee: Yes. I was able to solve the problem. I had to use a different cache_name for the data in config.json. This error occurs because you have updated the output layer to accommodate new classes, but the pickle file created uses old class list.

Hope that helps.

0reactions
YoonSungLeecommented, May 22, 2020

@YoonSungLee Do you have backend.h5 in the same location of config.json? It is pretrained weights for the model.

Wow, by changing the h5 file, I’m able to get over the error. But I have another error, again. I’m very exhausted. Could you help me? The error is as following:

Loading pretrained weights.

/usr/local/lib/python3.6/dist-packages/keras/callbacks/callbacks.py:998: UserWarning: epsilon argument is deprecated and will be removed, use min_delta instead. warnings.warn('epsilon argument is deprecated and ’ WARNING:tensorflow:From /usr/local/lib/python3.6/dist-packages/keras/backend/tensorflow_backend.py:422: The name tf.global_variables is deprecated. Please use tf.compat.v1.global_variables instead.

WARNING:tensorflow:From /usr/local/lib/python3.6/dist-packages/keras/backend/tensorflow_backend.py:431: The name tf.is_variable_initialized is deprecated. Please use tf.compat.v1.is_variable_initialized instead.

WARNING:tensorflow:From /usr/local/lib/python3.6/dist-packages/keras/backend/tensorflow_backend.py:438: The name tf.variables_initializer is deprecated. Please use tf.compat.v1.variables_initializer instead.

WARNING:tensorflow:From /usr/local/lib/python3.6/dist-packages/keras/callbacks/tensorboard_v1.py:200: The name tf.summary.merge_all is deprecated. Please use tf.compat.v1.summary.merge_all instead.

WARNING:tensorflow:From /usr/local/lib/python3.6/dist-packages/keras/callbacks/tensorboard_v1.py:203: The name tf.summary.FileWriter is deprecated. Please use tf.compat.v1.summary.FileWriter instead.

Epoch 1/100 src/tcmalloc.cc:283] Attempt to free invalid pointer 0x696e69617274160a

Read more comments on GitHub >

github_iconTop Results From Across the Web

Fused conv implementation does not support grouped ...
I had a similar error, the problem was with the number of channels for my image and the number of channels I specified...
Read more >
Error message for grouped convolution backprop on CPU is ...
Hello, I recently learned that gradients and backprop for grouped convolution is not supported on CPU, as discussed in the following github ...
Read more >
NVIDIA Deep Learning TensorRT Documentation
This NVIDIA TensorRT Developer Guide demonstrates how to use the C++ and Python APIs for implementing the most common deep learning layers.
Read more >
Optimizing Grouped Convolutions on Edge Devices - arXiv
convolutions (G) on the CPU of the Hikey 970 board for three common deep learning frameworks. No framework shows the expected behavior: i) ......
Read more >
Tensorflow Conv2d Error - Rusty Robotics
Fused conv implementation does not support grouped convolutions for now. I noticed the “Not found: No algorithm worked!” error after changing ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found