ClearML is not saving scalars/images when using Tensorflow Object detection API - TF2.2
See original GitHub issueThis issue is related to this thread: https://clearml.slack.com/archives/CTK20V944/p1610457717141400
To reproduce:
git clone https://github.com/glemarivero/raccoon_dataset.git
setup a virtualenv
run sh autoinstall.sh
Generate tfrecords:
python generate_tfrecord.py --output_path images.tfrecord --csv_input raccoon_labels.csv --image_dir images
Run training:
python model_main_tf2.py --model_dir=models/ --pipeline_config_path=pipeline.config
Issue Analytics
- State:
- Created 3 years ago
- Comments:20 (13 by maintainers)
Top Results From Across the Web
TF2 object detection API issue with resuming training from ...
I'm trying to resume training from a saved checkpoint and as usual I change the path in the config file to where the...
Read more >Object Detection From TF2 Saved Model
This demo will take you through the steps of running an “out-of-the-box” TensorFlow 2 compatible detection model on a collection of images. More...
Read more >Saving a Object Detection API SSD in TF.SavedModel format
Hi All, The “TF Object Detection API” has this colab: That shows how to build & train a ssd_resnet50_v1_fpn_640x640_coco17_tpu-8 model using ...
Read more >Object Detection with TensorFlow 2 Object Detection API
This article will examine how to perform object detection and image segmentation on a custom dataset using the TensorFlow 2 Object Detection API....
Read more >TensorFlow Object Detection API Tutorial
TensorFlow recently announced TF Object Detection API models to be TensorFlow 2 compatible . In this tutorial we will go over on how...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Just to add my +1, the scenario just described by @bmartinn is the same that I experience.
Hi @glemarivero Thank you again for the code! I am able to run the training session. I think it reproduces the issue, as I’m only getting the “steps_per_sec” scalar graph, and I’m missing the “Loss/BoxClassifierLoss/localization_loss” etc. Let me dig deeper and see what’s going on.