Extracting coordinates instead of drawing a box
See original GitHub issueEDIT: Nevermind, got it to work!
Hey, first off, great tutorial, thank you so much.
I got it to run on ubuntu 16.04 as well with ease but I have a problem. I’m running on a CLI Ubuntu server, so instead of using an image as output, I’d just like to have the coordinates of the boxes.
I looked into the Object_detection_image.py and found where the boxes are being drawn, but it uses a function named visualize_boxes_and_labels_on_image_array to draw them. If I try to ouput the np.squeeze(boxes), it returns this:
[[0.5897823 0.35585764 0.87036747 0.5124078 ]
[0.6508235 0.13419046 0.85757935 0.2114587 ]
[0.64070517 0.14992228 0.8580698 0.23488007]
...
[0. 0. 0. 0. ]
[0. 0. 0. 0. ]
[0. 0. 0. 0. ]]
Is there a way to just get the coordinates from that?
Thank you for your time!
EDIT:
Okay, I added a new function to the visualization_utils.py that returns the “ymin, ymax, xmin, xmax” variables, used in other functions of that file to draw the boxes.
The problem is, they look like this:
[[0.5897822976112366, 0.8703674674034119, 0.35585764050483704, 0.5124077796936035], [0.6508234739303589, 0.8575793504714966, 0.13419045507907867, 0.2114586979150772]]
I was expecting coordinates. These seem like percentages.
EDIT: Okay, I got it to work.
Issue Analytics
- State:
- Created 5 years ago
- Reactions:2
- Comments:126
add this to the utils/visualization_utils.py
add this to Object_detection_dir.py
as well as this:
I think this should be all.
This was very helpful. thank you so much. If anyone needs to access each coordinate separately change the 3rd last line in the newly added code to utils/visualization_utils.py which is
coordinates_list.append([ymin, ymax, xmin, xmax, (box_to_score_map[box]*100)])
into coordinates_list=[ymin, ymax, xmin, xmax, (box_to_score_map[box]*100)]
and you can access each ymin, ymax , xmin, xmax values separately using ymin=coordinate_list[0] etc. in your object detection file.