Unable to understand the logic behind the `object_detection_demo_yolov3_async.py` demo
See original GitHub issueI noticed that the recent pull for the file object_detection_demo_yolov3_async.py
did try to remove the otherwise confusing flattening of blob but I still can’t figure out how most of the results are parsed from this blob.
Is there some YOLO guide that you guys (@eizamaliev ) followed (except the official paper which is not so clear)?
Some example snippets that I don’t understand include
-
if is_proportional:
in the line131. What was the purpose behind introducing this additional parameter to pass around the functions? -
In the part
# Process raw value
x = (col + x) / params.side
y = (row + y) / params.side
I can’t understand how the x
,y
(values) are added with row
and col
is right. I mean what is the logic behind it?
Please help me get a clear understanding of the YOLO decoding 😕
Issue Analytics
- State:
- Created 3 years ago
- Comments:8 (8 by maintainers)
Top Results From Across the Web
Python for Beginners [20 of 44] Demo: Conditional Logic
Demo : Using if statements. Full 'Intro to Python' course on Microsoft Learn: https://aka.ms/MSLearnPython.
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Hi, @pra-dan. Thanks for your questions, I’ll try to answer them all.
Demo supports two resize type: with aspect ratio kept and without. Because initially was used simple resize, this behavior is saved as default. So
is_proportional
parameter tells to the function how it should rescale output boxes.As you can know, YOLO generate local coordinates relative to the cell. To get coordinates relative to image, you should add cell coordinates. By dividing by cell count across corresponding direction, you normalize coordinates.
Yeah, we are good to go 👍🏽