Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The network predicts absolute value of xy instead of offset to the grid cell as specified in paper. Why is it so? #20

Open
meet-minimalist opened this issue Apr 28, 2019 · 3 comments

Comments

@meet-minimalist
Copy link

No description provided.

@Jinming-Su
Copy link

Refer to the code, and I find the network predicts the values of center pixel and the width and height, which is not as what you say.

@meet-minimalist
Copy link
Author

@Jinming-Su I think you are correct. He has encoded the vector of 7x7x30 in dataset.py line no. 20 to 25 as offset of grid cell.

@jasonwjw
Copy link

Refer to the predict.py the function of decoder reveals the network predicts the offsets to the grid cell , while in the loss.py we can find that box1_xyxy and the box2_xyxy they are not the truely center_x and center_y for the image size, they are the offsets to the specific grid cell, and they need to add the location of the grid cell, but here we want to compute the iou for the predicted box and the target box,so we can treat them as a shift transformation of the center_x and center_y and the final iou won't change.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants