question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Incorporate BEVFusion's BEV Pooling operation into BEVDet

See original GitHub issue

Hello, I have been trying to replace BEVDet’s QuickCumSum operation with BEVFusion’s BEV Pooling operation. https://github.com/mit-han-lab/bevfusion/tree/main/mmdet3d/ops/bev_pool

To do so, I simply have replaced https://github.com/HuangJunJie2017/BEVDet/blob/e2f4b40dd7d267a81a0ee1c7453a1229381a6f02/mmdet3d/models/necks/view_transformer.py#L164-L169 with

x = bev_pool(x, geom_feats, B, self.nx[2], self.nx[0], self.nx[1])

Where bev_pool is BEVFusion’s bev_pool cuda operation.

However, I find that although there is significant speed up, the loss is not decreasing as expected (around 14 at end of epoch 5, while it should be around 9.5).

Looking at the papers, they seem to be equivalent pooling operations, but I was hoping for some guidance in case I missed something.

Thank you!

Issue Analytics

  • State:closed
  • Created a year ago
  • Comments:9 (8 by maintainers)

github_iconTop GitHub Comments

1reaction
Divadicommented, Jul 8, 2022

@HuangJunJie2017 I understand; I will look into it as well.

0reactions
HuangJunJie2017commented, Jul 13, 2022

@Divadi I start trying mixed-precision today. Just a few hours ago… Where to put @force_fp32 is a good problem. I will refer to other algorithms for some suggestions.

Read more comments on GitHub >

github_iconTop Results From Across the Web

BEVFusion: A Simple and Robust LiDAR-Camera Fusion ...
Our framework comprises two separate streams that encode raw camera and LiDAR sensor inputs into features in the same BEV space, followed by...
Read more >
A Simple and Robust LiDAR-Camera Fusion Framework
In contrast, we propose a surprisingly simple yet novel fusion framework, dubbed BEVFusion, whose camera stream does not depend on the input of...
Read more >
A Gentle Introduction to Pooling Layers for Convolutional ...
In this tutorial, you will discover how the pooling operation works and how to implement it in convolutional neural networks.
Read more >
A Simple and Robust LiDAR-Camera Fusion Framework
yet novel fusion framework, dubbed BEVFusion, whose camera stream does not depend on the input of LiDAR data, thus addressing the downside ...
Read more >
3D-PointCloud/README.md at master - GitHub
Papers and Datasets about Point Cloud. Contribute to zhulf0804/3D-PointCloud development by creating an account on GitHub.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found