Incorporate BEVFusion's BEV Pooling operation into BEVDet
See original GitHub issueHello, I have been trying to replace BEVDet’s QuickCumSum operation with BEVFusion’s BEV Pooling operation. https://github.com/mit-han-lab/bevfusion/tree/main/mmdet3d/ops/bev_pool
To do so, I simply have replaced https://github.com/HuangJunJie2017/BEVDet/blob/e2f4b40dd7d267a81a0ee1c7453a1229381a6f02/mmdet3d/models/necks/view_transformer.py#L164-L169 with
x = bev_pool(x, geom_feats, B, self.nx[2], self.nx[0], self.nx[1])
Where bev_pool is BEVFusion’s bev_pool cuda operation.
However, I find that although there is significant speed up, the loss is not decreasing as expected (around 14 at end of epoch 5, while it should be around 9.5).
Looking at the papers, they seem to be equivalent pooling operations, but I was hoping for some guidance in case I missed something.
Thank you!
Issue Analytics
- State:
- Created a year ago
- Comments:9 (8 by maintainers)
Top Results From Across the Web
BEVFusion: A Simple and Robust LiDAR-Camera Fusion ...
Our framework comprises two separate streams that encode raw camera and LiDAR sensor inputs into features in the same BEV space, followed by...
Read more >A Simple and Robust LiDAR-Camera Fusion Framework
In contrast, we propose a surprisingly simple yet novel fusion framework, dubbed BEVFusion, whose camera stream does not depend on the input of...
Read more >A Gentle Introduction to Pooling Layers for Convolutional ...
In this tutorial, you will discover how the pooling operation works and how to implement it in convolutional neural networks.
Read more >A Simple and Robust LiDAR-Camera Fusion Framework
yet novel fusion framework, dubbed BEVFusion, whose camera stream does not depend on the input of LiDAR data, thus addressing the downside ...
Read more >3D-PointCloud/README.md at master - GitHub
Papers and Datasets about Point Cloud. Contribute to zhulf0804/3D-PointCloud development by creating an account on GitHub.
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found

@HuangJunJie2017 I understand; I will look into it as well.
@Divadi I start trying mixed-precision today. Just a few hours ago… Where to put @force_fp32 is a good problem. I will refer to other algorithms for some suggestions.