BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision
Citations Over TimeTop 1% of 2023 papers
Abstract
We present a novel bird's-eye-view (BEV) detector with perspective supervision, which converges faster and bet-suits modern image backbones. Existing state-of-the-art BEV detectors are often tied to certain depth pretrained backbones like Vo Vn et, hindering the synergy between booming image backbones and BEV detectors. To address this limitation, we prioritize easing the optimization of BEV detectors by introducing perspective view supervision. To this end, we propose a two-stage BEV detector; where proposals from the perspective head are fed into the bird’ s-eye-view head for final predictions. To evaluate the effectiveness of our model, we conduct extensive ablation studies focusing on the form of supervision and the gener-ality of the proposed detector. The proposed method is ver-ified with a wide spectrum of traditional and modern image backbones and achieves new SoTA results on the large-scale nuScenes dataset. The code shall be released soon.
Related Papers
- → Which side are you on?(2006)272 cited
- Design of optimized V-detector algorithm with higher detection efficiency(2010)
- → Building Better Workplaces through Individual Perspective Taking: A Fresh Look at a Fundamental Human Process(2008)113 cited
- → Results Perspective One : Network Perspective(2013)
- Creative perspective for artists and designers(1995)