RGB-D Crowd Counting With Cross-Modal Cycle-Attention Fusion and Fine-Coarse Supervision
Citations Over TimeTop 10% of 2022 papers
Abstract
To tackle the negative effect of the arbitrary crowd distribution on the counting task, in this article, we propose a novel RGB-D crowd counting approach, including a cross-modal cycle-attention fusion (CmCaF) model and a novel fine-coarse (FC) supervision. In the feature level, the CmCaF model combines the RGB feature and depth feature in a cycle-attention way so as to model the crowd distribution effectively. In the supervision level, the novel design of FC supervision could optimize the counting model from both the fine pixel-aware level and coarse region-aware level to enhance its sensitivity to the whole crowd distribution and the instance location. Extensive evaluations on benchmarks well illustrate the feasibility of the proposed approach for the RGB-D crowd counting, as well as RGB and RGB-T counting. And the ablation study demonstrates the effectiveness of its main components on both the feature representation of cross-modal data and the accurate estimation of the crowd distribution.
Related Papers
- → Bad pixel identification by means of principal components analysis(2002)45 cited
- → Low complexity photo sensor dead pixel detection algorithm(2012)14 cited
- → Sub-pixel mapping based on sub-pixel to sub-pixel spatial attraction model(2011)12 cited
- → Reversible Data Hiding Based on Multiple Strategies(2022)1 cited
- → The Big Pixel Microworld(2021)