MUGGLE: MUlti-Stream Group Gaze Learning and Estimation
Citations Over TimeTop 13% of 2019 papers
Abstract
Being able to accurately predict the common gaze point of a group of persons is of particular interest to precise marketing and automatic group attention assessment. Group gaze estimation faces challenges including small face/head size and outlier observers. To address these challenges, we proposed a novel framework called Multi-stream Group Gaze Learning and Estimation (MUGGLE). The MUGGLE infrastructure includes two inference streams: 1) a holistic stream which utilizes fused attention map as input to a global deep convolutional structure to explore the global geometric configurations and contexts of interesting persons in the scene; and 2) an aggregative stream which robustly aggregates individual gazes via a recurrent structure (e.g., LSTM) to obtain outlier-tolerant estimation. Both streams are seamlessly integrated via a fusion network. Extensive experiments are performed on a fully annotated group gaze image dataset with 8,000+ images and 100,000+ faces (which is publicly releasable). The results demonstrate the effectiveness of the proposed MUGGLE framework in group gaze estimation.
Related Papers
- → Messages embedded in gaze of interface agents --- impression management with agent's gaze(2002)114 cited
- → The third gaze: de-constructing the host gaze in the psychoanalysis of tourism.(2012)8 cited
- → Direct Gaze Triggers Higher Frequency of Gaze Change: An Automatic Analysis of Dyads in Unstructured Conversation(2021)5 cited
- → Messages embedded in gaze of interface agents --- impression management with agent's gaze(2002)11 cited
- → Emergence of Agent Gaze Behavior using Interactive Kinetics-Based Gaze Direction Model(2020)1 cited