Emotion Recognition Based on Joint Visual and Audio Cues | doi.page