Detecting Events and Key Actors in Multi-person Videos | doi.page