Antoine Miech
Google (United States)(US)DeepMind (United Kingdom)(GB)Google DeepMind (United Kingdom)(GB)
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Human Pose and Action Recognition, Domain Adaptation and Few-Shot Learning, Video Analysis and Summarization, Topic Modeling
Most-Cited Works
- → Flamingo: a Visual Language Model for Few-Shot Learning(2022)1,238 cited
- → HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million\n Narrated Video Clips(2019)878 cited
- → End-to-End Learning of Visual Representations From Uncurated Instructional Videos(2020)579 cited
- → Learnable pooling with Context Gating for video classification(2017)242 cited
- → Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning(2023)198 cited
- → Learning a Text-Video Embedding from Incomplete and Heterogeneous Data(2018)176 cited
- → Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with\n Transformers(2021)130 cited
- → HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips(2019)117 cited
- → TubeDETR: Spatio-Temporal Video Grounding with Transformers(2022)87 cited
- → Leveraging the Present to Anticipate the Future in Videos(2019)71 cited