Antoine Yang
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Domain Adaptation and Few-Shot Learning, Human Pose and Action Recognition, Video Analysis and Summarization, Advanced Image and Video Retrieval Techniques
Most-Cited Works
- → Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning(2023)198 cited
- → NAS evaluation is frustratingly hard(2019)110 cited
- → TubeDETR: Spatio-Temporal Video Grounding with Transformers(2022)87 cited
- → Zero-Shot Video Question Answering via Frozen Bidirectional Language Models(2022)64 cited
- → CoVR: Learning Composed Video Retrieval from Web Video Captions(2024)31 cited
- → Learning to Answer Visual Questions From Web Videos(2022)25 cited
- → MANAS: Multi-Agent Neural Architecture Search(2019)19 cited
- → Just Ask: Learning to Answer Questions from Millions of Narrated Videos(2021)14 cited