a public good project by the
Synthesis
Company
of California

© 2026

Antoine Yang | doi.page

0 works0 citations0 h-index

Google Scholar OpenAlex

Antoine Yang

Publications by Year

Research Areas

Multimodal Machine Learning Applications, Domain Adaptation and Few-Shot Learning, Human Pose and Action Recognition, Video Analysis and Summarization, Advanced Image and Video Retrieval Techniques

Most-Cited Works

→ Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning(2023)198 cited
→ NAS evaluation is frustratingly hard(2019)110 cited
→ TubeDETR: Spatio-Temporal Video Grounding with Transformers(2022)87 cited
→ Zero-Shot Video Question Answering via Frozen Bidirectional Language Models(2022)64 cited
→ CoVR: Learning Composed Video Retrieval from Web Video Captions(2024)31 cited
→ Learning to Answer Visual Questions From Web Videos(2022)25 cited
→ MANAS: Multi-Agent Neural Architecture Search(2019)19 cited
→ Just Ask: Learning to Answer Questions from Millions of Narrated Videos(2021)14 cited