The Synthesis Company of San Francisco Mountain Logo
COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval | doi.page