Linear Spaces of Meanings: Compositional Structures in Vision-Language Models
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)(2023)
Key words
Word Representation,Representation Learning,Language Understanding,Visual Question Answering,Image Captioning
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined