Improving Vision-and-Language Reasoning Via Spatial Relations Modeling
IEEE/CVF Winter Conference on Applications of Computer Vision(2024)
Key words
Algorithms,Image recognition and understanding,Algorithms,Machine learning architectures,formulations,and algorithms,Algorithms,Vision + language and/or other modalities
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined