ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model

CoRR（2024）

Cited 8|Views37

Key words

Visual Question Answering,Language Understanding,Language Modeling,Syntax-based Translation Models

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Chat Paper

Summary is being generated by the instructions you defined