CARMA: Context-Aware Situational Grounding of Human-Robot Group Interactions by Combining Vision-Language Models with Object and Action RecognitionJoerg Deigmoeller,Stephan Hasler,Nakul Agarwal,Daniel Tanneberg,Anna Belardinelli,Reza Ghoddoosian,Chao Wang,Felix Ocker, Fan Zhang,Behzad Dariush,Michael Giengerarxiv(2025)引用 0|浏览0AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要