Chrome Extension
WeChat Mini Program
Use on ChatGLM

A Multimodal Generative AI Copilot for Human Pathology

Nature(2024)SCI 1区

Harvard Med Sch | Ohio State Univ | Mayo Clin

Cited 17|Views26
Abstract
The field of computational pathology[1,2] has witnessed remarkable progress in the development of both task-specific predictive models and task-agnostic self-supervised vision encoders[3,4]. However, despite the explosive growth of generative artificial intelligence (AI), there has been limited study on building general purpose, multimodal AI assistants and copilots[5] tailored to pathology. Here we present PathChat, a vision-language generalist AI assistant for human pathology. We build PathChat by adapting a foundational vision encoder for pathology, combining it with a pretrained large language model and finetuning the whole system on over 456,000 diverse visual language instructions consisting of 999,202 question-answer turns. We compare PathChat against several multimodal vision language AI assistants and GPT4V, which powers the commercially available multimodal general purpose AI assistant ChatGPT-4[7]. PathChat achieved state-of-the-art performance on multiple-choice diagnostic questions from cases of diverse tissue origins and disease models. Furthermore, using open-ended questions and human expert evaluation, we found that overall PathChat produced more accurate and pathologist-preferable responses to diverse queries related to pathology. As an interactive and general vision-language AI Copilot that can flexibly handle both visual and natural language inputs, PathChat can potentially find impactful applications in pathology education, research, and human-in-the-loop clinical decision making.
More
Translated text
PDF
Bibtex
AI Read Science
Video&Figures
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Related Papers

Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future Directions

Yuting He, Fuxiang Huang, Xinrui Jiang,Yuxiang Nie, Minghao Wang,Jiguang Wang,Hao Chen
IEEE REVIEWS IN BIOMEDICAL ENGINEERING 2025

被引用3

A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine

Hanguang Xiao, Feizhong Zhou, Xingyue Liu,Tianqi Liu, Zhipeng Li,Xin Liu, Xiaoxuan Huang
INFORMATION FUSION 2025

被引用0

Large Language Models in Pathology: A Comparative Study of ChatGPT and Bard with Pathology Trainees on Multiple-Choice Questions

Wei Du, Xueting Jin,Jaryse Carol Harris, Alessandro Brunetti, Erika Johnson, Olivia Leung, Xingchen Li, Selemon Walle, Qing Yu, Xiao Zhou, Fang Bian, Kajanna Mckenzie,
ANNALS OF DIAGNOSTIC PATHOLOGY 2024

被引用0

The Age of Foundation Models

Jana Lipkova,Jakob Nikolas Kather
NATURE REVIEWS CLINICAL ONCOLOGY 2024

被引用0

Review of Deep Learning-Based Pathological Image Classification: from Task-Specific Models to Foundation Models

Haijing Luan, Kaixing Yang, Taiyuan Hu, Jifang Hu, Siyao Liu, Ruilin Li, Jiayin He,Rui Yan,Xiaobing Guo, Niansong Qian,Beifang Niu
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE 2025

被引用0

MEDSQ: Towards Personalized Medical Education Via Multi-Form Interaction Guidance

Yong Ouyang, Wenjin Gao,Huanwen Wang, Lingyu Chen, Jing Wang,Yawen Zeng
EXPERT SYSTEMS WITH APPLICATIONS 2025

被引用0

A Vision-Language Foundation Model for Precision Oncology

Jinxi Xiang,Xiyue Wang,Xiaoming Zhang, Yinghua Xi, Feyisope Eweje, Yijiang Chen, Yuchen Li, Colin Bergstrom, Matthew Gopaulchan, Ted Kim,Kun-Hsing Yu, Sierra Willens,
NATURE 2025

被引用0

Evaluating ChatGPT’s Diagnostic Potential for Pathology Images

Liya Ding, Lei Fan, Miao Shen, Yawen Wang, Kaiqin Sheng, Zijuan Zou, Huimin An,Zhinong Jiang
FRONTIERS IN MEDICINE 2025

被引用0

Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper

要点】:本研究提出了PathChat,一种针对病理学领域的多功能视觉-语言通用AI助手,其在多选诊断问题和开放性问题中表现出优于现有系统的性能。

方法】:PathChat通过适配病理学的基础视觉编码器,结合预训练的大型语言模型,并在超过456,000个包含999,202个问答回合的多样化视觉语言指令上进行微调构建而成。

实验】:研究者在多个数据集上对PathChat进行了测试,包括开放数据集和商业可用的多模态通用AI助手ChatGPT-4所使用的GPT4V,结果显示PathChat在多项任务上达到最佳性能。