Med-R2: Perception and Reflection-driven Complex Reasoning for Medical Report Generation - 专知论文

会员服务 ·

0

有向 · Automator · Processing（编程语言） · 知识 (knowledge) · Guidance ·

Med-R2: Perception and Reflection-driven Complex Reasoning for Medical Report Generation

翻译：暂无翻译

Hao Wang,Shuchang Ye,Jinghao Lin,Usman Naseem,Jinman Kim

from arxiv, 28 pages, 3 figures, 1 table

Automated medical report generation (MRG) is increasingly used to reduce the burden of manual reporting and for decision support. Large vision-language models (LVLMs) hold great promise for automated MRG due to their fine-grained image-text alignment and advanced text-generation capabilities. Currently, state-of-the-art MRGs primarily focus on adapting pre-trained LVLMs with direct supervised fine-tuning (SFT), a fine-tuning strategy with medical image-report pairs. However, several factors limit the performance of these LVLMs. Firstly, direct SFT enables LVLMs to generate medical reports directly without an intermediate thinking process of pathological feature perception and diagnostic reasoning. This causes a potential failure to perceive pathological features and thus leads to misdiagnosis. Secondly, direct SFT lacks the incorporation of radiology-specific knowledge guidance, causing LVLMs to misinterpret perceived pathological features and make incorrect diagnoses. To address these gaps, we propose a novel fine-tuning strategy named Med-R2. We introduce a perception-driven long reasoning process that precedes report generation and incorporates radiology-specific knowledge as guidance. Additionally, to alleviate potential perceptual errors in complex reasoning, a reflection mechanism is introduced to refine the perception of pathological features and the generated report. Our experiments demonstrate that Med-R2 effectively enhances the capability of pathological features perception and diagnosis accuracy for MRG via fine-tuned LVLMs.

翻译：暂无翻译

0

相关内容

《基于随机优化提升军事医疗后送系统效能》最新165页博士论文

《基于随机优化提升军事医疗后送系统效能》最新165页博士论文

专知会员服务

19+阅读 · 2025年9月9日

【CMU博士论文】分析多模态机器学习模型性能及其在医学报告生成中的评估指标

【CMU博士论文】分析多模态机器学习模型性能及其在医学报告生成中的评估指标

专知会员服务

23+阅读 · 2024年12月16日

ICML 2024 | Med-ST：解锁时空信息在医学多模态预训练中的能力

ICML 2024 | Med-ST：解锁时空信息在医学多模态预训练中的能力

专知会员服务

13+阅读 · 2024年7月10日

医学图像描述综述：编码、解码及最新进展

医学图像描述综述：编码、解码及最新进展

专知会员服务

20+阅读 · 2023年7月31日

【CVPR2023】基于动态图增强对比学习的胸部X光报告生成

【CVPR2023】基于动态图增强对比学习的胸部X光报告生成

专知会员服务

21+阅读 · 2023年3月23日

港科大最新《深度学习医学图像分割MedISeg》综述论文，21页pdf涵盖212篇文献阐述MedISeg技巧、挑战和未来方向

港科大最新《深度学习医学图像分割MedISeg》综述论文，21页pdf涵盖212篇文献阐述MedISeg技巧、挑战和未来方向

专知会员服务

42+阅读 · 2022年9月22日

Nature Medicine | AI与临床相结合，最新DECIDE-AI指南助力临床人工智能从开发到实施

Nature Medicine | AI与临床相结合，最新DECIDE-AI指南助力临床人工智能从开发到实施

专知会员服务

29+阅读 · 2022年5月22日

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

专知会员服务

33+阅读 · 2020年5月2日

【MICCAI 2019 】Generative adversarial networks and adversarial methods in biomedical image analysis（基于生成对抗网络和对抗方法的生物医学图像分析），附223页PPT免费下载

【MICCAI 2019 】Generative adversarial networks and adversarial methods in biomedical image analysis（基于生成对抗网络和对抗方法的生物医学图像分析），附223页PPT免费下载

专知会员服务

32+阅读 · 2019年11月4日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

哈佛大学｜构建知识图谱PrimeKG以实现精准医疗--数据与代码全部公开，帮你从零开始复现知识图谱

哈佛大学｜构建知识图谱PrimeKG以实现精准医疗--数据与代码全部公开，帮你从零开始复现知识图谱

GenomicAI

29+阅读 · 2022年5月4日

【AI与军事】《军事情报中应用机器学习：人机协作的未来方法研究》中文版，2022最新论文

【AI与军事】《军事情报中应用机器学习：人机协作的未来方法研究》中文版，2022最新论文

专知

53+阅读 · 2022年4月24日

美国埃默里大学医学院发布最新「医学图像配准深度学习」综述论文

美国埃默里大学医学院发布最新「医学图像配准深度学习」综述论文

专知

15+阅读 · 2020年1月7日

医学图像分析最新综述：走向深度

医学图像分析最新综述：走向深度

炼数成金订阅号

36+阅读 · 2019年2月20日

Nature Medicine连发9篇论文，Jeff Dean、吴恩达等最新研究入列

Nature Medicine连发9篇论文，Jeff Dean、吴恩达等最新研究入列

新智元

15+阅读 · 2019年1月14日

Jeff Dean等发文《Nature Medicine》，综述深度学习在医疗领域的应用

Jeff Dean等发文《Nature Medicine》，综述深度学习在医疗领域的应用

机器之心

13+阅读 · 2019年1月13日

AI+医疗真正落地？Nature Medicine同时刊登9篇论文，聚焦人工智能在医学领域的应用

AI+医疗真正落地？Nature Medicine同时刊登9篇论文，聚焦人工智能在医学领域的应用

专知

14+阅读 · 2019年1月12日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

Relation Networks for Object Detection 论文笔记

Relation Networks for Object Detection 论文笔记

统计学习与视觉计算组

16+阅读 · 2018年4月18日

【前沿】自动从CT医疗影像中生成诊断报告，卡内基梅隆大学CMU邢波教授团队最新基于深度学习的医疗影像研究成果

【前沿】自动从CT医疗影像中生成诊断报告，卡内基梅隆大学CMU邢波教授团队最新基于深度学习的医疗影像研究成果

专知

18+阅读 · 2017年11月24日

功能选择性beta2肾上腺素受体激动剂的发现

国家自然科学基金

0+阅读 · 2016年12月31日

miR-223在腹主动脉瘤发展和干预中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

HERC2P2调控DNA损伤修复及胶质瘤TMZ化疗增敏：ceRNA作用的新机制

国家自然科学基金

0+阅读 · 2015年12月31日

脑胶质瘤中Hedgehog通路介导的长链非编码RNA-MEG3作用机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-143调控MSC及通过MSC来源囊泡参与肿瘤抑制的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-98/PDGF-BB/Sp7反馈调控环路在骨质疏松症发病机制中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-223调控血小板活化在动脉粥样硬化中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

上市后药品不良反应信号检测中双稳健方法的构建

国家自然科学基金

0+阅读 · 2015年12月31日

胶质瘤侵袭过程中DNMT1沉默miR-134与ERK信号通路自激活的表观新机制

国家自然科学基金

0+阅读 · 2015年12月31日

缺氧促进心肌细胞exosome分泌介导miR-22调控血管新生的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

Precision Recall Controllable Radiology Report Generation via Hybrid Natural Language and Clinical Reward Learning

Arxiv

0+阅读 · 6月23日

MMed-Bench-IR: A Heterogeneous Benchmark for Multilingual Medical Information Retrieval

Arxiv

0+阅读 · 6月23日

CADRE: Stable, Parameter Efficient Adaptation of Medical Vision Language Models with Bounded Forgetting and Prior Drift

CADRE: Stable, Parameter Efficient Adaptation of Medical Vision Language Models with Bounded Forgetting and Prior Drift

Arxiv

0+阅读 · 6月22日

Bridging Single Distortion Artifacts and Multifactorial Clinical Quality: Few-shot Biparametric MRI Quality Assessment via Distortion-trained Prototypical Networks

Arxiv

0+阅读 · 6月22日

MedTS-TTT: Test-Time Training for Medical Time Series Classification

Arxiv

0+阅读 · 6月19日

MEDLAYXPLAIN: Benchmarking the Expert-Lay Gap in Medical Vision-Language Models

Arxiv

0+阅读 · 6月19日

MedRLM: Recursive Multimodal Health Intelligence for Long-Context Clinical Reasoning, Sensor-Guided Screening, Evidence-Grounded Decision Support, and Community-to-Tertiary Referral Optimization

MedRLM: Recursive Multimodal Health Intelligence for Long-Context Clinical Reasoning, Sensor-Guided Screening, Evidence-Grounded Decision Support, and Community-to-Tertiary Referral Optimization

Arxiv

0+阅读 · 6月18日

Bridging Single Distortion Artifacts and Mmultifactorial Clinical Quality: Few-shot Biparametric MRI Quality Assessment via Distortion-trained Prototypical Networks

Arxiv

0+阅读 · 6月17日

MedicalAgentsBench for Complex Medical Reasoning: Comparing Internalized Reasoning Models versus Externalized Agent-based Frameworks

Arxiv

0+阅读 · 6月16日

Consensus Based Medical Image Segmentation Using Semi-Supervised Learning And Graph Cuts

Arxiv

11+阅读 · 2018年5月21日

VIP会员

文章信息

相关主题

Processing（编程语言）

知识 (knowledge)

最新内容

综述 | 从问答到任务完成：Agent系统与Harness设计

综述 | 从问答到任务完成：Agent系统与Harness设计

专知会员服务

1+阅读 · 今天16:54

Agentic RL：框架、实践与长程智能体训练

Agentic RL：框架、实践与长程智能体训练

专知会员服务

1+阅读 · 今天16:52

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

专知会员服务

6+阅读 · 今天8:00

重新思考无人机时代的生存能力

重新思考无人机时代的生存能力

专知会员服务

5+阅读 · 今天7:44

装甲突击旅：现代战争思考、战斗与组织

装甲突击旅：现代战争思考、战斗与组织

专知会员服务

4+阅读 · 今天7:28

在人工智能加速决策环境中拓展OODA循环

在人工智能加速决策环境中拓展OODA循环

专知会员服务

4+阅读 · 今天7:18

《廉价自杀式无人机战争的军事战略影响：乌克兰与伊朗案例研究》

《廉价自杀式无人机战争的军事战略影响：乌克兰与伊朗案例研究》

专知会员服务

5+阅读 · 今天7:07

军事欺骗：供作战战术指挥官使用的工具

军事欺骗：供作战战术指挥官使用的工具

专知会员服务

4+阅读 · 今天7:03

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

4+阅读 · 6月23日

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

6+阅读 · 6月23日

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

10+阅读 · 6月23日

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

4+阅读 · 6月23日

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

5+阅读 · 6月23日

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

8+阅读 · 6月23日

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

7+阅读 · 6月23日

相关VIP内容

《基于随机优化提升军事医疗后送系统效能》最新165页博士论文

《基于随机优化提升军事医疗后送系统效能》最新165页博士论文

专知会员服务

19+阅读 · 2025年9月9日

【CMU博士论文】分析多模态机器学习模型性能及其在医学报告生成中的评估指标

【CMU博士论文】分析多模态机器学习模型性能及其在医学报告生成中的评估指标

专知会员服务

23+阅读 · 2024年12月16日

ICML 2024 | Med-ST：解锁时空信息在医学多模态预训练中的能力

ICML 2024 | Med-ST：解锁时空信息在医学多模态预训练中的能力

专知会员服务

13+阅读 · 2024年7月10日

医学图像描述综述：编码、解码及最新进展

医学图像描述综述：编码、解码及最新进展

专知会员服务

20+阅读 · 2023年7月31日

【CVPR2023】基于动态图增强对比学习的胸部X光报告生成

【CVPR2023】基于动态图增强对比学习的胸部X光报告生成

专知会员服务

21+阅读 · 2023年3月23日

港科大最新《深度学习医学图像分割MedISeg》综述论文，21页pdf涵盖212篇文献阐述MedISeg技巧、挑战和未来方向

港科大最新《深度学习医学图像分割MedISeg》综述论文，21页pdf涵盖212篇文献阐述MedISeg技巧、挑战和未来方向

专知会员服务

42+阅读 · 2022年9月22日

Nature Medicine | AI与临床相结合，最新DECIDE-AI指南助力临床人工智能从开发到实施

Nature Medicine | AI与临床相结合，最新DECIDE-AI指南助力临床人工智能从开发到实施

专知会员服务

29+阅读 · 2022年5月22日

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

专知会员服务

33+阅读 · 2020年5月2日

【MICCAI 2019 】Generative adversarial networks and adversarial methods in biomedical image analysis（基于生成对抗网络和对抗方法的生物医学图像分析），附223页PPT免费下载

【MICCAI 2019 】Generative adversarial networks and adversarial methods in biomedical image analysis（基于生成对抗网络和对抗方法的生物医学图像分析），附223页PPT免费下载

专知会员服务

32+阅读 · 2019年11月4日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

热门VIP内容

开通专知VIP会员享更多权益服务

Agentic RL：框架、实践与长程智能体训练

重新思考无人机时代的生存能力

综述 | 从问答到任务完成：Agent系统与Harness设计

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

相关资讯

哈佛大学｜构建知识图谱PrimeKG以实现精准医疗--数据与代码全部公开，帮你从零开始复现知识图谱

哈佛大学｜构建知识图谱PrimeKG以实现精准医疗--数据与代码全部公开，帮你从零开始复现知识图谱

GenomicAI

29+阅读 · 2022年5月4日

【AI与军事】《军事情报中应用机器学习：人机协作的未来方法研究》中文版，2022最新论文

【AI与军事】《军事情报中应用机器学习：人机协作的未来方法研究》中文版，2022最新论文

专知

53+阅读 · 2022年4月24日

美国埃默里大学医学院发布最新「医学图像配准深度学习」综述论文

美国埃默里大学医学院发布最新「医学图像配准深度学习」综述论文

专知

15+阅读 · 2020年1月7日

医学图像分析最新综述：走向深度

医学图像分析最新综述：走向深度

炼数成金订阅号

36+阅读 · 2019年2月20日

Nature Medicine连发9篇论文，Jeff Dean、吴恩达等最新研究入列

Nature Medicine连发9篇论文，Jeff Dean、吴恩达等最新研究入列

新智元

15+阅读 · 2019年1月14日

Jeff Dean等发文《Nature Medicine》，综述深度学习在医疗领域的应用

Jeff Dean等发文《Nature Medicine》，综述深度学习在医疗领域的应用

机器之心

13+阅读 · 2019年1月13日

AI+医疗真正落地？Nature Medicine同时刊登9篇论文，聚焦人工智能在医学领域的应用

AI+医疗真正落地？Nature Medicine同时刊登9篇论文，聚焦人工智能在医学领域的应用

专知

14+阅读 · 2019年1月12日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

Relation Networks for Object Detection 论文笔记

Relation Networks for Object Detection 论文笔记

统计学习与视觉计算组

16+阅读 · 2018年4月18日

【前沿】自动从CT医疗影像中生成诊断报告，卡内基梅隆大学CMU邢波教授团队最新基于深度学习的医疗影像研究成果

【前沿】自动从CT医疗影像中生成诊断报告，卡内基梅隆大学CMU邢波教授团队最新基于深度学习的医疗影像研究成果

专知

18+阅读 · 2017年11月24日

相关论文

Precision Recall Controllable Radiology Report Generation via Hybrid Natural Language and Clinical Reward Learning

Arxiv

0+阅读 · 6月23日

MMed-Bench-IR: A Heterogeneous Benchmark for Multilingual Medical Information Retrieval

Arxiv

0+阅读 · 6月23日

CADRE: Stable, Parameter Efficient Adaptation of Medical Vision Language Models with Bounded Forgetting and Prior Drift

CADRE: Stable, Parameter Efficient Adaptation of Medical Vision Language Models with Bounded Forgetting and Prior Drift

Arxiv

0+阅读 · 6月22日

Bridging Single Distortion Artifacts and Multifactorial Clinical Quality: Few-shot Biparametric MRI Quality Assessment via Distortion-trained Prototypical Networks

Arxiv

0+阅读 · 6月22日

MedTS-TTT: Test-Time Training for Medical Time Series Classification

Arxiv

0+阅读 · 6月19日

MEDLAYXPLAIN: Benchmarking the Expert-Lay Gap in Medical Vision-Language Models

Arxiv

0+阅读 · 6月19日

MedRLM: Recursive Multimodal Health Intelligence for Long-Context Clinical Reasoning, Sensor-Guided Screening, Evidence-Grounded Decision Support, and Community-to-Tertiary Referral Optimization

MedRLM: Recursive Multimodal Health Intelligence for Long-Context Clinical Reasoning, Sensor-Guided Screening, Evidence-Grounded Decision Support, and Community-to-Tertiary Referral Optimization

Arxiv

0+阅读 · 6月18日

Bridging Single Distortion Artifacts and Mmultifactorial Clinical Quality: Few-shot Biparametric MRI Quality Assessment via Distortion-trained Prototypical Networks

Arxiv

0+阅读 · 6月17日

MedicalAgentsBench for Complex Medical Reasoning: Comparing Internalized Reasoning Models versus Externalized Agent-based Frameworks

Arxiv

0+阅读 · 6月16日

Consensus Based Medical Image Segmentation Using Semi-Supervised Learning And Graph Cuts

Arxiv

11+阅读 · 2018年5月21日

相关基金

功能选择性beta2肾上腺素受体激动剂的发现

国家自然科学基金

0+阅读 · 2016年12月31日

miR-223在腹主动脉瘤发展和干预中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

HERC2P2调控DNA损伤修复及胶质瘤TMZ化疗增敏：ceRNA作用的新机制

国家自然科学基金

0+阅读 · 2015年12月31日

脑胶质瘤中Hedgehog通路介导的长链非编码RNA-MEG3作用机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-143调控MSC及通过MSC来源囊泡参与肿瘤抑制的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-98/PDGF-BB/Sp7反馈调控环路在骨质疏松症发病机制中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-223调控血小板活化在动脉粥样硬化中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

上市后药品不良反应信号检测中双稳健方法的构建

国家自然科学基金

0+阅读 · 2015年12月31日

胶质瘤侵袭过程中DNMT1沉默miR-134与ERK信号通路自激活的表观新机制

国家自然科学基金

0+阅读 · 2015年12月31日

缺氧促进心肌细胞exosome分泌介导miR-22调控血管新生的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员