Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Benchmarking Direct Preference Optimization for Medical Large Vision–Language Models

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Dain-
dc.contributor.authorLee, Jiwoo-
dc.contributor.authorYun, Jaehoon-
dc.contributor.authorKoo, Yong Hoe-
dc.contributor.authorChen, Qingyu-
dc.contributor.authorKim, Hyunjae-
dc.contributor.authorKang, Jaewoo-
dc.date.accessioned2026-06-01T07:30:31Z-
dc.date.available2026-06-01T07:30:31Z-
dc.date.issued2026-03-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212923-
dc.description.abstractLarge vision-language models (LVLMs) are gaining traction in clinical tasks such as diagnostic support, report generation, and medical question answering. Among post-training techniques, Direct Preference Optimization (DPO) has shown promise in aligning model outputs with human preferences, yet its effectiveness in high-stakes medical contexts remains underexplored. In this work, we present the first systematic evaluation of nine DPO variants applied to two leading medical LVLMs, LLaVA-Med and HuatuoGPT-Vision. We benchmark these models on five curated datasets covering diverse clinical tasks. Evaluations include both automated metrics and expert assessments. Our results show that while DPO improves alignment and reduces severe hallucinations, it yields inconsistent gains over supervised fine-tuning. We further introduce DPO variant that better handles visual misinterpretations and enhances clinical understanding. These findings reveal both the potential and limitations of DPO in medical AI. To support future research, we will release all DPO training data, model checkpoints, and expert annotations upon acceptance.-
dc.format.extent16-
dc.language영어-
dc.language.isoENG-
dc.publisherAssociation for Computational Linguistics (ACL)-
dc.titleBenchmarking Direct Preference Optimization for Medical Large Vision–Language Models-
dc.typeArticle-
dc.identifier.doi10.18653/v1/2026.findings-eacl.267-
dc.identifier.scopusid2-s2.0-105038865684-
dc.identifier.bibliographicCitation19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026, pp 5052 - 5067-
dc.citation.title19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026-
dc.citation.startPage5052-
dc.citation.endPage5067-
dc.type.docTypeConference paper-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscopus-
dc.subject.keywordPlusComputational linguistics-
dc.subject.keywordPlusComputer vision-
dc.subject.keywordPlusNatural language processing systems-
dc.identifier.urlhttps://aclanthology.org/2026.findings-eacl.267/-
Files in This Item
Go to Link
Appears in
Collections
서울 의과대학 > 서울 내과학교실 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Yoon, Jai Hoon photo

Yoon, Jai Hoon
서울 의과대학 (DEPARTMENT OF INTERNAL MEDICINE)
Read more

Altmetrics

Total Views & Downloads

BROWSE