Show simple item record

FieldValueLanguage
dc.contributor.authorWang, Pengyu
dc.date.accessioned2026-05-21T23:54:31Z
dc.date.available2026-05-21T23:54:31Z
dc.date.issued2026en_AU
dc.identifier.urihttps://hdl.handle.net/2123/35335
dc.description.abstractMedical large vision-language models have enabled automatic medical report generation, yet two challenges still limit diagnostic utility: normality bias and token-level training misalignment. Models often under-detect abnormalities and miss clinically important findings, while token-level imitation captures writing style rather than report-level clinical correctness. To address this, this thesis proposes two complementary approaches: (i) MRGAgents, a disease-specific multi-agent system that decomposes reporting into condition-focused subtasks for more balanced and comprehensive coverage; and (ii) MRG-R1, a fine-tuning paradigm based on semantic-driven reinforcement learning that directly optimizes report-level clinical correctness and factual alignment. MRGAgents uses specialized agents trained on curated disease-specific subsets of IU X-Ray and MIMIC-CXR, giving each agent stronger discrimination and descriptive ability for its target conditions. At inference, their outputs are aggregated to better balance normal and abnormal findings and provide more complete diagnostic descriptions. Empirically, MRGAgents improved coverage and abnormality reporting over strong baselines, reducing missed findings. MRG-R1 introduces SRL with Group Relative Policy Optimization and a margin CheXbert cosine similarity reward on key radiologic findings. This directly optimizes report-level clinical-label agreement and semantic consistency beyond surface fluency. Evaluated on IU X-Ray and MIMIC-CXR with clinical efficacy metrics, MRG-R1 achieved state-of-the-art CE-F1. Ablation studies showed that MCCS provided finer-grained supervision than CE-F1-based objectives, while an explicit reasoning-to-report process encouraged structured generation and improved diagnostic accuracy with minimal computational overhead. Overall, these architectural and training contributions improve report comprehensiveness, abnormality sensitivity, and clinical correctness for chest X-ray report generation.en_AU
dc.language.isoenen_AU
dc.subjectMedical Report Generationen_AU
dc.subjectChest X-rayen_AU
dc.subjectMulti-agent Systemen_AU
dc.subjectReinforcement Learningen_AU
dc.titleMulti-agent System and Reinforcement Learning in Medical Report Generationen_AU
dc.typeThesis
dc.type.thesisMasters by Researchen_AU
dc.rights.otherThe author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission.en
usyd.facultySeS faculties schools::Faculty of Engineering::School of Computer Scienceen_AU
usyd.degreeMaster of Philosophy M.Philen_AU
usyd.awardinginstThe University of Sydneyen_AU
usyd.advisorKim, Jinman
usyd.include.pubNoen_AU


Show simple item record

Associated file/s

Associated collections

Show simple item record

There are no previous versions of the item available.