paper · arXiv

CORTEX: A Structured Reasoning Benchmark for Trustworthy 3D Chest CT MLLMs

Reasoning in multimodal large language models (MLLMs) has shown strong promise in medical imaging. However, this reasoning is usually free-form text judged only by its final answer, making it hard to interpret and verify, especially in 3D radiology, where a diagnosis should be traceable to evidence in the scan. Existing chest CT question-answering datasets compound this by reducing expert radiology reports to answer-only pairs, dropping the reasoning that links findings to conclusions and omitting the patient history clinicians rely on. As a result, reasoning-capable 3D chest CT MLLMs remain o

Want the primary source?View original →

newsUsing AI to help physicians diagnose rare genetic diseases affecting children

articleRetrieval is underrated companyNorthwind AI

newsUsing AI to help physicians diagnose rare genetic diseases affecting children companyNorthwind AI articleRetrieval is underrated

cs.CV