한빛사논문
울산대학교 의과대학, 서울아산병원
Hye Hyeon Moon 1#, Jiheon Jeong 2,3#, Ji Eun Park 1*, Namkug Kim 1,2*, Changyong Choi 2,3, Young-Hoon Kim 4, Sang Woo Song 4, Chang-Ki Hong 4, Jeong Hoon Kim 4, Ho Sung Kim 1
1Department of Radiology and Research Institute of Radiology, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Korea.
2Department of Convergence Medicine, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Korea.
3Department of Biomedical Engineering, Asan Medical Institute of Convergence Science of Technology, Asan Medical Center, College of Medicine, University of Ulsan, Seoul, Korea.
4Department of Neurosurgery, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Korea.
#These authors contributed equally to this work.
*Corresponding author: Ji Eun Park, Namkug Kim
Abstract
Background: This study evaluated whether generative artificial intelligence-based augmentation (GAA) can provide diverse and realistic imaging phenotypes and improve deep learning-based classification of isocitrate dehydrogenase (IDH) type in glioma compared with neuroradiologists.
Methods: For model development, 565 patients (346 IDH-wildtype, 219 IDH-mutant) with paired contrast-enhanced T1 and FLAIR MRI scans were collected from tertiary hospital and The Cancer Imaging Archive. Performance was tested on internal (119, 78 IDH-wildtype, 41 IDH-mutant [IDH1 and 2]) and external test sets (108, 72 IDH-wildtype, 36 IDH-mutant). GAA was developed using score-based diffusion model and ResNet50 classifier. The optimal GAA was selected in comparison with null model. Two neuroradiologists (R1, R2) assessed realism, diversity of imaging phenotypes, and predicted IDH mutation. The performance of a classifier trained with optimal GAA was compared with that of neuroradiologists using area under the receiver operating characteristics curve (AUC). The effect of tumor size and contrast enhancement on GAA performance was tested.
Results: Generated images demonstrated realism (Turing's test: 47.5%-50.5%) and diversity indicating IDH type. Optimal GAA was achieved with augmentation with 110 000 generated slices (AUC: 0.938). The classifier trained with optimal GAA demonstrated significantly higher AUC values than neuroradiologists in both the internal (R1, P=.003; R2, P<.001) and external test sets (R1, P<.01; R2, P<.001). GAA with large-sized tumors or predominant enhancement showed comparable performance to optimal GAA (internal test: AUC 0.956 and 0.922; external test: 0.810 and 0.749).
Conclusions: Application of generative AI with realistic and diverse images provided better diagnostic performance than neuroradiologists for predicting IDH type in glioma.
논문정보
관련 링크
연구자 키워드
관련분야 연구자보기
소속기관 논문보기
관련분야 논문보기