Multimodal AI for risk stratification in autism spectrum disorder: integrating voice and screening toolsopen access
- Authors
- Bae, Sookyung; Hong, Junho; Ha, Sungji; Moon, Jiwoo; Yu, Jaeeun; Choi, Hangnyoung; Lee, Junghan; Do, Ryemi; Sim, Hewoen; Kim, Hanna; Kim, Johanna Inhyang; Sung, Haneul; Kim, Hwiyoung; Kim, Bung-Nyun; Cheon, Keun-Ah
- Issue Date
- Aug-2025
- Publisher
- NATURE PUBLISHING GROUP
- Keywords
- Deep Learning; Diagnosis; Risk Assessment; Audio Features; Audio Tools; Autism Spectrum Disorders; Gold Standards; Mobile Applications; Multi-modal; Parent-child Interactions; Risk Categories; Risk Stratification; Screening Tool; Diseases; Adaptive Behavior; Adult; Analytical Error; Article; Artificial Intelligence; Autism; Child; Child Parent Relation; Deep Learning; Diagnostic Accuracy; Diagnostic Test Accuracy Study; Disease Severity; False Positive Result; Female; Follow Up; High Risk Patient; Human; Language Delay; Major Clinical Study; Male; Prediction; Screening; Self Care; Symptom; Videorecording; Voice
- Citation
- npj Digital Medicine, v.8, no.1, pp 1 - 15
- Pages
- 15
- Indexed
- SCIE
SCOPUS
- Journal Title
- npj Digital Medicine
- Volume
- 8
- Number
- 1
- Start Page
- 1
- End Page
- 15
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208721
- DOI
- 10.1038/s41746-025-01914-6
- ISSN
- 2398-6352
2398-6352
- Abstract
- Early Autism Spectrum Disorder (ASD) identification is crucial but resource-intensive. This study evaluated a novel two-stage multimodal AI framework for scalable ASD screening using data from 1242 children (18–48 months). A mobile application collected parent-child interaction audio and screening tool data (MCHAT, SCQ-L, SRS). Stage 1 differentiated typically developing from high-risk/ASD children, integrating MCHAT/SCQ-L text with audio features (AUROC 0.942). Stage 2 distinguished high-risk from ASD children by combining task success data with SRS text (AUROC 0.914, Accuracy 0.852). The model’s predicted risk categories strongly agreed with gold-standard ADOS-2 assessments (79.59% accuracy) and correlated significantly (Pearson r = 0.830, p < 0.001). Leveraging mobile data and deep learning, this framework demonstrates potential for accurate, scalable early ASD screening and risk stratification, supporting timely interventions.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 의과대학 > 서울 정신건강의학교실 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.