Development and evaluation of a machine learning-based point-of-care screening tool for genetic syndromes in children: a multinational retrospective study

Antonio R Porras; Kenneth Rosenbaum; Carlos Tor-Diez; Marshall Summar; Marius George Linguraru

doi:10.1016/S2589-7500(21)00137-0

Development and evaluation of a machine learning-based point-of-care screening tool for genetic syndromes in children: a multinational retrospective study

Lancet Digit Health. 2021 Oct;3(10):e635-e643. doi: 10.1016/S2589-7500(21)00137-0. Epub 2021 Sep 1.

Authors

Antonio R Porras¹, Kenneth Rosenbaum², Carlos Tor-Diez³, Marshall Summar², Marius George Linguraru⁴

Affiliations

¹ Sheikh Zayed Institute for Pediatric Surgical Innovation, Children's National Hospital, Washington, DC, USA; Department of Biostatistics & Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA. Electronic address: antonio.porras@cuanschutz.edu.
² Rare Disease Institute, Department of Genetics and Metabolism, Children's National Hospital, Washington, DC, USA.
³ Sheikh Zayed Institute for Pediatric Surgical Innovation, Children's National Hospital, Washington, DC, USA.
⁴ Sheikh Zayed Institute for Pediatric Surgical Innovation, Children's National Hospital, Washington, DC, USA; Departments of Radiology and Pediatrics, School of Medicine, Department of Biomedical Engineering, School of Engineering and Applied Science, George Washington University, Washington, DC, USA. Electronic address: mlingura@childrensnational.org.

PMID: 34481768
DOI: 10.1016/S2589-7500(21)00137-0

Abstract

Background: Delays in the diagnosis of genetic syndromes are common, particularly in low and middle-income countries with limited access to genetic screening services. We, therefore, aimed to develop and evaluate a machine learning-based screening technology using facial photographs to evaluate a child's risk of presenting with a genetic syndrome for use at the point of care.

Methods: In this retrospective study, we developed a facial deep phenotyping technology based on deep neural networks and facial statistical shape models to screen children for genetic syndromes. We trained the machine learning models on facial photographs from children (aged <21 years) with a clinical or molecular diagnosis of a genetic syndrome and controls without a genetic syndrome matched for age, sex, and race or ethnicity. Images were obtained from three publicly available databases (the Atlas of Human Malformations in Diverse Populations of the National Human Genome Research Institute, Face2Gene, and the dataset available from Ferry and colleagues) and the archives of the Children's National Hospital (Washington, DC, USA), in addition to photographs taken on a standard smartphone at the Children's National Hospital. We designed a deep learning architecture structured into three neural networks, which performed image standardisation (Network A), facial morphology detection (Network B), and genetic syndrome risk estimation, accounting for phenotypic variations due to age, sex, and race or ethnicity (Network C). Data were divided randomly into 40 groups for cross validation, and the performance of the model was evaluated in terms of accuracy, sensitivity, and specificity in both the total population and stratified by race or ethnicity, age, and sex.

Findings: Our dataset included 2800 facial photographs of children (1318 [47%] female and 1482 [53%] male; 1576 [56%] White, 432 [15%] African, 430 [15%] Hispanic, and 362 [13%] Asian). 1400 children with 128 genetic conditions were included (the most prevalent being Williams-Beuren syndrome [19%], Cornelia de Lange syndrome [17%], Down syndrome [16%], 22q11.2 deletion [13%], and Noonan syndrome [12%] syndrome) in addition to 1400 photographs of matched controls. In the total population, our deep learning-based model had an accuracy of 88% (95% CI 87-89) for the detection of a genetic syndrome, with 90% sensitivity (95% CI 88-92) and 86% specificity (95% CI 84-88). Accuracy was greater in White (90%, 89-91) and Hispanic populations (91%, 88-94) than in African (84%, 81-87) and Asian populations (82%, 78-86). Accuracy was also similar in male (89%, 87-91) and female children (87%, 85-89), and similar in children younger than 2 years (86%, 84-88) and children aged 2 years or older (eg, 89% [87-91] for those aged 2 years to <5 years).

Interpretation: This genetic screening technology could support early risk stratification at the point of care in global populations, which has the potential accelerate diagnosis and reduce mortality and morbidity through preventive care.

Funding: Children's National Hospital and Government of Abu Dhabi.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Africa
Asia
Face
Facial Expression
Female
Genetic Diseases, Inborn / diagnosis*
Hispanic or Latino
Humans
Infant
Internationality
Machine Learning*
Male
Phenotype*
Photography*
Point-of-Care Systems*
Reproducibility of Results
Retrospective Studies
Risk Assessment
Sensitivity and Specificity
White People