A Machine Learning Framework For Early-Stage Detection of Autism Spectrum Disorders

This document describes a machine learning framework for early detection of autism spectrum disorder (ASD). The framework uses four feature scaling strategies and eight machine learning algorithms to classify four standard ASD datasets of different age groups. The best performing classifiers and feature scaling techniques are identified for each dataset based on evaluation metrics. The results show that Ada Boost and linear discriminant analysis achieved the highest accuracies of 99.25% and 97.12% respectively for two datasets. Feature selection is also performed to rank important attributes. The proposed framework achieves promising results for early ASD detection compared to existing approaches.

Uploaded by

Aaradhana Raajasekar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views20 pages

A Machine Learning Framework For Early-Stage Detection of Autism Spectrum Disorders

Uploaded by

Aaradhana Raajasekar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Received 13 December 2022, accepted 25 December 2022, date of publication 26 December 2022,

date of current version 16 February 2023.

Digital Object Identifier 10.1109/ACCESS.2022.3232490

A Machine Learning Framework for Early-Stage

Detection of Autism Spectrum Disorders
S. M. MAHEDY HASAN1 , MD PALASH UDDIN 2,3 , (Member, IEEE),
MD AL MAMUN1 , (Senior Member, IEEE), MUHAMMAD IMRAN SHARIF4 ,
ANWAAR ULHAQ 5 , AND GOVIND KRISHNAMOORTHY6
1 Department of Computer Science and Engineering, Rajshahi University of Engineering and Technology, Rajshahi 6204, Bangladesh
2 Department of Computer Science and Engineering, Hajee Mohammad Danesh Science and Technology University, Dinajpur 5200, Bangladesh
3 School of Information Technology, Deakin University, Geelong, VIC 3220, Australia
4 Department of Computer Science, COMSATS University Islamabad, Wah Campus, Punjab 47040, Pakistan
5 School of Computing, Mathematics and Engineering, Charles Sturt University, Port Macquarie, NSW 2444, Australia
6 School of Psychology and Wellbeing, University of Southern Queensland, Ipswich, QLD 4305, Australia

Corresponding author: Anwaar Ulhaq ([email protected])

This work was supported by the Regional Australia Mental Health Research and Training Institute, Manna Institute, NSW, Australia, under
Grant 0000103935.

ABSTRACT Autism Spectrum Disorder (ASD) is a type of neurodevelopmental disorder that affects the
everyday life of affected patients. Though it is considered hard to completely eradicate this disease, disease
severity can be mitigated by taking early interventions. In this paper, we propose an effective framework for
the evaluation of various Machine Learning (ML) techniques for the early detection of ASD. The proposed
framework employs four different Feature Scaling (FS) strategies i.e., Quantile Transformer (QT), Power
Transformer (PT), Normalizer, and Max Abs Scaler (MAS). Then, the feature-scaled datasets are classified
through eight simple but effective ML algorithms like Ada Boost (AB), Random Forest (RF), Decision Tree
(DT), K-Nearest Neighbors (KNN), Gaussian Naïve Bayes (GNB), Logistic Regression (LR), Support Vector
Machine (SVM) and Linear Discriminant Analysis (LDA). Our experiments are performed on four standard
ASD datasets (Toddlers, Adolescents, Children, and Adults). Comparing the classification outcomes using
various statistical evaluation measures (Accuracy, Receiver Operating Characteristic: ROC curve, F1-score,
Precision, Recall, Mathews Correlation Coefficient: MCC, Kappa score, and Log loss), the best-performing
classification methods, and the best FS techniques for each ASD dataset are identified. After analyzing the
experimental outcomes of different classifiers on feature-scaled ASD datasets, it is found that AB predicted
ASD with the highest accuracy of 99.25%, and 97.95% for Toddlers and Children, respectively and LDA
predicted ASD with the highest accuracy of 97.12% and 99.03% for Adolescents and Adults datasets,
respectively. These highest accuracies are achieved while scaling Toddlers and Children with normalizer
FS and Adolescents and Adults with the QT FS method. Afterward, the ASD risk factors are calculated, and
the most important attributes are ranked according to their importance values using four different Feature
Selection Techniques (FSTs) i.e., Info Gain Attribute Evaluator (IGAE), Gain Ratio Attribute Evaluator
(GRAE), Relief F Attribute Evaluator (RFAE), and Correlation Attribute Evaluator (CAE). These detailed
experimental evaluations indicate that proper finetuning of the ML methods can play an essential role in
predicting ASD in people of different ages. We argue that the detailed feature importance analysis in this
paper will guide the decision-making of healthcare practitioners while screening ASD cases. The proposed
framework has achieved promising results compared to existing approaches for the early detection of ASD.

INDEX TERMS Autism spectrum disorder, machine learning, classification, feature scaling, feature
selection technique.

I. INTRODUCTION
The associate editor coordinating the review of this manuscript and Autism Spectrum Disorder (ASD) is a neurodevelopmental
approving it for publication was Santosh Kumar . condition associated with brain development that starts early

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
15038 VOLUME 11, 2023
S. M. Mahedy Hasan et al.: Machine Learning Framework for Early-Stage Detection of Autism Spectrum Disorders

stage of life, impacting a person’s social relationships and Regression Trees (CART), Naive Bayes (NB), and SVM for
interaction issues [1], [2]. ASD has restricted and repeated adult ASD prediction. In [23], an ML model via induction of
behavioral patterns, and the word spectrum encompasses a rules was proposed for autism detection, which includes test-
wide range of symptoms and intensity [3], [4], [5]. Even ing on only one dataset and limited comparison. The authors
though there is no sustainable solution for ASD, simply early in [17] used LR analysis to build an ML autism classification
intervention and proper medical care will make a significant approach, which also falls into the lack of extensive validation
difference in a kid’s development to focus on improving a and comparison. The authors in [24] scrutinized autism data
child’s behaviors and skills in communication [6], [7], [8]. and observed that 5 of the overall 65 characteristics are suf-
Even so, the identification and diagnosis of ASD are really ficient to detect ASD through attention deficit hyperactivity
difficult and sophisticated, using traditional behavioral sci- disorder (ADHD). In 2019, the authors in [25] constructed an
ence. Usually, Autism is most commonly diagnosed at about RF-based model for the prediction of ASD utilizing behav-
two years of age and can also be diagnosed later, based on ioral features. In addition, the authors in [26] used LDA and
its severity [9], [10], [11]. A variety of treatment strategies KNN methods to identify ASD Children between the ages
are available to detect ASD as quickly as possible. These of 4 and 11 years. In 2018, the authors in [27] suggested an
diagnostic procedures aren’t always widely used in practice ASD model based on the RF classifier for children between
until a severe chance of developing ASD. The authors in [12] the ages of 4-11. The authors in [28] evaluated the predictive
provided a short and observable checklist that can be seen at performance of the Deep Neural Network (DNN) in the diag-
different stages of a person’s life, including toddlers, children, nosis of ASD utilizing two distinct Adult datasets. In 2019,
teens, and adults. Subsequently, the authors in [13] con- the authors in [18] constructed a smartphone application
structed the ASDTests mobile apps system for ASD identi- programming interface on RF-CART and RF-ID3 for the
fication as fast as possible, depending on a range of question- diagnosis of ASDs of all ages. The authors in [29] assessed
naire surveys, Q-CHAT, and AQ-10 methods. Consequently, the performance of multiple SVM kernels in classifying ASD
they also created an open-source dataset utilizing mobile data for children and explored that the polynomial kernel
phone app information and submitted the datasets to a pub- worked much better. The authors in [1] performed several
licly accessible website called the University of California- feature selection techniques on four ASD datasets and found
Irvine (UCI) machine learning repository and Kaggle for that the SVM classifier performed better for RIPPER-based
more development in this area of study. Over the past few toddler subset, correlation-based feature selection (CFS) and
years, several studies have been conducted incorporating var- Boruta CFS intersect (BIC) method-based child subset and
ious Machine Learning (ML) approaches to analyze and diag- CFS-based adult subset. Furthermore, they applied Shapley
nose ASD and also other diseases, such as diabetes, stroke, Additive Explanations (SHAP) method to various feature
and heart failure prediction as quickly as possible [14], [15], subsets, which achieved the highest accuracy and ranked their
[16]. The authors in [17] analyzed the ASD attributes utilizing features based on performance. The authors in [30] carried
Rule-based ML (RML) techniques and confirmed that RML out ensemble ML approaches of Fuzzy K-Nearest Neighbor
helps classification models boost classification accuracy. The (FKNN), Kernel Support Vector Machines (KSVM), Fuzzy
authors in [18] combined the Random Forest (RF) along Convolution Neural Network (FCNN), and Random For-
with Iterative Dichotomiser 3 (ID3) algorithms and produced est (RF) to classify Parkinson’s disease and ASD. Finally,
predictive models for children, adolescents, and adults. The the classification results are verified utilizing Leave-One-
authors in [19] introduced a new evaluation tool, integrating Person-Out Cross Validation (LOPOCV). The authors in
ADI-R and ADOS ML methods, and implemented differ- [31] performed an evolutionary cultural optimization algo-
ent attribute encoding approaches to resolve data insuffi- rithm to optimize the weights of Artificial Neural Net-
ciency, non-linearity, and inconsistency issues. Another study works (ANN) in classifying three benchmark datasets of
conducted by the authors in [13] demonstrates a feature-to- autism screening Toddlers, Children, and Adults. The authors
class and feature-to-feature correlation value utilizing cogni- in [32] performed an experimental analysis using 16 dif-
tive computing and implemented Support Vector Machines ferent ML models, among them, four bio-inspired algo-
(SVM), Decision Tree (DT), Logistic Regression (LR) as rithms, namely, Gray Wolf Optimization (GWO), Flower
ASD diagnostic and prognosis classifiers [17]. In addition, Pollination Algorithm (FPA), Bat Algorithms (BA), and Arti-
the authors in [20] explored traditionally formed (TD) (N = ficial Bee Colony (ABC) were employed for optimizing
19) and ASD (N = 11) cases, in which a correlation-based the wrapper feature selection method in order to select the
attribute selection was used to determine the importance of most informative features and to increase the accuracy of
the attributes. In 2015, the authors in [21] investigated ASD the classification models on genetic and personal charac-
and TD children and recognized 15 preschool ASDs using teristics datasets. Another study conducted by the authors
only seven features. Besides that, they conveyed that cluster in [33] combined three benchmark datasets as Toddlers,
analysis might effectively analyze complex patterns to predict Adolescents, and Adults and performed a Light Gradient
ASD phenotype and diversity. The authors in [22] contrasted Boosting Machine (LGBM) classifier to classify ASD. The
the classifier accuracy of K-Nearest Neighbors (KNN), LR, authors in [34] utilized Extreme Learning Machines (ELM)
Linear Discrimination Analysis (LDA), Classification and and Random Vector Function Link (RVFL) generalization