Cit 5
Cit 5
Original Article
Received: 17 November 2021 Revised: 03 March 2022 Accepted: 10 April 2022 Published: 30 April 2022
Abstract - Autism spectrum disorder (ASD) is a neuro- persons. Still, individuals with ASD could converse,
developmental complaint that influences an individual’s interrelate, function, perform and hear in the approach part
communication, announcement, and knowledge talents. from mainly individual people. The knowledge, thoughts,
Analysis of Autism can be completed at any age-group level. and difficulty-resolving facility of persons with ASD can
Autism patients look at diverse kinds of disputes learning provide a choice from talented to more confronted. An
disabilities, and complexity with meditation. Mental health analysis of ASD contains a lot of provisions that applied to
problems, motor difficulties, and sensory problems are some be analyzed alone as autistic disorder syndrome. These
of the problems faced by Autism patients. Earlier diagnosis situations in named ASD.
and proper medication at the early stage are essential to
control ASD. The ASD prediction framework is built to ASD is a state connected to mind growth that forces how
support a behavioral aspect-based analysis model without a person recognizes and meet peoples, obtaining difficulty in
any device in this research. The ASD prediction process is public contact and dealings. The disorder also contains
focused on the childhood and adolescent analysis model genetic factors, environmental aspects, biological factors,
utilized in the system. The behavioral parameters are and inadequate and recurring behavior patterns. In the United
collected with the support of the Autism Query collections. States, more children's affected by Autism. Each person with
The decision tree (DT) and Support Vector Machine (SVM) Autism has different issues, like anxiety, seizures, and
techniques, K-Nearest Neighbors (KNN), and Artificial depression, as shown in Figure 1. Autism can be obtained at
Neural Network (ANN) are applied for the ASD prediction two or three and diagnosed as early as before 18 months.
process. The Correlated Feature selection based Random Early detection gives a good impact on the life of persons.
Forest (CFS-RT) algorithm is applied for the ASD prediction Persons with ASD may perform work, interconnect,
process, giving an accuracy of 93.03%, and ANN produces interrelate, and acquire behaviors that are changed from
97.68% and outperformance other methods. person to person. The skills of a person with ASD can differ
significantly. The person with ASD are issues with the social
Keywords - Autism Spectrum Disorder, Decision Tree, announcement and limited behaviors. The children have
Machine Learning, Data Mining, Support Vector Machine. varying habits of culture and give less attention. Diagnosing
ASD is also a problem because there is no proper medical
1. Introduction test or blood test to identify the disorder. Doctors see the
Autism spectrum disorder (ASD) is a progressive child's behavior and can start a diagnosis to increase the
disability that can benefit important basis community, quality of life.
contact, and behavioral issues. There are frequent tells how a
community with ASD looks that sets them together from new
Seizures
Depression Immune system function
ASD disturbs how an individual reacts, study, and states themselves. For parents of a child with ASD, significantly which
level the child has can help prepare them for issues their child might face in daily life. Figure 2 shows the level of ASD. The
first level needs help for every work, and the second level indicates the need for sustainable help. Three-level indicate the need
for more sustainable help and very problem to changing focus and actions.
As shown in Figure 1, a representative controlled learning difficulty includes attribute space W called features in the data
set, a label space Z, and a prediction space Z’. Classifier map each feature w to one of the already defined class labels z.
A training sample set is illustrated as Y = ((w1,z1),…,(wm,zm)) Є (W ×Z)m that consists of prearranged class labels, where
each wi Є Z. The output is a model Hs: W Z’ that learns from the sample set These approaches find the class of new
instances, Z'= Z = {±1} and input samples are a map with two labels as Z = {±1} = {-1, +1} in binary classification.
344
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
345
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
obtain 87.50%. This model used the naïve Bayes algorithm discover ASD with behavioral features from different age
and did not train more than one model. Chelsea et al. [29] levels.
reviewed the state of MLCand ASD with the help of some
questions and answered for quires. The authors use the ML Specific Objectives
model with MRI and fMRI attributes to know the neuronal • To design a system for Autism Spectrum Disorder
behavior changes in ASD to predict the disease earlier [30- (ASD) discovery with behavioral analysis.
31]. • To perform age group based analysis on the ASD
discovery process
From existing research, Autism screening tests are a • To design the Correlation-based Feature Selection
huge cost and not a time-saving one. The behavioral with Random Forest (CFS-RT) technique for the
differences are not measured in the device-dependent ASD ASD prediction process
prediction process. Technical people are required to handle • To eliminate device and technician dependency in
the data collection operations. Age group-based analysis is the ASD detection process
not supported in the ASD prediction process. • To minimize cost and time in the ASD prediction
operations
3. Problem Definition
The AI and ML techniques are used in autism detection. 4. Proposed System Methodology
The neuroimaging and kinematic data are used to discover 4.1 ASD Detection through Behavioral Features using
the Autism Spectrum ASD levels. The ASD classification is Decision Trees
carried out with electroencephalogram (EEG) signal Adult autism detection is designed to analyze the
dispensation and knowledge approaches. The EEG is a behavioral attributes of patients above 18 years. The
consistent biomarker for diagnosing ASD abnormalities. The behavioral features are gathered with the help of a
electrodes attached to the scalp are used to detention the EEG questionnaire. The Adult Questioner (AQ10) data set is get
signals. The mobile applications collect answers from the from the University of California, Irwin ML repository. Each
people for the Autism Questions. The MLg-based methods person is diagnosed with 10 questions. The ASD is identified
are applied to predict ASD at any age-group level. The using the responses given by the individual. The decision
Random Forest-CART (Classification and Regression Trees) tree classification is applied to discover the disease levels.
algorithm predicts ASD traits. The Random Forest-ID3 Figure 4 shows the proposed system architecture, and a
(Iterative Dichotomiser 3) algorithm is applied to perform the detailed view is shown in Fig. 5.
ASD prediction process. The Correlation Feature Selection
based Random Forest (CFS-RF) technique is built to
Adult ASD Data
Childhood ASD Data
Prediction Results
346
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
The children dataset and the adolescent dataset is stored in the data repository. It contains missing and incorrect values.
This can be preprocessed and get relevant data to build the ML classification models after preprocessing, splitting the operation
as testing and training to generate the classifier to detect the Autism of children and adolescents. The proposed system uses DT,
SVM, and Random Forest models to generate a good model to predict Autism in the early stage. In classification, 10-fold
cross-validation is applied to estimate the model.
ASD
Dataset
Preprocessing
(Filling Missing Values)
Splitting operation
Autism discovery
347
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
Algorithm steps to create the decision tree in R- print("Class probability rate analysis for ASD training data
Environment set")
For creating decision tree, first load from database usin R prop.table(table(TrainData$V21)) * 100
tool as follows print("Class probability rate analysis for ASD testing data
# Load the ASD Data from datasets set")
# Attribute Relationship File Format (ARFF) data prop.table(table(TestData$V21)) * 100
ASDData <- read.csv("ASDData.arff", header = FALSE, TrainData[["V21"]] <- factor(TrainData[["V21"]])
comment.char = "@") #Matrix construction with penalty
print("All ASD data") penalty.matrix <- matrix(c(0,1,10,0), byrow=TRUE,
print(paste("Number of Rows : ",nrow(ASDData))) nrow=2)
print(paste("Number of Columns : ",ncol(ASDData))) # Building the decision tree
print(" Attribute Details") #Classification Tree construction
print("1. Answer for Question - 1") tree <- rpart(TrainData$V21~., data=TrainData, parms =
print("2. Answer for Question - 2") list(loss = penalty.matrix), method = "class")
.................................................. # visualization Tree process
.................................................... rpart.plot(tree, nn=TRUE)
print("19. Used the screening app befosinre") # ASD prdiction using the tree model
print("20. Screening Method Type") TestPredict <- predict(object=tree,TestData,type="class")
("21. Screening Score") TestPredict
View the data by the command as summary(ASDData) # Performance of the aid
Missing value analyzed using the following coding # Build Confussion Matrix
ASDData[, 13][ASDData[, 13] == '?'] <- NA confusionMatrix(table(TestPredict, TestData$V21))
ASDData[, 20][ASDData[, 20] == '?'] <- NA print(paste("Time : ",Sys.time()))
missmap(ASDData) Options (warn=0)
Next step is data cleaning process to remove inconsistent
data Figure 6 shows the workflow of the decision tree to
# Data cleaning process discover Autism in children and adults data. The input
Assign alternate values to missing values dataset is given to the classifier for modeling. Split the
<- sum(is.na(ASDData[1])) dataset using information gain. The highest information gain
(paste("Missing values in Variable 1 : ",mcount)) gives the best solution. Its result is positive, and it predicts
mcount <- sum(is.na(ASDData[2])) that the patient may have ASD based on reading the given
................ questionnaires. Otherwise patient doesn’t suffer from any
----------------- autistic spectrum disorder. Figure 7 shows the flow of the
Then data optimization process by the following command SVM classifier for autism discovery. In this model, "C “ is
ASDData <- ASDData[c(1:12,14,32,21)] the number of misclassified samples. It contains a small
# Data Factorization - Categorical data coversion for class value and no weightage for misclassification. Compute the
attribute loss function using equation 4. Figure 8 shows the autism
# Data spliting for training and testing process detection by the Random Forest classifier of the machine
DataMat <- createDataPartition(y = ASDData$V21, p = 0.5, learning model. It combines all the results and aggregate
list = FALSE) results based on majority voting.
TrainData <- ASDData[DataMat, ]
TestData <- ASDData[-DataMat, ] L=1/2||W||^2+C (4)
# Dimensions of the split
print("Class probability rate analysis for ASD data set")
prop.table(table(ASDData$V21)) * 100
348
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
Start
Present the user with the option of answering 10 questions based on the extracted
features to be used for Autism Detection
Yes
Predict that patient may Is the result classified as
have Autism positive for Autism?
No
The patient mostly does not suffer from any Autistic Spectrum Disorder
End
349
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
Autism Datasets
350
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
Training
data 1 Training
data n
Training Training
data 2 data 3
1 1 0
1
Voting
[Averaging]
Predict
The ML algorithms are practiced to perform the ASD prediction process. The Correlated Feature selection (CFS) based
Random Forest algorithm is applied for the disease prediction process. The prediction scheme is associated with the Naïve
Bayes and Support Vector Machine (SVM) classifiers.
351
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
352
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
• tp – Quantity of optimistic instances in the trial or experimental dataset that are categorized as positive
• fp – Quantity of undesirable instances in the trial or experimental dataset that are categorized as positive
• tn – Quantity of undesirable instances in the trial or experimental dataset that are categorized as negative
• fn – Quantity of optimistic instances in the trial or experimental dataset that are categorized as negative
Accuracy of the classifier: Accuracy is repeatedly used result metric for prediction. Obtaining more accuracy in the model in
the ML approach is the highest aspect of education (5).
𝑓𝑝+𝑓𝑛
Classification Error = (5)
𝑡𝑝+𝑡𝑛+𝑓𝑝+𝑓𝑛
353
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
Table 3. FPR Analysis between Decision Tree Classifier (DTC), SVM, and CFS-RF Techniques
Records DTC (%) SVM (%) CFS-RF (%)
220 9.48 7.92 4.74
440 8.88 7.64 4.44
660 8.36 7.14 4.18
880 7.64 6.56 3.82
1100 6.92 6.28 3.46
False Positive Rate Analysis between Decision Tree Classifier (DTC), SVM and CFS-RF
Techniques
9.48
10 7.92
False Positive Rate ( %)
8.88 7.64
8.36 7.14
8 4.74
4.44 7.64 6.56
4.18 6.28
6 3.82 6.92
3.46
DTC
4
SVM
2
CFS-RF
0
220
440
660
880
1100
Records
Fig. 11 FPR Analysis between Decision Tree Classifier (DTC), SVM, and CFS-RF Techniques
354
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
False Negative Rate Analysis between Decision Tree Classifier (DTC), SVM and
CFS-RF Techniques
12
10.04
False Negative Rate (%)
10 7.56 9.32
7.34 8.61 7.04
5.02
8 4.66 7.96 6.72
4.32 6.38
3.98 7.08
6 DTC
3.52
4 SVM
CFS-RF
2
0
220
440
660
880
1100
Records
Fig. 12 False Negative Rate Analysis between Decision Tree Classifier (DTC), SVM, and CFS-RF Techniques
Table 5. Prediction Accuracy Analysis between Decision Tree Classifier (DTC), SVM, and CFS-RF Techniques
Records DTC (%) SVM (%) CFS-RF (%)
220 80.48 84.52 90.24
440 82.19 85.01 90.89
660 83.03 85.82 91.49
880 84.41 86.72 92.21
1100 86.02 87.34 93.03
355
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
Prediction Accuracy Analysis between Decision Tree Classifier (DTC), SVM and
CFS-RF Techniques
75 CFS-RF
70
220
440
660
880
1100
Records
Fig. 13 Prediction Accuracy Analysis between Decision Tree Classifier (DTC), SVM, and CFS-RF Techniques
356
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
12
11.5
Prediction Time (s)
0
220
440
660
880
1100
Records
Fig. 15 Prediction Time Analysis between DTC, SVM, and CFS-RF Techniques
357
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
References
[1] M. Thangamani, R. Vijayalakshmi, M. Ganthimathi, Rajita, P. Malarkodi, S. Nallusamy, Efficient classification of heart disease using
K-Means clustering Algorithm, International Journal of Engineering Trends and Technology, 68(12) (2020) 48-53.
[2] N. Suresh Kumar, M. Thangamani, V. Sasikumar S. Nallusamy, An Improved Machine Learning Approach for Predicting Ischemic
Stroke, An Improved Machine Learning Approach for Predicting Ischemic Stroke, 68(1)(2021) 111-115.
[3] Thangamani M., Ganthimathi M., Sridhar S.R., Akila M., and Keerthana R. Engineering, Detecting Coronavirus Contact using Internet
of things., International Journal of Pervasive Computing and Communications, 16(5) (2020) 447-456. https://doi.org/10.1108/IJPCC-
07-2020-0074.
[4] Dilantha Haputhanthri, Gunavaran Brihadiswaran, Sahan Gunathilaka, Dulani Meedeniya, and Sampath Jayarathna, An EEG based
Channel Optimized Classification Approach for Autism Spectrum Disorder, Moratuwa Engineering Research Conference (Mercon),
(2019).
[5] Surendiran, R., and Alagarsamy,K., 2010. Skin Detection Based Cryptography in Steganography (SDBCS) .International Journal of
Computer Science and Information Technologies (IJCSIT), 1(4) (2010) 221-225.
[6] Supriya H S, Sumanth Alva R*, Suprith K P, Vikas Kumar L, and P Kyshan Neheeth, applying supervised learning technique to
diagnose autism spectrum disorder (asd), International Journal of Recent Scientific Research , 11(5b) (2020) 38439-38441.
DOI: http://dx.doi.org/10.24327/ijrsr.2020.1105.5312.
[7] Inon Wiratsin, L. Narupiyakul, Feature Selection Technique for Autism Spectrum Disorder, Proceedings of the 5th International
Conference on Control Engineering and Artificial Intelligence, DOI, 10.1145/3448218.3448241, (2021).
[8] Surendiran, R., 2017. Development of Multi-Criteria Recommender System, SSRG International Journal of Economics and
Management Studies (IJEMS) ISSN: 2393 - 9125, 4(1) (2017) 28-33.
[9] Daniel Bone, Somer Bishop, Matthew P. Black, and Shrikanth S. Narayanan, Use of machine learning to improve autism screening and
diagnostic instruments: effectiveness, efficiency, and multi-instrument fusion, Journal of Child Psychology and Psychiatry, (2016).
[10] Lucia Billeci, Antonio Narzisi, Alessandro Tonacci, Beatrice Sbriscia-Fioretti, Luca Serasini, Francesca Fulceri, Fabio Apicella,
Federico Sicca, Sara Calderoni & Filippo Muratori An integrated EEG and eye-tracking approach for the study of responding and
initiating joint attention in Autism Spectrum Disorders, Scientific Reports, 7 (2017) 1-13.
[11] Filippo Muratori, Lucia Billeci, Sara Calderoni, Maria Boncoddo, Caterina Lattarulo, Valeria Costanzo, Marco Turi, Costanza Colombi
and Antonio Narzisi, How Attention to Faces and Objects Changes Over Time in Toddlers with Autism Spectrum Disorders:
Preliminary Evidence from An Eye-Tracking Study, Brain science, MDPI, 9(344) (2019) 1-11.doi:10.3390/brainsci9120344.
[12] S. Jayarathna, Y. Jayawardana, M. Jaime, and S. Thapaliya, Electroencephalogram (EEG) for Delineating Objective Measure of Autism
Spectrum Disorder, in Computational Models for Biomedical Reasoning and Problem Solving, IGI Global, (2019) 34-65.
[13] S. Thapaliya, S. Jayarathna, and M. Jaime, Evaluating the EEG and Eye Movements for Autism Spectrum Disorder, in IEEE
International Conference on Big Data (Big Data), (2018) 2328-2336.
[14] Surendiran, R., Similarity Matrix Approach in Web Clustering .Journal of Applied Science and Computations (JASC),5(1) (2018) 267-
272.
[15] Narzisi, A. Posada, M. Barbieri, F. Chericoni, N. Ciuffolini, D. Pinzino, M. Romano, R. Scattoni, M.L.Tancredi, R. Calderoni, S.,
Prevalence of autism spectrum disorder in a large Italian catchment area: A school-based population study within the ASDEU project.
Epidemiol. Psychiatr. Sci. (2018).
[16] E. Grossi, C. Olivieri, and M. Buscema, Diagnosis of autism through EEG processed by advanced computational algorithms: A pilot
study, Comput. Methods Programs Biomed., 142 (2017) 73-79.
[17] W. J. Bosl, H. Tager-Flusberg, and C. A. Nelson, EEG Analytics for Early Detection of Autism Spectrum Disorder: A data-driven
approach, Scientific Reports, 8(1) (2018) 1-20.
[18] Zhong Zhao, Xiaobin Zhang, Wenzhou Li, Xinyao Hu, Xingda Qu, Xiaolan Cao, Yanru Liu, and Jianping Lu, Applying Machine
Learning to Identify Autism With Restricted Kinematic Features, IEEE Access, (2019).
[19] J. Baio et al., Prevalence of Autism Spectrum Disorder Among Children Aged 8 Years - Autism and Developmental Disabilities
Monitoring Network, 11 Sites, United States, 2014, MMWR. Surveillance Summaries, 67(6) (2018) 1-23.
[20] Surendiran, R., Rajan, K.P. and Sathish Kumar, M., Study on the Customer targeting using Association Rule Mining. International
Journal on Computer Science and Engineering, 2(7) (2010) 2483-2484.
[21] Charlotte Küpper, Sanna Stroth, Nicole Wolff, Florian Hauck, Tanja Schad-Hansjosten, Luise poustka, Veit Roessner, Katharina
Schultebraucks & Stefan Roepke, identifying predictive features of autism spectrum disorders in a clinical sample of adolescents and
[22] Adults using machine learning, Scientific report, nature research, 10(4805) (2020). doi.org/10.1038/s41598-020-61607-w.
[23] NaziaJassim, Simon Baron-Cohen, and John Suckling, Meta-analytic evidence of differential prefrontal and early sensory cortex activity
during non-social sensory perception in Autism, Neuroscience and Biobehavioural Reviews, Elsevier, 127 (2021) 146-157.
[24] Suman Raj and Sarfaraz Masood, Analysis and Detection of Autism Spectrum Disorder Using Machine Learning Techniques, Procedia
Computer Science, Elsevier, 167 (2020) 994-1004.
[25] Judith Charpentier, Marianne Latinus, Frederic Andersson, Agathe Saby, Jean-Philippe Cottier, Frederique Bonnet-Brilhault,
EmmanuelleHouy-Durand, and Marie Gomot, Brain Correlates of Emotional Prosodic Change Detection in Autism Spectrum Disorder,
Neuro Image: Clinical, Elsevier, 28 (2020) 102512.
[26] Surendiran, R. and Alagarsamy, K., PCA-based geometric modeling for automatic face detection. Int. J. Comput. Sci. Inform.
Technol, 1(4) (2010) 221-225.
[27] Jennie Hayes, Tamsin Ford, Hateem Rafeeque, Ginny Russell, Clinical practice guidelines for the Diagnosis of Autism Spectrum
Disorder in Adults and Children in The UK: a narrative review, BMC Psychiatry, 13(18) (2018) doi: 10.1186/s12888-018-1800-1.
358
Dr. R. Surendiran et al. / IJETT, 70(4), 343-359, 2022
[28] Mengyi Liao, Hengyao Duan, and Guangshuai Wang, Application of Machine Learning Techniques to Detect the Children with Autism
Spectrum Disorder, Hindawi Journal of Healthcare Engineering, 2022 (2022) 1-10,
Article ID 9340027.https://doi.org/10.1155/2022/9340027.
[29] Chelsea M. Parlett-Pelleriti, Elizabeth Stevens, Dennis Dixon, Erik J. Linstead, Applications of Unsupervised Machine Learning in
Autism Spectrum Disorder Research: a Review, Journal of Autism and Developmental Disorders, (2022) 1-17.
https://doi.org/10.1007/s40489-021-00299.
[30] Cooper J. Mellema 1,2, Kevin P. Nguyen 1,2, Alex Treacher 1 & Albert Montillo, Reproducible neuroimaging features for diagnosis of
Autism Spectrum Disorder with Machine Learning, Scientific Reports, 0123456789 (2022) 1-13.https://doi.org/10.1038/s41598-022-
06459-2.
[31] Yinghan Zhu, Hironori Nakatani, Walid Yassin, Norihide Maikusa, Naohiro Okada, Akira Kunimatsu, Osamu Abe, Hitoshi Kuwabara,
Hidenori Yamasue, Kiyoto Kasai, Kazuo Okanoya, and Shinsuke Koike, Application of a Machine Learning Algorithm for Structural
Brain Images in Chronic Schizophrenia to Earlier Clinical Stages of Psychosis and Autism Spectrum Disorder: A Multiprotocol Imaging
Dataset Study, Schizophrenia Bulletin, 10 (2022) 1-12.
[32] N. Sciaraffa, G. Borghini, P. Aricò, Joint Analysis of Eye Blinks and Brain Activity to Investigate Attentional Demand during a Visual
Search Task, Brain Science, MDPI, 11(562) (20210) 1-20. DOI:10.3390/brainsci11050562.
[33] Bram van den Bekerom, Using Machine Learning for Detection of Autism Spectrum Disorder, IEEE, (2017).
[34] Fadi Thabtah, Autism Spectrum Disorder Screening: Machine Learning Adaptation and DSM-5 Fulfillment, ACM, (2017).
[35] Rasool Azeem Musa, Mehdi Ebadi Manaa2, and Ghassan Abdul-Majeed Predicting Autism Spectrum Disorder (ASD) for Toddlers and
Children Using Data Mining Techniques, Journal of Physics: Conference Series, IOP publishing,
doi:10.1088/1742-6596/1804/1/012089, 1804 (2021).
[36] Antoine Frigaux ,Joëlle Lighezzolo-Alnot , Jean-Claude Maleval and Renaud Evrard, Differential diagnosis on the Autism Spectrum:
Theorizing an Ordinary Autism, L’evolution ´ psychiatrique, Elsevier, 86(2021) e1–e24.
359