Autism Report-Grp
Autism Report-Grp
A Project Report On
submitted by
Rakshitha V [1DB18CS104]
Sirisha M N [1DB18CS138]
Sneha H R [1DB18CS140]
Tejaswini D S [1DB18CS149]
CERTIFICATE
This is to certify that the project Phase-II entitled “ Detection Of Autism Spectrum
Disorder In Early Stage Using Machine Learning Model” is bonafide report carried out
by Rakshitha V [1DB18CS104], Sirisha M N [1DB18CS138], Sneha H R [1DB18CS140]
and Tejaswini D S [1DB18CS149] students of Don Bosco Institute of Technology,
Bangalore in partial fulfillment for the award of the degree of Bachelor of Engineering in
Computer science and Engineering of Visvesvaraya Technological University, Belagavi
during the academic year 2021-22. The project Phase -II report has been approved as it
satisfies the academic requirements in respect of the Project Phase-II prescribed for the
Bachelor of Engineering Degree.
External Viva
Name of Examiner Signature with date
VISVESVARAYA TECHNOLOGICAL UNIVERSITY
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
DON BOSCO INSTITUTE OF TECHNOLOGY
Mysore Road, Kumbalgodu, Bengaluru-560074
DECLARATION
The satisfaction and euphoria that successful completion of any project is incomplete without
the mention of people who made it possible, whose constant guidance and encouragement
made our effort fruitful.
First and foremost, we ought to pay our due regards to Don Bosco Institute of Technology,
which provided us a platform and gave an opportunity to display our skills through the
medium of project work. We express our heartfelt thanks to our beloved principal Prof. B S
UMASHANKAR, DBIT, Bangalore for his encouragement all through our graduation life
and providing us with the infrastructure.
We express our deep sense of gratitude and thanks to Dr. K B SHIVA KUMAR, Head of
the Department, Computer Science and Engineering for extending his valuable insight
and suggestions offered during the course.
It is our utmost pleasure to acknowledge the kind help extended by our guide Mr.
Mohammed Kaleem, Assistant Professor, Department of Computer Science and
Engineering, for guidance and assistance which consequently resulted in getting the project
phase II work completed successfully.
Last but not the least I would like to thank teaching and non-teaching staff for their
cooperation extended during the completion of the project Phase-II.
Rakshitha V [1DB18CS104]
Sirisha M N [1DB18CS138]
Sneha H R [1DB18CS140]
Tejaswini D S [1DB18CS140]
ABSTRACT
Early diagnosis of ASD helps to have a quality life by providing thorough care and therapy.
However, in many developed countries, Diagnosis takes too much time. Besides, a trained medical
expert or specialists are required to identify autism spectrum disorder as there are no direct medical
tests. At present many techniques are used to evaluate autism in general, without identifying their
uniqueness or specific characteristics. In this project work Machine Learning algorithms such as
Naïve-Bayes classifier and SVM (Support Vector Machine) classifier have been utilized for
detecting autism. We have achieved the highest accuracy of 100% on SVM classifier and 95%
accuracy using naïve bayes classifier. Therefore, prediction model that can reliably detect ASD is
proposed.
TABLE OF CONTENTS
Acknowledgement I
Abstract II
Table of Contents III
List of Figures IV
1.4 Objectives 2
1.5 Problem Statement 3
2. LITERATURE SURVEY 3-4
3. DESIGN AND IMPLEMENTATION 5
5. CONCLUSION 20
REFERENCES 21
LIST OF FIGURES
DETECTION OF AUTISM SPECTRUM
DISORDER IN EARLY STAGE USING
MACHINE LEARNING MODEL
DETECTION OF AUTISM SPECTRUM DISORDER 18CSP83
CHAPTER 1
INTRODUCTION
1.1 OVERVIEW
Autism spectrum disorder is a neuro developmental disorder that affects a person’s interaction,
communication and learning skills. Although diagnosis of autism can be done at any age, its
symptoms generally appear in the first two years of life and develops through time. Autism patients
face different types of challenges such as difficulties with concentration, learning disabilities,
mental health problems such as anxiety, depression etc., motor difficulties, sensory problems and
many others.
Current explosion rate of autism around the world is numerous and it is increasing at a very high
rate. According to WHO, about 1 out of every 160 children has ASD. Some people with this
disorder can live independently, while others require life-long care and support.
Diagnosis of autism requires significant amount of time and cost. Earlier detection of autism can
come to a great help by prescribing patients with proper medication at an early stage. It can prevent
the patient’s condition from deteriorating further and would help to reduce long term costs
associated with delayed diagnosis. Thus, a time efficient, accurate and easy screening test tool is
very much required which would predict autism traits in an individual and identify whether or not
they require comprehensive autism assessment.
The objective of this work is to propose an autism prediction model using ML techniques and to
develop a mobile application that could effectively predict autism traits of an individual of any age.
In other words, this work focuses on developing an autism screening application for predicting the
ASD traits among people of age groups 4-17 years and for people of age 18 and more.
Chatter Bot: A chatbot is software application used to conduct an on-line chat conversion via text
or text-to-speech, in lieu of providing direct contact with a live human agent. Designed to
convincingly simulate the way a human would behave as a conversational partner, chatbot systems
typically require continuous tuning and testing, and many in production remain unable to
adequately converse or pass the industry standard Turing test.
1.4 OBJECTIVES
The objective of this work is to propose an autism prediction model using ML techniques that
could effectively predict autism traits of an individual of any age. In other words, this work focuses
on developing an autism screening application for predicting the ASD traits among people of age
groups 4-11 years, 12-17 years and for people of age 18 and more.
To propose an autism prediction model using ML Techniques, that could effectively predict
autism traits of an individual of any age group.
Learning the characteristics of children with autism.
Early identification of the stages of autism.
Diagnosis using improvement of autism child.
Autism spectrum disorder is a neuron developmental disorder that affects a person’s interaction,
communication and learning skills. Although diagnosis of autism can be done at any age, its
symptoms generally appear in the first two years of life and develops through time. Autism patients
face different types of challenges such as difficulties with concentration, learning disabilities,
mental health problems such as anxiety, depression etc., motor difficulties, sensory problems and
many others. However, in many countries, it is too late to diagnose and involves lengthy
procedure.
To simplify this procedure, Autism prediction model has been created using Machine Learning
algorithms such as Naïve-bayes and SVM classifiers.
CHAPTER 2
LITERATURE SURVEY
Guannan Li et al. [2] has proposed a volume-based analysis of the amygdala and hippocampal
subfields of the infant subjects with risk of ASD at around 24 months of age. Specifically, to
address the challenge of low tissue contrast, we propose a novel deep-learning approach, i.e.,
dilated-dense U-Net, to automatically segment the amygdala and hippocampal subfields.
Fitrilia Susanti et al. [3] Children with autism have communication disorder that affects the
children face difficulty interacting and communicating with their environment both verbally and no
verbally. To facilitate it, learning visual communication was needed because children with autism
were better at receiving information visually than orally. The application of visual communication
learning for children with autism developed in digital media used the Picture Exchange
Communication System (PECS) method.
Leslie Mertz et al. [4] Trained clinicians are able to diagnose autism by the time when a child is
two years old, and recognized beneficial therapies for the children can then be pursued at once.
Unfortunately, the author points out that the average age of diagnosis of autism in the United States
is 4.3 years. The late diagnosis is owed in large part to a lack of accomplished workers to conduct
the qualitative assessment and evaluate children accurately, including the ones who have milder
forms of ASD.
Osman et al. [5] The classification technique for ASD diagnosis was utilised in this study on
children aged 4 to 11. For cataloguing, the K Nearest Neighbor algorithms and Linear Discriminant
Analysis are used. Thirty percent of the data set was chosen as test data and seventy percent was
chosen as training data to test the algorithms. As a consequence of the work, the LDA method has a
90.8 percent accuracy, while the KNN approach has an 88.5 percent accuracy. The sensitivity and
specificity values for the LDA algorithm are 0.9524 and.08667, respectively. For the KNN method,
Elizabeth Stevens et al. [6] they have done the cluster study on the sample of 2,116 children with
ASD in order to find the patterns of stimulating behaviours observed in centre and home-based
clinical settings. The results indicates that while the presence of multiple challenging behaviours is
common in most of the cases as a leading behaviour.
Airi Tusji et al. [7] made studying and creating dynamic interpersonal distance models and real-
time
recognition systems (ASD). In precise way we demonstrated the quantitative quantities of chasing
behaviours observed during therapy. Chasing behaviours are a extremely social activity since the
children has to guess the movement of a partner (therapist).
Cheol-Hong et al. [8] A structure to detect, record and label different behavioural patterns of kids
with autism spectrum disorder (ASD) have been established. The structure integrates 2 diverse
sensor platforms which are both wearable and inert.
Elena Pattani et al. [9] The study intended to examine the variations about compassion and coping
in mothers of kids (3 to 11 age group) separated in two distinct groups: an “ASD group” consisting
of mothers of kids detected with autism spectrum syndrome and a “control group” including
mothers of children with typical development.
ZhiZheng et al. [10] They proposed ASOTS, a closed-loop independent computer system for
developing social alignment skills for teenagers with ASD, in this work. This technology is thought
to be able to detect and track a child's attention in response to social alignment offers, as well as
assisting the adolescent in achieving correct social alignment when necessary.
CHAPTER 3
System architecture is a conceptual model that defines the structure and behavior of the system.
Architecture comprises of the system components and the relationship. Describing, how they
work together to implement the system as whole. The proposed system architecture is shown in
Fig.1.
Data synthesis, in the context of information and data, refers to analysing what we've learnt from
our data collection and making a choice, as well as developing a program, strategy, or method
based on the information analysed. To put it another way, data synthesis is the process of
combining ideas from various sources to produce something new that suits the demands of the
community and population we're working with. Any service's function revolves around data
collection and analysis. Good data is information that is accurate, dependable, and reflects what is
truly going on in the service, the subject you're researching, or the community as a whole. For
data to be valuable, we must have faith in its accuracy. Another crucial feature of a good
measurement tool is its validity. Validity refers to the tool's ability to measure what it claims to
measure. The process of validating a data gathering tool or scale is determining the level of
confidence with which we make judgments about persons using the scale.
The training dataset often consists of pairs of an input vector (or scalar) and the corresponding
output vector (or scalar), where the answer key is commonly denoted as the target (or label).
Based on the result of the comparison and the specific learning algorithm being used, the
parameters of the model are adjusted. The test dataset is a dataset used to provide an unbiased
evaluation of a final model fit on the training dataset. It is independent of the training dataset, but
Dept of CSE, DBIT 2021-2022 Page 8
that follows the same probability distribution as the training dataset.
DETECTION OF AUTISM SPECTRUM DISORDER 18CSP83
Prediction models are intended to aid healthcare professionals and patients in making decisions
about diagnostic tests, therapy initiation and discontinuation, and lifestyle changes. While they
cannot replace clinical experience, they can provide objective information about a person's illness
risk and help to avoid some frequent biases in clinical decision-making. Biases in the way data is
obtained or filtered for use by the model, on the other hand, might add other forms of biases,
therefore the underlying data and cohort selection are critical. Here, we have used two algorithms
in order to develop the prediction model:
Bernoulli Naive Bayes algorithm
SVM algorithm
A classification problem or a regression problem are the two types of challenges that a
performance evaluation model can address. We will need to utilise different metrics to evaluate
our model depending on which group the project belongs to.
In the high-level design, the proposed functional and non-functional requirements of the
software are depicted. Overall solution to the architecture is developed which can handle those need
Dept of CSE, DBIT 2021-2022 Page 10
DETECTION OF AUTISM SPECTRUM DISORDER 18CSP83
Naïve Bayes Classifier: In machine learning, naive Bayes classifiers are a family of simple
"probabilistic classifiers" based on applying Bayes' theorem with strong (naive) independence
assumptions between the features. Naive Bayes has been studied extensively since the 1960s. It
was introduced into the text retrieval community in the early 1960s, and remains a popular
method for text categorization, the problem of judging documents as belonging to one category
or the other with word frequencies as the features. With appropriate pre-processing, it is
competitive in this domain with more advanced methods including support vector machines. It
also finds application in automatic medical diagnosis. Naive Bayes classifiers are highly scalable,
requiring a number of parameters linear in the number of variables (features/predictors) in a
learning problem. Maximum- likelihood training can be done by evaluating a closed-form
expression, which takes linear time, rather than by expensive iterative approximation as used for
many other types of classifiers.
A DFD shows what kind of information will be input to and output from the system, how the data
will advance through the system, and where the data will be stored.
I. DFD – 0
II. DFD – 1
III. DFD – 2
While a use case itself might drill into a lot of detail about every possibility, a use case diagram
can help provide a higher-level view of the system. It has been said before that "Use case
diagrams are the blueprints for the system". They provide the simplified and graphical
representation of what the system must actually do.
Figure 3.4 Flowchart of the Use-case diagram for Autism prediction model
Class diagram is a type of static structure diagram that describes the structure of a system by
showing the system's classes, their attributes, operations (or methods), and the relationships
among objects. The class diagram is the main building block of object- oriented modeling. It is
used for general conceptual modeling of the structure of the application, and for detailed
modeling translating the models into programming code. Class diagrams can also be used for
data modeling.
CHAPTER 4
SYSTEM TESTING
4.1 TESTING
The goal of testing is to find mistakes. Testing is a technique for determining how well a software
product works. The process of attempting to find every possible flaw or weakness in a work
product is known as software testing.
The first testing was performed in the first module, Data Preprocessing, to confirm that the
dataset was free of missing or unknown values. Data cleansing is completed successfully using
the original CSV file as input.
The second and third tests are performed in the second module, Feature Extraction, to lower the
dataset's dimensionality. To obtain the reduced feature dataset, the preprocessed CSV file is used,
and PCA and Random Forest are successfully applied individually.
Before any new application can be automated, it must first be manually tested. Manual software
testing takes more time and effort, but it is required to determine whether automation is possible.
Manual testing concepts do not necessitate familiarity with any testing tool. "100% Automation is
not achievable," says one of the Software Testing Fundamentals. This necessitates manual testing.
Before software is launched into production, automated testing is used to ensure that it is working
properly and fulfilling requirements. This type of software testing employs predefined sequences
that are carried out by testing tools. Automated testing tools conduct software tests, report the
results, and compare the findings to previous test runs.
Automated tests can be used for a variety of scenarios, including unit, API, and regression testing.
The fundamental advantage of automated software testing is that it reduces as much manual effort
as possible into a series of scripts.
If unit testing, for example, takes a significant portion of a quality assurance (QA) team's
resources,
this procedure should be considered for automation.
Automated tests can be conducted multiple times throughout the day. This method is consistent
with continuous testing, continuous integration (CI), and continuous delivery (CD) software
development approaches, all of which strive to get code changes into production without
requiring
manual intervention.
and output.
The module dependency graph is a key feature that influences the integration strategy. The order
in which various modules call one other is represented by the module dependency graph. A
module
dependency graph is represented by a structure chart. As a result of looking at the structure chart,
an integration plan can be created using any of the following methods:
1. Big-bang approach
2. Top-down approach
3. Bottom-up approach
4. Mixed approach
The fundamental goal of each subsystem's testing is to check the interfaces between the numerous
modules that make up the subsystem. Both the control and data interfaces are put to the test
Bottom-up
Dept of CSE,integration
DBIT testing has the advantage of allowing multiple disconnected subsystems
2021-2022 Page 19to
DETECTION OF AUTISM SPECTRUM DISORDER 18CSP83
be tested at the same time. The complexity that arises when a system is made up of a large
number of little subsystems is a disadvantage of bottom-up testing.
The fundamental goal of each subsystem's testing is to check the interfaces between the numerous
modules that make up the subsystem. Both the control and data interfaces are put to the test.
Bottom-up integration testing has the advantage of allowing multiple disconnected subsystems to
be tested at the same time. The complexity that arises when a system is made up of a large
number of little
subsystems is a disadvantage of bottom-up testing.
We tested all of the individual programs first in the Main module, and after receiving successful
results in the individual program testing, we went on to the integration.
We integrated a few programs and then put them to the test, with positive results.
4.2 TEST CASES
CHAPTER 4
RESULT AND SNAPSHOTS
Naïve Bayes algorithm gives 95 % accuracy in the detection of autism from dataset whereas
Support Vector Machine gives highest accuracy of 100%. Once the user enters the data answering
AQ-10 questions and other individual characteristics details model predicts whether a person has
autism or not using better performing and more accurate algorithm and displays it to the user.
Suggestion page is also included along with the prediction page to provide user a better knowledge
about the prediction made. Comparison of performance of both the algorithms are shown in Fig.5.
This shows that SVM Classifier gives the highest accuracy among the Naïve Bayes and SVM
classifier algorithms.
SYSTEM SNAPSHOTS
Figure 4.3 shows the home page of the application for the user.
Figure 4.4 shows the registration page which allows the new user to register into the portal and get
their diagnosis done with their provided details.
Figure 4.6 shows the predict page determines whether the user has Autism or not, based on the
inputs provided.
Figure 4.7 shows the Suggestion page which helps user to follow the course of treatment or the
activities stated, so one can get clarified with the information provided here.
Figure 4.8 shows the About page where we can get an idea of what Autism is and how to deal with
it.
Figure 4.9 shows the compose section of the Chatbot page. This allows the registered users to
compose and send text to chatter bot.
CONCLUSION
Autism is a neurological developmental disability that hampers normal brain development affecting
communication, social interaction, cognition and behavior. Autism is known as a spectrum disorder
because its symptoms and characteristics appear in a variety of combinations that affect children
because of which they have to face severe challenges.
A portal is developed for prediction of Autism based on the input provided. Algorithms are used to
predict the presence of Autism. Naive Bayes algorithm, Support Vector Machine algorithm,
Random Forest algorithm etc. are the algorithms used to predict the presence of Autism along with
its stages.
A Chatbot is built to assess the condition of the patient based on the input provided. An interactive
session is involved in order to analyze the patient’s condition and recommend suitable medicines.
In future, this work can be extended to work with different types of ASD data like:
Voice recognition method, sensors can be used to understand their actions, and the recorded
speech therapy can be added to this project.
REFERENCES
[1] Base paper – “Machine Learning-Based Models for Early Stage Detection of Autism Spectrum Disorders” Tania
Akter , Md. Shahriare Satu, Md. Imran Khan , Mohammad Hanif Ali , Shahadat Uddin , Pietro Lió , Julian M.W.
Quinn , And Mohammad Ali Moni Received October 10, 2019, accepted October 30, 2019, Date of publication
November 11, 2019, Date of current version November 27, 2019 IEEE.
[2] Guannan Li, Meng-Hsiang Chen3, Gang Li, Di Wu4 , Quansen Sun1 , DinggangShen, Li Wang, “ A preliminary
volumetric MRI study of Amygdala and Hippocampal subfields in Autism during infancy”,16th International
Symposium on Biomedical Imaging, 2019 IEEE.
[3] FitriliaSusanti, DanangJunaedi, VeronikhaEffendy, “Communication Learning User Interface Model for Children
with Autism with the Goal-Directed Design Method”, 7th International Conference on Information and Communication
Technology, 2019 IEEE.
[4] Leslie Mertz, “New Quantitative Approach to Autism Diagnosis”, International Conference on Medical and Health
Informative 2019 IEEE.
[5] Osman Altay, Mustafa Ulas, “Prediction of the Autism Spectrum Disorder Diagnosis with Linear Discriminant
Analysis Classifier and K-Nearest Neighbor in Children”, 6th International Symposium on Digital Forensic and
Security,2018 IEEE.
[6] Elizabeth Stevens, Abigail Atchison, Laura Stevens, Esther Hong,DoreenGranpeesheh, Dennis Dixon, Erik Linstead,
“A Cluster Analysis of Challenging Behaviors in Autism Spectrum Disorder”, 16th IEEE International Conference on
Machine Learning and Applications, 2017 IEEE.
[7] Airi Tsuji, Satoru Sekine, Takuya Enomoto, Soichiro Matsuda, Junichi Yamamoto, Kenji Suzuki, "Modeling of the
Chasing Behaviors for Developmental Program of Children with Autism Spectrum Disorders” 16th international
conference on cognitive informatics and cognitive and computing, 2017 IEEE.
[8] Cheol-Hong Min, “Automatic Detection and Labeling of Self-Stimulatory Behavioral Patterns in Children with
Autism Spectrum Disorder”, 16th International Conference on Data Mining Workshop, Barcelona, Spain, Dec. 2016
IEEE.
[9] Elena Pattini, Dolores Rollo, “Response to stress in the parents of children with autism spectrum disorder”,
Instrumentation and Measurement Society prior to the acceptance and publication, 2016 IEEE.
[10] ZhiZheng, Qiang Fu, Huan Zhao, Amy R. Swanson, Amy S. Weitlauf, Zachary E. Warren, and NilanjanSarkar,
“Design of an Autonomous Social Orienting Training System (ASOTS) for Young Children with Autism”, IEEE
Transactions on Neural Systems and Rehabilitation Engineering 2016 IEEE.
BATCH – 2018-2022
NAME: Rakshitha V Sirisha M N Sneha H R Tejaswini D S