(2ND BEST MODEL) Decision Tree Prediction Model.
(2ND BEST MODEL) Decision Tree Prediction Model.
Regression Algorithms
I. Introduction
Decision Tree
Learning Modality (LM) was used in public and private schools in the Philippines during the
Covid19 outbreak . For their clients, the Department of Education (DepEd) has established
Learning Delivery Modalities (LDM). The synchronous and asynchronous modes of learning are
also provided to parents and students as part of this[1]. Learners and teachers have been able to
gain knowledge in a modular format thanks to the LDM deployment. The majority of the
learning resources are gathered using Cloud Computing (CC), which is a model of information
management[2]. The learning materials are still assessed to see if the modules effectively
measure students' grasp of a subject. Furthermore, the success of learners at all grade levels and
across domains must be evaluated.
The LDM created a solution to continue the learners' educations in the presence of the COVID-
19. The delivery of modules to the learners that must be completed each week was implemented
by the elementary, junior high, and senior high school students. Other schools have also prepared
for synchronous and asynchronous learning models, depending on the type of system chosen at
the time of enrollment. The program's execution has two big challenges: no computer or
smartphone, and some people are having trouble connecting to the internet. For the
implementation of learning modules; an excessive or insufficient number of printed learning
modules, broken modules as a result of wear and tear, and so on.
The process of distributing learning resources and the resources required by the various schools
in DepEd Region 4A[3], which consists of 21 Divisions, will be evaluated using a model. The
study will focus on prediction for the elementary, junior, and senior high school education
departments, with coverage ranging from Kindergarten to Grade 12 , from the 2016-2017
academic year to the 2019-2020 academic year. To assess each institution's performance and
success rate with cloud-based learning materials[4], a Decision Tree prediction was applied.
Furthermore, determining the acceptability of cloud-based learning by studying the trend of
various data acquired from various sources.
Different prediction models, such as Naïve Bayes , Gradient Boosted Tree, Random Forest, and
others[5], have been utilized in various research; however, these models may not be helpful in
predicting primary school enrolment in Region 4A. It will be more difficult to estimate the
enrollment pattern of this dilemma due to various trends and pandemics[6]. By defining the
various parameters and the best-fit predictive algorithm, the research will be more accurate.
One of the leading predictive algorithms is Decision Tree[7]; This method has been utilized in a
variety of applications, including medical, statistical, and environmental forecasting, as well as
enrollment analysis[8][9]. The Decision Tree approach has also been shown to suit
multidimensional datasets[10]. Using this method in this scenario would allow for a bigger scope
and more accuracy[11]. Decision Tree is algorithm that can perform both classification and
regression tasks. They are very powerful algorithms, capable of fitting complex datasets.
Besides, decision trees are fundamental components of random forests, which are among the
most potent Machine Learning algorithms available today.
The use of decision trees is extremely common. Because of its simplicity and transparency, it is
widely used in data mining. Decision trees are frequently graphically depicted as a hierarchical
structure, which makes them easier to understand than alternative strategies[12]. This structure
primarily consists of a root node and a series of branches (conditions) that lead to additional
nodes until we reach the leaf node, which holds the route's final conclusion. Because of its
straightforward form, the decision tree is a self-explanatory model. Each internal node tests an
attribute, and each branch corresponds to the value of that attribute (or range of values). At the
end of the process, each lead assigns a classification.
This paper is organized into five sections. In section 2, Data Preparation is presented. Section 3
provides brief details about commonly used methods for classification model evaluation. In
section 4, experimental results are presented and analyzed with respect to model results and
discussion. Finally recommendation is presented in Section 5
The data were gathered in the different parts of the Philippines, Antipolo,
Binan, Cavite, Dasmarinas, Quezon City, Rizal, and Tanauan. The gathered data
will be use to predict next academic year by using rapid miner.
Methodology
Data cleaning increased the dataset's quality by utilizing correlation to find the relevant
attributes, eliminating duplicate entries, and organizing data. The end result will be a dataset
ready for further training and testing.
For training and testing datasets with random values, the data was separated into 70
percent and 30 percent coefficients. The table below shows the outcome of the forecast with
the average number of values. The average values indicate the dataset's most accurate
forecast.
The range of values for the total and prediction are closely connected, but there is a
substantial difference in the range of values for the minimum and maximum values. The exact
figures comprise a prognosis that differs from the previous three academic years; as a result, the
prediction follows the pattern of the entering kindergarten's assumption based on the trend.
Gender is one of the most important factors to consider when making a prediction. The
statistical value of each parameter determines the viability of prediction; consequently, the
dataset's strength will reveal the link between each predictive value shown above.
V. Conclusion
With the data we gathered after we run it in data miner we proved that
Decision tree is the second best option to use in predicting data such as this.
Decision tree achieved being the fastest total and fastest scoring model out of the
other models there is in data miner.
References:
M. M. Shahabadi and M. Uplane, “Synchronous and Asynchronous e-learning Styles and
Academic Performance of e-learners,” Procedia - Social and Behavioral Sciences, vol. 176,
pp. 129–138, 2015, doi: 10.1016/j.sbspro.2015.01.453.
M. Chamilco, A. Pacheco, C. Peñaranda, E. Felix, and M. Ruiz, “Materials and methods on
digital enrollment system for educational institutions,” Materials Today: Proceedings, no.
xxxx, pp. 2–6, 2021, doi: 10.1016/j.matpr.2021.04.213.
E. Jimenez and Y. Sawada, “Public for private: The relationship between public and private
school enrollment in the Philippines,” Economics of Education Review, vol. 20, no. 4, pp.
389–399, 2001, doi: 10.1016/S0272-7757(00)00061-3.
P. Singh and Y. P. Huang, “A new hybrid time series forecasting model based on the
neutrosophic set and quantum optimization algorithm,” Computers in Industry, vol. 111, pp.
121–139, 2019, doi: 10.1016/j.compind.2019.06.004.
M. D. Hernandez, A. C. Fajardo, and R. P. Medina, “A Hybrid Convolutional Neural
Network-Gradient Boosted Classifier for Vehicle Classification,” IJRTE Journal, no. 2, pp.
213–216, 2019, doi: 10.35940/ijrte.B1016.078219.
R. Bozick, D. M. Anderson, and L. Daugherty, “Patterns and predictors of postsecondary re-
enrollment in the acquisition of stackable credentials,” Social Science Research, vol. 98, no.
April, p. 102573, 2021, doi: 10.1016/j.ssresearch.2021.102573.
Merritt, Stephen; Francomano, Anne; and Garcia, Martin (2020) "Optimizing the Enrollment
Funnel with Decision Trees and Rule Based List," SMU Data Science Review: Vol. 3 : No. 1
, Article 3.
V. Vamitha, “A different approach on fuzzy time series forecasting model,” Materials
Today: Proceedings, vol. 37, no. Part 2, pp. 125–128, 2020, doi:
10.1016/j.matpr.2020.04.579.
M. dela Cruz, “of State Universities and Colleges in Central Luzon Philippines :,” 2019.
A. Bender et al., “Dataset for multidimensional assessment to incentivise decentralised
energy investments in Sub-Saharan Africa,” Data in Brief, vol. 37, p. 107265, 2021, doi:
10.1016/j.dib.2021.107265.
M. D. Hernandez, A. C. Fajardo, R. P. Medina, J. T. Hernandez, and R. M. Dellosa,
“Implementation of data augmentation in convolutional neural network and gradient boosted
classifier for vehicle classification,” International Journal of Scientific and Technology
Research, vol. 8, no. 12, pp. 185–189, 2019, [Online]. Available: http://www.ijstr.org/final-
print/dec2019/Implementation-Of-Data-Augmentation-In-Convolutional-Neural-Network-
And-Gradient-Boosted-Classifier-For-Vehicle-Classification.pdf
Abdul Fattah Mashat, Mohammed M. Fouad, Philip S. Yu, Tarek F. Gharib, “A Decision
Tree Classification Model for University Admission System”, (IJACSA) International
Journal of Advanced Computer Science and Applications, Vol. 3, No. 10, 2012