SLRG
SLRG
Soohyun Yang
College of Engineering
Department of Civil and Environmental Engineering
Types of ML techniques – All learning is learning!
Our scope
•
Classification
“Presence of labels”
Advertisement popularity •
“Absence of labels”
Recommender systems (YT) •
“Behavior-driven : feedback loop”
Learning to play games (AlphaGo)
• Spam classification • Clustering
Buying habits (group customers) • Industrial simulation
• Regression
Face recognition • Grouping user logs • Resource management
https://towardsdatascience.com/what-are-the-types-of-machine-learning-e2b9e5d1756f
Regression
A statistical method to determine the relationship between
a dependent variable (target) and one or more independent
variables (features), predicting a target value on a continuous scale
for a given new data.
Samples
KNN algorithm – Regression problem (con’t)
2. Data separation into the
training and test sets
• random_state [integer] : A parameter
for the random number generator.
• DO NOT NEED ‘Stratification’ process
for regression problem.
4. Fit the regression model using the training set (fit method).
=> Storing the training set to compute neighbors during prediction.
5. Make predictions on the test data (predict method).
6. Evaluate the model’s performance (score method
=> via the coefficient of determination, R2, 결정계수).
• 0 ≤ R2 ≤ 1 : Higher value => Better performance in predicting the test set’s outcomes.
KNN algorithm – Regression problem (con’t)
83.000
=> On average, the
predicted targets differ
from the real ones as
many as ~83 vehicles/hr.
847.8
Trained model application to a new data2
Let’s predict a target value for the other new data with [feature1 = 100].
The predicted outcome is 847.8, which is identical with the new data1.
=> Does it make sense? Why did it happen? How can we resolve it?
Take-home points (THPs)
-
-
-
…