0% found this document useful (0 votes)

50 views8 pages

FRM Part 1 Quants 2023 ML

1. Linear regression and logistic regression are commonly used supervised learning algorithms for predicting continuous and categorical variables respectively. 2. Decision trees can be used for both classification and regression problems. They work by recursively splitting the data into purer child nodes based on entropy or Gini impurity measures. 3. Principal component analysis (PCA) is an unsupervised technique that reduces the dimensionality of highly correlated data by transforming it into a smaller number of uncorrelated principal components.

Uploaded by

Ishika Parasrampuria

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views8 pages

FRM Part 1 Quants 2023 ML

Uploaded by

Ishika Parasrampuria

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

FRM part 1

A. SUPERVISED LEARNING QUANTS 2023

LINEAR REGRESSION PREDICTION ALGORITHM

Exploring the linear relationship among Independent Variable (X) and Dependent
Variable (Y), where Y is a continuous variable i.e., lies b/w- ∞ to ∞
Such that [ y = b 0+ b 1+ x 1+ b 2+ x 2 ] Ex: Sales = 11.75 + 3.75 (Television) + 2.23 (Magazine)
THINGS NEED TO CHECK IN MODEL

Model Accuracy Model Adequacy Significance Generalization

* Measure MSC * Hypothesis Testing * Divide Data into

Training Set Validation Set Test Set

(Training model on this data) (Validation & Tune Model) (Testing Model on our of Sample Data)

TESTING DATA & ERRORS

1 MSE Training Set 2 MSE Validation > MSE Training BIAS-VARIANCE

TRADE-OFF
BIAS ERROR VARIANCE ERROR
(Problem of underfitting) (Overfitting problem)

Solution- Add more features Solution- Penalized regression

LASSO RIDGE ELASTIC NETS

2
Penalty Term:- [λ x β ί ] Penalty Term:- [λ x (β ί ) ] [RIDGE + LASSO]
Penalized Regression includes a constraint such that the regression
coefficient are chosen to minimize {SSE + Penalty Term}. A feature must
make sufficient contribution to model fit to offset penalty from including its.

Shiksha Jain How to choose λ ? K- FOLD CROSS VALIDATION Karan Aggarwal

FRM part 1
QUANTS 2023
LOGISTIC REGRESSION

CLASSIFICATION ALGORITHM
Target Variable (y) is a category/class Ex:- y = Defaulter / Non-defaulter i.e., 1 or 0
x = Gender, X = Income X = Age

Why not Linear Regression?

Linear Regression can give Y any value b/w [-∞ to ∞ ] but we want Y as 1
^
or 0 i.e., Logistic Regression we map Y b/w 0 and 1 like probabilities and
allocate probability as 1 and probability as 0 by using linking functions
& Maximising their probability (MLE).

LINKING FUNCTIONS

Linking Functions Linking Functions

Shiksha Jain Karan Aggarwal

FRM part 1
DECISION TREE aka CLASSIFICATION &
QUANTS 2023
REGRESSION TREE
Classification & Regression Tree (CART) is a supervised ML Technique, that can be
applied to predict either a Categorical Target Variable (y) producing a
CLASSIFICATION TREE or a Continuous Target Variable producing regression tree.
To start decision tree Identify Root Node which has the lowest
Mis-classification based on Lowest
ENTROPY or GINNI

ENTROPY or IMPURITY = [ -P log(P) – (1-P) log (1-P)] GINNI MEASURE =∑ P X (1-P)

#DECISION TREE & PARTIONING OF FEATURES REGRESSION TREE

TRAVEL COST Root Node Same as Classification Tree with difference being
on SSE reduction instead of ENTROPY measure.
ADVANTAGE OF CART
STANDARD EXPENSIVE CHEAP
We can take maximum possible leaf nodes &
achieve 100% accuracy.
Car Train
MALE FEMALE DISADVANTAGE OF CART
Terminal or
Leaf Node
Car Owned Problem of Over-fitting
Bus
Internal Node
1 0 STOPPING CRITERIA PRUNING

RANDOM FOREST Bus Train ❶ Maximum Depth:

Just like penalized
Similar to bagging with an extension of idea taking Limiting growth of tree by
regression, we
Random set of observations + Random set of features too. specifying the no. of splits use λ parameter
ENSEMBLE LEARNING & RANDOM FOREST
❷ Minimum Observation left to prone the tree
specified to grow tree further. & λ is chosen
Ensemble Learning- Instead of basing predictions on the using k-fold cross
result of a single tree use group of trees called ensemble ❸ Maximum Decision validation.
& average result of all the trees will be converged Nodes specified
towards more accurate prediction.
This technique is called Bagging or Bootstrap aggregation.
Shiksha Jain Karan Aggarwal
FRM part 1
K- NEAREST NEIGHBOUR QUANTS 2023

KNN is based on an intuition that new observation will be

classified to be the class which has majority in the nearest
neighbours.
“Like Neighbours Like You”
Find distance of each point to the new observation and
decide the class based on K, where K = No. of Nearest
Neighbours.

New Observation here will

be classified as class 1
based on K = 7

ADVANTAGE OF KNN CHALLENGES OF KNN

Straight Forward & Choosing K
Powerful Large K can dilute the concept
NON-PARAMETRIC of nearest neighbour while
Easy for Multi-class small K can lead to error
classification rate
Shiksha Jain Karan Aggarwal
FRM part 1
SUPPORT VECTOR MACHINE QUANTS 2023
Support Vector Machine is a linear classifier that
determines the hyper plane that originally separates the
observations into 2 sets of data points.

Best Hyper Plane is the plane that maximizes Margin

width based on Support vector (touch points).
“Agar support vectors ko thik se separate kardiya toh baki
correctly classify honge hi.”

If Data is Linearly Separable Hard Margin Classifications

Real world data may not perfectly linearly separable in that case :-

NON-LINEAR SVM SOFT MARGIN SVM

SVM Applications :- Suited for small to medium complex high dimension data.
Shiksha Jain Karan Aggarwal
FRM part 1
QUANTS 2023
PRINCIPAL COMPONENT ANALYSIS

PCA is used to reduce highly correlated features of data into

some main uncorrelated composit variables
called Principal Components (PC).

Suppose our data has 8 features, we find PC

1 by giving weights to original features in
such a manner that variance (information)
explained is maximized.

PC2 will be found again by maximizing the (remaining) variance

explained with subject to the constraint that PC1 & PC2 are
uncorrelated.
Eigenvectors: Weights of new mutually uncorrelated variables.

Eigenvalues: Variance explained by each Eigenvectors (PC’s)

Trade off b/w

Visualization by complexity & accuracy.
SCREE PLOT Here, keeping 3 PC
(78%) variance is
explained while
complexity reduced
from 8 to 3 features.

Shiksha Jain Karan Aggarwal

CLUSTERING ALGORITHMS FRM part 1
QUANTS 2023
Clustering means sorting observations into groups such that the items within the
same cluster are similar and observations in 2 different clusters are as dissimilar as
possible. “A property known as separation.”

CASE 1:- K (No. of clusters) known K- Means Clustering

STEPS:-
1. Decide K (Hyper Parameter)
2. Randomly assign each value to a particular cluster
3. Find out Centroid (Average) of each cluster
4. Calculate distance (Euclidean distance) between each observation & 2 centroids
5. Based on distance, assign each observations to its closest centroid
Repeat steps 3 to 5 till observations shops shifting to new groups i.e.,
Algorithm converges to its true solution

CASE 2:- K (No. of clusters) known Hierarchical Clustering

Agglomerating Clustering BOTTOM-UP TOP-DOWN Divisive Clustering

Consider each observation as a single Consider all observations belonging to

cluster than find 2 clusters & combine a single cluster. Divide observations
them into one new large cluster. Repeat based on some measure of distance
this process iteratively till all and continue the partition until each
observations are clumped into a single cluster contains only one single
cluster. observation.

Shiksha Jain Karan Aggarwal

NEURAL NETWORKS FRM part 1
QUANTS 2023
Each Node within hidden layer has 2 functional parts:-
1. Summation Operator
2. Activation/Transformation Function
Summation Operator- Multiplies each Input by a
weight and sums the weighted values to form
the total Net Input.
Total Net Input is then passed to activation
function which transforms the input into final
output of the node.
The Output of hidden layer is transmitted to next
set of nodes or the output layer. Output layer
again contains an activation function & a
summation operator.

Advantages Of Neural Network

Capture Complex Non-Linear Interaction
among features.
Disadvantages Of Neural Network
Risk of Overfitting Black Box Non Interpretable

DEEP LEARNING & REINFORCEMENT LEARNING

DEEP LEARNING
If the number of hidden layers are very large, neural networks is called Deep Learning Nets (DLN)
REINFORCEMENT LEARNING
RL algorithm involves an agent that should perform actions that will maximise its rewards
overtime, taking into consideration the constraint of its environment.
Ex: A Virtual Gamer (Agent) uses his console commands (Actions) with the information on the
screen (Environment) to maximise his/her score (Reward).
Shiksha Jain Karan Aggarwal

Applied Statistics Gupta - Kapoor
100% (8)
Applied Statistics Gupta - Kapoor
714 pages
Introduction To Classification - PPT Slides 1
No ratings yet
Introduction To Classification - PPT Slides 1
62 pages
Module 1 ML Mumbai University
No ratings yet
Module 1 ML Mumbai University
47 pages
Accountancy Project File Class 12th For 2023-24 Session
86% (85)
Accountancy Project File Class 12th For 2023-24 Session
39 pages
Unit 2 - Machine Learning - WWW - Rgpvnotes.in
100% (2)
Unit 2 - Machine Learning - WWW - Rgpvnotes.in
21 pages
Data Analytics With Excel Lab Manual
100% (2)
Data Analytics With Excel Lab Manual
56 pages
Chemistry Project Class-XII (2021-22) - Investigatory Project
85% (305)
Chemistry Project Class-XII (2021-22) - Investigatory Project
16 pages
Quiz FMG
100% (1)
Quiz FMG
11 pages
Coincent - Data Science With Python Assignment
100% (2)
Coincent - Data Science With Python Assignment
23 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
03 Supervised Classification
No ratings yet
03 Supervised Classification
68 pages
CH 04 Classification Techniques
No ratings yet
CH 04 Classification Techniques
89 pages
Machine Learning Algorithms Laiki
No ratings yet
Machine Learning Algorithms Laiki
123 pages
Module 3
No ratings yet
Module 3
79 pages
Lec08 Classification KNN ANN
No ratings yet
Lec08 Classification KNN ANN
39 pages
Unit 5
No ratings yet
Unit 5
73 pages
Data Science in FInancial Services - 3
No ratings yet
Data Science in FInancial Services - 3
76 pages
Mars 05
No ratings yet
Mars 05
28 pages
Week 09 Lesson 1 Intro Machine Learning 1 To 32
No ratings yet
Week 09 Lesson 1 Intro Machine Learning 1 To 32
61 pages
Refer For KNNDecison Tree SVM
No ratings yet
Refer For KNNDecison Tree SVM
90 pages
The KNN
No ratings yet
The KNN
31 pages
ML Techniques and Concepts
No ratings yet
ML Techniques and Concepts
48 pages
DAC ML Tutorial Final Deck
No ratings yet
DAC ML Tutorial Final Deck
150 pages
Asset-V1 ColumbiaX+CSMM.101x+1T2017+type@asset+block@AI Edx ML 5.1intro
No ratings yet
Asset-V1 ColumbiaX+CSMM.101x+1T2017+type@asset+block@AI Edx ML 5.1intro
70 pages
Lec 04
No ratings yet
Lec 04
70 pages
ML
No ratings yet
ML
49 pages
Unit 4
No ratings yet
Unit 4
23 pages
Unit-4 AML (1. Basics and K-NN)
No ratings yet
Unit-4 AML (1. Basics and K-NN)
25 pages
Session 5
No ratings yet
Session 5
36 pages
ML04 KNN-SVM 2024-2025
No ratings yet
ML04 KNN-SVM 2024-2025
57 pages
Module 3
No ratings yet
Module 3
63 pages
ML Unit 2
No ratings yet
ML Unit 2
37 pages
Day 4 Content
No ratings yet
Day 4 Content
35 pages
Intro To Machine Learning New
No ratings yet
Intro To Machine Learning New
18 pages
Lecture 07 Slides
No ratings yet
Lecture 07 Slides
45 pages
DUnit I
No ratings yet
DUnit I
25 pages
Python 06 MachineLearning
No ratings yet
Python 06 MachineLearning
45 pages
L6 Lecture Image - Classification.fundemental v4
No ratings yet
L6 Lecture Image - Classification.fundemental v4
66 pages
To Machine Learning: Isabelle Guyon
No ratings yet
To Machine Learning: Isabelle Guyon
40 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
ML Unit2
No ratings yet
ML Unit2
38 pages
Jntuk r20 ML Unit-II
No ratings yet
Jntuk r20 ML Unit-II
33 pages
Unit-7 ML
No ratings yet
Unit-7 ML
11 pages
ML Topics
No ratings yet
ML Topics
18 pages
INSTANCE Based Learning
No ratings yet
INSTANCE Based Learning
12 pages
Data Science Unit 3
No ratings yet
Data Science Unit 3
33 pages
(Marc Loudon, Jim Parise) Organic Chemistry PDF
95% (20)
(Marc Loudon, Jim Parise) Organic Chemistry PDF
1,595 pages
Bda Unit 5
No ratings yet
Bda Unit 5
11 pages
Mathematical Statistics With Applications PDF
100% (16)
Mathematical Statistics With Applications PDF
644 pages
Machine Learning For Beginners PDF
No ratings yet
Machine Learning For Beginners PDF
29 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
Zindgi Ki Dastan - Merged
No ratings yet
Zindgi Ki Dastan - Merged
149 pages
The Orthodox Christian Mission
No ratings yet
The Orthodox Christian Mission
3 pages
Lect 1
No ratings yet
Lect 1
24 pages
Document
No ratings yet
Document
6 pages
My ML Tiny Notes
No ratings yet
My ML Tiny Notes
5 pages
An Introduction To Data Analysis in R - 9783030489977 PDF
100% (3)
An Introduction To Data Analysis in R - 9783030489977 PDF
289 pages
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
No ratings yet
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
11 pages
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
No ratings yet
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
28 pages
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
No ratings yet
ML Concepts: 1. Parametric Vs Non-Parametric Models:: Examples: Linear, Logistic, SVM
34 pages
"Classifiers": R & D Project by Under The Guidance of
No ratings yet
"Classifiers": R & D Project by Under The Guidance of
59 pages
Introductory Mathematical Analysis For Quantitative Finance 081537254x 9780815372547 Compress
100% (3)
Introductory Mathematical Analysis For Quantitative Finance 081537254x 9780815372547 Compress
322 pages
M L
No ratings yet
M L
4 pages
Fire Hose Cabinet
No ratings yet
Fire Hose Cabinet
7 pages
ML Questions Answers
No ratings yet
ML Questions Answers
4 pages
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
No ratings yet
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
17 pages
Probability and Statistical Inference-CRC (2021)
89% (9)
Probability and Statistical Inference-CRC (2021)
444 pages
FRM Bionic Turtle T2-Quantitative
100% (2)
FRM Bionic Turtle T2-Quantitative
133 pages
Vahid
No ratings yet
Vahid
18 pages
DWM - END SEM LAB Questions
No ratings yet
DWM - END SEM LAB Questions
9 pages
An Introduction To Linear Algebra For Science and Engineering - 3rd Ed - Norman
100% (16)
An Introduction To Linear Algebra For Science and Engineering - 3rd Ed - Norman
592 pages
2010 DG Challenger
No ratings yet
2010 DG Challenger
14 pages
Statistical Regression and Classification - From Linear Models To Machine Learning
100% (10)
Statistical Regression and Classification - From Linear Models To Machine Learning
532 pages
Machine Learning QNA
No ratings yet
Machine Learning QNA
1 page
University of Perpetual Help System DALTA
No ratings yet
University of Perpetual Help System DALTA
185 pages
Learning Statistics
100% (27)
Learning Statistics
408 pages
Introduction To Statistics
94% (47)
Introduction To Statistics
134 pages
Essential Calculus Skills Practice Workbook With Full Solutions
95% (84)
Essential Calculus Skills Practice Workbook With Full Solutions
528 pages
Sse 213
No ratings yet
Sse 213
3 pages
ACKS - Class - Illusionist PDF
No ratings yet
ACKS - Class - Illusionist PDF
8 pages
UVEB Technology With 1.5 Nanometer Heteroatom Titanates Zirconates
No ratings yet
UVEB Technology With 1.5 Nanometer Heteroatom Titanates Zirconates
106 pages
MOFP - Families Fabaceae, Brassicaceae, Malvaceae
No ratings yet
MOFP - Families Fabaceae, Brassicaceae, Malvaceae
2 pages
Ring Load
No ratings yet
Ring Load
1 page
MCS-012 Block 3
No ratings yet
MCS-012 Block 3
94 pages
Course Handout
No ratings yet
Course Handout
5 pages
MBA Managerial Economics Unit 1 - Economic Problems and Decision Making
No ratings yet
MBA Managerial Economics Unit 1 - Economic Problems and Decision Making
24 pages
Hybrid Organizations:: O, S, I, I
No ratings yet
Hybrid Organizations:: O, S, I, I
8 pages
Solutions On Quiz 1
No ratings yet
Solutions On Quiz 1
6 pages
Senior Software Engineer Web Api 11
No ratings yet
Senior Software Engineer Web Api 11
7 pages
Ref - Integrity Problems of Concrete Piles - FPrimeC - FPrimeC Solutions Inc
No ratings yet
Ref - Integrity Problems of Concrete Piles - FPrimeC - FPrimeC Solutions Inc
7 pages
7 Key Principles of Apparel Costing - Textile Tutorials
No ratings yet
7 Key Principles of Apparel Costing - Textile Tutorials
2 pages
76 Command Set
No ratings yet
76 Command Set
27 pages
Research Scope - Period Panties Market. - Global Industry Analysis Size Share Growth Trends and Forecasts 2023 - 2031
No ratings yet
Research Scope - Period Panties Market. - Global Industry Analysis Size Share Growth Trends and Forecasts 2023 - 2031
13 pages
Rocky Mountain Spotted Fever
No ratings yet
Rocky Mountain Spotted Fever
9 pages
Qadaqadar PDF
No ratings yet
Qadaqadar PDF
4 pages
ResearchMethods CS
No ratings yet
ResearchMethods CS
16 pages
Significance of Sara Pariksha in Ayurveda: A Critical Review: October 2018
No ratings yet
Significance of Sara Pariksha in Ayurveda: A Critical Review: October 2018
7 pages
Monitoring Sheet MR Sia Opv Campaign Final 2023 Doc Grace
No ratings yet
Monitoring Sheet MR Sia Opv Campaign Final 2023 Doc Grace
12 pages
Rehan
No ratings yet
Rehan
1 page
Nervous System Regulation Script
No ratings yet
Nervous System Regulation Script
3 pages
12 - Memory Management, Garbage Collection, Immutability, and Design by Contrac
No ratings yet
12 - Memory Management, Garbage Collection, Immutability, and Design by Contrac
3 pages
Digvijay Singh
No ratings yet
Digvijay Singh
2 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

FRM Part 1 Quants 2023 ML

Uploaded by

FRM Part 1 Quants 2023 ML

Uploaded by

FRM part 1

A. SUPERVISED LEARNING QUANTS 2023

LINEAR REGRESSION PREDICTION ALGORITHM

Model Accuracy Model Adequacy Significance Generalization

Training Set Validation Set Test Set

TESTING DATA & ERRORS

1 MSE Training Set 2 MSE Validation > MSE Training BIAS-VARIANCE

Solution- Add more features Solution- Penalized regression

LASSO RIDGE ELASTIC NETS

Shiksha Jain How to choose λ ? K- FOLD CROSS VALIDATION Karan Aggarwal

Why not Linear Regression?

Linking Functions Linking Functions

Shiksha Jain Karan Aggarwal

ENTROPY or IMPURITY = [ -P log(P) – (1-P) log (1-P)] GINNI MEASURE =∑ P X (1-P)

#DECISION TREE & PARTIONING OF FEATURES REGRESSION TREE

RANDOM FOREST Bus Train ❶ Maximum Depth:

KNN is based on an intuition that new observation will be

New Observation here will

ADVANTAGE OF KNN CHALLENGES OF KNN

Best Hyper Plane is the plane that maximizes Margin

If Data is Linearly Separable Hard Margin Classifications

NON-LINEAR SVM SOFT MARGIN SVM

PCA is used to reduce highly correlated features of data into

Suppose our data has 8 features, we find PC

PC2 will be found again by maximizing the (remaining) variance

Eigenvalues: Variance explained by each Eigenvectors (PC’s)

Trade off b/w

Shiksha Jain Karan Aggarwal

CASE 1:- K (No. of clusters) known K- Means Clustering

CASE 2:- K (No. of clusters) known Hierarchical Clustering

Agglomerating Clustering BOTTOM-UP TOP-DOWN Divisive Clustering

Consider each observation as a single Consider all observations belonging to

Shiksha Jain Karan Aggarwal

Advantages Of Neural Network

DEEP LEARNING & REINFORCEMENT LEARNING

You might also like