0% found this document useful (0 votes)
25 views16 pages

Train Test Split

The document outlines model validation methods, specifically focusing on Train Test Split Evaluation and Cross Validation. It explains the roles of independent and dependent variables in regression analysis, using GPA and entrance scores as examples. Additionally, it mentions the use of the sklearn library for implementing train_test_split to prepare data for model training and testing.

Uploaded by

Lia Nelli
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views16 pages

Train Test Split

The document outlines model validation methods, specifically focusing on Train Test Split Evaluation and Cross Validation. It explains the roles of independent and dependent variables in regression analysis, using GPA and entrance scores as examples. Additionally, it mentions the use of the sklearn library for implementing train_test_split to prepare data for model training and testing.

Uploaded by

Lia Nelli
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 16

Model Validation

Validation Methods
1) Train Test Split Evaluation
2) Cross Validation
• X TRAIN –independent variable
• YTRAIN – dependent variable

• (Refer linear regression video)Teacher Reference


• Answer dump – Y TEST
• GPA- DEPENDENT VARIABLE
• ENTRANCE SCORE – INDEPENDENT VARIABLE
• Entrance exam – X
• GPA- Y
• Entrance exam score – XTRAIN
• GPA- YTRAIN
• Sklearn is a model
• Train_test_split is a function of sklearn model
• train_test_split(x,y)
• X, Y indicates – entire x and y value (ie) xtrain,xtest,ytrain,ytest
• Size is .4 = 40% test data and 60% train data
• Random state=42( any value can be given for random state)
• This Random state helps to run the same coding in different
machine and split up the data properly (no changes happen
even coding run on different machines)
Regression
• Simple linear regression – strainght line(linear line)
• Polynomial regression – curved line (non linear line)

You might also like