Unit 3
Unit 3
UNIT-3
Dr Rashmi Popli
Associate Professor
Department of Computer Engineering
2. Dataset: Collect a dataset that includes information about houses, such as:
• Number of bedrooms
• Square footage
• Neighborhood
• Distance to public amenities
• Previous sale prices (target variable)
Boosting improves accuracy by sequentially training models and giving more importance
to misclassified data.
Dr Rashmi Popli,Associate Professor,Department of CE
Boosting (Ada boost)
• For boosting-we build our first bag of data with select randomly from
training data and train model in a usual way.
• Next is take all our training data and use it to test the model.
• We will discover that some of the points are not well predicted
(significant error).
• For second bag we choose randomly data again but each instance is
weighted according to this error.
• Now we test our system altogether and combine their outputs and
again we measure error across all this data.
• Thus we build next bag and so on..