Indian Institute of Technology, Kanpur: Applied Machine Learning
Indian Institute of Technology, Kanpur: Applied Machine Learning
IME 673
Programming Assignment 1
Linear Regression
Submitted To:
Prof. Veena Bansal
DIME, IIT Kanpur
Submitted By:
Shirsendu Samanta 18125048
Q1. Plotting and understanding the data
a) There are different ways of plotting the data. You could use scatter plot. You could
plot one crop, say coconut. Plot area on the x-axis and production on the y-axis. You
can ignore other dimensions of the data. What inferences do you make from this
plot? Do you need to normalize your data?
Answer a)
Scatter plot is used for visualization of the data. I have taken one crop “rice” for further
analysis.
Scatter plot of Rice –> Area ~ Production
From the plot we can infer that area and the amount of production is having a linear
relationship.
For better analysis I have normalize the data , but we can see that there is not much
difference between the scatterplot before and after normalization.
Parameter/Metric Value
Coefficient 0.861796
Intercept 0.009837285
MAE 0.26
MSE 0.30
R2 0.61