CS3362 DS
CS3362 DS
COURSE OBJECTIVES
To understand the python libraries for data science
To understand the basic Statistical and Probability measures for data science
To learn descriptive analytics on the benchmark data sets
To apply correlation and regression analytics on standard data sets
To present and interpret data using visualization packages in Python.
LIST OF EXPERIMENTS:
1. Download, install and explore the features of NumPy, SciPy, Jupyter, Statsmodels and
Pandas packages.
4. Reading data from text files, Excel and the web and exploring various commands for
doing descriptive analytics on the Iris data
5. Use the diabetes data set from UCI and Pima Indians Diabetes data set for performing
the following:
a. Univariate analysis: Frequency, Mean, Median, Mode, Variance, Standard Deviation,
Skewness and Kurtosis.
b. Bivariate analysis: Linear and logistic regression modeling
c. Multiple Regression analysis
d. Also compare the results of the above analysis for the two data sets.