Introduction To Data Science Syllabus
Introduction To Data Science Syllabus
20CSH402 Honors/Minorcourses
3 1 0 4
INTRODUCTIONTO DATASCIENCE
CourseObjectives:
i. Thecourseteachescriticalconceptsandskillsincomputerprogrammingandstatist
icalinference, in conjunction with hands-on analysis of real-world datasets,
including economicdata,document collections, geographical data, andsocial
networks.
ii. Itdelvesintosocialissuessurroundingdata analysissuchasprivacyanddesign.
CourseOutcomes:
i. Applydimensionalityreductiontoolssuchasprinciplecomponentanalysis
ii. Evaluateoutcomesandmakedecisionsbasedondata
iii. UnderstandhowtoUseexploratorytoolssuchasclusteringandvisualizationtoolst
oanalyzedata.
iv. Applydimensionalityreductiontoolssuchasprinciplecomponentanalysis
v. Ableto know how to Performbasicanalysis of networkdata.
UNIT –I
Introduction-Introduction to Data Science – Evolution of Data Science – Data Science
Roles – Stages in aDataScienceProject–ApplicationsofDataScienceinvariousfields–
DataSecurityIssues.
UNIT –II
DataCollectionAndData Pre-Processing-Data Collection Strategies – Data Pre-
Processing- Overview – Data Cleaning – Data Integration and Transformation – Data
Reduction – Data Discretization.
UNIT –III
ExploratoryData Analytics -Descriptive Statistics – Mean, Standard Deviation,
Skewness and Kurtosis – Box Plots – Pivot Table – Heat Map – Correlation Statistics –
ANOVA.
UNIT – IV
Model Development-Simple and Multiple Regression – Model Evaluation using
Visualization – Residual Plot –Distribution Plot – Polynomial Regression and Pipelines –
Measures for In-sampleEvaluation– Prediction and Decision Making.
UNIT –V:
ModelEvaluation- Generalization Error – Out-of-Sample Evaluation Metrics – Cross
Validation – Overfitting –Under Fitting and Model Selection – Prediction by using Ridge
Regression – TestingMultipleParameters by using Grid Search.
TextBooks:
1. DataScienceforBeginners,byAndrewPark
2. TheArtofDataScience—
AGuideforAnyoneWhoWorksWithData,byRogerD.PengandElizabethMatsui.
References:
1. JojoMoolayil,“SmarterDecisions:TheIntersectionofIoTandDataScience”,PA
CKT,2016.
2. CathyO’NeilandRachel Schutt,“DoingDataScience”,O'Reilly,2015.
3. DavidDietrich,BarryHeller,BeibeiYang,“DataScienceandBigdataAnalytics”,
EMC2013
4. Raj,Pethuru,“HandbookofResearchonCloudInfrastructuresforBigDataAnalyti
cs”,IGIGlobal.