0% found this document useful (0 votes)
27 views2 pages

CSD 101

CSD

Uploaded by

Harshil Gupta
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views2 pages

CSD 101

CSD

Uploaded by

Harshil Gupta
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Annexure 8

INTRODUCTION TO DATA SCIENCE

Course Code: CSD 101 Credit Units: 03


Total Hours: 45
Course Objective:
To provide a basic understanding of data science field and its implementation in Various Industries.

Course Contents:
Module I: Introduction : (5 Hours)

Introduction to Data Science, Definition and description of Data Science, history and development of Data Science,
terminologies related with Data Science, basic framework and architecture, difference between Data Science and
business analytics, importance of Data Science in today’s business world, primary components of Data Science,
users of Data Science and its hierarchy.

Module II: Data Science Project Management(8 Hours)


Data Science project framework, Stages in a Data Science Project ,execution flow of a Data Science project, various
components of Data Science projects, stakeholders of Data Science project, , challenges and scope of Data Science
project management, process evaluation model, comparison of Data Science project methods, improvement in
success of Data Science project models.

Module III: Mathematics behind Data Science: (12 Hours)


Role of mathematics in Data Science, importance of probability and statistics in Data Science, important types of
statistical measures in Data Science : Descriptive, Predictive and prescriptive statistics, introduction to statistical
inference and its usage in Data Science, application of statistical techniques in Data Science, Basics of probability,
permutation and combination, introduction to linear Regression model, mean, mode, median, Outliers, Leverage
points, Business Logics, Feature Engineering, bad data identification and correction.

Module IV: Computers in Data Science(9 Hours)


Role of computer science in Data Science, various components of computer science being used for Data Science,
role of relation data base systems in Data Science: SQL, NoSQL, role of data warehousing in Data Science, terms
related with data warehousing techniques, importance of operating concepts and memory management, various
freely available software tools used in Data Science : R, Python, important proprietary software tools, different
business intelligence tools and its crucial role in Data Science project presentation.

Module V: Applications of Data Science: (8 Hours)


Applications of Data Science in various fields. industry use cases of Data Science implementation General use
cases of data science in Finance-defaulter detection, E-Commerce-Recommendation Systems, Banking Industry-
Loan credibility System, Real Estate, and GIS Systems- optimal route founding (Olla, Uber)

Course Outcome:
Student are well acquainted with knowledge about Data Science and can do EDA Projects

Examination Scheme:

Components A CT S/V/Q/HA ESE


Weightage (%) 5 15 10 70

A: Attendance, CT: Class Test,:, S/V/Q/HA: Seminar/Viva/Quiz/ Home Assignment, EE: End Semester
Examination

Text & References:


Texts:

• Think Python by Allen B Downey


• Cathy O’Neil and Rachel Schutt. Doing Data Science, Straight Talk From The Frontline. O’Reilly. 2014.
• Avrim Blum, John Hopcroft and Ravindran Kannan. Foundations of Data Science.
Annexure 8

References:
• Jure Leskovek, Anand Rajaraman and Jeffrey Ullman. Mining of Massive Datasets. v2.1, Cambridge
University Press. 2014. (free online)
• Kevin P. Murphy. Machine Learning: A Probabilistic Perspective. ISBN 0262018020. 2013.
• Foster Provost and Tom Fawcett. Data Science for Business: What You Need to Know about Data Mining
and Data-analytic Thinking. ISBN 1449361323. 2013.
• Trevor Hastie, Robert Tibshirani and Jerome Friedman. Elements of Statistical Learning, Second Edition.
ISBN 0387952845. 2009. (free online)

You might also like