IDS Syllabus
IDS Syllabus
Data Science is an interdisciplinary, problem-solving oriented subject that is used to apply scientific techniques
to practical problems. The course orients on preparation of datasets and programming of data analysis tasks. This
course covers the topics: Set Theory, Probability theory, Tools for data science, ML algorithms and demonstration
of experiments either by using MS-Excel/Python/R.
After the completion of the course, the student will be able to:
CO6 Use statistical principles to solve mean and standard deviations for given data. 1 to 4, 12 1,2, 3
Bloom’s Level
CO# Apply Analyze Evaluate Create
Remember(L1) Understand(L2)
(L3) (L4) (L5) (L6)
CO1
CO2
CO3
CO4
CO5
CO6
COURSE ARTICULATIONMATRIX:
PO10
PO11
PO12
CO#/
PSO1
PSO2
PSO3
PO1
PO2
PO3
PO4
PO5
PO6
PO7
PO8
PO9
POs
CO1 3 2 2 2 2 3 1 1
CO2 2 3 2 1 2 2 2 3 2 2
CO3 2 3 3 2 2 3 3 3
CO4 3 3 3 2 2 2 2 2 3 3 3
CO5 2 3 2 2 2 3 3 3
CO6 3 3 2 2 2 3 3 3
Note: 1-Low, 2-Medium, 3-High
COURSE CONTENT
THEORY
Contents
UNIT –
1
Introduction to Microsoft Excel:
History and importance of Microsoft Excel, Creating Excel tables, understand how to Add, Subtract, Multiply,
Divide in Excel. Excel Data Validation, Sorting, Filtering, Grouping, Ungrouping and Subtotal. Introduction to
formulas and functions in Excel. Logical functions (operators) and conditions. Visualizing data using charts
in Excel. Import XML Data into Excel, How to Import CSV Data (Text) into Excel, How to Import MS Access
Data into Excel, Working with Multiple Worksheets.
UNIT – 2
Introduction to Data Science:
What is Data Science? Applications of Data Science, Data science life cycle, Tools for data science, definition of
AI, types of machine learning (ML), list of ML algorithms for classification, clustering, and feature selection.
Probability theory, bayes theorem, bayes probability; Cartesian plane, equations of lines, graphs; exponents.
Introduction to SQL: SQL Commands experimental demonstrations-DDL, DML, DCL, TCL, DQL. Import SQL
Database Data into Excel.
UNIT – 3
D Data Relationship Methods:
Introduction to Correlation, Description of linear regression and Logistic Regression, Introducing the Gaussian,
Introduction to Standardization, Standard Normal Probability Distribution in Excel, Calculating Probabilities
from Z-scores, Central Limit Theorem, Algebra with Gaussians, Markowitz Portfolio Optimization,
Standardizing x and y Coordinates for Linear Regression, Standardization Simplifies Linear Regression,
Modeling Error in Linear Regression, Information Gain from Linear Regression.
.
UNIT – 4
Data visualization using scatter plots, charts, graphs, histograms, and maps: Statistical Analysis:
Descriptive statistics- Mean, Standard Deviation for Continuous Data, Frequency, Percentage for
Categorical Data.
Introduction to Python: Python basics, Strings, Lists, Tuples, Sets, Dictionaries. Introduction to python libraries
- Numpy, Matplotlib, Pandas, Scikit-Learn, Implementation of ML.
TEXT BOOKS:
REFERENCE BOOKS:
1. B.V. Ramana, “Higher Engineering Mathematics”, 19th edition, Tata McGraw Hill Publications, 2013.
2. ErwinKreyszig, “Advanced Engineering Mathematics”, 9th edition, Wiley Publications, 2013.
3. Seymour Lipschutz, John J. Schiller, “Schaum's Outline of Introduction to Probability and Statistics”,
McGraw Hill Professional, 1998.
JOURNALS/MAGAZINES:
1. https://www.journals.elsevier.com/computational-statistics-and-data-analysis
2. https://www.springer.com/journal/41060International Journal on Data Science and Analytics
3. https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=8254253IEEE Magazine on Big data and
Analytics
SWAYAMNPTEL/MOOCs
SELF-LEARNINGEXERCISES:
1. Relational database management system.
2. Advanced MS-Excel