Fundementalsof Data Science
Fundementalsof Data Science
Course Description:
Pre-requisites : None
Alternate Exposure : None
This course is designed as a window to the field of Data Science. Students will get a bird’s
eye view of the technology and process involved in using Data Science for meaningful
inferences. They can also gain enough knowledge to understand the machine learning
process for data science and complete a hands-on project using standard data sets.
Course Objectives:
● Provide a basic foundation for data science and application areas related to it.
● Understand the underlying core concepts and emerging technologies in data science.
● Explore the concepts of data preprocessing, model development and evaluation and
tuning.
What is Data Science?, Fundamentals of Data Science, The many paths to Data Science,
Data Science Topics and Algorithms, Cloud for Data Science; Foundations of Big Data,
What is Hadoop?, How Big Data is driving Digital Transformation, Data Science Skills and
Big Data, Neural Networks and Deep Learning, Applications of Machine Learning; How
should Companies Get Started in Data Science?, Applications of Data Science, How can
someone become a Data Scientist?, Recruiting for Data Science.
Learning Outcomes:
Understanding Data, Python Packages for Data Science, Importing and Exporting Data in
Python, Analyzing Data with Python, Accessing Databases with Python.
Learning Outcomes:
Python for - Pre-processing Data, Dealing with Missing Values, Data Formatting, Data
Normalization, Binning, Turning categorical variables into quantitative variables;
Exploratory Data Analysis, Descriptive Statistics, Groupby in Python, Correlation,
Correlation - Statistics, Association between two categorical variables: Chi-Square.
Learning Outcomes:
Model Development, Linear Regression and Multiple Linear Regression, Model Evaluation
Learning Outcomes:
Model Evaluation and Refinement, Overfitting, Underfitting and Model Selection, Ridge
Regression Introduction, Grid Search
Learning Outcomes:
Textbooks:
1. Introducing Data Science, Davy Cielen, Arno D. B. Meysman and Mohamed Ali,
Manning Publications,2016.
2. Think Like a Data Scientist, Brian Godsey, Manning Publications, 2017.
References:
1. https://www.coursera.org/learn/what-is-datascience#about
2. https://www.coursera.org/learn/data-analysis-with-python/home/info
3. Data Science from Scratch: First Principles with Python, Joel Grus, O’Reilly, 1st
edition, 2015.
4. Doing Data Science, Straight Talk from the Frontline, Cathy O'Neil, Rachel Schutt,
O’ Reilly, 1st edition, 2013
Course Outcomes:
Upon successful completion of the course, students will be able to:
● Understand the fundamental concepts of data science.
● Evaluate the data analysis techniques for applications handling large data
● Experiment with the data science process.
● Apply the concept of machine learning in the data science process.
● Visualize and present model inference using various tools
APPROVED IN:
SDG Justification: