IU Master
IU Master
DATA SCIENCE
TABLE OF
CONTENTS
02. About IU 2
Online education is a fundamental disruption that will have a far-reaching impact. upGrad was founded
taking this into consideration. upGrad is an online education platform to help individuals develop their
professional potential in the most engaging learning environment.
Since its inception, upGrad has delivered over 20 million hours of learning, delivering programs by
collaborating with universities across the world including LJMU, IIT Madras, IIIT Bangalore and Deakin
Business School among others. And it doesn’t end there.
upGrad, in collaboration with IIIT Bangalore, a renowned university and International University of
Applied Sciences, Germany offering programs specialising in Data Science, Machine Learning and
Artificial Intelligence, is excited to offer a one-of-its-kind, academically rigorous and industrially relevant
Master's Degree in Data Science.
The faculty includes an average of 15+ years of experience. The faculty covers the conceptual depths of
topics such as Data Science, Machine Learning and AI, and Big Data Analytics. These will be
complemented by industry-relevant case studies from major industry verticals by industry leaders with
8+ years of experience from upGrad’s industry network.
Ronnie Screwvala
Co-founder & Executive Chairman
1
ABOUT
IU
>6,000
Business Cooperations
In today’s globalised world a simple degree is not enough. IU recognised this fact long ago, and therefore,
we have always striven to offer our students much more than a simple degree. With our innovative,
international, English-based Bachelor’s and Master’s programmes, our goal is to redefine the standards of
what it takes to be a manager. Through the in-depth, subject-specific knowledge taught by IU’s highly
experienced professors, and the practical experience they are exposed to during their studies, our
students gain a cutting edge advantage over public university students. With a network of over 6,000
reputable partner companies and organisations around the world, IU can make your career in a
competitive, global environment possible.
About us
2
PROGRAM
HIGHLIGHTS
Dual Accreditation and Alumni Status
Get certified by IIITB and International University of Applied Sciences (IU), Germany and
gain dual alumni status on successful completion of the program.
Customisable Curriculum
Choose from 6 specialisations at IIITB and 10 electives at IU on the basis of your
background and career aspirations and get the learning you want.
3
FACULTY AND
INDUSTRY EXPERTS
Chandrashekar Ramanathan
Dean Academics, IIITB
Tricha Anjali
Associate Dean, IIITB
Prof. S. Sadagopan
Director, IIITB
Anshuman Gupta
Director - Data Science, Pitney Bowes
He has a PhD (Dual) from Penn State University as well as a BTech Degree
from IIT Bombay.
4
Ujjyaini Mitra
Head of Analytics, Zee5
An alumna of McKinsey and Co., Flipkart, and Bharti Airtel with over 11
years of experience.
Ankit Jain
Sr. Research Scientist, Uber AI Labs
An alumnus of IIT Bombay, UCB and Harvard Business School with over 9
years of experience.
Sajan Kedia
Lead Data Scientist (Pricing), Myntra
5
Prof. G. Srinivasaraghavan
Professor, IIITB
S. Anand
CEO, Gramener
6
UPGRAD
LEARNING EXPERIENCE
Coaching Format
Dedicated Student Success Manager Online format with weekly live sessions from industry
experts to help with topic walk-throughs, doubt
Weekly real-time doubt clearing sessions resolution and personalised project feedback.
Mentorship
60+ live interactive sessions with
industry experts, fortnightly
personalised group (1:8) mentorship
sessions and a dedicated student
mentor for proactive mentoring.
7
NEW
ADDITIONS
8
INDUSTRY
PROJECTS
9
LEARNING
PATH
6
Choose any of the 6 Specialisations
25 weeks (with 6 weeks of Capstone)
Data Science Natural Language Deep Learning Business Analytics Business Intelligence/ Data Engineering
Generalist Processing Data Analyics
Tools: Python, Tools: Python, Tools: Python, Excel, Tools: Python, Tools: Python, Power Tools: Hadoop,
Tableau, SQL Excel TensorFlow mySQL, Excel BI, Excel, mySQL, HBase, Sqoop,
MongoDB, Shiny, Hive, Flume,
PySpark, Spark,
Airflow
Journey in
Description
Taught by one of the most renowned data scientists in the country (S.Anand, CEO, Gramener), this
module takes you from a beginner level Excel user to an almost professional user.
Description
This module covers concepts of the CRISP-DM framework for business problem-solving.
Module
• INTRODUCTION TO PYTHON - I
Description
Build a foundation for the most in-demand programming language of the 21st century.
• INTRODUCTION TO PYTHON - II
Description
Learn to apply some of the commonly used paradigms of functional programming in Python.
• PROGRAMMING IN PYTHON
Description
Learn how to approach and solve logical problems using programming.
Description
Data in companies is definitely not stored in excel sheets! Learn the fundamentals of database and
extract information from RDBMS using the structured query language.
11
• PYTHON FOR DATA SCIENCE
Description
Learn how to manipulate datasets in Python using Pandas which is the most powerful library for
data preparation and analysis.
• VISUALISATION IN PYTHON
Description
Humans are visual learners and hence no task related to data is complete without visualisation.
Learn to plot and interpret various graphs in Python and observe how they make data analysis
and drawing insights easier.
Description
Learn how to find and analyse the patterns in the data to draw actionable insights.
Description
Reinforce the concepts learnt in data science through this rigorous assignment involving the past.
Description
Build the mathematical foundation required for understanding the Machine Learning Algorithms.
• INFERENTIAL STATISTICS
Description
Build a strong statistical foundation and learn how to ‘infer’ insights from a huge population using a
small sample.
• HYPOTHESIS TESTING
Description
Understand how to formulate and validate hypotheses for a population to solve real-life business
problems.
• ADVANCED SQL
Description
Apply advanced SQL concepts like windowing and procedures to derive insights from data and
answer pertinent business questions.
Description
Solve a real industry problem through the concepts learnt in exploratory data analysis.
12
MACHINE LEARNING 3 ASSIGNMENTS
Module
Description
Venture into the machine learning community by learning how one variable can be predicted using
several other variables through a housing dataset where you will predict the prices of houses based
on various factors.
Description
Build a model to understand the factors on which the demand for bike sharing systems vary on and
help a company optimise its revenue.
• LOGISTIC REGRESSION
Description
Learn your first binary classification technique by determining which customers of a telecom
operator are likely to churn versus who are not to help the business retain customers.
Description
Learn how to group elements into different clusters when you don’t have any pre-defined labels to
segregate them through K-means clustering, hierarchical clustering, and more.
Description
Learn how to approach open ended real world problems using data as a lever to draw actionable
insights.
Description
Apply the machine learning concepts learnt to help an internation NGO cluster countries to
determine their overall development and plan for lagging countries.
Description
Help the Sales team of your company identify which leads are worth pursuing through this
classification case study.
13
SPECIALISATION 1: DATA SCIENCE GENERALIST
Module 3 ASSIGNMENTS
Description
Learn how the human decision making process can be replicated using a decision tree and other
powerful ensemble algorithms.
Description
Learn the pros and cons of simple and complex models and the different methods for quantifying
model complexity, along with general machine learning techniques like feature engineering,
model evaluation,and many more.
• ML LAB I: CLASSIFICATION
Description
Understand important concepts related to dimensionality reduction, the basic idea and the
learning algorithm of PCA, and its practical applications on supervised and unsupervised
problems.
• ADVANCED REGRESSION - I
Description
In this module, take a more advanced look at regression models and learn the concepts related
to regularisation.
Description
An introduction to the world of NLP and basic text processing skills. Learn how to build a
classification engine that works on (unstructured) textual data.
Description
Learn advanced visualisation techniques using the most in-demand visualisation tool in the industry.
14
• DATA STORYTELLING
Description
Learn how to effectively strategise, communicate, and fine grain your data analysis projects and
understand how to optimally present your findings to technical and non-technical stakeholders
and upgrade your storytelling skills.
Module 3 ASSIGNMENTS
• DATA MODELLING
Description
In this module, you will learn and use data modelling on a dataset to solve a business problem.
Description
Apply advanced SQL concepts like windowing and procedures to derive insights from data and
answer pertinent business questions
Description
Learn how to assess the efficiency your code using algorithm analysis techniques and learn to write
recursive algorithms
Description
Learn most fundamental searching and sorting algorithms and design techniques
Description
Learn user defined data structures -Stack, Queue, Trees in Python that help in advanced data
manipulation
• PYTHON - OOPS
Description
Learn OOP concepts such as Class, Object, Method, Inheritance, Polymorphism, Data Abstraction
and Encapsulation.
15
• PYTHON WEEKLONG LAB
Module 3 ASSIGNMENTS
• CAPSTONE PROJECT
Description
Solve an end-to-end real-life industry problem from a wide variety of domains. Make a video
presentation of your working demo to showcase in your portfolio.
Module
3 ASSIGNMENTS
• TREE MODELS
Description
Learn how the human decision making process can be replicated using a decision tree and other
powerful ensemble algorithms.
Description
Learn the pros and cons of simple and complex models and the different methods for quantifying
model complexity, alongwith general machine learning techniques like feature engineering,
model evaluation, and many more.
Description
Understand important concepts related to dimensionality reduction, the basic idea and the
learning algorithm of PCA, and its practical applications on supervised and unsupervised
problems.
• ADVANCED REGRESSION
Description
In this module, take a more advanced look at regression models and learn the concepts related to
regularisation.
Description
Build a regularised regression model to understand the most important variables to predict the
house prices in Australia.
16
• BAGGING AND BOOSTING
Description
Learn about ensemble modelling through bagging and boosting and understand how weak
algorithms can be transformed into stronger ones.
Description
In this module, you will learn how to analyse and forecast a series that varies with time.
Description
Solve the most crucial business problem for a leading telecom operator in India and southeast
Asia - predicting customer churn.
Description
Do you get annoyed by the constant spams in yor mail box? Wouldn’t it be nice if we had a
program to check your spellings? In this module learn how to build a spell checker & spam
detector using techniques like phonetic hashing,bag-of-words, TF-IDF, etc.
Description
This module will help you in understanding how to extract meaningful features from the processed
text data. Using these features you will be able to extract entities, classify POS tags, generating
similarity score between two question strings.
Description
To perform a sentiment analysis on product reviews from Amazon using NLP & Machine learning.
This assignemnt will be focused to give you a business understanding on how to do product
optimisation using NLP.
• INTRO TO DL
Description
Learn the most sophisticated and cutting-edge technique in machine learning - Artificial Neural
Networks & how to apply Deep learning for NLP.
17
• NLP INDUSTRIAL APPLICATIONS
Description
Learn how to use NLP with Neural networks for different industrial applications like text
classification, Question pair similarity, Text generation, Topic modelling.
Description
Imagine if you could make a restaurant booking without opening Zomato. Build your own restaurant
search chatbot with the help of RASA - an open source framework and deploy it on Slack.
CAPSTONE
Module
• DEPLOYMENT
2 ASSIGNMENTS
Description
Learn how to productionise your model and deploy it on the server. 3 ASSIGNMENTS
• CAPSTONE
Description
Choose from a range of real-world industry woven projects on advanced topics like
Recommendation Systems, Fraud Detection, Emotion Detection from faces, Social Media
Listening, Speech Recognition among many others.
Module
• TREE MODELS
Description
Learn how the human decision making process can be replicated using a decision tree and other
powerful ensemble algorithms.
Description
Learn the pros and cons of simple and complex models and the different methods for quantifying
model complexity, along with general machine learning techniques like feature engineering,
model evaluation, and many more.
18
• PRINCIPAL COMPONENT ANALYSIS
Description
Understand important concepts related to dimensionality reduction, the basic idea and the learning
algorithm of PCA, and its practical applications on supervised and unsupervised problems.
• ADVANCED REGRESSION
Description
In this module, take a more advanced look at regression models and learn the concepts related to
regularisation.
Description
Build a regularised regression model to understand the most important variables to predict the
house prices in Australia.
2 ASSIGNMENTS
• BAGGING AND BOOSTING
Description 3 ASSIGNMENTS
Learn about ensemble modelling through bagging and boosting and understand how weak
algorithms can be transformed into stronger ones.
Description
In this module, you will learn how to analyse and forecast a series that varies with time.
Description
Solve the most crucial business problem for a leading telecom operator in India and southeast
Asia - predicting customer churn.
Module
Description
Learn the most sophisticated and cutting-edge technique in machine learning - Artificial
Neural Networks or ANNs.
19
• NEURAL NETWORKS ASSIGNMENT
Description
Build a neural network from scratch in Numpy to identify handwritten digits.
Description
Learn the basics of CNN and OpenCV and apply it to Computer Vision tasks like detecting
anomalies in chest X-Ray scans, vehicle detection to count & categorise them to help the
government ascertain the width and strength of the road.
Description
Ever wondered what goes behind machine translation, sentiment analysis, speech recognition?
Learn how RNN helps in these areas having sequential data like text, speech, videos, and a lot
more.
• GESTURE RECOGNITION
Description
Make a Smart TV system which can control the TV with user’s hand gestures as the remote control.
CAPSTONE
Module
• DEPLOYMENT
Description
Learn how to productionise your model and deploy it on the server.
• CAPSTONE
Description
Choose from a range of real-world industry woven projects on advanced topics like
Recommendation Systems, Fraud Detection, Emotion Detection from faces, Social Media
Listening, Speech Recognition among many others.
2 ASSIGNMENTS
20
SPECIALISATION 4: BUSINESS ANALYTICS
Module
• TREE MODELS
Description
Learn how the human decision making process can be replicated using a decision tree and other
powerful ensemble algorithms.
Description
In this module, you will learn how to analyse and forecast a series that varies with time.
Description
Apply the concepts learnt in time series to solve a forecasting problem for a retail giant.
Description
Learn the pros and cons of simple and complex models and the different methods for quantifying
model complexity, along with general machine learning techniques like feature engineering,
model evaluation, and many more.
Description
Learn how to write optimised SQL query that require less memory and execute in lesser amount
of time.
• ADVANCED EXCEL
Description
Learn the advanced concepts in Excel and start to perform data analysis like a pro!
Description
Solve the most crucial business problem for a leading telecom operator in India and southeast
Asia - predicting customer churn.
21
BUSINESS REQUIREMENTS 2 ASSIGNMENTS
Module
Description
Learn how to attack a business problem using various structured frameworks like 5W, 5WHYs, and
SPIN.
Description
Apply your learnings from the course to solve a real-life business problem.
• OPERATIONS RESEARCH
Description
Learn about the world of operations research through linear and integer optimisations.
• DATA STORYTELLING
Description
Learn how to effectively strategise, communicate, and fine grain your data analysis projects and
understand how to optimally present your findings to technical and non-technical stakeholders
and upgrade your storytelling skills.
Description
Understand how a project in the industry is taken up and solved through a comprehensive
business case study.
Module
• CAPSTONE PROJECT
Description
Solve an end-to-end real-life industry problem from a wide variety of domains. Available capstone
project choices -
(i) Stock Analysis & Portfolio Management
(ii) E-Commerce & Marketing
(iii) Heatlhcare
(iv) Supply Chain Optimisation
(v) Credit Card Fraud Detection
22
SPECIALISATION 5: BUSINESS INTELLIGENCE / DATA ANALYTICS
Module
• DATA MODELLING
Description
In this module, you will learn and use data modelling on a dataset to solve a business problem.
Description
Learn how to write optimised SQL query that require less memory and execute in lesser amount
of time.
Description
In this assignment, you will work on a movies dataset using SQL to extract exciting insights.
• ADVANCED EXCEL
Description
Learn the advanced concepts in Excel and start to perform data analysis like a pro!
Description
Take your knowledge of query languages a step further by learning about MongoDB - a NoSQL
database which is becoming more and more popular in the industry.
Description
Understand the basics of big data and cloud and learn to work with an EMR cluster on a cloud
based service.
Description
In this module, you learn about the architecture and features of the Hive Query Language.
Description
Understand how a project in the industry is taken up and solved through a comprehensive
business case study.
23
STORYTELLING WITH ADVANCED VISUALISATIONS 2 ASSIGNMENTS
Module
Description
Learn advanced visualisation techniques using the most in-demand visualisation tool in the industry.
Description
Apply the new found Excel and Tableau skills to solve an exciting business assignment.
Description
Take your visualisation game a step forward by understanding how to operate PowerBI.
Description
Get a brief introduction to another popular open-sourced visualisation library in Python and
learn to code and create powerful, pretty, and interactive visualisations.
• DATA STORYTELLING
Description
Learn how to effectively strategise, communicate, and fine grain your data analysis projects and
understand how to optimally present your findings to technical and non-technical stakeholders
and upgrade your storytelling skills.
Description
Understand how a project in the industry is taken up and solved through a comprehensive
business case study.
Module
24
• CAPSTONE PROJECT
Description
Solve an end-to-end real-life industry problem from a wide variety of domains. Available capstone
project choices -
(i) Web & Social Media Analytics
(ii) Finance and Risk Analytics
(iii) Marketing and Retail Analytics
(iv) Supply Chain Analytics
(v) Fraud Analytics
Module
Description
This module you will learn what big data is, its various characteristics, and its determining factors.
You will also get an idea of the various sources of big data and the wide range of big data
applications in different industries such as retail, healthcare, and finance.
Description
Understand what is cloud and setup your AWS account which will be required during the program.
Description
Understand the world of distributed data processing and storage with Hadoop. Learn to write
MapReduce jobs in Python.
Description
Practise MapReduce Programming on a Big Dataset.
Description
Understand the concepts of Data Management and learn to model data from a Relational
Database.
25
• NOSQL DATABASES AND APACHE HBASE NOSQL DATABASES AND MONGODB
(OPTIONAL)
Description
Learn the concepts of NoSQL databases. Understand the working of Apache HBase.
Description
Understand the intricacies behind designing a data warehouse and a data lake for use case/s.
Description
Get familiar with the challenges involed in data ingestion. Use Sqoop and Flume to ingest
structured and unstructured data into Hadoop.
Description
Manage and query a data warehouse with Apache Hive. Learn to write optimised HQL for large
scale data analysis.
Description
Use HQL to analyse a Big Dataset
• AMAZON REDSHIFT
Description
Learn to deploy a Redshift cluster and use it for querying data.
Description
Get introduced to Apache Spark, a lightning fast big data processing engine.
Description
Practise Apache Spark and its core libraries on the NYC Parking Ticket dataset.
Description
Make use of Sqoop, Redshift & Spark to design an ETL data pipeline.
26
DATA ENGINEERING - II 3 ASSIGNMENTS (1 MANDATORY, 2 OPTIONAL)
Module
Description
Use PySpark to create large scale data processing applications.
• APACHE FLINK(OPTIONAL)
Description
Get Introduced to Apache Flink and learn query batch data. Use DataStream API to create a stream
processing application.
Description
Understand the producer-consumer architecture of Apache Kafka. Learn to set up a Kafka cluster
for managing real-time data.
Description
Learn about the real-time data processing architecture of Apache Spark. Build Spark Streaming
applications to process data in real-time.
Description
This assignment revolves around building Spark structured streaming application to processing
stock data in real-time.
Description
Automate Data Pipelines with Airflow.
Description
Use PySpark to do EDA and Predictive Analysis using Spark’s ML library.
Description
An assignment related to a classification based problem statement.
27
• PROJECT: REAL TIME DATA PROCESSING
Description
Build an end-to-end real-time data processing application using Spark Streaming and Kafka.
Module
• CAPSTONE PROJECT
Description
The capstone project will stitch all the components of data engineering together.
Module ECTS
• Electives B 10
Module ECTS
ELECTIVE B OPTIONS
Module
• Management
28
• Consumer Behavior and Research
• Corporate Finance
• Cognitive Computing
29
MEET
THE CLASS
5% Healthcare
5% E-Commerce
1% Manufacturing
1% Telecom
1% Education
15% Other
3% Retail
1% Consulting
WORK EXPERIENCE
30
HEAR FROM
OUR LEARNERS
31
ADMISSION
PROCESS
SELECTION PROCESS
STEP 1: Selection Test STEP 2: Review and Shortlisting of STEP 3: Enrollment for Access
Suitable Candidates to Prep Content
Fill out an application and take a Our faculty will review all applications, Make a quick block payment
short 17-minute online test with considering the educational and with assistance from our loan
few questions. professional background of an partners where required,
applicant and review the test scores receive immediate access to
where applicable. Following this, the prep content and begin
your upGrad journey.
assured a great peer group to learn
and network with.