0% found this document useful (0 votes)

24 views3 pages

BDDA - Course Outline

Uploaded by

Aru Ranjan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views3 pages

BDDA - Course Outline

Uploaded by

Aru Ranjan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

FORE School of Management, New Delhi

Programme: PGDM (FMG-30, IMG-15, FM-04 & BDA-02)

Course Name: Big Data and Data Analytics for Managers (Using Python) Credit: 3.0
Term: 4 Academic Year: 2022-2023
Faculty: Prof. Ashok Kumar Harnal & (Mr. Anuj Saini (20hrs) for BDA-02)
Office Contact No.: 8750893093
Email: [email protected]

Introduction

This course has two objectives: One, build up project-profile of students on Kaggle/github using important
techniques and second, analyzing big data on Spark—a unified platform for data analytics. We begin with covering
two very important machine learning techniques that are often used in the data analytics community. Learning to
optimize hyper parameters, especially when there are many of them, is very important in any model building
exercise. We briefly cover Hadoop—a big-data storage platform and then move on to analyzing data on this
platform using Spark. We cover streaming analytics—that is analyzing data in motion. Streaming analytics has
numerous applications (for example in ‘social-media-analytics’) and a number of business models (for example
that of Uber or of smart-cities) are built only on streaming technologies. This course assumes some prior basic
working knowledge of two python libraries—pandas and numpy. This is a project oriented Big Data course with
python as the primary language.

Students are expected to have laptops with minimum 8GB of RAM. They are strongly advised to upgrade to 16GB.
OS of Windows 10, Mac or Ubuntu will do.

Objectives: Broadly speaking the course’s objectives are two-fold:

1. Executing real-world projects on Kaggle so as to build up project-profile of students.

2. Learning techniques to handle Big Data and streaming data.

Text Book:

1. Hands on Machine Learning with Scikit Learn Keras and TensorFlow 2nd Edition-2019--Aurélien
Geron

Reference Book:

1. Feature Engineering for Machine Learning--Principles and Techniques for Data Scientists by Alice
Zheng & Amanda Casari
2. Spark The Definitive Guide--by Bill Chambers and Matei Zaharia
3. HadoopThe definitive guide by Tom White

Course Pedagogy: This is a project based and lab-oriented course. For every topic there is a project. Students are
first exposed to a problem, then understand data and learn techniques and tools to solve the problem and finally a
model is built and solution presented. For working on Big Data and streaming analytics related projects we will
use virtual machines.

Evaluation Components:

Team Project: 20%

Class Participation: 10%
Quiz: 10%
Mid Term: 20%
End Term: 40%
TOTAL – 100 marks

Page 1 of 3
Session Plan: (Each session is of 90 minutes unless specified)

Session(s) Topic/Session Theme Project(s) Learning outcomes

(see note on data sources
below)
1 Data Pipelining Practical To learn smooth processing of
data through pipelines

2-3 Using pipes in modeling, Otto project from Kaggle Learn to use pipelines in any
stacking classifiers predictive modeling project

3-4 Structure in data—tsne and Otto project from Kaggle To learn how to discover
umap whether data has some
structure or is mostly random

5-6 Bayesian Hyper-parameter Kaggle project: Satellite Learn how to tune

optimization images discrimination hyperparameters for best
possible predictive modeling

15-minutes online-quiz

7-8 Catboost—Experiments with Kaggle Project: BNP Learn to perform modelling of

Paribas Cardif Claims data with large number of
Management categorical features

9 Learning Hadoop Simple experiments on To learn challenges in Big Data

hadoop storage

10 Spark basics Experiment with Spark Understand Spark as a unified

DataFrames platform for predictive
modeling.

11-12 Machine learning with Spark--I Classification --do---

15-minutes online quiz

13-14 Machine Learning with Spark-- Bike-sharing Dataset— ---do---

II Regression problem

15 Spark streaming Analyzing streaming data

16-17 Kafka—Streaming Analytics Simple experiments with To learn basics of streaming

Kafka analytics

18-19 Spark-Kafka data pipeline Analyzing streaming data Develop a simple pipeline for
over a pipeline streaming data

19-20 Students Project work in class

Page 2 of 3
For official use: -
As Benchmarked with course content in previous year, the contents of this course: (Please mark the
right option below)
(a) Is totally new
(b) Has not changed at all
(c) Has undergone less than/equal to 20% change
 _
(d) Has undergone more than 20% change _
_
/
Faculty – Prof. Ashok Harnal Area Chair – Prof. Shilpi Jain

Manager (Academics-1)

Dean (Academics)

Page 3 of 3

Module 1 Introduction To Big Data Analytics
No ratings yet
Module 1 Introduction To Big Data Analytics
121 pages
TE Computer 2019 Course 22.06.2021-52-99
No ratings yet
TE Computer 2019 Course 22.06.2021-52-99
48 pages
Bda U1
No ratings yet
Bda U1
80 pages
Python Crash Course
0% (1)
Python Crash Course
15 pages
Big Data - Road Map
No ratings yet
Big Data - Road Map
22 pages
Big Data Engineer Course
No ratings yet
Big Data Engineer Course
31 pages
Bca Bigdata Fifth - Sem Approved Syllabus
No ratings yet
Bca Bigdata Fifth - Sem Approved Syllabus
23 pages
Testbank For Introduction To Biotechnology 4th Edition Thieman Solution Manual
No ratings yet
Testbank For Introduction To Biotechnology 4th Edition Thieman Solution Manual
18 pages
Acct Statement XX0471 11012025
No ratings yet
Acct Statement XX0471 11012025
5 pages
Introduction of Subject
No ratings yet
Introduction of Subject
28 pages
MCA - II Sem - Curriculum and Syllabus
No ratings yet
MCA - II Sem - Curriculum and Syllabus
15 pages
Ai and Data Science
No ratings yet
Ai and Data Science
9 pages
Graduation Date On Resume
100% (1)
Graduation Date On Resume
7 pages
Ds603Pc: Big Data Analytics B.Tech. III Year II Sem. L T P C 3 0 0 3 Course Objectives
No ratings yet
Ds603Pc: Big Data Analytics B.Tech. III Year II Sem. L T P C 3 0 0 3 Course Objectives
1 page
Ai and Data Science
No ratings yet
Ai and Data Science
9 pages
Data Science C
No ratings yet
Data Science C
21 pages
Institutionalization Stage Revalida
No ratings yet
Institutionalization Stage Revalida
59 pages
Boycott List of Israel Items
No ratings yet
Boycott List of Israel Items
3 pages
Lesson7 ASIAN-REGIONALISM
No ratings yet
Lesson7 ASIAN-REGIONALISM
18 pages
Automated Accounting Management System 1
No ratings yet
Automated Accounting Management System 1
46 pages
Marketing Research On Maggi 1
No ratings yet
Marketing Research On Maggi 1
57 pages
BoM For Transformer
No ratings yet
BoM For Transformer
24 pages
CS8091 Bigdata QB 2022-2023 Final
No ratings yet
CS8091 Bigdata QB 2022-2023 Final
6 pages
Barton Liquid Level (Mechanical)
No ratings yet
Barton Liquid Level (Mechanical)
36 pages
Effect of Tax Avoidance and Tax Evasion
No ratings yet
Effect of Tax Avoidance and Tax Evasion
13 pages
Materi Matrikulasi
No ratings yet
Materi Matrikulasi
72 pages
EX750-5 Circuit Diagram
100% (1)
EX750-5 Circuit Diagram
18 pages
Bda Aids Syllabus
No ratings yet
Bda Aids Syllabus
3 pages
A Guide To: Project Auditing
No ratings yet
A Guide To: Project Auditing
37 pages
Poll Watchers' Guide: 13 May 2019 National and Local Elections
No ratings yet
Poll Watchers' Guide: 13 May 2019 National and Local Elections
58 pages
2023 HIT2203-Course Outline
No ratings yet
2023 HIT2203-Course Outline
6 pages
SEM VII BDA Syllabus Theory
No ratings yet
SEM VII BDA Syllabus Theory
4 pages
Amazon RDS Custom
No ratings yet
Amazon RDS Custom
26 pages
CS8091 Bigdata Analytics Lessonplan With Date
No ratings yet
CS8091 Bigdata Analytics Lessonplan With Date
11 pages
Coursera Report Divyansh Sahai CSF443
No ratings yet
Coursera Report Divyansh Sahai CSF443
7 pages
310251: Data Science and Big Data Analytics
No ratings yet
310251: Data Science and Big Data Analytics
2 pages
IIT Kharagpur Data Science PDF
No ratings yet
IIT Kharagpur Data Science PDF
22 pages
BDA Syllabus - Sem VII - Mumbai University
No ratings yet
BDA Syllabus - Sem VII - Mumbai University
3 pages
Course Outline - ML IIFT Delhi MBA (BA) Sep-Dec 24
No ratings yet
Course Outline - ML IIFT Delhi MBA (BA) Sep-Dec 24
5 pages
BD Course Handout
No ratings yet
BD Course Handout
5 pages
Course Pack BDA
No ratings yet
Course Pack BDA
6 pages
M.E CSE Syllabus
No ratings yet
M.E CSE Syllabus
7 pages
4th Sem Syllabus
No ratings yet
4th Sem Syllabus
12 pages
Principles of Engineering System Design
No ratings yet
Principles of Engineering System Design
43 pages
Group 3 Nestle SDRM 3
No ratings yet
Group 3 Nestle SDRM 3
14 pages
CIT 4401big Data Analytics Course Outline
No ratings yet
CIT 4401big Data Analytics Course Outline
5 pages
Group 8 Digit Insurance SDRM-3
No ratings yet
Group 8 Digit Insurance SDRM-3
11 pages
HIT2203 Course Outline
No ratings yet
HIT2203 Course Outline
6 pages
Brochure - UoA - Curriculum
No ratings yet
Brochure - UoA - Curriculum
13 pages
Big Data Analytics Syllabus
No ratings yet
Big Data Analytics Syllabus
1 page
PCAC2009
No ratings yet
PCAC2009
3 pages
CC ZG522 Course Handout
No ratings yet
CC ZG522 Course Handout
6 pages
Guideline On Submission of Amendment and Record Piling Plans PDF
No ratings yet
Guideline On Submission of Amendment and Record Piling Plans PDF
9 pages
IOT Analytics - AI361
No ratings yet
IOT Analytics - AI361
3 pages
Course Outline of CSE 761 Big Data Analytics
No ratings yet
Course Outline of CSE 761 Big Data Analytics
3 pages
Certificate in Big Data Analytics For Business and Management
No ratings yet
Certificate in Big Data Analytics For Business and Management
17 pages
The Social Network Review
No ratings yet
The Social Network Review
16 pages
2024 25 ODD CE449 BDA Syllabus
No ratings yet
2024 25 ODD CE449 BDA Syllabus
4 pages
Course Outline PDF
No ratings yet
Course Outline PDF
4 pages
Lesson Plan Big Data Analytics
No ratings yet
Lesson Plan Big Data Analytics
2 pages
Big Data Analytics-Syllabus
No ratings yet
Big Data Analytics-Syllabus
3 pages
International Reporting Template: Exploration Results, Mineral Resources and Mineral Reserves
No ratings yet
International Reporting Template: Exploration Results, Mineral Resources and Mineral Reserves
36 pages
Bite411l Big-data-Analytics TH 1.0 73 Bite411l 67 Acp
No ratings yet
Bite411l Big-data-Analytics TH 1.0 73 Bite411l 67 Acp
2 pages
Big Data-2
No ratings yet
Big Data-2
3 pages
391 - CS8091 Big Data Analytics - Anna University 2017 Regulation Syllabus
0% (2)
391 - CS8091 Big Data Analytics - Anna University 2017 Regulation Syllabus
2 pages
Big Data Technologies Course Outline
No ratings yet
Big Data Technologies Course Outline
2 pages
Syllabus
No ratings yet
Syllabus
3 pages
Content Control Interfaces
No ratings yet
Content Control Interfaces
58 pages
Incongruities: This Comes From A Difference Between What A Product/service Actually Is and What
No ratings yet
Incongruities: This Comes From A Difference Between What A Product/service Actually Is and What
2 pages
Brochure Big Data
No ratings yet
Brochure Big Data
3 pages
Python AWS Data Engineering Course - Master PySpark, Kafka, SQL
No ratings yet
Python AWS Data Engineering Course - Master PySpark, Kafka, SQL
3 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
4 pages
DL2108T02
No ratings yet
DL2108T02
10 pages
Cis CPD LSC
No ratings yet
Cis CPD LSC
2 pages
Brochure Big Data
No ratings yet
Brochure Big Data
6 pages
113 Ce 74
No ratings yet
113 Ce 74
4 pages
Big Data Analytics (BDA) : Name of The Faculty: Affiliation: Teaching Area
No ratings yet
Big Data Analytics (BDA) : Name of The Faculty: Affiliation: Teaching Area
8 pages
Crim 12 Cases
No ratings yet
Crim 12 Cases
6 pages
ABM - Course Outline
No ratings yet
ABM - Course Outline
6 pages
Syllabus - CIS 509 Data Mining II (Fall 2019)
No ratings yet
Syllabus - CIS 509 Data Mining II (Fall 2019)
7 pages
FORE School of Management Course Outline & Session Plan
No ratings yet
FORE School of Management Course Outline & Session Plan
4 pages
Wireless Networks
No ratings yet
Wireless Networks
5 pages
CS8091 Syllabus
No ratings yet
CS8091 Syllabus
2 pages
Documents of The 1898 Declaration of Philippine Independence
No ratings yet
Documents of The 1898 Declaration of Philippine Independence
1 page
Big Data Analytics
No ratings yet
Big Data Analytics
3 pages
Practicum Action Plan
No ratings yet
Practicum Action Plan
2 pages
Big Data - 2 Marks-1
No ratings yet
Big Data - 2 Marks-1
1 page
Big Data Analytics Comp Syllabus Sem7
No ratings yet
Big Data Analytics Comp Syllabus Sem7
4 pages
ED1 Fundamental Principles of Taxation Tax Laws and Tax Administration
No ratings yet
ED1 Fundamental Principles of Taxation Tax Laws and Tax Administration
2 pages
Oxford Exam Excellence Recording 26
No ratings yet
Oxford Exam Excellence Recording 26
1 page
Ravi Teja Resume
No ratings yet
Ravi Teja Resume
2 pages

BDDA - Course Outline

Uploaded by

BDDA - Course Outline

Uploaded by

FORE School of Management, New Delhi

Programme: PGDM (FMG-30, IMG-15, FM-04 & BDA-02)

Objectives: Broadly speaking the course’s objectives are two-fold:

1. Executing real-world projects on Kaggle so as to build up project-profile of students.

Team Project: 20%

Session(s) Topic/Session Theme Project(s) Learning outcomes

5-6 Bayesian Hyper-parameter Kaggle project: Satellite Learn how to tune

7-8 Catboost—Experiments with Kaggle Project: BNP Learn to perform modelling of

9 Learning Hadoop Simple experiments on To learn challenges in Big Data

10 Spark basics Experiment with Spark Understand Spark as a unified

11-12 Machine learning with Spark--I Classification --do---

15-minutes online quiz

13-14 Machine Learning with Spark-- Bike-sharing Dataset— ---do---

15 Spark streaming Analyzing streaming data

16-17 Kafka—Streaming Analytics Simple experiments with To learn basics of streaming

19-20 Students Project work in class

You might also like