0% found this document useful (0 votes)

286 views13 pages

Big Data Analytics

This document outlines a 6-month course plan for teaching Big Data Analytics, with the objective of providing employable skills in areas like machine learning, Apache Spark, and scaling techniques. The course will be taught over 26 weeks for 25 hours per week, with a focus on hands-on learning. Upon completion, students will gain abilities relevant to jobs in the high-demand big data field, such as data architect or engineer roles.

Uploaded by

star

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

286 views13 pages

Big Data Analytics

Uploaded by

star

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Government of Pakistan

National Vocational and Technical Training Commission

Prime Minister Hunarmand Pakistan Program,

"Skills for All"

Course Contents/ Lesson Plan

Course Title: Big Data Analytics
Duration: 6 Months

Plot no. 38, Kirthar Road, H-9 Islamabad

051-9044250
Trainer Name Dr. Asif Jamshed

Course Title Big Data Analytics

Objective of Course Employable skills and hands on practice for Web Development, Graphic
Designing and Mobile App Development

The main goal of this course is to help students learn, understand, and
practice big data analytics and machine learning approaches, which
include the study of modern computing big data technologies and scaling
up machine learning techniques focusing on industry applications. Mainly
the course objectives are: conceptualization and summarization of big
data and machine learning, trivial data versus big data, big data
computing technologies, machine learning techniques, and scaling up
machine learning approaches.
The students learning outcomes are designed to specify what the
Learning Outcome of the students will be able to perform after completion of the course:
Course  Ability to identify the characteristics of datasets and compare the
trivial data and big data for various applications.
 Ability to select and implement machine learning techniques and
computing environment that are suitable for the applications
under consideration.
 Ability to solve problems associated with batch learning and
online learning, and the big data characteristics such as high
dimensionality, dynamically growing data and in particular
scalability issues.
 Ability to understand and apply scaling up machine learning
techniques and associated computing techniques and
technologies.
 Ability to recognize and implement various ways of selecting
suitable model parameters for different machine learning
techniques.
 Ability to integrate machine learning libraries and mathematical
and statistical tools with modern technologies like Apache Spark.
Course Execution Plan Total Duration of Course: 6 Months (26 Weeks)
Class Hours: 5 Hours per day
Theory: 20% Practical: 80%
Weekly Hours: 25 Hours Per week
Total Contact Hours: 650 Hours

Plot no. 38, Kirthar Road, H-9 Islamabad

051-9044250
1. Upwork
Companies Offering Jobs in 2. Freelancer
the respective trade 3. Fiverr
4. Government Institutes
5. Software Houses
6. Companies all over the world are offering its jobs as they want to
know the trends of market
Upskilling in Big Data and Analytics field is a smart career decision.
Job Opportunities According to Allied Market Research, the globalmarket of only
Hadoop/Spark will reach $84.6 Billion by 2021 and there is a shortage of
1.4-1.9 million Hadoop/Spark data analysts in the U.S. alone. Here is
selection of specialist opportunities in your area:
 Big Data Architect (Average Salary: 124000$ / Annum)
 Big Data Engineer (Average Salary: 117000$ / Annum)
 Big Data Developer (Average Salary: 88500$ / Annum)

No of Students 25

Learning Place Classroom / Lab

Instructional Resources Development Platform:

 https://github.com/ ,
 https://spark.apache.org/,
 https://www.edureka.co/apache-spark-scala-certification-
training,
 https://www.youtube.com/watch?v=iP1wOSsKjW8&list=PLS1Qul
Wo1RIahlYDqHWZb81qsKgEvPiHn,
 https://stackoverflow.com/

Learning Material:
 https://spark.apache.org/docs/latest/api/python/index.htmlhttps
://www.youtube.com/watch?v=9mELEARcxJo&list=PL9ooVrP1hQ
OGyFc60sExNX1qBWJyV5IMb
 https://www.youtube.com/watch?v=Uct_EbThV1E&list=PLZ7s-
Z1aAtmIbaEj_PtUqkqdmI1k7libK
 https://www.edureka.co/apache-spark-scala-certification-training
 https://www.youtube.com/watch?v=wjfeGxqAQOY&list=PLrjkTql
3jnm-CLxHftqLgkrZbM8fUt0vn
 https://www.youtube.com/watch?v=iP1wOSsKjW8&list=PLS1Qul
Wo1RIahlYDqHWZb81qsKgEvPiHn

Plot no. 38, Kirthar Road, H-9 Islamabad

051-9044250
Scheduled Module Title Learning Units Remarks
Week
Week 1  Introduction  Motivational Lecture
 Course Introduction
 Success stories
 Job market
 Course Applications
 Institute/work ethics
 Discussion on Python and its market
position.
 Motivation regarding learning aspects
of this course
 Setting up environment for Python.
 Installation of Anaconda
 What is Big Data?
 Characteristics of Big Data
 The Impact of Big Data
 Big Data - Beyond the Hype, Big Data
Examples, Sources of Big Data
 Big Data Adoption, The Big Data and
Data Science
 The Big Data Platform, Big Data and
Data Science. Skills for Data Scientists
Week 2 Module -1  Overview of DBMS
 Components of DBMS
Chapter 1.1-  Database Architecture
 Types of Database Model
 ER Model: Basic Concepts
 ER Model: Creating ER Diagram
 The Extended ER Model
 Codd's 12 rule of RDBMS
 Basic Concepts of RDBMS
 Types of Database key
 Introduction to Normalization

Basic SQL

 SQL Introduction
 Create query
 Alter query
 Truncate, Drop and Rename query
 All DML command
 All TCL Command
 All DCL Command
 WHERE clause
 SELECT query
 LIKE clause

Plot no. 38, Kirthar Road, H-9 Islamabad

051-9044250
 ORDER BY clause
 Group BY clause
 Having clause
 DISTINCT keyword
 AND & OR operator
 DIVISION operator

Advanced SQL

 SQL Constraints
 SQL function
 SQL Join
 SQL Alias
 SQL SET operation
 SQL Sequences
 SQL Views

Week 3 Chapter 1.2-  Types of IDE(s) and IDE that will be

used in the duration of this course. e.g.
Spyder, Jupyteretc
 Hello World Program “Print Command”
 Keyword Types
 Expressions and Variables
 Input Method
 Conditions and Branching
 Loops

Week 4 Chapter 2.1  String Operations

 Lists and Tuples
 Sets
 Dictionaries
 Reading and Writing files
 Functions
 Objects and Classes

Week 5 Chapter 2.2  Introduction with Numpy

 Numpy one dimensional Array
 Numpytwo-dimensional Array
 Numpy Array Operations

Week 6 Chapter 3.1  Descriptive Statistics

 Data Manipulation
 Data Wrangling

Week 7 Chapter 3.2  Working with Pandas

 Descriptive Statistics with Pandas
 Group by with Python

Plot no. 38, Kirthar Road, H-9 Islamabad

051-9044250
 Data Manipulation with Pandas

Week 8 Chapter 4  Data Wrangling with Pandas

 Discussion regarding exam

Week 9 Chapter 5.1  Introduction to Matplotlib

 Basic Plotting with Matplotlib
 Line Plots
 Area Plots
 Histograms
Week 10 Chapter 5.2  Bar Charts
 Pie Charts
 Box Plots
 Scatter Plots
 Word Cloud
Week 11 Chapter 6.1  What is Spark and what is its purpose?
 Components of the Spark unified stack
 Resilient Distributed Dataset (RDD)
 Scala and Python overview
Week 12 Chapter 6.2  Understand how to create parallelized
collections and external datasets
 Work with Resilient Distributed
Dataset (RDD) operations
 Utilize shared variables and key-value
pairs
Week 13 Chapter 6.3  Describe and run some Spark examples
 Pass functions to Spark
 Create and run a Spark standalone
application
Week 14 Chapter 6.4  Understand and use the various Spark
libraries

Week 15
Mid-Term Assignment

Week 16 Chapter 7  Apache Spark Next-Generation Big Data

Apache Shark Next- Framework
Generation Big Data  History of Spark
Framework  Why we should prefer spark?
 Introduction to Apache Spark
 Components of Spark
 Application of In-memory Processing
 Hadoop Ecosystem vs Spark
 Advantages of Spark
 Spark Architecture

Plot no. 38, Kirthar Road, H-9 Islamabad

051-9044250
 Spark Cluster in Real World
 Demo: Running a Scala Programs in Spark
Shell
 Demo: Setting Up Execution Environment
in IDE
 Demo: Spark Web UI
 Key Takeaways
 Knowledge Check
 Practice Project: Apache Spark Next-
Generation Big Data Framework
Week 17 Chapter 8  Introduction to Spark RDD
 RDD in Spark
Spark Core Processing  Creating Spark RDD
RDD
 Pair RDD
 RDD Operations
 Demo: Spark Transformation Detailed
Exploration Using Scala Examples
 Demo: Spark Action Detailed
Exploration Using Scala
 Caching and Persistence
 Storage Levels
 Lineage and DAG
 Need for DAG
 Debugging in Spark
 Partitioning in Spark
 Scheduling in Spark
 Shuffling in Spark
 Sort Shuffle
 Aggregating Data with Paired RDD
 Demo: Spark Application with Data
Written Back to HDFS and Spark UI
 Demo: Changing Spark Application
Parameters
 Demo: Handling Different File Formats
 Demo: Spark RDD with Real-world
Application
 Demo: Optimizing Spark Jobs
 Key Takeaways
 Knowledge Check
 Practice Project: Spark Core Processing
RDD
Week 18 Chapter 9  Spark SQL Processing DataFrames
 Spark SQL Introduction
Spark SQL Processing  Spark SQL Architecture
DataFrames
 Dataframes
 Demo: Handling Various Data Formats

Plot no. 38, Kirthar Road, H-9 Islamabad

051-9044250
 Demo: Implement Various Dataframe
Operations
 Demo: UDF and UDAF
 Interoperating With RDDs
 Demo: Process Dataframe Using SQL
Query
 RDD vs Dataframe vs Dataset
 Practice Project: Processing
Dataframes
 Key Takeaways
 Knowledge Check
 Practice Project: Spark SQL - Processing
Dataframes
Week 19 Chapter 10.1 ● Spark Mlib Modeling Big Data With
Spark
Part 1 ● Role of Data Scientist and Data Analyst
in Big Data
Spark Mlib Modelling
● Analytics in Spark
BigData with Spark
● Machine Learning
● Supervised Learning
● Demo: Classification of Linear SVM
● Demo: Linear Regression With Real
World Case Studies
● Unsupervised Learning Demo:
Unsupervised Clustering K-means
Week 20 Chapter 10.2 ● Reinforcement Learning
● Semi-supervised Learning
Part 2 ● Overview of Mlib
● Mlib Pipelines
Spark Mlib Modelling
● Key Takeaways
BigData with Spark
● Knowledge Check
● Practice Project: Spark Mlib -
Modelling Big data With Spark
Week 21 Employable ● Guidelines to the Trainees for selection
Project/Assignment (6 of students employable project like
weeks i.e. 21-26) in final year project (FYP)
addition of regular ● Assign Independent project to each
classes. Trainee
OR ● A project based on trainee’s aptitude
On job training ( 2 and acquired skills.
weeks) ● Designed by keeping in view the
emerging trends in the local market as
well as across the globe.
● The project idea may be based on
Entrepreneur.
● Leading to the successful employment.
● The duration of the project will be 6

Plot no. 38, Kirthar Road, H-9 Islamabad

051-9044250
weeks
● Ideas may be generated via different
sites such as:
https://1000projects.org/
https://nevonprojects.com/
https://www.freestudentprojects.com/
https://technofizi.net/best-computer-
science-and-engineering-cse-project-
topics-ideas-for-students/
 Final viva/assessment will be
conducted on project assignments.
 At the end of session the project will
be presented in skills competition
 The skill competition will be conducted
on zonal, regional and National level.
 The project will be presented in front
of Industrialists for commercialization
 The best business idea will be placed in
NAVTTC business incubation center for
commercialization.
---------------------------------------------------------
OR
On job training for 2 weeks:
 Aims to provide 2 weeks industrial
training to the Trainees as part of
overall training program
 Ideal for the manufacturing trades
 As an alternate to the projects that
involve expensive equipment
 Focuses on increasing Trainee’s
motivation, productivity, efficiency and
quick learning approach.

Week 22 Chapter 11.1 ● Streaming Overview

● Real-time Processing of Big Data
Part 1 ● Data Processing Architectures
● Demo: Real-time Data Processing
Stream Processing
● Spark Streaming
Frameworks and Spark
● Demo: Writing Spark Streaming
Streaming
Application
● Introduction to DStreams
● Transformations on DStreams
● Design Patterns for Using Foreachrdd
● State Operations
● Windowing Operations
● Join Operations Stream-dataset Join
● Demo: Windowing of Real-time Data

Plot no. 38, Kirthar Road, H-9 Islamabad

051-9044250
Processing
● Streaming Sources Demo: Processing
Twitter Streaming Data
● Structured Spark Streaming
● Use Case Banking Transactions
● Structured Streaming Architecture
Model and Its Components
● Output Sinks
Week 23 Chapter 11.2 ● Structured Streaming APIs
● Constructing Columns in Structured
Part 2 Streaming
● Windowed Operations on Event-time
Stream Processing
● Use Cases
Frameworks and Spark
● Demo: Streaming Pipeline
Streaming
● Practice Project: Spark Streaming
● Key Takeaways
● Knowledge Check
● Practice Project: Stream Processing
Frameworks and Spark Streaming
Week 24 Chapter 12.1  Spark GraphX
 Introduction to Graph
Part 1  GraphX in Spark
Spark GraphX  GraphX Operators
 Join Operators
 GraphX Parallel System
 Algorithms in Spark
Week 25 Chapter 12.2 ● Pregel API
● Use Case of GraphX
Part 2 ● Demo: GraphX Vertex Predicate
● Demo: Page Rank Algorithm
Spark GraphX
● Key Takeaways
● Knowledge Check
● Practice Project: Spark GraphX Project
Assistance
● Final Project Assessment
Week 26 Entrepreneurship and  Job Market Searching
Final Assessment in  Self-employment
project  Freelancing sites
 Introduction
 Fundamentals of Business Development
 Entrepreneurship
 Startup Funding
 Business Incubation and Acceleration
 Business Value Statement
 Business Model Canvas
 Sales and Marketing Strategies
 How to Reach Customers and Engage CxOs

Plot no. 38, Kirthar Road, H-9 Islamabad

051-9044250
 Stakeholders Power Grid
 RACI Model, SWOT Analysis, PEST Analysis
 SMART Objectives
 OKRs
 Cost Management (OPEX, CAPEX, ROCE
etc.)
 Final Assessment

List of Machinery / Equipment

Quantity physically available at

Sr. No Name of item as per curriculum
the training location
1 Computers Minimum Corei5 25
 LCD Display 17” with built in speakers

2 DSL Internet Connection (Minimum 1 MB) Available on every PC

3 25 each
Accessories/Devices

 Connectors
 Multimedia
 Printer (NW printer)
 Audio/visual aid
 White Board
 Pin Board
 Flip Chart Board
 Hard copy of Training Material
 Mobile Phones

For every PC
Wires, data cables, power plugs, power
4 supply

Available
5 UPS

Available
6 Generator / Solar Backup

Available
7 Air Conditioner (2 Tons)

Plot no. 38, Kirthar Road, H-9 Islamabad

051-9044250
1. Software List

Sr. No Software Name

1. MS Office 2016 (Installed on each PC)

2. Operating System (Windows, Linux or other Operating Systems)
3. Programming Languages including NetBeans, Android studio (Licensed
4. Web Servers including IIS, Apache (Licensed software installed on each PC)

5. Databases including MySQL, ERWIN (Licensed software installed on each PC)

FTP Client including FileZilla, File Manager (Licensed software installed on
6.
each PC)
7. Web hosting manager/control panel
Web browser including Internet Explorer, Google Chrome, Mozilla Firefox,
8.
Netscape, Opera (installed on each PC)
9. Firewall (each PC)
Security scanning tools including Antivirus (each PC)
Networking
10.

Required Software’s:
 Anaconda Jupyter
 MySQL Database
11.
 MS Office
 MS Visio
 MySQL

2. Minimum Qualification of Teachers / Instructor

The qualification of teachers / instructor of this course should be minimum of bachelors in
Computer science with minimum 3 years of development experience in relevant trade.
 Bachelors of Computers Science / Networks (Hons)

3. Supportive Notes

Teaching Learning Material

Books Name Author

Python Crash Course Eric Matthes

Plot no. 38, Kirthar Road, H-9 Islamabad

051-9044250
Big Data Analysis with Python Ankit Shukla,Ivan Marin
and Sarang VK

Big Data Course ( Edureka Online Course)

Plot no. 38, Kirthar Road, H-9 Islamabad

051-9044250

(Ebook PDF) Introduction To Data Mining 2nd Edition by Pang-Ning Tanpdf Download
100% (8)
(Ebook PDF) Introduction To Data Mining 2nd Edition by Pang-Ning Tanpdf Download
51 pages
Data Mining Notes
No ratings yet
Data Mining Notes
1,231 pages
" E-Commerce Website": Final Project Report
0% (1)
" E-Commerce Website": Final Project Report
37 pages
20IT503 - Big Data Analytics - Unit1
No ratings yet
20IT503 - Big Data Analytics - Unit1
59 pages
The Power of Data Storytelling by Sejal Vora 2019 9789353282905 9789353282912 Compress
No ratings yet
The Power of Data Storytelling by Sejal Vora 2019 9789353282905 9789353282912 Compress
249 pages
ST2195 Complete
No ratings yet
ST2195 Complete
430 pages
Chapter1 Describing Financial Series
No ratings yet
Chapter1 Describing Financial Series
136 pages
Business Intelligence
No ratings yet
Business Intelligence
141 pages
Computer Hardware Maintanance Repair1
No ratings yet
Computer Hardware Maintanance Repair1
32 pages
Azure CSP Documntation
No ratings yet
Azure CSP Documntation
376 pages
In My Case: Master - Slave Replication Step by Step
100% (1)
In My Case: Master - Slave Replication Step by Step
4 pages
2.1. Research Topic 2.2. Literature Review 2.3. Research Problem in Research 2.4. Writing Research Questions, Objectives and Research Hypothesis
No ratings yet
2.1. Research Topic 2.2. Literature Review 2.3. Research Problem in Research 2.4. Writing Research Questions, Objectives and Research Hypothesis
39 pages
Internal PPT - Applications and Trends in Data Mining
No ratings yet
Internal PPT - Applications and Trends in Data Mining
17 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
221 pages
Cognitive Psychology Sem 1
No ratings yet
Cognitive Psychology Sem 1
136 pages
What's Big Data? How Does It Relate To Data Science?
No ratings yet
What's Big Data? How Does It Relate To Data Science?
19 pages
CCW331 Business Analytics Lecture Notes 1
No ratings yet
CCW331 Business Analytics Lecture Notes 1
286 pages
Statistical Inference
No ratings yet
Statistical Inference
113 pages
AdvancesInKnowledgeDicoveryAndDataMining 2012 Part1
100% (1)
AdvancesInKnowledgeDicoveryAndDataMining 2012 Part1
642 pages
Linked List
No ratings yet
Linked List
97 pages
Sample CoreJava For The Imaptient
No ratings yet
Sample CoreJava For The Imaptient
120 pages
Nonlinear Programming Solution A
No ratings yet
Nonlinear Programming Solution A
65 pages
Writing A Research Proposal
No ratings yet
Writing A Research Proposal
8 pages
01 ASAP TimeSeriesForcasting Day1 2 Introduction
No ratings yet
01 ASAP TimeSeriesForcasting Day1 2 Introduction
62 pages
Lecture 4 - Density Based Methods
No ratings yet
Lecture 4 - Density Based Methods
16 pages
Session 3 - Logistic Regression
50% (2)
Session 3 - Logistic Regression
28 pages
Microbiology An Evolving Science 4Th Edition PDF, Epub, Ebook
No ratings yet
Microbiology An Evolving Science 4Th Edition PDF, Epub, Ebook
4 pages
Foundations of Data Science
No ratings yet
Foundations of Data Science
4 pages
A Novel Deep Learning Framework: Prediction and Analysis of Financial Time Series Using CEEMD and LSTM
No ratings yet
A Novel Deep Learning Framework: Prediction and Analysis of Financial Time Series Using CEEMD and LSTM
21 pages
Lectures Machine Learning
No ratings yet
Lectures Machine Learning
205 pages
Data Science and Ethical Issues
No ratings yet
Data Science and Ethical Issues
42 pages
001-2023-0921 DLMDSBDT01 Course Book
No ratings yet
001-2023-0921 DLMDSBDT01 Course Book
124 pages
Unit 1
No ratings yet
Unit 1
70 pages
Lecture 3 Data Mining
No ratings yet
Lecture 3 Data Mining
30 pages
HTML Deep Dive Notes by Yadnyesh
No ratings yet
HTML Deep Dive Notes by Yadnyesh
124 pages
Dlmdmdql01 Course Book
No ratings yet
Dlmdmdql01 Course Book
104 pages
Website: VCE To PDF Converter: Facebook: Twitter:: Number: 1z0-148 Passing Score: 800 Time Limit: 120 Min
No ratings yet
Website: VCE To PDF Converter: Facebook: Twitter:: Number: 1z0-148 Passing Score: 800 Time Limit: 120 Min
54 pages
Diabetes Prediction Using Data Mining
No ratings yet
Diabetes Prediction Using Data Mining
17 pages
Course Pack - OOP
No ratings yet
Course Pack - OOP
3 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
UNIT-1 Introduction: Dr. C.Nagaraju Head of Cse Ysrec of YVU Proddatur
100% (1)
UNIT-1 Introduction: Dr. C.Nagaraju Head of Cse Ysrec of YVU Proddatur
86 pages
(Fall 2011) CS-402 Data Mining - Final Exam-SUB - v03
No ratings yet
(Fall 2011) CS-402 Data Mining - Final Exam-SUB - v03
6 pages
Dbms Unit 5 Final
No ratings yet
Dbms Unit 5 Final
16 pages
OCPI 2.2 d2
No ratings yet
OCPI 2.2 d2
186 pages
Types of Data (Qualitative and Quantitative)
No ratings yet
Types of Data (Qualitative and Quantitative)
89 pages
Full Syllabus of Calicut University (2004) Information Technology (IT)
No ratings yet
Full Syllabus of Calicut University (2004) Information Technology (IT)
191 pages
Research Data Analysis With Power BI: Vijay Krishnan S Bharanidharan G Krishnamoorthy
No ratings yet
Research Data Analysis With Power BI: Vijay Krishnan S Bharanidharan G Krishnamoorthy
8 pages
A Comparison of Classification Techniques On Prediction of Student Performance
No ratings yet
A Comparison of Classification Techniques On Prediction of Student Performance
6 pages
P1 - Single Layer Feed Forward Networks
No ratings yet
P1 - Single Layer Feed Forward Networks
52 pages
Oop # 1
No ratings yet
Oop # 1
11 pages
Ejercicois Eviews
100% (1)
Ejercicois Eviews
10 pages
Data Analytics & Data Science Job Ready Program
No ratings yet
Data Analytics & Data Science Job Ready Program
4 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Fourth Edition: Descriptive Analytics I: Nature of Data, Statistical Modeling, and Visualization
No ratings yet
Fourth Edition: Descriptive Analytics I: Nature of Data, Statistical Modeling, and Visualization
66 pages
Lecture 3: Fields, Getters and Setters, Constructors, Testing
No ratings yet
Lecture 3: Fields, Getters and Setters, Constructors, Testing
21 pages
Course Pack BDA
No ratings yet
Course Pack BDA
6 pages
Data Mining: Concepts and Techniques: - Introduction
No ratings yet
Data Mining: Concepts and Techniques: - Introduction
44 pages
K-Means Clustering Algorithm
No ratings yet
K-Means Clustering Algorithm
13 pages
Assignment 1&2
No ratings yet
Assignment 1&2
4 pages
Deep Learning and CNNFYTGS5101-Guoyangxie
No ratings yet
Deep Learning and CNNFYTGS5101-Guoyangxie
42 pages
Syllabus
No ratings yet
Syllabus
3 pages
DBMS Course Outline
No ratings yet
DBMS Course Outline
14 pages
ST2195 Programming For Data Science
No ratings yet
ST2195 Programming For Data Science
11 pages
Modeling Web Application
No ratings yet
Modeling Web Application
25 pages
Big Data Training in Chennai - Big Data Course in Chennai
No ratings yet
Big Data Training in Chennai - Big Data Course in Chennai
1 page
Gradient Descent Algorithm
No ratings yet
Gradient Descent Algorithm
5 pages
08250771
No ratings yet
08250771
8 pages
Motion Detection
No ratings yet
Motion Detection
33 pages
X-Mabini's TLE-CSS Reviewer
No ratings yet
X-Mabini's TLE-CSS Reviewer
5 pages
Technical Analysis Course Syllabus
No ratings yet
Technical Analysis Course Syllabus
1 page
Fiona
No ratings yet
Fiona
83 pages
Lex Scop Closures Funs Term Curry
No ratings yet
Lex Scop Closures Funs Term Curry
23 pages
Computer Science
No ratings yet
Computer Science
23 pages
Jaya Stqa
No ratings yet
Jaya Stqa
53 pages
4D Ajax Framework Developer Guide
No ratings yet
4D Ajax Framework Developer Guide
28 pages
Adadamin Steps
No ratings yet
Adadamin Steps
9 pages
Components of A Computer System
No ratings yet
Components of A Computer System
28 pages
Lecture 3
No ratings yet
Lecture 3
30 pages
Computer Project Documentation - 25
No ratings yet
Computer Project Documentation - 25
23 pages
Why Java Is Robust
No ratings yet
Why Java Is Robust
16 pages
Python Class 4
No ratings yet
Python Class 4
13 pages
Program Blocks: Main (OB1)
No ratings yet
Program Blocks: Main (OB1)
9 pages
Android Documentation With Firebase
No ratings yet
Android Documentation With Firebase
5 pages
Cloud Computing Practical File
No ratings yet
Cloud Computing Practical File
16 pages
Meet Kaur CV
No ratings yet
Meet Kaur CV
4 pages
Data Structures Assignment 2: (Backtracking Using Stack)
No ratings yet
Data Structures Assignment 2: (Backtracking Using Stack)
4 pages
An Advanced Signal Processing Toolkit For JAVA App
No ratings yet
An Advanced Signal Processing Toolkit For JAVA App
7 pages
Snakes and Ladders - The Quickest Way Up - Lab 7 FODSA Question - Contests - HackerRank
No ratings yet
Snakes and Ladders - The Quickest Way Up - Lab 7 FODSA Question - Contests - HackerRank
5 pages
Password 3
No ratings yet
Password 3
1 page
CV - Manuel Antonio Gomez Merino
No ratings yet
CV - Manuel Antonio Gomez Merino
2 pages

Big Data Analytics

Uploaded by

Big Data Analytics

Uploaded by

Government of Pakistan

National Vocational and Technical Training Commission

Prime Minister Hunarmand Pakistan Program,

Course Contents/ Lesson Plan

Plot no. 38, Kirthar Road, H-9 Islamabad

Course Title Big Data Analytics

Plot no. 38, Kirthar Road, H-9 Islamabad

Learning Place Classroom / Lab

Instructional Resources Development Platform:

Plot no. 38, Kirthar Road, H-9 Islamabad

Plot no. 38, Kirthar Road, H-9 Islamabad

Week 3 Chapter 1.2-  Types of IDE(s) and IDE that will be

Week 4 Chapter 2.1  String Operations

Week 5 Chapter 2.2  Introduction with Numpy

Week 6 Chapter 3.1  Descriptive Statistics

Week 7 Chapter 3.2  Working with Pandas

Plot no. 38, Kirthar Road, H-9 Islamabad

Week 8 Chapter 4  Data Wrangling with Pandas

Week 9 Chapter 5.1  Introduction to Matplotlib

Week 16 Chapter 7  Apache Spark Next-Generation Big Data

Plot no. 38, Kirthar Road, H-9 Islamabad

Plot no. 38, Kirthar Road, H-9 Islamabad

Plot no. 38, Kirthar Road, H-9 Islamabad

Week 22 Chapter 11.1 ● Streaming Overview

Plot no. 38, Kirthar Road, H-9 Islamabad

Plot no. 38, Kirthar Road, H-9 Islamabad

List of Machinery / Equipment

Quantity physically available at

2 DSL Internet Connection (Minimum 1 MB) Available on every PC

Plot no. 38, Kirthar Road, H-9 Islamabad

Sr. No Software Name

1. MS Office 2016 (Installed on each PC)

5. Databases including MySQL, ERWIN (Licensed software installed on each PC)

2. Minimum Qualification of Teachers / Instructor

Teaching Learning Material

Books Name Author

Python Crash Course Eric Matthes

Plot no. 38, Kirthar Road, H-9 Islamabad

Big Data Course ( Edureka Online Course)

Plot no. 38, Kirthar Road, H-9 Islamabad

You might also like