Big Data Analytics Course Outline (Fall 2020) : Dr. Tariq Mahmood 830 Am - 11 Am (Monday) Scope
Big Data Analytics Course Outline (Fall 2020) : Dr. Tariq Mahmood 830 Am - 11 Am (Monday) Scope
____________________________________________________________________________________
Weekly Lecture Breakdown (6 hours/Week):
• Week 1:
o History of Analytics, Definition of Big Data, Big Data Characteristics, Use Cases, 10 Vs of
Big Data, Why Big Data Matters
o Types of Big Data and Data Lakes
o Big Data Landscapes
o Definition and categorization of Big Data Analytics
o Introduction and Overview of NoSQL databases
o Introduction and Overview of Apache Hadoop Ecosystem
• Week 2:
o Hadoop 2, YARN, HDFS, and setting up Hadoop clusters
o Hands-on Activity
• Week 3:
o MapReduce: Theory and Hands-on
• Week 4:
o Apache Spark with Apache Kafka
o Hands-on Activity
• Week 5:
o Apache Hive, Apache HBase and Apache Cassandra
o Hands-on Activity
• Week 6:
o Apache Presto and Apache Drill
o Hands-on Activity
• Week 7:
o Document NoSQL with MongoDB
o Hands-on Activity
• Week 8:
o Graph NoSQL with Neo4J
o Hands-on Activity
• Week 9:
o Key Value Stores with Redis
o Hands-on Activity
• Week 10:
o Lambda and Kappa Big Data Architectures
o Potential Hands-on
• Week 11:
o Zeta Big data architecture
o Potential Hands-on
• Week 12:
o Spark’s ML-Lib
o Potential Hands-on
• Week 13:
o Project Demos
_____________________________________________________________________________________
Potential Lab Tools: MS SQL Server (SSIS), Oracle Online, ETL on Python (Anaconda).
_____________________________________________________________________________________
Text Books: