Data Engineering Bootcamp
Data Engineering Bootcamp
● Python Fundamentals
● Why Python and how it is different from the R programming language
● Variables, Identifiers, and Keywords in Python
● Data Structures in Python
● Strings, Array, Lists, Tuples, Set and Dictionaries
● Python Conditionals and Loops
● If, Nested If, Indentations
● Loops in Python
● Basic Operation and Operator in Python
● Operators in Python
● OOPs Concept
● Python Functions and Classes
● Functions and their types
● Classes in Python
● Type Conversion
● Lambda Functions
● Data Wrangling using Numpy
● Numpy for Data Engineers
● Data Wrangling using Pandas
● Pandas for Data Engineers
● Python for Visualisation
● Matplotlib
● Seaborne
● Introduction to SQL
● Introduction to Databases
● What is Database
● Introduction to MySQL and NoSQL
● DDL v/s DML v/s DCL v/s TCL
● Datatypes in SQL
● Basics of SQL
● Basic SQL statements (SELECT, DELETE and UPDATE)
● How to convert data into tables
● COMMIT and ROLLBACK statements
● Filtering Data using SQL
● Filter Data using the WHERE and ORDER BY Clause
● Usage of Filtering Operators – IN, NOT IN, IS NULL, BETWEEN
● Regular Expression for Filtering
● Functions in Database
● Basics of Function
● Boolean Expressions and Concatenation
● String Function
● Grouping Function
● Introduction to SQL
● Grouping Data and Computing Aggregates
● Introduction to Grouping
● Using GROUP BY & HAVING
● Subqueries and Nested queries in SQL
● Single-Row, Multiple-Row Subqueries
● Subqueries with ANY and ALL Operators
● Conditional Expressions using CASE Clause
● Correlated Subqueries
● Windows Functions in SQL
● Intro to window functions
● Basic windowing syntax
● The usual suspects: SUM, COUNT, and AVG ROW_NUMBER() RANK() and
DENSE_RANK() NTILE LAG and LEAD
● Defining a window alias
● Advanced windowing techniques
● Displaying Data from Multiple tables
● Introduction to Joins and its types
● Using UNION, UNION ALL, and EXPERT Clause Views, Sequences, and Indexes in SQL
MODULE - 3 BIG DATA WITH HADOOP AND SPARK