III B.
Tech SQL for Data Analysis L T P C
II Semester 3 0 2 3
COURSE OBJECTIVES:
1. Study basic knowledge about preparation of data for analysis
2. Study basic knowledge about Time series analysis
3. Study basic knowledge about Cohort analysis
4. Student will study about Text Analysis
5. Student will learn how to handle Anomalies
6. Student will learn how to perform Experiment Analysis on complex data sets
COURSE OUTCOMES:
1. Student will be able to prepare data for analysis using PostgreSQL
2. Student will be able to perform Time series analysis and Time Manipulations using
SQL
3. Student will be able to perform Cohort and cross-section analysis using SQL
4. Student will be able to perform Text Analysis using SQL
5. Student will be able to detect and to handle Anomalies using SQL
6. Student will be able to perform Experiment Analysis on complex data sets using SQL
UNIT I
Analysis with SQL: What Is Data Analysis? Why SQL? Database Types, Preparing Data for
Analysis: Types of Data, SQL Query Structure, Profiling: Distributions and data quality,
Preparing: Data Cleaning and shaping Data
UNIT-II
Time Series Analysis: Date, Datetime, and Time Manipulations; The Retail Sales Data Set,
Trending the Data, Rolling Time Windows, Analysing with Seasonality
UNIT-III
Cohort Analysis: A Useful Analysis Framework, The Legislators Data Set, Retention,
Related Cohort Analyses, Cross-Section Analysis, Through a Cohort Lens
UNIT-IV
Text Analysis: Why Text Analysis with SQL? The UFO Sightings Data Set, Text
Characteristics, Text Parsing, Text Transformations, Finding Elements Within Larger Blocks
of Text, Constructing and Reshaping Text Concatenation Reshaping Text
UNIT-V
Anomaly Detection: Capabilities and Limits of SQL for Anomaly Detection, The Data Set,
Detecting Outliers, Forms of Anomalies, Handling Anomalies
UNIT VI
Experiment Analysis and Complex Data Sets: Strengths and Limits of Experiment Analysis
with SQ, Data Set, Types of Experiments, Challenges with Experiments and Options for
Rescuing Flawed Experiments, Alternative Analyses, SQL for Complex Data Sets, Code
Organization, Organizing Computations
TEXT BOOKS:
1. SQL for Data Analysis -Advanced Techniques for Transforming Data into Insights-
O’REILLY-Cathy Tanimura-Edition 2021
2. SQL for Data Analytics -Perform fast and efficient data analysis with the power o-f SQL
-Upom Malik Matt Goldvesser and Benjamin Johnston-Packt Publications