0% found this document useful (0 votes)

30 views19 pages

Module 1 - SQL For Analytics Introduction

SQL

Uploaded by

priyanshudeshwal287

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views19 pages

Module 1 - SQL For Analytics Introduction

SQL

Uploaded by

priyanshudeshwal287

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

SQL

for Analytics
Start Learning
SQL For Analytics
Learn SQL by Application! Realistic ends to end case
studies, examples and challenges to teach you the way it is
meant to be used.

Preface
SQL was initially created to be the language for generating,
manipulating, and retrieving data from relational databases, which
have been around for more than 40 years. Over the past decade or
so, however, other data platforms such as Hadoop, Spark, and
NoSQL have gained a great deal of traction, eating away at the
relational database market. As will be discussed in the last few
chapters of this book, however, the SQL language has been evolving
to facilitate the retrieval of data from various platforms, regardless
of whether the data is stored in tables, documents, or flat files.

The easiest, as well as an essential skill that every data science

aspirant should acquire, is SQL. This course is designed for all the
users who, maybe experienced with data analysis but new to SQL,
or perhaps experienced with SQL but new to data analysis. Or you
may be new to both topics entirely. We learn SQL only for the
purpose of data analysis and will exclude the concepts which might
relate to data engineering and deep database management studies.
MODULE 01

A Little Background
• Introduction to Database
• Relational Database, Primary Key & Foreign Key
• SQL as Part of the Data Analysis Workflow
• Database Data Types
Contents
Introduction to Database ..................................................... 2
1.1. Data Infrastructure .................................................... 2
1.2. Relational Database Systems ...................................... 3
1.3. SQL Constraints: ....................................................... 5
PRIMARY KEY Constraint ............................................. 5
FOREIGN KEY Constraint............................................. 6
Referencing Columns in Another Table ......................... 6
1.4. Database Structure .................................................... 8
1.5. Four Sublanguages of SQL .......................................... 8
SQL for Analytics ............................................................... 10
2.1. What Is Data Analysis? ............................................. 10
2.2. SQL as Part of the Data Analysis Workflow ................. 10
Database Data Types ......................................................... 13
3.1. Types of Data ........................................................... 13
1. Structured Versus Unstructured ................................ 13
2. Quantitative Versus Qualitative Data ......................... 14
3. Sparse Data ............................................................ 14
3.2. Database Data Types ................................................ 14
Introduction to Database | Module 1

SECTION 1

Introduction to Database
A database is nothing more than a set of related information. A
telephone book, for example, is a database of the names, phone
numbers, and addresses of all people living in a particular
region. While a telephone book is certainly a universal and
frequently used database, it suffers from the following:

• Finding a person’s telephone number can be time

consuming.
• A telephone book is indexed only by last/first names, so
finding the names of the people living at a particular
address, is not a practical.
• From the moment the telephone book is printed, the
information becomes less and less accurate.

The same drawbacks attributed to telephone books can be

applied to any manual data storage system. Because a
computerised database system stores data electronically, it is
able to retrieve data more quickly, index data in multiple ways,
and deliver up-to-the-minute information.

1.1. Data Infrastructure

A database is a set of data stored in a computer. This data is
usually structured in a way that makes the data easily
accessible. Databases aren’t the only way data can be stored,
and there is an increasing variety of options for storing data
needed for analysis and powering applications. File storage
systems, NoSQL databases and search-based data stores are
alternative data storage systems that offer low latency for
application development and searching log files. Although not
typically part of the analysis process, they are increasingly part

2
Module 1 | Introduction to Database

of organizations’ data infrastructure. NoSQL is a technology that

allows for data modelling that is not strictly relational. It allows
for very low latency storage and retrieval, critical in many online
applications. Examples of these data stores that you might hear
about in your organization are Cassandra, Couchbase,
DynamoDB, Memcached, Giraph, and Neo4j.

1.2. Relational Database Systems

A relational database uses a structure that allows us to identify
and access data in relation to another piece of data in the
database. Often, data in a relational database is organized into
tables. Each row in the table is considered as a record. Every
record is broken down into fields that represent single items of
data describing a specific thing. For example, you can store
information about a collection of book data inside a database.
Information pertaining to the books themselves can be stored in
a table called Books. Each book record can be stored in one table
row with each specific piece of data such as book title, author,
or price, stored into a separate field.

A Database contains
one or more tables.

A table contains a
number of records.

Field 1 Field 2 Field 3 Field 4

A record contains
one or more fields

3
Introduction to Database | Module 1

Databases are usually associated with software that allows for

the data to be updated and queried. The software that manages
the database is called a Relational Database Management
System (RDBMS). These systems make storing data and
returning results easier and more efficient by allowing different
questions and commands to be posed to the database. Popular
RDBMS software includes

When working with databases we will participate in the design,

maintenance and administration of the database that supplies
data to our website or application. In order to do this, however,
we will need to access that data and also automate the process
to allow other users to retrieve and perhaps even modify data
without technical knowledge. To achieve this, we will need to
communicate with the database in a language it can interpret.
Structured Query Language (SQL) will allow us to directly
communicate with databases and is thus the subject of this
course. SQL is composed of commands that enable users to
create database and table structures, perform various types of
data manipulation and data administration and query the
database in order to extract useful information.

🗒️ Is SQL a Programming Language

SQL isn’t a general-purpose language in the way that C or
Python are. SQL without a database and data in tables is just a
text file. SQL can’t build a website, but it is powerful for
working with data in databases. On a practical level, what
matters most is that SQL can help you get the job of data
analysis done.

4
Module 1 | Introduction to Database

1.3. SQL Constraints:

In a database table, we can add rules to a column known
as constraints. These rules control the data that can be stored
in a column. For example, if a column has NOT NULL constraint,
it means the column cannot store NULL values. The constraints
used in SQL are:

Constraint Description
NOT NULL values cannot be null.
UNIQUE values cannot match any older value.
PRIMARY KEY used to uniquely identify a row.
FOREIGN KEY references a row in another table.
CHECK validates condition for new value.
DEFAULT set default value if not passed.
CREATE INDEX used to speed up the read process.

PRIMARY KEY Constraint

In SQL, the PRIMARY KEY constraint is used to uniquely identify

rows. It is a combination of NOT NULL and UNIQUE constraints
i.e. it cannot contain duplicate or NULL values.

-- create Colleges table with primary key

college_id

CREATE TABLE Colleges (

college_id INT,
college_code VARCHAR(20) NOT NULL,
college_name VARCHAR(50),
CONSTRAINT CollegePK PRIMARY KEY (college_id)
);

Here, the college_id column is the PRIMARY KEY . This means

that the values of this column must be unique, and it cannot
contain NULL values.

5
Introduction to Database | Module 1

FOREIGN KEY Constraint

The FOREIGN KEY constraint is used to create a relationship

between two tables. A foreign key is defined using the FOREIGN
KEY and REFERENCES keywords.

-- this table doesn’t contain foreign keys

CREATE TABLE Customers (

id INTEGER PRIMARY KEY,
name VARCHAR(100),
age INTEGER
);

-- create another table named Prodcuts

-- add foreign key to customer_id column
-- the foreign key references the id column of
the customers table

CREATE TABLE Products (

customer_id INTEGER ,
name VARCHAR(100),
FOREIGN KEY (customer_id)
REFERENCES Customers(id)
);

id column in the Products table references the id column in

the Customers table.

Referencing Columns in Another Table

The FOREIGN KEY constraint in SQL establishes a relationship

between two tables by linking columns in one table to those in
another. For example,

6
Module 1 | Introduction to Database

Here, the customer_id field in the Orders table is a FOREIGN

KEY that refers to the customer_id field in the Customers table.
This means that the value of the customer_id (of Orders table)
must be a value from the customer_id column (of
Customers table).

🗒️Note: The foreign key can be referenced to any column in

the parent table. However, it is a general practice to reference
the foreign key to the primary key of the parent table.

7
Introduction to Database | Module 1

1.4. Database Structure

SQL is used to access, manipulate, and
retrieve data from objects in a database.
Databases can have one or more
schemas, which provide the
organization and structure and contain
other objects. Within a schema, the
objects most commonly used in data
analysis are tables, views, and
functions. Tables contain fields, which
hold the data. Tables may have one or
more indexes; an index is a special kind
of data structure that allows data to be
retrieved more efficiently.

1.5. Four Sublanguages of SQL

To communicate with databases, SQL has four sublanguages for
tackling different jobs, and these are mostly standard across
database types.

1. DQL, or data query language, is what this course is mainly

about. It’s used for querying data, which you can think of
as using code to ask questions of a database. DQL
commands include SELECT , FROM , WHERE , JOINS , etc.
SQL queries can access a single table (or view), can
combine data from multiple tables through the use of
joins, and can also query across multiple schemas in the
same database.
2. DDL, or data definition language, is used to create and
modify tables, views, users, and other objects in the
database. It affects the structure but not the contents.
There are three common commands: CREATE , ALTER , and
DROP . CREATE is used to make new objects. ALTER

8
Module 1 | Introduction to Database

changes the structure of an object, such as by adding a

column to a table. DROP deletes the entire object and its
structure.
3. DCL, or data control language, is used for access control.
Commands include GRANT and REVOKE , which give
permission and remove permission, respectively. In an
analysis context, GRANT might be needed to allow a
colleague to query a table you created. You might also
encounter such a command when someone has told you
a table exists in the database but you can’t see it—
permissions might need to be GRANTed to your user.
4. DML, or data manipulation language, is used to act on the
data itself. The commands are INSERT , UPDATE , and
DELETE . INSERT adds new records and is essentially the
“load” step in extract, transform, load (ETL). UPDATE
changes values in a field, and DELETE removes rows.

9
SQL for Analytics | Module 1

SECTION 2

SQL for Analytics

Before actually starting talking with the database, we’ll start
with a discussion of what data analysis is and then move on
to a discussion of SQL: what is SQL, why it’s so popular, and
how SQL fits into data analysis .

2.1. What Is Data Analysis?

Data analysis is part data discovery, part data interpretation, and
part data communication. Very often the purpose of data
analysis is to improve decision-making, by humans and
increasingly by machines through automation.

Mining historical data helps us understand the characteristics

and behaviour of customers, suppliers, and processes.
Historical data can help us develop informed estimates and
predicted ranges of outcomes, which will sometimes be wrong
but quite often will be right. Past data can point out gaps,
weaknesses, and opportunities. It allows organizations to
optimize, save money, and reduce risk and fraud. It can also help
organizations find opportunity and it can become the building
blocks of new products that delight customers.

2.2. SQL as Part of the Data Analysis

Workflow
Analysis workflow refers to the series of steps that an analyst
follows to achieve the desired outcome. It always starts with a
question, and ends with a presentation/ visual dashboard to
present the outcome of the analysis to stakeholders.

10
Module 1 | SQL for Analytics

1. First step of analysis workflow is ‘Framing the Question’

which may be about how many new customers have been
acquired, how sales are trending, or why some users stick
around for a long time while others try a service and never
return.
2. Once the question is framed, we consider where the data
originated. Data is generated by ‘Source Systems’, a
term that includes any human or machine process that
generates data of interest. Data can be generated by
people by hand, such as when someone fills out a form or
takes notes during a doctor’s visit. Data can also be
machine-generated, such as when an application
database records a purchase, an event-streaming system
records a website click or a marketing management tool
records an email open.
3. The next step is moving the data and storing it in a
database for analysis. I will use the terms ‘Data
Warehouse’, which is a database that consolidates data
from across an organization into a central repository, and
data store, which refers to any type of data storage
system that can be queried.
Usually, a person or team is responsible for getting data
into the data warehouse. This process is called ETL
(Extract, Transform, and Load). Extract pulls the data
from the source system. Transform optionally changes
the structure of the data, performs data quality cleaning,
or aggregates the data. Load puts the data into the

11
SQL for Analytics | Module 1

database. You might also hear the terms source and

target in the context of ETL. The source is where the data
comes from, and the target is the destination, i.e., the
database and the tables within it.
4. Once the data is in a database, the next step is
‘Performing Queries and Analysis’. In this step, SQL is
applied to explore, profile, clean, shape, and analyze the
data. Exploring the data involves becoming familiar with
the topic, where the data was generated, and the
database tables in which it is stored. Profiling involves
checking the unique values and distribution of records in
the data set. Cleaning involves fixing incorrect or
incomplete data, adding categorization and flags, and
handling null values. Shaping is the process of arranging
the data into the rows and columns needed in the result
set. Finally, analysing the data involves reviewing the
output for trends, conclusions, and insights.
5. ‘Presentation of the Data’ into a final output form is the
last step in the overall workflow. Businesspeople won’t
appreciate receiving a file of SQL code; they expect you to
present graphs, charts, and insights. Communication is
key to having an impact with analysis, and for that, we
need a way to share the results with other people.

12
Module 1 | Database Data Types

SECTION 3

Database Data Types

Data scientists spend 60% of their time cleaning and organizing
data in order to prepare it for analysis or modelling work.
Preparing data is such a common task that terms have sprung up
to describe it, such as data munging, data wrangling, and data
prep. Data preparation is easier when a data set has a data
dictionary, a document or repository that has clear descriptions
of the fields, possible values, how the data was collected, and
how it relates to other data. Unfortunately, this is frequently not
the case. Documentation often isn’t prioritized, even by people
who see its value, or it becomes out-of-date as new fields and
tables are added or the way data is populated changes. Even
when a data dictionary exists, you will still likely need to do data
prep work as part of the analysis.

3.1. Types of Data

Data is the foundation of analysis, and all data has a database
data type and also belongs to one or more categories of data.
Having a firm grasp of the many forms data can take will help you
be a more effective data analyst.

1. Structured Versus Unstructured

Data is often described as structured or unstructured. Most
databases were designed to handle structured data, where each
attribute is stored in a column, and instances of each entity are
represented as rows. For example, an address table might have
fields for street address, city, state, and postal code. Each row
would hold a particular customer’s address. Each field has a
data type and allows only data of that type to be entered.
Structured data is easy to query with SQL.

13
Database Data Types | Module 1

Unstructured data is the opposite of structured data. There is

no predetermined structure, data model, or data type.
Unstructured data is often the “everything else” that isn’t
database data. Documents, emails, and web pages are
unstructured. They don’t fit into the traditional data types, and
thus they are more difficult for relational databases to store
efficiently and for SQL to query

2. Quantitative Versus Qualitative Data

Quantitative data is numeric. It comes with numeric information

such as price, quantity, or visit duration. Counts, sums,
averages, or other numeric functions are applied to the data.
Qualitative data is usually text-based. Temperature and
humidity levels are quantitative, while descriptors like “hot and
humid” are qualitative. The price a customer paid for a product
is quantitative; whether they like or dislike it is qualitative.

3. Sparse Data

Sparse data occurs when there is a small amount of information

within a larger set of empty or unimportant information. Sparse
data might show up as many nulls and only a few values in a
particular column. JSON is one approach that has been
developed to deal with sparse data from a writing and storage
perspective, as it stores only the data that is present and omits
the rest. This is in contrast to a row-store database, which has to
hold memory for a field even if there is no value in it.

3.2. Database Data Types

Fields in database tables all have defined data types. You don’t
necessarily need to be an expert on the nuances of data types to
be good at analysis, but later in the course, we’ll encounter
situations in which considering the data type is important, so this

14
Module 1 | Database Data Types

section will cover the basics. These are based on Postgres but
are similar across most major database types.

String data types are the most versatile. These can hold letters,
numbers, and special characters, including unprintable
characters like tabs and newlines. String fields can be defined to
hold a fixed or variable number of characters. A CHAR field could
be defined to allow only two characters to hold, for example, US
state abbreviation. Whereas a field storing the full names of
states would need to be a VARCHAR to allow a variable number
of characters.

Numeric data types are all the ones that store numbers, both
positive and negative. Mathematical functions and operators
can be applied to numeric fields. Numeric data types include the
INT types as well as FLOAT, DOUBLE, and DECIMAL types that
allow decimal places. Integer data types are often implemented
because they use less memory than their decimal counterparts.

15
Database Data Types | Module 1

The logical data type is called BOOLEAN. It has values of TRUE

and FALSE and is an efficient way to store information where
these options are appropriate. Operations that compare two
fields return a BOOLEAN value as a result. This data type is often
used to create flags, and fields that summarize the presence or
absence of a property in the data.

The datetime types include DATE, TIMESTAMP, and TIME. Date

and time data should be stored in a field of one of these database
types whenever possible since SQL has a number of useful
functions that operate on them. Timestamps and dates are very
common in databases and are critical to many types of analysis,
particularly time series analysis and cohort analysis.

Other data types, such as JSON and geographical types, are

supported by some but not all databases.

Dolomite DolomiteFluikaPumpsDatasheet PDF
No ratings yet
Dolomite DolomiteFluikaPumpsDatasheet PDF
13 pages
EEE Job Preparation Syllabus
100% (2)
EEE Job Preparation Syllabus
3 pages
The Internet Gaming Disorder Test (IGD-20 Test) (Pontes Et Al., 2014)
67% (3)
The Internet Gaming Disorder Test (IGD-20 Test) (Pontes Et Al., 2014)
2 pages
िदली िविवालय University of Delhi: Fee Receipt
No ratings yet
िदली िविवालय University of Delhi: Fee Receipt
1 page
Lecture 5 Slides
No ratings yet
Lecture 5 Slides
132 pages
SQL Course
No ratings yet
SQL Course
88 pages
Concept of Database
No ratings yet
Concept of Database
16 pages
(SQL Notes) - TheTestingAcademy - Pramod - Google Drive
No ratings yet
(SQL Notes) - TheTestingAcademy - Pramod - Google Drive
20 pages
Database Concepts - Mysql
No ratings yet
Database Concepts - Mysql
61 pages
Database Course
No ratings yet
Database Course
33 pages
SQL Lec 01 03
No ratings yet
SQL Lec 01 03
14 pages
DBMS Session1
No ratings yet
DBMS Session1
24 pages
Unit - 3 RDBMS
No ratings yet
Unit - 3 RDBMS
51 pages
Relational Database Management Systems (Basic)
No ratings yet
Relational Database Management Systems (Basic)
18 pages
Week 1SQL
No ratings yet
Week 1SQL
10 pages
Unit - 3 RDBMS
No ratings yet
Unit - 3 RDBMS
51 pages
Intro - To - DBMS 1
No ratings yet
Intro - To - DBMS 1
95 pages
DBMS Unit1
No ratings yet
DBMS Unit1
10 pages
Unit-III DMBS
No ratings yet
Unit-III DMBS
13 pages
Database Management
No ratings yet
Database Management
122 pages
M1 - Intro
No ratings yet
M1 - Intro
56 pages
DBMS Aryan
No ratings yet
DBMS Aryan
33 pages
Red Hat Openstack Administration I (Cl110) - Datasheet
No ratings yet
Red Hat Openstack Administration I (Cl110) - Datasheet
44 pages
ITFPlusEBook (FC0 U61) Module2 - Unit4
No ratings yet
ITFPlusEBook (FC0 U61) Module2 - Unit4
11 pages
12thInformationPractices (StudyMaterial)
No ratings yet
12thInformationPractices (StudyMaterial)
9 pages
Lecture # 13 - Database
No ratings yet
Lecture # 13 - Database
13 pages
Unit 4 Data Models
No ratings yet
Unit 4 Data Models
34 pages
SQL Material PDF
100% (1)
SQL Material PDF
55 pages
CH 7 Database Concepts 1
No ratings yet
CH 7 Database Concepts 1
26 pages
DBMS Notes 1730956881
No ratings yet
DBMS Notes 1730956881
8 pages
Database and SQL Concepts
No ratings yet
Database and SQL Concepts
14 pages
Cts SQL
No ratings yet
Cts SQL
33 pages
DBMS Lec 1 & 2
No ratings yet
DBMS Lec 1 & 2
102 pages
Database
No ratings yet
Database
5 pages
SQL Command and Constant
No ratings yet
SQL Command and Constant
6 pages
SQL Interview Success From Beginner To Pro
From Everand
SQL Interview Success From Beginner To Pro
Shana
No ratings yet
Database Essay
No ratings yet
Database Essay
4 pages
Intro To Databases and SQL
No ratings yet
Intro To Databases and SQL
22 pages
Database Concept
No ratings yet
Database Concept
6 pages
SQLnew
No ratings yet
SQLnew
40 pages
Chapter 1
No ratings yet
Chapter 1
103 pages
DBMS Concepts and Relational Data Model
No ratings yet
DBMS Concepts and Relational Data Model
9 pages
Lec 14 Database
No ratings yet
Lec 14 Database
45 pages
DBMS - NOTES - (Selected Topics)
No ratings yet
DBMS - NOTES - (Selected Topics)
36 pages
As-5 Data & Databases
No ratings yet
As-5 Data & Databases
10 pages
Ty Bcom Sem 5 DBMS Full
No ratings yet
Ty Bcom Sem 5 DBMS Full
21 pages
Database and Database Management System
No ratings yet
Database and Database Management System
8 pages
Database Concepts
No ratings yet
Database Concepts
3 pages
Ln. 3 - Relational Database Management System Grade 10 CBSE
No ratings yet
Ln. 3 - Relational Database Management System Grade 10 CBSE
19 pages
Lecture 9
No ratings yet
Lecture 9
26 pages
DBMS Unit1
No ratings yet
DBMS Unit1
48 pages
My SQL
No ratings yet
My SQL
15 pages
Ch-8 Introduction To DBMS
No ratings yet
Ch-8 Introduction To DBMS
8 pages
(Structured Query Language) : DBA Lounge
No ratings yet
(Structured Query Language) : DBA Lounge
55 pages
POST MID TERMDATABASE MANAGEMENT SYSTEM-Notes
No ratings yet
POST MID TERMDATABASE MANAGEMENT SYSTEM-Notes
13 pages
Unit III - Database Management 20-21-1
No ratings yet
Unit III - Database Management 20-21-1
46 pages
Introduction To SQL - Manipulating Data Sets
No ratings yet
Introduction To SQL - Manipulating Data Sets
12 pages
Grade 11 Chapter 9 Computer Science
No ratings yet
Grade 11 Chapter 9 Computer Science
8 pages
Class 10 Unit - 8 (DBMS) Notes
No ratings yet
Class 10 Unit - 8 (DBMS) Notes
7 pages
Reviewer ITEC48
No ratings yet
Reviewer ITEC48
13 pages
SQL For Data Science
No ratings yet
SQL For Data Science
32 pages
1-Database Basics and Structured Query Language
No ratings yet
1-Database Basics and Structured Query Language
5 pages
Learning SQL: Master SQL Fundamentals
From Everand
Learning SQL: Master SQL Fundamentals
Kiet Huynh
No ratings yet
Learn SQL: Database Management Basics
From Everand
Learn SQL: Database Management Basics
Kiet Huynh
No ratings yet
Sdrsharp, To Make Black and White Listeners See Colours..
No ratings yet
Sdrsharp, To Make Black and White Listeners See Colours..
47 pages
Karthik V.: Sr. Azure Devops Engineer
No ratings yet
Karthik V.: Sr. Azure Devops Engineer
9 pages
220V AC Powered White Led Lamp
No ratings yet
220V AC Powered White Led Lamp
16 pages
BSA Clipper
No ratings yet
BSA Clipper
14 pages
Ebooks Archi
No ratings yet
Ebooks Archi
72 pages
Sop Maf
No ratings yet
Sop Maf
3 pages
Autonomous Navigation and 2D Mapping: Shubass A/L Rames (192061041)
No ratings yet
Autonomous Navigation and 2D Mapping: Shubass A/L Rames (192061041)
57 pages
Error Details
No ratings yet
Error Details
2 pages
1) 1 Mobile - Computing - Mtu (Essay1)
No ratings yet
1) 1 Mobile - Computing - Mtu (Essay1)
30 pages
SM 30
No ratings yet
SM 30
10 pages
Link OS v6.4ReleaseNotes
No ratings yet
Link OS v6.4ReleaseNotes
193 pages
Param Win PDF
No ratings yet
Param Win PDF
183 pages
A Concise Introduction To Software Engineering: Pankaj Jalote
No ratings yet
A Concise Introduction To Software Engineering: Pankaj Jalote
233 pages
Home Automation System Project 1-3
No ratings yet
Home Automation System Project 1-3
62 pages
Kushwaha Profile
No ratings yet
Kushwaha Profile
56 pages
PEARLVINE 2021 Presentation Complete Motivational & Informative Plan - PearlvineGuide
100% (4)
PEARLVINE 2021 Presentation Complete Motivational & Informative Plan - PearlvineGuide
36 pages
CL Cheat Sheet
No ratings yet
CL Cheat Sheet
10 pages
Airo Wizard
No ratings yet
Airo Wizard
3 pages
NV Install Guide
No ratings yet
NV Install Guide
36 pages
MVC Interview Questions and Answers PDF
78% (18)
MVC Interview Questions and Answers PDF
18 pages
Gcu 104 PDF
No ratings yet
Gcu 104 PDF
2 pages
ADC and DAC
No ratings yet
ADC and DAC
4 pages
Avalanche
No ratings yet
Avalanche
74 pages
Anesthesiologists Manual of Surgical Procedures 5th Edition Unlocked Test Bank
No ratings yet
Anesthesiologists Manual of Surgical Procedures 5th Edition Unlocked Test Bank
317 pages
Lab Encryption
No ratings yet
Lab Encryption
3 pages
Cgaxis Models Volume 79 PDF
No ratings yet
Cgaxis Models Volume 79 PDF
102 pages

Module 1 - SQL For Analytics Introduction

Uploaded by

Module 1 - SQL For Analytics Introduction

Uploaded by

SQL

The easiest, as well as an essential skill that every data science

• Finding a person’s telephone number can be time

The same drawbacks attributed to telephone books can be

1.1. Data Infrastructure

of organizations’ data infrastructure. NoSQL is a technology that

1.2. Relational Database Systems

Field 1 Field 2 Field 3 Field 4

Databases are usually associated with software that allows for

When working with databases we will participate in the design,

🗒️ Is SQL a Programming Language

1.3. SQL Constraints:

PRIMARY KEY Constraint

In SQL, the PRIMARY KEY constraint is used to uniquely identify

-- create Colleges table with primary key

CREATE TABLE Colleges (

Here, the college_id column is the PRIMARY KEY . This means

FOREIGN KEY Constraint

The FOREIGN KEY constraint is used to create a relationship

-- this table doesn’t contain foreign keys

CREATE TABLE Customers (

-- create another table named Prodcuts

CREATE TABLE Products (

id column in the Products table references the id column in

Referencing Columns in Another Table

The FOREIGN KEY constraint in SQL establishes a relationship

Here, the customer_id field in the Orders table is a FOREIGN

🗒️Note: The foreign key can be referenced to any column in

1.4. Database Structure

1.5. Four Sublanguages of SQL

1. DQL, or data query language, is what this course is mainly

changes the structure of an object, such as by adding a

SQL for Analytics

2.1. What Is Data Analysis?

Mining historical data helps us understand the characteristics

2.2. SQL as Part of the Data Analysis

1. First step of analysis workflow is ‘Framing the Question’

database. You might also hear the terms source and

Database Data Types

3.1. Types of Data

1. Structured Versus Unstructured

Unstructured data is the opposite of structured data. There is

2. Quantitative Versus Qualitative Data

Quantitative data is numeric. It comes with numeric information

Sparse data occurs when there is a small amount of information

3.2. Database Data Types

The logical data type is called BOOLEAN. It has values of TRUE

The datetime types include DATE, TIMESTAMP, and TIME. Date

Other data types, such as JSON and geographical types, are

You might also like