SQL Material

Uploaded by

thesantastor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views47 pages

SQL Material

Uploaded by

thesantastor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Structured Query

Language
The way of storing relational data
Data Engineering
How the data will be used in the future so that the format
you use will make sense. Here are some of the questions
you might want to consider
• How do I store multimodal data, e.g., a sample that
might contain both images and texts?
• Where do I store my data so that it’s cheap and still fast
to access?
Data Engineering
Row-Major and Column Major
• Row-major formats are better when you have to do a lot of writes, whereas column-major ones are
better when you have to do a lot of column-based reads.
Row-Major and Column Major
• Consider that you want to store the
number 1000000. If you store it in a text
file, it’ll require 7 characters, and if each
character is 1 byte, it’ll require 7 bytes. If
you store it in a binary file as int32, it’ll
take only 32 bits or 4 bytes.
Data Models
• Data models describe how data is represented. Consider cars in the real world. In a
database, a car can be described using its make, its model, its year, its color, and its
price
• Alternatively, you can also describe a car using its owner, its license plate, and its history
of registered addresses. This is another data model for cars.
• Two types of Data Models: Relational models and NoSQL models.
Data Models
• Data models describe how data is represented. Consider cars in the real world. In a
database, a car can be described using its make, its model, its year, its color, and its
price
• Alternatively, you can also describe a car using its owner, its license plate, and its history
of registered addresses. This is another data model for cars.
• Two types of Data Models: Relational models and NoSQL models.
Data Models
• Data models describe how data is represented. Consider cars in the real world. In a
database, a car can be described using its make, its model, its year, its color, and its
price
• Alternatively, you can also describe a car using its owner, its license plate, and its history
of registered addresses. This is another data model for cars.
• Two types of Data Models: Relational models and NoSQL models.
Relational Data Model
NoSQL Data Model
• All documents in a document database are assumed to be encoded in the same format.
• Each document has a unique key that represents that document, which can be used to retrieve it.
• A document is often a single continuous string, encoded as JSON, XML, or a binary format like BSON
(Binary JSON)
Graph Data Model
• The graph model is built around the concept of a “graph.”
• A graph consists of nodes and edges, where the edges represent the relationships between the
nodes.
• A database that uses graph structures to store its data is called a graph database.
Structured and Unstructured
Declarative & Imperative
• In the declarative paradigm, you specify the outputs you want, and the computer
figures out the steps needed to get you the queried outputs.
• In the imperative paradigm, you specify the steps needed for an action and the
computer executes these steps to return the outputs
OLTP vs OLAP
• OLTP: Online Transaction Processing
• OLTP systems are designed to support everyday transaction-
oriented applications in industries such as banking, retail, logistics,
etc.
• Prioritizes fast query processing and maintaining data integrity in
multi-access environments.
• Data is often current, not historical.
• Examples: A bank's system where customers withdraw or deposit
money; a retailer's system where customers make purchases.
OLTP vs OLAP
• OLAP: Online Analytical Processing
• OLAP systems are designed to support complex queries and offer
business insights. They facilitate multi-dimensional analytical
queries, providing a platform for business intelligence and data
mining.
• Simple relationships with fewer joins.
• Aggregated data.
• Commonly uses schemas like star and snowflake.
• Examples: An e-commerce company analysing sales trends over
the past year; a system providing business performance metrics.
What is Relational Database
• A relational database is a collection of information that
organizes data in predefined relationships where data
is stored in one or more tables (or "relations") of
columns and rows
Different SQL Tools
• MySQL: An open-source relational database management
system, owned by Oracle Corporation. One of the most popular
databases for web-based applications
• Microsoft SQL: A relational database management system
developed by Microsoft. Used for a variety of applications ranging
from small applications to large scale enterprise applications. SQL
Server uses T-SQL as its primary querying language
• PostgreSQL: It's an open-source relational database
management system (RDBMS). Known for its extensibility and SQL
compliance. It's not just an SQL processing tool but also offers
"NoSQL" capabilities.
Different SQL Tools
• PL/SQL(Procedural Language for SQL): Predominantly used in
Oracle Databases for writing stored procedures, functions, and
triggers.
• SQLite: A C-language library that offers a lightweight, disk-based
database, which doesn’t require a separate server process. It's
serverless, self-contained, and zero-configuration.
4 Stages of DBMS
• Database Management Systems are having 4 Important
Characteristics.
• Data Definition – Define the data being tracked
• Data Manipulation – Add, Update & Remove the Data
• Data Retrieval - Extract and Report the data available in
database
• Administration - defining users on the system, security,
monitoring, system administration
Database Tables
• A database table is a lot like a spreadsheet. • Data is kept in
Columns and Rows.
• Each Column is assigned:
• A Unique Name, identifying a human readable name of the
column. (ie FIRST_NAME, LAST_NAME)
• A Data Type (ie - String, Date, Time, Number, etc)
• Optionally, constraints (ie - Is a value required?, Length of String,
etc) • Each Row is a distinct database Record.
Primary Key & Surrogate Key
• A Primary Key is an optional special database column or columns
used to identify a database record.
A Surrogate Key is a type of Primary Key which used a unique
generated value.
• Should have no business value, and should never change.
Data Relationships
• One to One - Record in Table A matches exactly one record in Table B
• One to Many - Record in Table A matches many in Table B, but Table B matches
only one record
in Table A. (Think - An Order with multiple items)
• Many to Many - Record in Table A matches many in Table B, and Table B
matches many records in Table A.
Data Relationships
• One to One - Record in Table A matches exactly one record in Table B
• One to Many - Record in Table A matches many in Table B, but Table B matches
only one record
in Table A. (Think - An Order with multiple items)
• Many to Many - Record in Table A matches many in Table B, and Table B
matches many records in Table A.
Data Relationships
Data Relationships
DDL
• DDL - Data Definition Language (ie CREATE TABLE...) is used to
define the relational model
• Under the covers, the RDBMS will store data about your tables in
catalog tables
• The software is used to enforce data being stored conforms to the
rules you’ve defined for the data.
DML
• DML - Data Manipulation Language
Allows you to add (INSERT), change (UPDATE), or remove (DELETE)
data.
The RDBMS enforces data manipulation adheres to the rules of the
Data Definition.
The RDBMS allows set up ‘rules’ for multi-user systems.
These rules manage what happens in competing conditions. (what
happens when two users want to update the same data, at the
same time)
Retrieval
• Data Retrieval is the act of pulling data out of the database
• The RDBMS determines the optimal way to retrieve data out of the
database. • Multi-table joins can become very complex.
• Consider tables with billions and billions of rows.
• Reports can go from seconds, to hours when the retrieval strategy
is wrong.
• The RDBMS also considers what happens when updates occur
while your report is running.
Character Set
• Computers are driven off of binary information - ie 1’s and zeros. •
A ‘bit’ is binary one or zero.
• A byte is a collection of eight bits (10000111) = 70
• ASCII - American Standard Code for Information Interchange
• One of the first ‘character’ sets
• Limited to 128 characters (mostly letters, numbers, common
punctuation)
• UTF-8 is highly popular used for email / web. 1 - 4 bytes long.
• Up to 1,112,064 characters
Data Normalization
Database Normalization is the most important factor in Database
design or Data modeling. Database Normalization is the process to
eliminate data redundancies and store the data logically to make
data management easier
• First Normal Form (1NF)
• Second Normal Form (2NF)
• Third Normal Form (3NF)
• Fourth Normal Form
• Fifth Normal Form
• Boyce Codd Normal Form(BCNF)
First Normal Form
In the first normal form, each column must contain only one value.
No table should store repeating groups of related data. The easiest
way to follow the first normal form is to inspect the database table
horizontally.
Second Normal Form
In the second normal form, first, the database must be in the first
normal form and there should not be any partial dependency. If
there are duplicate values in the row, they should be stored in their
own separate tables and linked to the table using foreign keys.
Third Normal Form
In the third normal form, the database is already in the third normal
form, if it is in the second normal form. Every non-key column must
be mutually independent. Identify any columns in the table that are
interdependent and break those columns into their own separate
tables.
Third Normal Form
Functional Dependency: When there is a relationship exists
between the primary key and non-key attribute within a table it is
called functional dependency.
• X -> Y
• Here, X is known as determinant, and Y is known as the
dependent.

Transitive Dependency: When there is an indirect functional

dependency between the attributes it is called Transitive
Dependency. If A -> B and B -> C then A -> C is called Transitive
Dependency. To achieve Third Normal Form (3NF) we have to
eliminate Functional Dependency.
Boyce Codd Normal Form
Boyce-Codd Normal Form or BCNF is an extension to the third
normal form, and is also known as 3.5 Normal Form.

For a table to satisfy the Boyce-Codd Normal Form, it should satisfy

the following two conditions:
1. It should be in the Third Normal Form.
2.And, for any dependency A → B, A should be a super key.
• The second point sounds a bit tricky, right? In simple words, it
means, that for a dependency A → B, A cannot be a non-prime
attribute, if B is a prime attribute.
Boyce Codd Normal Form
And, for any dependency A → B, A should be a super key.
• The second point sounds a bit tricky, right? In simple words, it
means, that for a dependency A → B, A cannot be a non-prime
attribute, if B is a prime attribute.
Fourth Normal Form
For a table to satisfy the Fourth Normal Form, it should satisfy the
following two conditions:
1. It should be in the Boyce-Codd Normal Form.
2.And, the table should not have any Multi-valued Dependency
A table is said to have multi-valued dependency, if the following
conditions are true,
1. For a dependency A → B, if for a single value of A, multiple value of
B exists, then the table may have multi-valued dependency.
2.Also, a table should have at-least 3 columns for it to have a multi-
valued dependency.
Denormalization
It is a database optimization technique where we can add
redundant data to one or more tables and optimize the efficiency
of the database. It is applied after doing normalization. It also
avoids costly joins in a relational database. It is used on the already
normalized database to increase performance. In denormalization,
we are including data from one table to another table to reduce the
number of joins in the query which helps in speeding up the
performance.
Data Integrity & Referential
Data integrity refers to the overall accuracy and consistency of the
data in your database. You want high-quality data. People lose
confidence in your data when they spot problems like a salary
value that contains alpha characters or a percent increase value
over 100 percent.

Referential integrity refers to the quality of

the relationships between the data in your tables. If you have a
complaint in the complaint table for a customer that doesn’t exist
in the customer table, you have a referential integrity problem. By
defining customer_id as a foreign key, you can be assured that
every customer_id in the complaint table refers to
a customer_id that exists in the customer table.
Temporary Table
MySQL allows you to create temporary tables—that is, a temporary
result set that will exist only for your current session and then be
automatically dropped

To create a temporary table based on the results of a query, simply

precede the query with the same create temporary table syntax
Common Table Expressions
Temporary result set that you name and can then select from as if
it were a table. You can use CTEs only for the duration of one query
(versus temporary tables, which can be used for the entire session)

We use the with keyword to give the CTE a name

Recursive CTE
Recursion is a technique that is used when an object references
itself. When I think of recursion

Recursion is useful when your data is organized as a hierarchy or a

series of values where you need to know the previous value to
arrive at the current value.
Subquery
A subquery (or inner query) is a query nested within another query.
A subquery is used to return data that will be used by the main
query. When a query has a subquery, MySQL runs the subquery first,
selects the resulting value from the database, and then passes it
back to the outer query
Views, Functions & Procedures
• Views are useful in situations where you want to simplify a
complex query or hide sensitive or irrelevant data.
• Functions and procedures are programs you can call by name.
Because they’re saved in your MySQL database, they are
sometimes called stored functions and procedures.
• Collectively, they are referred to as stored routines or stored
programs.
• When you write a complex SQL statement or a group of
statements with several steps, you should save it as a function or
procedure so you can easily call it by name later.
• Functions and procedures are saved in the database where you
created them
Views, Functions & Procedures
• Function gets called from a SQL statement and always returns
one value
• A procedure, on the other hand, gets called explicitly using
a call statement.
• While procedures may return no values, one value, or many
values, a function accepts arguments, performs some task, and
returns a single value.
• Procedures are often used to execute business logic by updating,
inserting, and deleting records in tables, and they can also be
used to display a dataset from the database.
• Functions are used for smaller tasks, like getting one piece of data
from the database or formatting a value. Sometimes you can
implement the same functionality as either a procedure or a
function.
Triggers
• Triggers are most often used to track changes made to a table or
to enhance the data’s quality before it’s saved to the database.
• Like functions and procedures, triggers are saved in the database
in which you create them.
• Triggers can be set to fire either before or after rows are changed.
• You can also write triggers that fire before rows are changed in a
table, to change the data that gets written to tables or prevent
rows from being inserted or deleted. This can help improve the
quality of your data before you save it to the database.
Events
• Events can be scheduled to run once or at some interval, like daily,
weekly, or yearly; for example, you might create an event to
perform weekly payroll processing. You can use events to
schedule long-running processing during off-hours, like updating
a billing table based on orders that came in that day
• MySQL has an event scheduler that manages the scheduling and
execution of events.

DBMS UNIT 1 Full Notes
100% (1)
DBMS UNIT 1 Full Notes
28 pages
CATRule
No ratings yet
CATRule
22 pages
Introduction To Data Models 677e35511a823
No ratings yet
Introduction To Data Models 677e35511a823
45 pages
Database Concept
No ratings yet
Database Concept
6 pages
Class 6
No ratings yet
Class 6
29 pages
SQL-1 (Scratch To Advance)
No ratings yet
SQL-1 (Scratch To Advance)
31 pages
M5 Access 2024
No ratings yet
M5 Access 2024
34 pages
Unit 1: Introduction: Dhanashree Huddedar
No ratings yet
Unit 1: Introduction: Dhanashree Huddedar
26 pages
Intro Dbms
No ratings yet
Intro Dbms
44 pages
Reviewer ITEC48
No ratings yet
Reviewer ITEC48
13 pages
Intro 2 DB
No ratings yet
Intro 2 DB
126 pages
DBMS 2nd Semester
No ratings yet
DBMS 2nd Semester
74 pages
Introduction Part 2
No ratings yet
Introduction Part 2
57 pages
DBMS
No ratings yet
DBMS
63 pages
102 Copies Adv Lesson 1
No ratings yet
102 Copies Adv Lesson 1
5 pages
Dbms
No ratings yet
Dbms
44 pages
Lecture 1
No ratings yet
Lecture 1
22 pages
087 Khushboo
No ratings yet
087 Khushboo
40 pages
CHAPTER 1-Introduction To DBMS - Final
No ratings yet
CHAPTER 1-Introduction To DBMS - Final
49 pages
Lecture - 03 - Database Architecture and Data Models
No ratings yet
Lecture - 03 - Database Architecture and Data Models
25 pages
Course Pack - Introduction To Databases
No ratings yet
Course Pack - Introduction To Databases
41 pages
11 TH
No ratings yet
11 TH
11 pages
Top 70+ SQL Interview Questions and Answers (Mostly Asked)
No ratings yet
Top 70+ SQL Interview Questions and Answers (Mostly Asked)
1 page
CACS101 CFA Unit 4
No ratings yet
CACS101 CFA Unit 4
21 pages
Chapter 2
No ratings yet
Chapter 2
77 pages
Week 1
No ratings yet
Week 1
36 pages
Introduction To DBMS
No ratings yet
Introduction To DBMS
17 pages
Ddbmss
No ratings yet
Ddbmss
21 pages
DBMS Unit 2 Tesseract
No ratings yet
DBMS Unit 2 Tesseract
32 pages
Introduction To Database Systems: Database Systems Lecture 1 Natasha Alechina WWW - Cs.nott - Ac.uk/ nza/G51DBS
No ratings yet
Introduction To Database Systems: Database Systems Lecture 1 Natasha Alechina WWW - Cs.nott - Ac.uk/ nza/G51DBS
24 pages
Ln. 3 - Relational Database Management System Grade 10 CBSE
No ratings yet
Ln. 3 - Relational Database Management System Grade 10 CBSE
19 pages
Database Development (Basic)
No ratings yet
Database Development (Basic)
5 pages
DBMS File
No ratings yet
DBMS File
33 pages
Database Design and Development
No ratings yet
Database Design and Development
74 pages
SQL Material
No ratings yet
SQL Material
56 pages
1 Introduction
No ratings yet
1 Introduction
9 pages
DBI202 Library
No ratings yet
DBI202 Library
3 pages
Database Analysis & Design
No ratings yet
Database Analysis & Design
57 pages
ARE 510 5 Databases
No ratings yet
ARE 510 5 Databases
23 pages
Unit 1 Dbms
No ratings yet
Unit 1 Dbms
45 pages
Dbms
No ratings yet
Dbms
34 pages
PC03 DBMS Notes V-2 by Rajeev
No ratings yet
PC03 DBMS Notes V-2 by Rajeev
17 pages
DBMS Unit1
No ratings yet
DBMS Unit1
30 pages
DBMS Unit 1
No ratings yet
DBMS Unit 1
31 pages
Lec Database
No ratings yet
Lec Database
57 pages
Final Dbms
No ratings yet
Final Dbms
32 pages
CS2255 Notes
No ratings yet
CS2255 Notes
45 pages
CS2255 Notes
No ratings yet
CS2255 Notes
45 pages
Chapter 1k
No ratings yet
Chapter 1k
34 pages
DBMS Lab Manual
No ratings yet
DBMS Lab Manual
61 pages
DBMS Aryan
No ratings yet
DBMS Aryan
33 pages
Database Management System
No ratings yet
Database Management System
49 pages
Wa0001.
No ratings yet
Wa0001.
129 pages
UNIT 1
No ratings yet
UNIT 1
104 pages
A Brief History of Database Systems
100% (1)
A Brief History of Database Systems
4 pages
Lect 1-2pdf
No ratings yet
Lect 1-2pdf
55 pages
Data Storage and Relational Databases
No ratings yet
Data Storage and Relational Databases
14 pages
Basics of SQL
No ratings yet
Basics of SQL
16 pages
Lectrure Series 4 - Mid 2 - Data Resources - (Book - CH 5)
No ratings yet
Lectrure Series 4 - Mid 2 - Data Resources - (Book - CH 5)
32 pages
Ampere INS
No ratings yet
Ampere INS
6 pages
JavaScript Final
No ratings yet
JavaScript Final
65 pages
C Previous Year Qps
No ratings yet
C Previous Year Qps
34 pages
cs550 23f Proj2
No ratings yet
cs550 23f Proj2
10 pages
Python Basics Shwetank Singh PDF
No ratings yet
Python Basics Shwetank Singh PDF
32 pages
TRM - NITIW401 - IoT Web Application
No ratings yet
TRM - NITIW401 - IoT Web Application
92 pages
Industrial Training Summer 2024: K. K. Wagh Polytechnic, Nashik
No ratings yet
Industrial Training Summer 2024: K. K. Wagh Polytechnic, Nashik
38 pages
ABAP Interview Questions
No ratings yet
ABAP Interview Questions
11 pages
COMM SA 1 CLass 12 2024
No ratings yet
COMM SA 1 CLass 12 2024
9 pages
Unit 5 PLSQL
No ratings yet
Unit 5 PLSQL
15 pages
Couse
No ratings yet
Couse
2 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
Xii Cs Chapter4 Solutions
No ratings yet
Xii Cs Chapter4 Solutions
30 pages
Interview Preparation Material
No ratings yet
Interview Preparation Material
43 pages
OOP-1 Answers
No ratings yet
OOP-1 Answers
28 pages
Unit 3
No ratings yet
Unit 3
43 pages
Python 101-PCEP Preparation
No ratings yet
Python 101-PCEP Preparation
49 pages
Chapter 4-Introduction To Programming
No ratings yet
Chapter 4-Introduction To Programming
14 pages
5th LCC ICTO Test
No ratings yet
5th LCC ICTO Test
18 pages
Computer Science P2
No ratings yet
Computer Science P2
16 pages
C++ VIVA Question and Answers
No ratings yet
C++ VIVA Question and Answers
6 pages
APznzaYLsRd8Svkd4QP9GWRBjHWl6JF3tP-DIYCGU7pFD1X6qtAsz7ZX2kQOnid64S7jS975QD-L3XYk3YHaX6yh1fleocZkfr_LBd4bz90MlkJYyWn0n6Pl0YDIRgXKoH-xTdDz_mOXiqy_w72yPfSqcur8RAReuxRPEWSd059J9EaSwx0aY9hUllNjWsY5MwrUpt0qybtFmjwI-kbNsr
No ratings yet
APznzaYLsRd8Svkd4QP9GWRBjHWl6JF3tP-DIYCGU7pFD1X6qtAsz7ZX2kQOnid64S7jS975QD-L3XYk3YHaX6yh1fleocZkfr_LBd4bz90MlkJYyWn0n6Pl0YDIRgXKoH-xTdDz_mOXiqy_w72yPfSqcur8RAReuxRPEWSd059J9EaSwx0aY9hUllNjWsY5MwrUpt0qybtFmjwI-kbNsr
4 pages
PRLD - Home Exam
No ratings yet
PRLD - Home Exam
15 pages
K31 Imperativeprogrammierung
No ratings yet
K31 Imperativeprogrammierung
50 pages
Attribute Grammars - PPL
No ratings yet
Attribute Grammars - PPL
9 pages
CC112 Reviewer
No ratings yet
CC112 Reviewer
1 page
rkCD-Chapter 1 - INTRO TO COMPILERS
No ratings yet
rkCD-Chapter 1 - INTRO TO COMPILERS
11 pages
PPS With Diagram
No ratings yet
PPS With Diagram
17 pages
Bvoc CS 1 C Prog Lab
No ratings yet
Bvoc CS 1 C Prog Lab
11 pages

SQL Material

Uploaded by

SQL Material

Uploaded by

Structured Query

Transitive Dependency: When there is an indirect functional

For a table to satisfy the Boyce-Codd Normal Form, it should satisfy

Referential integrity refers to the quality of

To create a temporary table based on the results of a query, simply

We use the with keyword to give the CTE a name

Recursion is useful when your data is organized as a hierarchy or a

You might also like