0% found this document useful (0 votes)

19 views79 pages

Unit-III - SQL & Schema Refinement

Sql

Uploaded by

chinnub2006

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views79 pages

Unit-III - SQL & Schema Refinement

Sql

Uploaded by

chinnub2006

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 79

Unit-III : SQL & Schema

Refinement
Form of basic SQL query
Regular expressions in the SELECT Command
SQL provides support for pattern matching through the LIKE operator, along with the use of the wild-card symbols.

Regular expressions: is a sequence of characters that define a search pattern, mainly for use in pattern matching with strings, or string
matching.

Examples:
Finds Names that start or ends with "a“
Finds names that start with "a" and are at least 3 characters in length.

LIKE: The LIKE operator is used in a 'WHERE' clause to search for a specified pattern in a column

wild-card: There are two primary wildcards used in conjunction with the `LIKE` operator

percent sign (%) Represents zero, one, or multiple characters

underscore sign(_) Represents a single character

UNION operator
The UNION operator is used to combine the result sets of two or more SELECT statements. However, it will
only select distinct values. The UNION operator selects only distinct values by default. If you want to allow
duplicate values, you can use UNION ALL
INTERSECT
INTERSECT
EXCEPT
Nested Queries
● A nested query is a query within another query. Nested query allows for more complex and specific data
retrieval.
● In SQL, a nested query involves a query that is placed within another query.
● Output of the inner query is used by the outer query.
● A nested query has two SELECT statements: one for the inner query and another for the outer query.
Syntax of Nested Queries
Types of Nested Queries in SQL
Independent Nested Queries
In independent nested queries, the execution order is from the innermost query to the outer query. An outer query
won't be executed until its inner query completes its execution. The outer query uses the result of the inner query.
Operators such as IN, NOT IN, ALL, and ANY are used to write independent nested queries.

● The IN operator checks if a column value in the outer query's result is present in the inner query's
result. The final result will have rows that satisfy the IN condition.
● The NOT IN operator checks if a column value in the outer query's result is not present in the inner
query's result. The final result will have rows that satisfy the NOT IN condition.
● The ALL operator compares a value of the outer query's result with all the values of the inner query's
result and returns the row if it matches all the values.
● The ANY operator compares a value of the outer query's result with all the inner query's result
values and returns the row if there is a match with any value.
Co-related Nested Queries
In co-related nested queries, the inner query uses the values from the outer query to execute the inner query
for every row processed by the outer query. The co-related nested queries run slowly because the inner query
is executed for every row of the outer query's result.
Aggregation Operators / Functions

○ SQL aggregation function is used to perform the calculations on multiple rows of a single column of a
table. It returns a single value.
○ It is also used to summarize the data.
Aggregation Operators

● Aggregation operators are used to perform operations on a group of values to return a single
summarizing value. The most common aggregation operators include COUNT, SUM, AVG, MIN, and
MAX.
SELECT COUNT(*)
FROM PRODUCT_MAST;

SELECT COUNT(*)
FROM PRODUCT_MAST;
WHERE RATE>=20;

SELECT COUNT(DISTINCT COMPANY)

FROM PRODUCT_MAST;

SELECT COMPANY, COUNT(*)

FROM PRODUCT_MAST
GROUP BY COMPANY;
Triggers
● A trigger is a predefined action that the database automatically executes in response to certain events on
a particular table or view. Triggers are typically used to maintain the integrity of the data, automate
data-related tasks, and extend the database functionalities.
● There are various types of triggers based on when they are executed:

BEFORE: Trigger is executed before the triggering event.

AFTER: Trigger is executed after the triggering event.
INSTEAD OF: Trigger is used to override the triggering event, primarily for views.

● They can also be categorized by the triggering event:

INSERT: Trigger is executed when a new row is inserted.

UPDATE: Trigger is executed when a row is updated.
DELETE: Trigger is executed when a row is deleted.
the basic syntax for creating a trigger in SQL, using MySQL as an

trigger_name: Name of the trigger.

trigger_time: BEFORE, AFTER, or INSTEAD OF.

trigger_event: INSERT, UPDATE, or DELETE.

table_name: The name of the table associated with the trigger.

trigger_body: The set of SQL statements to be executed.

Key Features of Triggers
1. Automatic Execution: Triggers run automatically in response to data modification events. You don't have to explicitly call them.
2. Event-Driven: They are defined to execute before or after INSERT, UPDATE, and DELETE events.
3. Transitional Access: Triggers can access the "old" (pre-modification) and "new" (post-modification) values of the rows
affected.

Example of a Trigger
Suppose we have an `Employees` table and we want to maintain an `AuditLog` table that keeps a record of
salary changes for employees.
How the Trigger Works

- The trigger is named `AfterSalaryUpdate`.

- It activates ÀFTER` an ÙPDATE` on the Èmployees` table.
- It compares the old and new salary values. If there's a change (ÒLD.Salary != NEW.Salary`), it inserts a new record into the
ÀuditLog` table with the details of the change and the current date and time (`NOW()`).

With this trigger in place, every time an employee's salary is updated in the `Employees` table, an entry is automatically added
to the `AuditLog` table recording the change.
Active Databases

An active database is a database that uses triggers and other event-driven functionalities. The term "active" signifies that the DBMS reacts
automatically to changes in data and predefined events. Triggers are a primary mechanism that makes a database "active."

Key Features of Active Databases

1. Event-Condition-Action (ECA) Rule: This is the foundational concept of active databases. When a specific event occurs, the database
checks a particular condition, and if that condition is met, an action is executed.
2. Reactive Behavior: The database can react to changes without external applications or users having to intervene, thanks to the ECA rules.
3. Flexibility: Active databases provide more flexibility in data management and ensure better data integrity and security.

Why are Active Databases Important?

● Integrity Maintenance: Active databases can enforce more complex business rules that can't be enforced using standard integrity
constraints.
● Automation: They can automate certain tasks, reducing manual interventions.
● Alerts: They can notify users or applications when specific conditions are met.
Relation between Triggers and Active Databases

● Triggers are what give an active database its "active" nature. The ability of the database to
react to events automatically is primarily because of triggers that execute in response to
these events.
● In essence, while "trigger" refers to the specific procedural code blocks that run in response
to events, "active database" refers to the broader capability of a DBMS to support and use
such event-driven functionalities.
Data Redundancy
● Data redundancy means the occurrence of duplicate copies of similar data. It is done intentionally to keep
the same piece of data at different places, or it occurs accidentally.
● In DBMS, when the same data is stored in different tables, it causes data redundancy.
● Sometimes, it is done on purpose for recovery or backup of data, faster access of data, or updating data
easily. Redundant data costs extra money, demands higher storage capacity, and requires extra effort to
keep all the ﬁles up to date.
● Sometimes, unintentional duplicity of data causes a problem for the database to work properly, or it may
become harder for the end user to access data. Redundant data unnecessarily occupy space in the
database to save identical copies, which leads to space constraints, which is one of the major problems.
● In the below example, there is a "Student" table that contains data such as "Student_id", "Name", "Course",
"Session", "Fee", and "Department". As you can see, some data is repeated in the table, which causes
redundancy.
Problems caused by redundancy
Data redundancy in databases refers to the unnecessary duplication of data. It can arise from poor database design or lack of proper normalization.
Redundancy can cause several issues:

Problems Caused by Redundancy

1. Wasted Storage

Storing duplicate data consumes more storage than necessary.

2. Data Anomalies

These are inconsistencies that arise due to redundancy.

● Update Anomalies: When you have the same piece of data stored in multiple places, updating it in one place can lead to inconsistency if it's not updated
everywhere.
● Insertion Anomalies: You might have to insert redundant data in multiple places, leading to inconsistencies.
● Deletion Anomalies: Deleting data in one table might unintentionally remove necessary data that's needed elsewhere.
3. Increased Complexity

Querying and maintaining redundant data can be more complex.

4. Performance Issues

Duplicate data can slow down search, update, and insert operations.

5. Data Integrity Issues

If data is inconsistent across tables, it can lead to data integrity issues.

Example for Problems Caused by Redundancy:
Let's consider a simplistic example. Suppose you have a table called "Orders" with the following structure and data:

| OrderID | CustomerName | Product | CustomerAddress |

|---------|--------------|-----------|-------------------|

| 1 | Madhu | Laptop | Hyderabad |

| 2 | Madhu | Mouse | Hyderabad |

| 3 | Naveen | Keyboard | Bengaluru |

From the table:

Redundancy: The `CustomerName` "Madhu" and his `CustomerAddress` "Hyderabad" are repeated for two orders.
Problems:

1. Update Anomaly: If Madhu moves to a new address, you'd have to update multiple rows. If you forget to update all the rows, it leads to inconsistent data.
2. Insertion Anomaly: To insert a new order for Madhu, you have to re-enter his address, leading to further redundancy.
3. Deletion Anomaly: If you decide to delete the order with the mouse, you might be tempted to delete Madhu's details entirely, but that would remove crucial data associated with the laptop order.

Solution:

Normalizing the database can resolve these problems. In this example, splitting the table into two tables, `Orders` and `Customers`, would be a start:

1. Customers Table:

| CustomerID | CustomerName | CustomerAddress |

|------------|--------------|-------------------|

| 101 | Madhu | Hyderabad |

| 102 | Naveen | Bengaluru |

2. Orders Table:

| OrderID | CustomerID | Product |

|---------|------------|----------|

| 1 | 101 | Laptop |

| 2 | 101 | Mouse |

| 3 | 102 | Keyboard |

This design reduces redundancy and eliminates the anomalies.

Decompositions and its problems

● Decomposition in the context of database design refers to the process of breaking down a single table into multiple tables in order to eliminate
redundancy, reduce data anomalies, and achieve normalization. Decomposition is typically done using rules defined by normalization forms.
● However, while decomposition can be helpful, it is not without challenges. Done incorrectly, decomposition can lead to its own set of problems.

Problems Related to Decomposition

1. Loss of Information
● Non-loss decomposition: When a relation is decomposed into two or more smaller relations, and the original relation can be perfectly reconstructed
by taking the natural join of the decomposed relations, then it is termed as lossless decomposition. If not, it is termed "lossy decomposition."
● Example: Let's consider a table `R(A, B, C)` with a dependency `A → B`. If you decompose it into `R1(A, B)` and `R2(B, C)`, it would be lossy
because you can't recreate the original table using natural joins.
Functional Dependencies and its reasoning
Functional dependencies play a vital role in the normalization process in relational database design. They help in defining the relationships between
attributes in a relation and are used to formalize the properties of the relation and drive the process of decomposition.

Functional Dependencies (FD)

A functional dependency `\( X \rightarrow Y \)` between two sets of attributes X and Y in a relation R is defined as: if two tuples (rows) of R have the
same value for attributes X, then they must also have the same values for attributes Y. In other words, the values of X determine the values of Y.

1. sid functionally determines sname because for a given

student ID, there's only one possible student name
2. zipcode functionally determines cityname, a specific zip code
should determine a unique cityname
3. cityname functionally determines state, A city name could
determine a state.
4. Mathematically, these functional dependencies can be
represented as:
5. \( sid \rightarrow sname \)
\( zipcode \rightarrow cityname \)
Reasoning About Functional Dependencies
Introduction to Normal Forms
● In database management systems (DBMS), the concept of normalization is employed to organize
relational databases efficiently and to eliminate redundant data, ensure data dependency, and ensure
data integrity. The process of normalization is divided into several stages, called "normal forms." Each
normal form has a specific set of rules and criteria that a database schema must meet.
● Normalization often involves trade-offs. While higher normal forms eliminate redundancy and improve
data integrity, they can also result in more complex relational schemas and sometimes require more
joins, which can affect performance. As such, it's essential to understand the data and the specific
application's requirements when deciding the level of normalization suitable for a particular situation.
Sometimes, denormalization (intentionally introducing redundancy) is implemented to improve
performance, especially in read-heavy databases.

Types of Normal Forms

1. First Normal Form (1NF)

2. Second Normal Form (2NF)
3. Third Normal Form (3NF)
4. Boyce-Codd Normal Form (BCNF)
5. Fourth Normal Form (4NF)
6. Fifth Normal Form (5NF or Project-Join Normal Form - PJNF)
7. Sixth Normal Form (6NF)
First Normal Form (1NF) in DBMS
Second Normal Form (2NF) in DBMS
Example for Second Normal Form
Third Normal Form (3NF)
Boyce-Codd Normal Form (BCNF) in DBMS
Fourth Normal Form (4NF) in DBMS
Fifth Normal Form (5NF or PJNF) in DBMS
Now, these decomposed tables eliminate the redundancy caused by the specific constraints and join dependencies of the original
relation. When you take the natural join of these tables, you will get back the original table.
It's worth noting that reaching 5NF can lead to an increased number of tables, which can complicate queries and database
operations. Thus, achieving 5NF should be a conscious decision made based on the specific requirements and constraints of a
given application.

Complex Integrity Constraints in SQL
No ratings yet
Complex Integrity Constraints in SQL
8 pages
SQL Notes
No ratings yet
SQL Notes
66 pages
SQL QuickStart Guide The Simplified Beginner s Guide to Managing Analyzing and Manipulating Data With SQL 1st Edition Shields 2024 scribd download
100% (2)
SQL QuickStart Guide The Simplified Beginner s Guide to Managing Analyzing and Manipulating Data With SQL 1st Edition Shields 2024 scribd download
55 pages
SQL Test 2 - Copy (2)
No ratings yet
SQL Test 2 - Copy (2)
11 pages
SQL For Beginners SQL Made Easy For Data Analysis
No ratings yet
SQL For Beginners SQL Made Easy For Data Analysis
21 pages
RDBMS Assignment1 - Oct 2024
No ratings yet
RDBMS Assignment1 - Oct 2024
5 pages
Get Data Modeling and Database Design 2nd Edition, (Ebook PDF) Free All Chapters
100% (5)
Get Data Modeling and Database Design 2nd Edition, (Ebook PDF) Free All Chapters
37 pages
Triggers and Active Data Bases in DBMS
No ratings yet
Triggers and Active Data Bases in DBMS
4 pages
ITE 302 SemiFinal Exam
No ratings yet
ITE 302 SemiFinal Exam
3 pages
SQL - Quick Guide
No ratings yet
SQL - Quick Guide
11 pages
Dbms Imp Notes
0% (1)
Dbms Imp Notes
5 pages
BEST DBMS
No ratings yet
BEST DBMS
177 pages
CSE 303 Lec 10 DesignTheory
No ratings yet
CSE 303 Lec 10 DesignTheory
63 pages
Pgdca Project by Sumoti Das
No ratings yet
Pgdca Project by Sumoti Das
43 pages
Question Bank For DBMS CIT II 2 Mark Ans-1
No ratings yet
Question Bank For DBMS CIT II 2 Mark Ans-1
2 pages
SQL Cheatshet
100% (1)
SQL Cheatshet
15 pages
What is a Trigger
No ratings yet
What is a Trigger
34 pages
Database Management System
No ratings yet
Database Management System
80 pages
DBMS-1
No ratings yet
DBMS-1
31 pages
Candidate Key
No ratings yet
Candidate Key
8 pages
Assignment_4_NPTEL_DBMS_January_2025
No ratings yet
Assignment_4_NPTEL_DBMS_January_2025
10 pages
SQL Select Null
No ratings yet
SQL Select Null
55 pages
DBMS Unit-2
No ratings yet
DBMS Unit-2
27 pages
SQL More Notes-2
No ratings yet
SQL More Notes-2
23 pages
DBMS Unit 3
No ratings yet
DBMS Unit 3
11 pages
IS273: Database Systems Spring 2020 Complex SQL
No ratings yet
IS273: Database Systems Spring 2020 Complex SQL
54 pages
3 Notes of 3 Unit
No ratings yet
3 Notes of 3 Unit
36 pages
DBMS MID2
No ratings yet
DBMS MID2
59 pages
SQL Concepts
No ratings yet
SQL Concepts
2 pages
DBMS Lec 32 1NF and 2NF
No ratings yet
DBMS Lec 32 1NF and 2NF
20 pages
Relational Design
No ratings yet
Relational Design
50 pages
Databaseexam PDF
No ratings yet
Databaseexam PDF
3 pages
SQL Aggregate Functions
No ratings yet
SQL Aggregate Functions
52 pages
Concepts of Keys in ER Model
No ratings yet
Concepts of Keys in ER Model
5 pages
SQL 1
No ratings yet
SQL 1
14 pages
Constructing OLAP Cubes Based On Queries: Tapio Niemi Jyrki Nummenmaa Peter Than&h
No ratings yet
Constructing OLAP Cubes Based On Queries: Tapio Niemi Jyrki Nummenmaa Peter Than&h
7 pages
DBMS4
No ratings yet
DBMS4
17 pages
A Simple Guide To Five Normal Forms in Relational Database Theory
No ratings yet
A Simple Guide To Five Normal Forms in Relational Database Theory
13 pages
Unit3db
No ratings yet
Unit3db
22 pages
Structured Query Language
No ratings yet
Structured Query Language
29 pages
dbms
No ratings yet
dbms
15 pages
Module3 Dbms
No ratings yet
Module3 Dbms
192 pages
Learn
No ratings yet
Learn
31 pages
2. View Index Stored Procedure and Trigger
No ratings yet
2. View Index Stored Procedure and Trigger
26 pages
functional dependency and Attribute Closure (1)
No ratings yet
functional dependency and Attribute Closure (1)
7 pages
SQL Interview Preparation
No ratings yet
SQL Interview Preparation
15 pages
DBMS M2 Final
No ratings yet
DBMS M2 Final
45 pages
Nested Queries
No ratings yet
Nested Queries
29 pages
Normalization For Online Examination Database
No ratings yet
Normalization For Online Examination Database
48 pages
Cognizant Interview Guide (1)
No ratings yet
Cognizant Interview Guide (1)
8 pages
GATE Questions-DBMS-Functional Dependency
No ratings yet
GATE Questions-DBMS-Functional Dependency
6 pages
Cursors, Triggers
No ratings yet
Cursors, Triggers
30 pages
Unit 5
No ratings yet
Unit 5
75 pages
SQL_part_2__1733732359
No ratings yet
SQL_part_2__1733732359
10 pages
AD3391 - Quest_Bank_DDM_updated
No ratings yet
AD3391 - Quest_Bank_DDM_updated
21 pages
Basic Commands of SQL
No ratings yet
Basic Commands of SQL
63 pages
Functional Dependency
No ratings yet
Functional Dependency
17 pages
SQL Interview
No ratings yet
SQL Interview
9 pages
DBMS LAB
No ratings yet
DBMS LAB
36 pages
DBMS 3
No ratings yet
DBMS 3
56 pages
14 Triggers
No ratings yet
14 Triggers
26 pages
UNIT-3
No ratings yet
UNIT-3
64 pages
SQL Quick Guide PDF
No ratings yet
SQL Quick Guide PDF
7 pages
DBMS
No ratings yet
DBMS
12 pages
Hospital Management System
No ratings yet
Hospital Management System
22 pages
Basdat II - Pertemuan III
No ratings yet
Basdat II - Pertemuan III
27 pages
Database Cheat Sheet
No ratings yet
Database Cheat Sheet
4 pages
Advanced SQL Concepts
No ratings yet
Advanced SQL Concepts
38 pages
SQL Basics 1: Relational Database Management System
No ratings yet
SQL Basics 1: Relational Database Management System
10 pages
What Are The Different Types of Joins? What Is The Difference Between Them? Inner Join
No ratings yet
What Are The Different Types of Joins? What Is The Difference Between Them? Inner Join
9 pages
SQL Operators Functions and Keywords
No ratings yet
SQL Operators Functions and Keywords
9 pages
RDBMS Notes
88% (108)
RDBMS Notes
68 pages
RDBMS and DBMS Concepts
No ratings yet
RDBMS and DBMS Concepts
5 pages
Triggers&Assertion
No ratings yet
Triggers&Assertion
33 pages
DBMS UNIT-3 Notes
100% (3)
DBMS UNIT-3 Notes
45 pages
4.what Is Normalization PDF
No ratings yet
4.what Is Normalization PDF
9 pages
Dbms MCQ 01: Database Administrator
100% (1)
Dbms MCQ 01: Database Administrator
84 pages
Vinayaka - Dbms - M.tech Key
No ratings yet
Vinayaka - Dbms - M.tech Key
15 pages
Active DB
No ratings yet
Active DB
46 pages
Procedural Extension To SQL Using Triggers - Lecture 2: DR Akhtar Ali
No ratings yet
Procedural Extension To SQL Using Triggers - Lecture 2: DR Akhtar Ali
28 pages
UNIT-IV-MCA-305-ADVANCED DBMS
No ratings yet
UNIT-IV-MCA-305-ADVANCED DBMS
15 pages
Week 5
No ratings yet
Week 5
52 pages
1ST and 2ND Quarter Prog F2F Final Version
No ratings yet
1ST and 2ND Quarter Prog F2F Final Version
56 pages
Non-Prime Attribute: Attributes Called Attributes Attribute Not Any Candidate Key Is Called Attribute
No ratings yet
Non-Prime Attribute: Attributes Called Attributes Attribute Not Any Candidate Key Is Called Attribute
5 pages
SQL Alias:: To Make Selected Columns More Readable.
No ratings yet
SQL Alias:: To Make Selected Columns More Readable.
17 pages
Two Marks Questions With Answers
100% (2)
Two Marks Questions With Answers
18 pages
Active Database
No ratings yet
Active Database
30 pages
Chapter 15: Basics of Functional Dependencies and Normalization For Relational Databases
No ratings yet
Chapter 15: Basics of Functional Dependencies and Normalization For Relational Databases
65 pages
SQL SELECT Statement
No ratings yet
SQL SELECT Statement
5 pages
DBMS Lab Manual
From Everand
DBMS Lab Manual
Jitendra Patel
1.5/5 (3)

Unit-III - SQL & Schema Refinement

Uploaded by

Unit-III - SQL & Schema Refinement

Uploaded by

Unit-III : SQL & Schema

percent sign (%) Represents zero, one, or multiple characters

underscore sign(_) Represents a single character

SELECT COUNT(DISTINCT COMPANY)

SELECT COMPANY, COUNT(*)

BEFORE: Trigger is executed before the triggering event.

● They can also be categorized by the triggering event:

INSERT: Trigger is executed when a new row is inserted.

trigger_name: Name of the trigger.

trigger_time: BEFORE, AFTER, or INSTEAD OF.

trigger_event: INSERT, UPDATE, or DELETE.

table_name: The name of the table associated with the trigger.

trigger_body: The set of SQL statements to be executed.

- The trigger is named `AfterSalaryUpdate`.

Key Features of Active Databases

Why are Active Databases Important?

Problems Caused by Redundancy

Storing duplicate data consumes more storage than necessary.

These are inconsistencies that arise due to redundancy.

Querying and maintaining redundant data can be more complex.

5. Data Integrity Issues

If data is inconsistent across tables, it can lead to data integrity issues.

| OrderID | CustomerName | Product | CustomerAddress |

| 1 | Madhu | Laptop | Hyderabad |

| 2 | Madhu | Mouse | Hyderabad |

| 3 | Naveen | Keyboard | Bengaluru |

From the table:

| CustomerID | CustomerName | CustomerAddress |

| 101 | Madhu | Hyderabad |

| 102 | Naveen | Bengaluru |

| OrderID | CustomerID | Product |

This design reduces redundancy and eliminates the anomalies.

Problems Related to Decomposition

Functional Dependencies (FD)

1. sid functionally determines sname because for a given

Types of Normal Forms

1. First Normal Form (1NF)

You might also like