Sqlfordevscom Next Level Database Techniques For Developers Pages 5 10

The document discusses techniques for improving data manipulation performance in SQL databases. It covers strategies for updating rows based on related data, deleting duplicate rows efficiently with common table expressions, and optimizing high-volume counter updates by spreading the load across multiple rows. The document also recommends analyzing tables after bulk data modifications to keep database statistics and query planning accurate.

Uploaded by

Kumar SIVA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views6 pages

Sqlfordevscom Next Level Database Techniques For Developers Pages 5 10

Uploaded by

Kumar SIVA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Data Manipulation

A database without any INSERT, UPDATE or DELETE query would be a barely valuable
application. Although some applications exist with only static content, they are the
exception and you will have to do data modiﬁcations all the time. While this seems to be the
most uncomplicated functionality in SQL, it still has room for improvement for your
applications. Always remember that the number of write operations your disk can do in a
second is very limited. If you can reduce the operations per second, your application will be
much more performant.

The data manipulation chapter will teach you tricks to update rows based on information in
other tables, delete duplicate rows or make your application faster by removing lock
contention. You should study the last tip closely, as I have often found this to be a
performance problem.

5
Prevent Lock Contention For Updates On Hot Rows

-- MySQL
INSERT INTO tweet_statistics (
tweet_id, fanout, likes_count
) VALUES (
1475870220422107137, FLOOR(RAND() * 10), 1
) ON DUPLICATE KEY UPDATE likes_count =
likes_count + VALUES(likes_count);

-- PostgreSQL
INSERT INTO tweet_statistics (
tweet_id, fanout, likes_count
) VALUES (
1475870220422107137, FLOOR(RANDOM() * 10), 1
) ON CONFLICT (tweet_id, fanout) DO UPDATE SET likes_count =
tweet_statistics.likes_count + excluded.likes_count;

In some applications counters for e.g. likes of a tweet are constantly updated. During a
traﬃc spike or for trendy content a counter may get updated countless times within a
second. Due to the database's concurrency control the updates will start interfering with
each other as a row can only be locked by one transaction (query) at a time. Every update
will be executed one after another instead of parallel execution for independent rows.

Instead of updating a single row, the increments are fan outed to e.g. 100 diﬀerent rows in a
special counter table. The scaling factor now increases by the number of additional rows the
counter is written to. Those values are later aggregated to a single value and saved in their
original column that would have had lock contention.

6
Updates Based On A Select Query

-- MySQL
UPDATE products
JOIN categories USING(category_id)
SET price = price_base - price_base * categories.discount;

-- PostgreSQL
UPDATE products
SET price = price_base - price_base * categories.discount
FROM categories
WHERE products.category_id = categories.category_id;

Tables are often not updated in isolation, but the values are updated based on information
stored in other tables. For e.g. discounting all products on Black Friday, a discount for every
product category will be applied. Instead of the naive approach to execute an update query
for every category, you can update the products by joining them to their categories. The
manual join in the application is replaced by a more eﬃcient one by the database.

Notice: I have written a more extensive text about this topic on my database
focused website SqlForDevs.com: UPDATE from a SELECT

7
Return The Values Of Modified Rows

-- PostgreSQL:
DELETE FROM sessions
WHERE ip = '127.0.0.1'
RETURNING id, user_agent, last_access;

Many maintenance operations are based on finding particular rows, processing them (e.g.
sending an email or calculating some statistics) and marking them as processed. Typically a
flag within the row is updated or deleted as it is not needed anymore. This workflow can be
simplified by using the RETURNING feature and doing the data manipulation and selection of
the data in one step.

This feature is available for DELETE, INSERT and UPDATE queries and will always return the
data after the modiﬁcation, e.g. the inserted or updated data with all triggers executed and
generated values available.

Notice: This feature is only available for PostgreSQL.

8
Delete Duplicate Rows

-- MySQL
WITH duplicates AS (
SELECT id, ROW_NUMBER() OVER(
PARTITION BY firstname, lastname, email
ORDER BY age DESC
) AS rownum
FROM contacts
)
DELETE contacts
FROM contacts
JOIN duplicates USING(id)
WHERE duplicates.rownum > 1;

-- PostgreSQL
WITH duplicates AS (
SELECT id, ROW_NUMBER() OVER(
PARTITION BY firstname, lastname, email
ORDER BY age DESC
) AS rownum
FROM contacts
)
DELETE FROM contacts
USING duplicates
WHERE contacts.id = duplicates.id AND duplicates.rownum > 1;

After some time, most applications will have duplicated rows resulting in a bad user
experience, higher storage requirements and less database performance. The cleaning
process is usually implemented in application code with complex chunking behavior as the
data does not fit into memory entirely. By using a Common Table Expression (CTE) the
duplicate rows can be identified and sorted by their importance to keep them. A single
delete query can afterward delete all duplicates except a specific number of ones to keep.
The former complex logic is done by one simple SQL query.

Notice: I have written a more extensive text about this topic on my database
focused website SqlForDevs.com: Delete Duplicate Rows

9
Table Maintenance After Bulk Modifications

-- MySQL
ANALYZE TABLE users;

-- PostgreSQL
ANALYZE SKIP_LOCKED users;

The database needs up-to-date statistics about your tables like the approximate amount of
rows, data distribution of values and more to calculate the most eﬃcient way to execute
your query. Contrary to indexes that are automatically altered whenever a row aﬀecting its
data is created, updated or deleted the statistics are not mutated on every change. A
recalculation is only triggered when a threshold of changes to a table is crossed.

Whenever you change a big part of a table, the number of affected rows may still be below
the statistics recalculation threshold but significant enough to make the statistics incorrect.
Some queries may become very slow as the database predicts the best query plan based on
the now incorrect information about the table. Therefore, you should analyze a table to
trigger the statistics recalculation after every significant change to ensure fast queries.

7.2 Netezza Database Users Guide
No ratings yet
7.2 Netezza Database Users Guide
326 pages
Big Data Engineering Interview Questions
67% (3)
Big Data Engineering Interview Questions
189 pages
PostgreSQL Cheat Sheet
80% (5)
PostgreSQL Cheat Sheet
3 pages
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Sqlfordevscom Next Level Database Techniques For Developers PDF
No ratings yet
Sqlfordevscom Next Level Database Techniques For Developers PDF
50 pages
Sqlfordevscom Next Level Database Techniques For Developers 9 12
No ratings yet
Sqlfordevscom Next Level Database Techniques For Developers 9 12
4 pages
Sqlfordevscom Next Level Database Techniques For Developers 1 4
No ratings yet
Sqlfordevscom Next Level Database Techniques For Developers 1 4
4 pages
PostgreSQL CHEAT SHEET
No ratings yet
PostgreSQL CHEAT SHEET
8 pages
Rebellabs SQL Cheat Sheeasdasdasd2222222t
No ratings yet
Rebellabs SQL Cheat Sheeasdasdasd2222222t
1 page
SQL Cheat Sheet: Basic Queries
No ratings yet
SQL Cheat Sheet: Basic Queries
1 page
SQL Cheat Sheet: Basic Queries
No ratings yet
SQL Cheat Sheet: Basic Queries
1 page
Hitachi
No ratings yet
Hitachi
7 pages
Sqlfordevscom Next Level Database Techniques For Developers 22 27
No ratings yet
Sqlfordevscom Next Level Database Techniques For Developers 22 27
6 pages
Sqlfordevscom Next Level Database Techniques For Developers 37 40
No ratings yet
Sqlfordevscom Next Level Database Techniques For Developers 37 40
4 pages
Sqlfordevscom Next Level Database Techniques For Developers Pages 21 30
No ratings yet
Sqlfordevscom Next Level Database Techniques For Developers Pages 21 30
10 pages
MySQL Cheat Sheet & Quick Reference
No ratings yet
MySQL Cheat Sheet & Quick Reference
26 pages
SQL QA
No ratings yet
SQL QA
8 pages
1748265355673
No ratings yet
1748265355673
14 pages
MySQL Cheat Sheet & Quick Reference
No ratings yet
MySQL Cheat Sheet & Quick Reference
20 pages
Mysql Cheat Sheet: 100 Questions: 1. Basic SQL Queries
No ratings yet
Mysql Cheat Sheet: 100 Questions: 1. Basic SQL Queries
29 pages
sql
No ratings yet
sql
13 pages
Rebellabs SQL Cheat Sheeasdasdasd55555555
No ratings yet
Rebellabs SQL Cheat Sheeasdasdasd55555555
1 page
DBMS 2
No ratings yet
DBMS 2
5 pages
SQL and PostgreSQL The Complete Developer's Guide
No ratings yet
SQL and PostgreSQL The Complete Developer's Guide
5 pages
MySQL Notes
No ratings yet
MySQL Notes
6 pages
SQL Theory With Query
No ratings yet
SQL Theory With Query
11 pages
SQL Cheat Sheet 2021 Web
No ratings yet
SQL Cheat Sheet 2021 Web
1 page
SQL
No ratings yet
SQL
9 pages
KPMG Data Analyst Interview Questions
No ratings yet
KPMG Data Analyst Interview Questions
30 pages
Important Sql Q&A
No ratings yet
Important Sql Q&A
18 pages
Dbms Solved Questions
No ratings yet
Dbms Solved Questions
18 pages
Design Patterns Elements of Reusable Object-Oriented Software
No ratings yet
Design Patterns Elements of Reusable Object-Oriented Software
17 pages
Aaaaaa
No ratings yet
Aaaaaa
15 pages
SQL Cheat Sheet
No ratings yet
SQL Cheat Sheet
15 pages
PostgreSQL Questions and Answers
No ratings yet
PostgreSQL Questions and Answers
6 pages
SQL Simplified
No ratings yet
SQL Simplified
11 pages
SQL Simplified
No ratings yet
SQL Simplified
11 pages
DBMS LAB Manual
No ratings yet
DBMS LAB Manual
37 pages
(M7S2-POWERPOINT) - Transaction Control Commands
No ratings yet
(M7S2-POWERPOINT) - Transaction Control Commands
34 pages
Select Modifying Data: SQL Cheat Sheet - Postgresql
No ratings yet
Select Modifying Data: SQL Cheat Sheet - Postgresql
3 pages
Postgre SQL Advance Notes
No ratings yet
Postgre SQL Advance Notes
23 pages
20250219 - Zafin Learn Session - PostgreSQL Performance for Application Developers
No ratings yet
20250219 - Zafin Learn Session - PostgreSQL Performance for Application Developers
58 pages
Handbook For SQL and SQL Injection PART1 1658256349
No ratings yet
Handbook For SQL and SQL Injection PART1 1658256349
13 pages
662a5089e0494246e350140dslides - Data Wrangling With SQL
No ratings yet
662a5089e0494246e350140dslides - Data Wrangling With SQL
85 pages
Create Table Insert Into Select Update Delete
No ratings yet
Create Table Insert Into Select Update Delete
3 pages
Unit 03 - SQL Queries (1)
No ratings yet
Unit 03 - SQL Queries (1)
82 pages
Ilovepdf Merged (2)
No ratings yet
Ilovepdf Merged (2)
21 pages
SQL Basic to Advance Interview Question and Answer 1731934628
No ratings yet
SQL Basic to Advance Interview Question and Answer 1731934628
12 pages
Wipro Data Analyst Interview Questions
No ratings yet
Wipro Data Analyst Interview Questions
29 pages
Really Big Elephants: Data Warehousing Postgresql
No ratings yet
Really Big Elephants: Data Warehousing Postgresql
62 pages
SQL Session 02 - Manual
No ratings yet
SQL Session 02 - Manual
8 pages
Database syntax (by chatGPT)
No ratings yet
Database syntax (by chatGPT)
4 pages
SQL_Queries_with_Examples
No ratings yet
SQL_Queries_with_Examples
6 pages
Subqueries With The SELECT Statement:: Ramesh Ahmedabad Khilan Delhi Kota Chaitali Mumbai
No ratings yet
Subqueries With The SELECT Statement:: Ramesh Ahmedabad Khilan Delhi Kota Chaitali Mumbai
29 pages
Sqlfordevscom Next Level Database Techniques For Developers Pages 11 17
No ratings yet
Sqlfordevscom Next Level Database Techniques For Developers Pages 11 17
7 pages
(M7-MAIN) - Data Manipulation Language (DML)
No ratings yet
(M7-MAIN) - Data Manipulation Language (DML)
50 pages
SQL Interview Success From Beginner To Pro
From Everand
SQL Interview Success From Beginner To Pro
Shana
No ratings yet
DBMS Lab Manual
From Everand
DBMS Lab Manual
Jitendra Patel
1.5/5 (3)
Tableau 8.2 Training Manual: From Clutter to Clarity
From Everand
Tableau 8.2 Training Manual: From Clutter to Clarity
Larry Keller
No ratings yet
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
SQL Mastery: From Novice Queries to Advanced Database Wizardry
From Everand
SQL Mastery: From Novice Queries to Advanced Database Wizardry
Scott Markham
No ratings yet
Connect Microsoft Fabric Lakehouse With SQL Server Management System (SSMS)
No ratings yet
Connect Microsoft Fabric Lakehouse With SQL Server Management System (SSMS)
9 pages
Mysql Example
No ratings yet
Mysql Example
5 pages
MCQ
No ratings yet
MCQ
20 pages
Characteristics of Dbms
No ratings yet
Characteristics of Dbms
2 pages
Topic Beyond Syllabus DBMS
No ratings yet
Topic Beyond Syllabus DBMS
12 pages
18Cs53: Database Management Systems: Introduction To Transaction Processing Concepts and Theory
No ratings yet
18Cs53: Database Management Systems: Introduction To Transaction Processing Concepts and Theory
37 pages
Lab 6 Database
No ratings yet
Lab 6 Database
22 pages
Normalization
No ratings yet
Normalization
20 pages
Lecture 4 - SQL Part II
No ratings yet
Lecture 4 - SQL Part II
73 pages
Ts 726
No ratings yet
Ts 726
2 pages
RECORD PROGRAMS(24-25) (CLASS 12)
No ratings yet
RECORD PROGRAMS(24-25) (CLASS 12)
4 pages
Step by Step Control file recovery
No ratings yet
Step by Step Control file recovery
6 pages
2database Management System of Multi Level Marketing Organisation
No ratings yet
2database Management System of Multi Level Marketing Organisation
24 pages
Report Legal Heirs Plz This Use File
No ratings yet
Report Legal Heirs Plz This Use File
822 pages
Dbms Ques & Ans-7
No ratings yet
Dbms Ques & Ans-7
17 pages
BCS403 3
No ratings yet
BCS403 3
1 page
Paper 38-Comparison Study of Commit Protocols For Mobile Environment
No ratings yet
Paper 38-Comparison Study of Commit Protocols For Mobile Environment
8 pages
1-MongoDB (3 Files Merged)
No ratings yet
1-MongoDB (3 Files Merged)
7 pages
PHP MySQL Module IV
No ratings yet
PHP MySQL Module IV
15 pages
Dms Unit 5 22319
No ratings yet
Dms Unit 5 22319
5 pages
Student Evaluation System
67% (3)
Student Evaluation System
33 pages
1 Hibernate
No ratings yet
1 Hibernate
36 pages
Vector DB Guide
No ratings yet
Vector DB Guide
47 pages
Unit 4 - SRP
No ratings yet
Unit 4 - SRP
13 pages
DBMS Practical Question and Answer
No ratings yet
DBMS Practical Question and Answer
5 pages
Data Warehousing Fundamentals
No ratings yet
Data Warehousing Fundamentals
108 pages
Locks in Oracle
No ratings yet
Locks in Oracle
4 pages
Dbms Lab Mannual
No ratings yet
Dbms Lab Mannual
16 pages
Mushkir Resume
No ratings yet
Mushkir Resume
1 page

Sqlfordevscom Next Level Database Techniques For Developers Pages 5 10

Uploaded by

Sqlfordevscom Next Level Database Techniques For Developers Pages 5 10

Uploaded by

Data Manipulation

Notice: This feature is only available for PostgreSQL.

You might also like