0% found this document useful (0 votes)

104 views7 pages

Surrogate Key Vs Natural Key Differences and When To Use in SQL Server

The document discusses the differences between surrogate keys and natural keys in SQL Server and considerations for when to use each type. It provides an overview of surrogate keys, which are system-generated values without business meaning, and natural keys, which are columns that already exist in the table and have business meaning. The document then lists pros and cons of each type of key, such as surrogate keys being less prone to changes in business requirements but requiring more storage and joins. It concludes that the best approach depends on one's specific requirements, as each key type has similar numbers of advantages and disadvantages.

Uploaded by

elliottjs1091

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views7 pages

Surrogate Key Vs Natural Key Differences and When To Use in SQL Server

Uploaded by

elliottjs1091

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Surrogate Key vs Natural Key Differences and When to

Use in SQL Server

mssqltips.com/sqlservertip/5431/surrogate-key-vs-natural-key-differences-and-when-to-use-in-sql-server/

By: Ben Snaidero | Updated: 2022-01-31 | Comments (6) | Related: More >
Database Design

Problem
If you polled any number of Microsoft SQL Server database professionals and asked the
question, "Which is better when defining a primary key, having surrogate key or natural
key column(s)?", I'd bet the answer would be very close to a 50/50 split. About the only
definitive answer you will get on the subject is most people agree that when implementing
a data warehouse, you have to use surrogate keys for your dimension and fact tables.
This is because a source OLTP relational database can change at any time due to
business requirements and your data warehouse should be able to handle these changes
without needing any updates. This tip will go through some of the pros and cons of each
type of primary key so that you can make a better decision when deciding which one to
implement in your own environments.

Solution

Before we get into the pros and cons let's first make sure we understand the difference
between a surrogate and natural key.

Surrogate Key Overview

A surrogate key is a system generated (could be GUID, sequence, unique identifier, etc.)
value with no business meaning that is used to uniquely identify a record in a table. The
key itself could be made up of one or multiple columns (i.e. Composite Key). The
following diagram shows an example of a table with a surrogate key (AddressID column)
along with some sample data. Notice the key itself has no business meaning, it's just a
sequential integer serving as a unique key.

1/7
Natural Key Overview
A natural key is a column or set of columns that already exist in the table (e.g. they are
attributes of the entity within the data model) and uniquely identify a record in the table.
Since these columns are attributes of the entity they obviously have business meaning.
The following is an example of a table with a natural key (SSN column) along with some
sample data. Notice that the key for the data in this table has business meaning.

Natural Key vs. Surrogate Key for Database Design

Since this topic has been debated for years with no definitive answer as to which is better,
I thought with this tutorial I would put together a list of all the pros and cons of each type
of key. This list can then be used as a reference when deciding what type of key would
be best suited for your own environment/application. After all, everyone's requirements
are different. What works or performs well in one application might not work so well in
another.

Natural Key Pros

2/7
Key values have business meaning and can be used as a search key when
querying the table
Column(s) and primary key index already exist so no disk extra space is required for
the extra column/index that would be used by a surrogate key column
Fewer table joins since join columns have meaning. For example, this can reduce
disk IO by not having to perform extra reads on a lookup table

Natural Key Cons

May need to change/rework key if business requirements change. For example, if
you used SSN for your employee as in the example above and your company
expands outside of the United States not all employees would have a SSN so you
would have to come up with a new key for your database tables.
More difficult to maintain if key requires multiple columns. It's much easier from the
application side dealing with a key column that is constructed with just a single
column.
Poorer performance since key value is usually larger and/or is made up of multiple
columns. Larger keys will require more IO both when inserting/updating data as
well as when you query.
Can't enter record until key value is known. It's sometimes beneficial for an
application to load a placeholder record in one table then load other tables and then
come back and update the main table.
Can sometimes be difficult to pick a good key. There might be multiple candidate
keys each with their own trade-offs when it comes to design and/or performance.

Surrogate Key Pros

No business logic in key so no changes based on business requirements. For
example, if the Employee table above used a integer surrogate key you could
simply add a separate column for SIN if you added an office in Canada (to be used
in place of a Social Security Number)
Less code if maintaining same key strategy across all entities. For example,
application code can be reused when referencing primary keys if they are all
implemented as a sequential integer.
Better performance since key value is smaller. Less disk IO is required on when
accessing single column indexes from an optimization perspective.
Surrogate key is guaranteed to be unique. For example, when moving data
between test systems you don't have to worry about duplicate keys since new key
will be generated as data is inserted.
If a sequence used then there is little index maintenance required since the value is
ever increasing which leads to less index fragmentation.

Surrogate Key Cons

Extra column(s)/index for surrogate key will require extra disk space

3/7
Extra column(s)/index for surrogate key will require extra IO when insert/update
data
Requires more table joins to child tables since data has no meaning on its own.
Can have duplicate values of natural key in table if there is no other unique
constraint defined on the natural key
Difficult to differentiate between test and production data. For example, since
surrogate key values are just auto-generated values with no business meaning it's
hard to tell if someone took production data and loaded it into a test environment.
Key value has no relation to data so technically design breaks 3NF (i.e.
normalization)
The surrogate key value can't be used as a search key
Different implementations are required based on database platform. For example,
SQL Server identity columns are implemented a little bit different than they are in
Postgres or DB2.

Summary
As mentioned above it's easy to see why this continues to be debated. Each type of key
has a similar number of pros and cons. If you read through them though you can see
how based your requirements some of the cons might not even apply in your
environment. If that's the case then it makes it much easier to decide which type of key is
the best fit for your application.

Next Steps
Read more tips on SQL Server constraints
Read other tips on data warehousing
Read more information auto generated keys in SQL Server

4/7
About the author

Ben Snaidero has been a SQL Server and Oracle DBA for over 10 years and focuses on
performance tuning.

This author pledges the content of this article is based on professional experience and
not AI generated.

View all my tips

5/7
Article Last Updated: 2022-01-31

Comments For This Article

Friday, October 9, 2020 - 10:49:14 AM - Fred smith Back To Top (86625)

I am shocked no one pointed out that you shouldn't be storing clear text SSN. Period.

Wednesday, April 18, 2018 - 8:21:47 PM - Joe Celko Back To Top (75731)

I wish more people would read Codd's original work. His definition of a surrogate key is
that it is hidden from the view of the user, and the engine uses it to build the joins or
other constructs. Think of a hash code or something, it's only used by the engine and
never exposed. Unfortunately, the SQL Server community wants to define it is
something they actually build themselves and expose. Obviously, you have to keep the
"natural" keys for data integrity, and then carry the extra burden of the exposed
surrogates. Given modern hardware and software, it's not that much trouble to use
insanely long natural keys for joins.

Monday, April 16, 2018 - 12:19:57 PM - JRStern Back To Top (75714)

Well, here are a couple more very big factors. First, that most SQL Server pros, most
of the time, do use surrogate keys, most frequently an identity int or bigint, sometimes a
GUID. And that they even use this as the clustered PK more often than not.
And second, that they do this for a good reason, and that's because the CK and PK
have special uses in SQL Server, the nonclustered keys go through them, they are
used to validate FKs, and more. SQL Server does not really separate the logical and
physical implementations that well. This causes surrogates to be much more highly
used in SQL Server than might otherwise be true. I'd say also that the optimizer often
has trouble with multi-field indexes, but that's a whole separate discussion.

Monday, April 16, 2018 - 9:28:36 AM - Adel Yousuf Back To Top (75712)

Good Topic

Monday, April 16, 2018 - 3:52:24 AM - Arno Tolmeijer Back To Top (75707)

6/7
Hi Ben,

Great article, but I miss one point: due to security regulations, such as GDPR,
encryption and data masking may influence to usability of a natural key. Greetings,
Arno Tolmeijer

Monday, April 16, 2018 - 2:48:13 AM - Vinod Arvind Bhilare Back To Top (75706)

Hi ,

It help us alot for me to improve my SQL knowledge

7/7

SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
From Everand
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
i Code Academy
5/5 (4)
UUID or GUID As Primary Keys Be Careful
100% (1)
UUID or GUID As Primary Keys Be Careful
33 pages
Databilities 1.0 CC by-NC-ND 4.0
No ratings yet
Databilities 1.0 CC by-NC-ND 4.0
14 pages
Sorogate Key
No ratings yet
Sorogate Key
7 pages
Guid For Primary Key
No ratings yet
Guid For Primary Key
11 pages
Keys (DBMS)
No ratings yet
Keys (DBMS)
42 pages
DBMS Keys: Candidate, Super, Primary, Foreign Key Types With Example
No ratings yet
DBMS Keys: Candidate, Super, Primary, Foreign Key Types With Example
7 pages
SQL query1
No ratings yet
SQL query1
12 pages
DBMS Keys
No ratings yet
DBMS Keys
16 pages
What Are Keys in DBMS
No ratings yet
What Are Keys in DBMS
8 pages
Session 16 - Keys
No ratings yet
Session 16 - Keys
8 pages
Chapter 1 DBMS - Fall23
No ratings yet
Chapter 1 DBMS - Fall23
30 pages
Lecture 12
No ratings yet
Lecture 12
17 pages
Why we need a Key
No ratings yet
Why we need a Key
7 pages
9-Keys in DBMS-05-08-2024
No ratings yet
9-Keys in DBMS-05-08-2024
14 pages
Keys
No ratings yet
Keys
22 pages
7275 Keys
No ratings yet
7275 Keys
4 pages
Practical Database Design
100% (6)
Practical Database Design
13 pages
Entity Relationship Model and Diagram
No ratings yet
Entity Relationship Model and Diagram
8 pages
Keys and Notations
No ratings yet
Keys and Notations
10 pages
Key Constraint
No ratings yet
Key Constraint
6 pages
Keys
No ratings yet
Keys
17 pages
Dot Net Tricks
No ratings yet
Dot Net Tricks
6 pages
SQL Programming & Database Management For Noobee
From Everand
SQL Programming & Database Management For Noobee
Kishor Sarkar X
No ratings yet
5 slide
No ratings yet
5 slide
20 pages
DBMS Keys
No ratings yet
DBMS Keys
8 pages
Keys
No ratings yet
Keys
3 pages
Lecture 6 - DBMS Keys Primary, Candidate, Super, Alternate and Foreign
No ratings yet
Lecture 6 - DBMS Keys Primary, Candidate, Super, Alternate and Foreign
17 pages
What Is RDBMS (Relational Database Management System) ?
No ratings yet
What Is RDBMS (Relational Database Management System) ?
54 pages
BCDE103 Keys Class Notes
No ratings yet
BCDE103 Keys Class Notes
2 pages
Fundamentals of Database Systems: Prepared By: Ms. Roda Flor Andrea B. Teodocio
No ratings yet
Fundamentals of Database Systems: Prepared By: Ms. Roda Flor Andrea B. Teodocio
30 pages
DBMS_KCS501_Notes Unit-1
No ratings yet
DBMS_KCS501_Notes Unit-1
44 pages
DBMS_Keys
No ratings yet
DBMS_Keys
13 pages
DBMS Keys.
No ratings yet
DBMS Keys.
16 pages
UNIT-2pdf
No ratings yet
UNIT-2pdf
25 pages
Normalization Part I
No ratings yet
Normalization Part I
60 pages
Unit 3 DBMS Keys Join Aggregate View Clauses
No ratings yet
Unit 3 DBMS Keys Join Aggregate View Clauses
23 pages
DBMS Proficiency
No ratings yet
DBMS Proficiency
8 pages
Unit 2_DBMS Notes for Students
No ratings yet
Unit 2_DBMS Notes for Students
71 pages
Keys Overview
No ratings yet
Keys Overview
7 pages
DBMS Keys
No ratings yet
DBMS Keys
11 pages
12 RDBMS
No ratings yet
12 RDBMS
8 pages
Unit-3 Relational Data Model
No ratings yet
Unit-3 Relational Data Model
24 pages
Unit2-Relational Model-Part1
No ratings yet
Unit2-Relational Model-Part1
30 pages
Keys in Database: By-Suraj Dewasi Prachi Singh Rathore
No ratings yet
Keys in Database: By-Suraj Dewasi Prachi Singh Rathore
17 pages
Types of Keys in Database Management System: Sos in Computer Science and Application Pgdca 203: Dbms
No ratings yet
Types of Keys in Database Management System: Sos in Computer Science and Application Pgdca 203: Dbms
11 pages
Section 6 - Introduction to Databases
No ratings yet
Section 6 - Introduction to Databases
27 pages
Database Constraints: What Are Keys?
No ratings yet
Database Constraints: What Are Keys?
8 pages
Database
No ratings yet
Database
5 pages
DB بحث
No ratings yet
DB بحث
10 pages
Database Management System (BCAC401)
No ratings yet
Database Management System (BCAC401)
7 pages
SQL1
No ratings yet
SQL1
91 pages
INTools_Administration
No ratings yet
INTools_Administration
24 pages
Keys in DBMS
No ratings yet
Keys in DBMS
5 pages
I-BSC[DBMS] [UNIT - 3 FULL}
No ratings yet
I-BSC[DBMS] [UNIT - 3 FULL}
17 pages
Peformance Tuning Tips
No ratings yet
Peformance Tuning Tips
16 pages
Difference Between Primary Key and Unique Key
No ratings yet
Difference Between Primary Key and Unique Key
4 pages
Database Systems DBMS KEYS 7
No ratings yet
Database Systems DBMS KEYS 7
15 pages
sql Command and constant
No ratings yet
sql Command and constant
6 pages
DBMS Unit-3
No ratings yet
DBMS Unit-3
66 pages
DBMS-Note-3
No ratings yet
DBMS-Note-3
8 pages
267903202
No ratings yet
267903202
27 pages
Healthy SQL Server
No ratings yet
Healthy SQL Server
26 pages
Mongodb: Presented By: Josmi Agnes Jose Roll Number: 20bda27
No ratings yet
Mongodb: Presented By: Josmi Agnes Jose Roll Number: 20bda27
27 pages
Datastage Faq
No ratings yet
Datastage Faq
202 pages
unit 5 - part 1
No ratings yet
unit 5 - part 1
25 pages
CCIDF Syllabus
No ratings yet
CCIDF Syllabus
2 pages
Sat - 100.Pdf - Prediction of Cyber Attacks Using Data Science Technique
No ratings yet
Sat - 100.Pdf - Prediction of Cyber Attacks Using Data Science Technique
11 pages
Iso Iec - 26514 2008
No ratings yet
Iso Iec - 26514 2008
11 pages
PCVL Brgy 1207005
No ratings yet
PCVL Brgy 1207005
13 pages
BG - Product Designer
No ratings yet
BG - Product Designer
1 page
IJRPR13360
No ratings yet
IJRPR13360
8 pages
Chapter 4 Self Test AIS
No ratings yet
Chapter 4 Self Test AIS
5 pages
Zupic Cater 2015 - Bibliometric Methods in Management and Organization
No ratings yet
Zupic Cater 2015 - Bibliometric Methods in Management and Organization
46 pages
Chapter 3 - Global Internet
No ratings yet
Chapter 3 - Global Internet
32 pages
University of Turku Theme Beamer Template Unofficial 1
No ratings yet
University of Turku Theme Beamer Template Unofficial 1
23 pages
Landing Page Handbook
100% (3)
Landing Page Handbook
277 pages
Supervised Learning in R Classification
No ratings yet
Supervised Learning in R Classification
7 pages
SP3D Admin Syllbus For Kagira & Onlinepiping PDF
No ratings yet
SP3D Admin Syllbus For Kagira & Onlinepiping PDF
7 pages
Access Test
No ratings yet
Access Test
8 pages
IT 405 (DBMS) Unit I Notes - 1615635798
No ratings yet
IT 405 (DBMS) Unit I Notes - 1615635798
15 pages
MS Purview Data Governance Roadmap
No ratings yet
MS Purview Data Governance Roadmap
9 pages
What Is Data Analysis
No ratings yet
What Is Data Analysis
6 pages
MSBI Online Training PDF
No ratings yet
MSBI Online Training PDF
8 pages
KDD98-012
No ratings yet
KDD98-012
7 pages
9 Skills Every Business Analytics Professional Needs - Harvard Business Analytics Program
No ratings yet
9 Skills Every Business Analytics Professional Needs - Harvard Business Analytics Program
5 pages
Microsoft SQL Server 2005 Express Edition For Dummies (2006)
100% (1)
Microsoft SQL Server 2005 Express Edition For Dummies (2006)
412 pages
Bitalag Integrated School Alumni Tracer With Sms Notification
No ratings yet
Bitalag Integrated School Alumni Tracer With Sms Notification
4 pages
Sensor Web 2.0 As A Service For Internet of Things: I-Ching Hsu Yu-Ju Su
No ratings yet
Sensor Web 2.0 As A Service For Internet of Things: I-Ching Hsu Yu-Ju Su
4 pages
Practical File IT-402
No ratings yet
Practical File IT-402
40 pages

Surrogate Key Vs Natural Key Differences and When To Use in SQL Server

Uploaded by

Surrogate Key Vs Natural Key Differences and When To Use in SQL Server

Uploaded by

Surrogate Key vs Natural Key Differences and When to

Use in SQL Server

Surrogate Key Overview

Natural Key vs. Surrogate Key for Database Design

Natural Key Pros

Natural Key Cons

Surrogate Key Pros

Surrogate Key Cons

View all my tips

Comments For This Article

Friday, October 9, 2020 - 10:49:14 AM - Fred smith Back To Top (86625)

Monday, April 16, 2018 - 12:19:57 PM - JRStern Back To Top (75714)

It help us alot for me to improve my SQL knowledge

You might also like