SQL Advance: The TOP Clause
SQL Advance: The TOP Clause
The TOP clause can be very useful on large tables with thousands of records. Returning a large number of records can impact on
performance.
MySQL Syntax
SELECT column_name(s)
FROM table_name
LIMIT number
Example
SELECT *
FROM Persons
LIMIT 5
Oracle Syntax
SELECT column_name(s)
FROM table_name
WHERE ROWNUM <= number
Example
SELECT *
FROM Persons
WHERE ROWNUM <=5
Now we want to select only the two first records in the table above.
Now we want to select only 50% of the records in the table above.
The LIKE operator is used in a WHERE clause to search for a specified pattern in a column.
Now we want to select the persons living in a city that starts with "s" from the table above.
The "%" sign can be used to define wildcards (missing letters in the pattern) both before and after the pattern.
Next, we want to select the persons living in a city that ends with an "s" from the "Persons" table.
Next, we want to select the persons living in a city that contains the pattern "tav" from the "Persons" table.
It is also possible to select the persons living in a city that NOT contains the pattern "tav" from the "Persons" table, by using the NOT
keyword.
SQL Wildcards
SQL Wildcards
SQL wildcards can substitute for one or more characters when searching for data in a database.
Wildcard Description
% A substitute for zero or more characters
_ A substitute for exactly one character
[charlist] Any single character in charlist
[^charlist] Any single character not in charlist
or
[!charlist]
Now we want to select the persons living in a city that starts with "sa" from the "Persons" table.
Next, we want to select the persons living in a city that contains the pattern "nes" from the "Persons" table.
Now we want to select the persons with a first name that starts with any character, followed by "la" from the "Persons" table.
Next, we want to select the persons with a last name that starts with "S", followed by any character, followed by "end", followed by
any character, followed by "on" from the "Persons" table.
Now we want to select the persons with a last name that starts with "b" or "s" or "p" from the "Persons" table.
Next, we want to select the persons with a last name that do not start with "b" or "s" or "p" from the "Persons" table.
SQL IN Operator
The IN Operator
SQL IN Syntax
SELECT column_name(s)
FROM table_name
WHERE column_name IN (value1,value2,...)
IN Operator Example
Now we want to select the persons with a last name equal to "Hansen" or "Pettersen" from the table above.
The BETWEEN operator is used in a WHERE clause to select a range of data between two values.
The BETWEEN operator selects a range of data between two values. The values can be numbers, text, or dates.
SELECT column_name(s)
FROM table_name
WHERE column_name
BETWEEN value1 AND value2
Now we want to select the persons with a last name alphabetically between "Hansen" and "Pettersen" from the table above.
In some databases, persons with the LastName of "Hansen" or "Pettersen" will not be listed, because the BETWEEN operator only
selects fields that are between and excluding the test values).
In other databases, persons with the LastName of "Hansen" or "Pettersen" will be listed, because the BETWEEN operator selects
fields that are between and including the test values).
And in other databases, persons with the LastName of "Hansen" will be listed, but "Pettersen" will not be listed (like the example
above), because the BETWEEN operator selects fields between the test values, including the first test value and excluding the last test
value.
Example 2
To display the persons outside the range in the previous example, use NOT BETWEEN:
SQL Alias
SQL Alias
You can give a table or a column another name by using an alias. This can be a good thing to do if you have very long or complex
table names or column names.
SELECT column_name(s)
FROM table_name
AS alias_name
Alias Example
Assume we have a table called "Persons" and another table called "Product_Orders". We will give the table aliases of "p" and "po"
respectively.
Now we want to list all the orders that "Ola Hansen" is responsible for.
As you'll see from the two SELECT statements above; aliases can make queries easier to both write and to read.
SQL Joins
SQL joins are used to query data from two or more tables, based on a relationship between certain columns in these tables.
SQL JOIN
The JOIN keyword is used in an SQL statement to query data from two or more tables, based on a relationship between certain
columns in these tables.
A primary key is a column (or a combination of columns) with a unique value for each row. Each primary key value must be unique
within the table. The purpose is to bind data together, across tables, without repeating all of the data in every table.
Note that the "O_Id" column is the primary key in the "Orders" table and that the "P_Id" column refers to the persons in the "Persons"
table without using their names.
Notice that the relationship between the two tables above is the "P_Id" column.
Before we continue with examples, we will list the types of JOIN you can use, and the differences between them.
JOIN: Return rows when there is at least one match in both tables
LEFT JOIN: Return all rows from the left table, even if there are no matches in the right table
RIGHT JOIN: Return all rows from the right table, even if there are no matches in the left table
FULL JOIN: Return rows when there is a match in one of the tables
The INNER JOIN keyword return rows when there is at least one match in both tables.
SELECT column_name(s)
FROM table_name1
INNER JOIN table_name2
ON table_name1.column_name=table_name2.column_name
The INNER JOIN keyword return rows when there is at least one match in both tables. If there are rows in "Persons" that do not have
matches in "Orders", those rows will NOT be listed.
The LEFT JOIN keyword returns all rows from the left table (table_name1), even if there are no matches in the right table
(table_name2).
SELECT column_name(s)
FROM table_name1
LEFT JOIN table_name2
ON table_name1.column_name=table_name2.column_name
Now we want to list all the persons and their orders - if any, from the tables above.
The LEFT JOIN keyword returns all the rows from the left table (Persons), even if there are no matches in the right table (Orders).
The RIGHT JOIN keyword Return all rows from the right table (table_name2), even if there are no matches in the left table
(table_name1).
SELECT column_name(s)
FROM table_name1
RIGHT JOIN table_name2
ON table_name1.column_name=table_name2.column_name
Now we want to list all the orders with containing persons - if any, from the tables above.
The RIGHT JOIN keyword returns all the rows from the right table (Orders), even if there are no matches in the left table (Persons).
The FULL JOIN keyword return rows when there is a match in one of the tables.
SELECT column_name(s)
FROM table_name1
FULL JOIN table_name2
ON table_name1.column_name=table_name2.column_name
Now we want to list all the persons and their orders, and all the orders with their persons.
The FULL JOIN keyword returns all the rows from the left table (Persons), and all the rows from the right table (Orders). If there are
rows in "Persons" that do not have matches in "Orders", or if there are rows in "Orders" that do not have matches in "Persons", those
rows will be listed as well.
The UNION operator is used to combine the result-set of two or more SELECT statements.
Notice that each SELECT statement within the UNION must have the same number of columns. The columns must also have similar
data types. Also, the columns in each SELECT statement must be in the same order.
Note: The UNION operator selects only distinct values by default. To allow duplicate values, use UNION ALL.
PS: The column names in the result-set of a UNION are always equal to the column names in the first SELECT statement in the
UNION.
"Employees_Norway":
E_ID E_Name
01 Hansen, Ola
02 Svendson, Tove
03 Svendson, Stephen
04 Pettersen, Kari
"Employees_USA":
E_ID E_Name
01 Turner, Sally
02 Kent, Clark
03 Svendson, Stephen
04 Scott, Stephen
Now we want to list all the different employees in Norway and USA.
E_Name
Hansen, Ola
Svendson, Tove
Svendson, Stephen
Pettersen, Kari
Turner, Sally
Kent, Clark
Scott, Stephen
Note: This command cannot be used to list all employees in Norway and USA. In the example above we have two employees with
equal names, and only one of them will be listed. The UNION command selects only distinct values.
Result
E_Name
Hansen, Ola
Svendson, Tove
Svendson, Stephen
Pettersen, Kari
Turner, Sally
Kent, Clark
Svendson, Stephen
Scott, Stephen
The SQL SELECT INTO statement can be used to create backup copies of tables.
The SELECT INTO statement selects data from one table and inserts it into a different table.
The SELECT INTO statement is most often used to create backup copies of tables.
SELECT *
INTO new_table_name [IN externaldatabase]
FROM old_tablename
Or we can select only the columns we want into the new table:
SELECT column_name(s)
INTO new_table_name [IN externaldatabase]
FROM old_tablename
Make a Backup Copy - Now we want to make an exact copy of the data in our "Persons" table.
SELECT *
INTO Persons_Backup
FROM Persons
We can also use the IN clause to copy the table into another database:
SELECT *
INTO Persons_Backup IN 'Backup.mdb'
FROM Persons
We can also copy only a few fields into the new table:
SELECT LastName,FirstName
INTO Persons_Backup
FROM Persons
The following SQL statement creates a "Persons_Backup" table with only the persons who lives in the city "Sandnes":
SELECT LastName,Firstname
INTO Persons_Backup
FROM Persons
WHERE City='Sandnes'
The following example creates a "Persons_Order_Backup" table contains data from the two tables "Persons" and "Orders":
SELECT Persons.LastName,Orders.OrderNo
INTO Persons_Order_Backup
FROM Persons
INNER JOIN Orders
ON Persons.P_Id=Orders.P_Id
The data type specifies what type of data the column can hold. For a complete reference of all the data types available in MS Access,
MySQL, and SQL Server, go to our complete Data Types reference.
Now we want to create a table called "Persons" that contains five columns: P_Id, LastName, FirstName, Address, and City.
The P_Id column is of type int and will hold a number. The LastName, FirstName, Address, and City columns are of type varchar
with a maximum length of 255 characters.
The empty "Persons" table will now look like this:
The empty table can be filled with data with the INSERT INTO statement.
SQL Constraints
SQL Constraints
Constraints are used to limit the type of data that can go into a table.
Constraints can be specified when a table is created (with the CREATE TABLE statement) or after the table is created (with the
ALTER TABLE statement).
NOT NULL
UNIQUE
PRIMARY KEY
FOREIGN KEY
CHECK
DEFAULT
The NOT NULL constraint enforces a column to NOT accept NULL values.
The NOT NULL constraint enforces a field to always contain a value. This means that you cannot insert a new record, or update a
record without adding a value to this field.
The following SQL enforces the "P_Id" column and the "LastName" column to not accept NULL values:
The UNIQUE and PRIMARY KEY constraints both provide a guarantee for uniqueness for a column or set of columns.
Note that you can have many UNIQUE constraints per table, but only one PRIMARY KEY constraint per table.
The following SQL creates a UNIQUE constraint on the "P_Id" column when the "Persons" table is created:
MySQL:
To allow naming of a UNIQUE constraint, and for defining a UNIQUE constraint on multiple columns, use the following SQL syntax:
To create a UNIQUE constraint on the "P_Id" column when the table is already created, use the following SQL:
MySQL:
The PRIMARY KEY constraint uniquely identifies each record in a database table.
Each table should have a primary key, and each table can have only ONE primary key.
The following SQL creates a PRIMARY KEY on the "P_Id" column when the "Persons" table is created:
MySQL:
To allow naming of a PRIMARY KEY constraint, and for defining a PRIMARY KEY constraint on multiple columns, use the
following SQL syntax:
To create a PRIMARY KEY constraint on the "P_Id" column when the table is already created, use the following SQL:
To allow naming of a PRIMARY KEY constraint, and for defining a PRIMARY KEY constraint on multiple columns, use the
following SQL syntax:
Note: If you use the ALTER TABLE statement to add a primary key, the primary key column(s) must already have been declared to
not contain NULL values (when the table was first created).
MySQL:
Let's illustrate the foreign key with an example. Look at the following two tables:
Note that the "P_Id" column in the "Orders" table points to the "P_Id" column in the "Persons" table.
The "P_Id" column in the "Persons" table is the PRIMARY KEY in the "Persons" table.
The "P_Id" column in the "Orders" table is a FOREIGN KEY in the "Orders" table.
The FOREIGN KEY constraint is used to prevent actions that would destroy links between tables.
The FOREIGN KEY constraint also prevents that invalid data form being inserted into the foreign key column, because it has to be
one of the values contained in the table it points to.
The following SQL creates a FOREIGN KEY on the "P_Id" column when the "Orders" table is created:
MySQL:
To allow naming of a FOREIGN KEY constraint, and for defining a FOREIGN KEY constraint on multiple columns, use the
following SQL syntax:
To create a FOREIGN KEY constraint on the "P_Id" column when the "Orders" table is already created, use the following SQL:
To allow naming of a FOREIGN KEY constraint, and for defining a FOREIGN KEY constraint on multiple columns, use the
following SQL syntax:
MySQL:
If you define a CHECK constraint on a single column it allows only certain values for this column.
If you define a CHECK constraint on a table it can limit the values in certain columns based on values in other columns in the row.
The following SQL creates a CHECK constraint on the "P_Id" column when the "Persons" table is created. The CHECK constraint
specifies that the column "P_Id" must only include integers greater than 0.
My SQL:
To allow naming of a CHECK constraint, and for defining a CHECK constraint on multiple columns, use the following SQL syntax:
To create a CHECK constraint on the "P_Id" column when the table is already created, use the following SQL:
The default value will be added to all new records, if no other value is specified.
The following SQL creates a DEFAULT constraint on the "City" column when the "Persons" table is created:
The DEFAULT constraint can also be used to insert system values, by using functions like GETDATE():
To create a DEFAULT constraint on the "City" column when the table is already created, use the following SQL:
MySQL:
MySQL:
Indexes allow the database application to find data fast; without reading the whole table.
Indexes
An index can be created in a table to find data more quickly and efficiently.
The users cannot see the indexes, they are just used to speed up searches/queries.
Note: Updating a table with indexes takes more time than updating a table without (because the indexes also need an update). So you
should only create indexes on columns (and tables) that will be frequently searched against.
Note: The syntax for creating indexes varies amongst different databases. Therefore: Check the syntax for creating indexes in your
database.
The SQL statement below creates an index named "PIndex" on the "LastName" column in the "Persons" table:
If you want to create an index on a combination of columns, you can list the column names within the parentheses, separated by
commas:
Indexes, tables, and databases can easily be deleted/removed with the DROP statement.
What if we only want to delete the data inside the table, and not the table itself?
The ALTER TABLE statement is used to add, delete, or modify columns in an existing table.
To delete a column in a table, use the following syntax (notice that some database systems don't allow deleting a column):
To change the data type of a column in a table, use the following syntax:
Notice that the new column, "DateOfBirth", is of type date and is going to hold a date. The data type specifies what type of data the
column can hold. For a complete reference of all the data types available in MS Access, MySQL, and SQL Server, go to our complete
Data Types reference.
Now we want to change the data type of the column named "DateOfBirth" in the "Persons" table.
Notice that the "DateOfBirth" column is now of type year and is going to hold a year in a two-digit or four-digit format.
Next, we want to delete the column named "DateOfBirth" in the "Persons" table.
Auto-increment allows a unique number to be generated when a new record is inserted into a table.
The following SQL statement defines the "P_Id" column to be an auto-increment primary key field in the "Persons" table:
By default, the starting value for AUTO_INCREMENT is 1, and it will increment by 1 for each new record.
To let the AUTO_INCREMENT sequence start with another value, use the following SQL statement:
To insert a new record into the "Persons" table, we will not have to specify a value for the "P_Id" column (a unique value will be
added automatically):
The SQL statement above would insert a new record into the "Persons" table. The "P_Id" column would be assigned a unique value.
The "FirstName" column would be set to "Lars" and the "LastName" column would be set to "Monsen".
The following SQL statement defines the "P_Id" column to be an auto-increment primary key field in the "Persons" table:
The MS SQL Server uses the IDENTITY keyword to perform an auto-increment feature.
By default, the starting value for IDENTITY is 1, and it will increment by 1 for each new record.
To specify that the "P_Id" column should start at value 10 and increment by 5, change the identity to IDENTITY(10,5).
To insert a new record into the "Persons" table, we will not have to specify a value for the "P_Id" column (a unique value will be
added automatically):
INSERT INTO Persons (FirstName,LastName)
VALUES ('Lars','Monsen')
The SQL statement above would insert a new record into the "Persons" table. The "P_Id" column would be assigned a unique value.
The "FirstName" column would be set to "Lars" and the "LastName" column would be set to "Monsen".
The following SQL statement defines the "P_Id" column to be an auto-increment primary key field in the "Persons" table:
By default, the starting value for AUTOINCREMENT is 1, and it will increment by 1 for each new record.
To specify that the "P_Id" column should start at value 10 and increment by 5, change the autoincrement to
AUTOINCREMENT(10,5).
To insert a new record into the "Persons" table, we will not have to specify a value for the "P_Id" column (a unique value will be
added automatically):
The SQL statement above would insert a new record into the "Persons" table. The "P_Id" column would be assigned a unique value.
The "FirstName" column would be set to "Lars" and the "LastName" column would be set to "Monsen".
You will have to create an auto-increment field with the sequence object (this object generates a number sequence).
The code above creates a sequence object called seq_person, that starts with 1 and will increment by 1. It will also cache up to 10
values for performance. The cache option specifies how many sequence values will be stored in memory for faster access.
To insert a new record into the "Persons" table, we will have to use the nextval function (this function retrieves the next value from
seq_person sequence):
The SQL statement above would insert a new record into the "Persons" table. The "P_Id" column would be assigned the next number
from the seq_person sequence. The "FirstName" column would be set to "Lars" and the "LastName" column would be set to
"Monsen".
SQL Views
A view contains rows and columns, just like a real table. The fields in a view are fields from one or more real tables in the database.
You can add SQL functions, WHERE, and JOIN statements to a view and present the data as if the data were coming from one single
table.
Note: A view always shows up-to-date data! The database engine recreates the data, using the view's SQL statement, every time a user
queries a view.
If you have the Northwind database you can see that it has several views installed by default.
The view "Current Product List" lists all active products (products that are not discontinued) from the "Products" table. The view is
created with the following SQL:
Another view in the Northwind sample database selects every product in the "Products" table with a unit price higher than the average
unit price:
Another view in the Northwind database calculates the total sale for each category in 1997. Note that this view selects its data from
another view called "Product Sales for 1997":
We can also add a condition to the query. Now we want to see the total sale only for the category "Beverages":
Now we want to add the "Category" column to the "Current Product List" view. We will update the view with the following SQL:
SQL Dates
The most difficult part when working with dates is to be sure that the format of the date you are trying to insert, matches the format
of the date column in the database.
As long as your data contains only the date portion, your queries will work as expected. However, if a time portion is involved, it gets
complicated.
Before talking about the complications of querying for dates, we will look at the most important built-in functions for working with
dates.
The following table lists the most important built-in date functions in MySQL:
Function Description
NOW() Returns the current date and time
CURDATE() Returns the current date
CURTIME() Returns the current time
DATE() Extracts the date part of a date or date/time expression
EXTRACT() Returns a single part of a date/time
DATE_ADD() Adds a specified time interval to a date
DATE_SUB() Subtracts a specified time interval from a date
DATEDIFF() Returns the number of days between two dates
DATE_FORMAT() Displays date/time data in different formats
The following table lists the most important built-in date functions in SQL Server:
Function Description
GETDATE() Returns the current date and time
DATEPART() Returns a single part of a date/time
DATEADD() Adds or subtracts a specified time interval from a date
DATEDIFF() Returns the time between two dates
CONVERT() Displays date/time data in different formats
MySQL comes with the following data types for storing a date or a date/time value in the database:
SQL Server comes with the following data types for storing a date or a date/time value in the database:
Note: The date types are chosen for a column when you create a new table in your database!
For an overview of all data types available, go to our complete Data Types reference.
You can compare two dates easily if there is no time component involved!
Now we want to select the records with an OrderDate of "2008-11-11" from the table above.
Now, assume that the "Orders" table looks like this (notice the time component in the
"OrderDate" column):
we will get no result! This is because the query is looking only for dates with no time portion.
Tip: If you want to keep your queries simple and easy to maintain, do not allow time components in your dates!
SQL NULL Values
This chapter will explain the IS NULL and IS NOT NULL operators.
If a column in a table is optional, we can insert a new record or update an existing record without adding a value to this column. This
means that the field will be saved with a NULL value.
Note: It is not possible to compare NULL and 0; they are not equivalent.
Suppose that the "Address" column in the "Persons" table is optional. This means that if we insert a record with no value for the
"Address" column, the "Address" column will be saved with a NULL value.
It is not possible to test for NULL values with comparison operators, such as =, <, or <>.
We will have to use the IS NULL and IS NOT NULL operators instead.
SQL IS NULL
How do we select only the records with NULL values in the "Address" column?
How do we select only the records with no NULL values in the "Address" column?
In the next chapter we will look at the ISNULL(), NVL(), IFNULL() and COALESCE() functions.
Suppose that the "UnitsOnOrder" column is optional, and may contain NULL values.
SELECT ProductName,UnitPrice*(UnitsInStock+UnitsOnOrder)
FROM Products
In the example above, if any of the "UnitsOnOrder" values are NULL, the result is NULL.
Microsoft's ISNULL() function is used to specify how we want to treat NULL values.
The NVL(), IFNULL(), and COALESCE() functions can also be used to achieve the same result.
SELECT ProductName,UnitPrice*(UnitsInStock+ISNULL(UnitsOnOrder,0))
FROM Products
Oracle
Oracle does not have an ISNULL() function. However, we can use the NVL() function to achieve the same result:
SELECT ProductName,UnitPrice*(UnitsInStock+NVL(UnitsOnOrder,0))
FROM Products
MySQL
MySQL does have an ISNULL() function. However, it works a little bit different from Microsoft's ISNULL() function.
SELECT ProductName,UnitPrice*(UnitsInStock+IFNULL(UnitsOnOrder,0))
FROM Products
SELECT ProductName,UnitPrice*(UnitsInStock+COALESCE(UnitsOnOrder,0))
FROM Products
Data types and ranges for Microsoft Access, MySQL and SQL Server.
In MySQL there are three main types : text, number, and Date/Time types.
Text types:
Note: The values are sorted in the order you enter them.
Number types:
*The integer types have an extra option called UNSIGNED. Normally, the integer goes from an negative to positive value. Adding the
UNSIGNED attribute will move that range up so it starts at zero instead of a negative number.
Date types:
Data type Description
DATE() A date. Format: YYYY-MM-DD
Note: The supported range is from '1970-01-01 00:00:01' UTC to '2038-01-09 03:14:07' UTC
TIME() A time. Format: HH:MM:SS
Note: Values allowed in four-digit format: 1901 to 2155. Values allowed in two-digit format: 70 to 69,
representing years from 1970 to 2069
*Even if DATETIME and TIMESTAMP return the same format, they work very differently. In an INSERT or UPDATE query, the
TIMESTAMP automatically set itself to the current date and time. TIMESTAMP also accepts various formats, like
YYYYMMDDHHMMSS, YYMMDDHHMMSS, YYYYMMDD, or YYMMDD.
Character strings:
Unicode strings:
Binary types:
Number types:
Data type Description Storage
tinyint Allows whole numbers from 0 to 255 1 byte
smallint Allows whole numbers between -32,768 and 32,767 2 bytes
int Allows whole numbers between -2,147,483,648 and 2,147,483,647 4 bytes
bigint Allows whole numbers between -9,223,372,036,854,775,808 and 9,223,372,036,854,775,807 8 bytes
decimal(p,s) Fixed precision and scale numbers. 5-17 bytes
The p parameter indicates the maximum total number of digits that can be stored (both to the
left and to the right of the decimal point). p must be a value from 1 to 38. Default is 18.
The s parameter indicates the maximum number of digits stored to the right of the decimal
point. s must be a value from 0 to p. Default value is 0
numeric(p,s) Fixed precision and scale numbers. 5-17 bytes
The p parameter indicates the maximum total number of digits that can be stored (both to the
left and to the right of the decimal point). p must be a value from 1 to 38. Default is 18.
The s parameter indicates the maximum number of digits stored to the right of the decimal
point. s must be a value from 0 to p. Default value is 0
smallmoney Monetary data from -214,748.3648 to 214,748.3647 4 bytes
money Monetary data from -922,337,203,685,477.5808 to 922,337,203,685,477.5807 8 bytes
float(n) Floating precision number data from -1.79E + 308 to 1.79E + 308. 4 or 8 bytes
The n parameter indicates whether the field should hold 4 or 8 bytes. float(24) holds a 4-byte
field and float(53) holds an 8-byte field. Default value of n is 53.
real Floating precision number data from -3.40E + 38 to 3.40E + 38 4 bytes
Date types: