DBMS - Chapter-4
DBMS - Chapter-4
DBMS - Chapter-4
s Basic Structure s Set Operations s Aggregate Functions s Null Values s Nested Subqueries s Derived Relations s Views s Modification of the Database s Joined Relations s Data Definition Language s Embedded SQL, ODBC and JDBC
4.1
4.2
Basic Structure
s SQL is based on set and relational operations with certain
4.3
query
5 corresponds to the projection operation of the relational algebra
s E.g. find the names of all branches in the loan relation
branch-nam e
(loan)
5 Use, e.g., branch_name instead of branch-name in a real implementation. 5 We use - since it looks nicer!
s NOTE: SQL names are case insensitive, i.e. you can use capital
or small letters.
5 You may wish to use upper case where-ever we use bold font.
Database System Concepts 4.4 Silberschatz, Korth and Sudarshan
4.5
s The query:
select loan-number, branch-name, amount 100 from loan would return a relation which is the same as the loan relations, except that the attribute amount is multiplied by 100.
4.6
satisfy
5 corresponds to the selection predicate of the relational algebra.
s To find all loan number for loans made at the Perryridge branch
with loan amounts greater than $1200. select loan-number from loan where branch-name = Perryridge and amount > 1200
s Comparison results can be combined using the logical
4.7
between $90,000 and $100,000 (that is, $90,000 and $100,000) select loan-number from loan where amount between 90000 and 100000
4.8
s Find the name, loan number and loan amount of all customers
having a loan at the Perryridge branch. select customer-name, borrower.loan-number, amount from borrower, loan where borrower.loan-number = loan.loan-number and branch-name = Perryridge
4.9
rename the column name loan-number as loan-id. select customer-name, borrower.loan-number as loan-id, amount from borrower, loan where borrower.loan-number = loan.loan-number
4.10
Tuple Variables
s Tuple variables are defined in the from clause via the use of the
as clause.
s Find the customer names and their loan numbers for all
customers having a loan at some branch. select customer-name, T.loan-number, S.amount from borrower as T, loan as S where T.loan-number = S.loan-number
s
Find the names of all branches that have greater assets than some branch located in Brooklyn. select distinct T.branch-name from branch as T, branch as S where T.assets > S.assets and S.branch-city = Brooklyn
4.11
String Operations
s SQL includes a string-matching operator for comparisons on character
5 concatenation (using ||) 5 converting from upper to lower case (and vice versa) 5 finding string length, extracting substrings, etc.
4.12
in Perryridge branch select distinct customer-name from borrower, loan where borrower loan-number = loan.loan-number and branch-name = Perryridge order by customer-name
s We may specify desc for descending order or asc for ascending
4.13
Duplicates
s In relations with duplicates, SQL can define how many copies of
(r1): If there are c1 copies of tuple t1 in r1, and t1 satisfies selections ,, then there are c1 copies of t1 in (r1).
2. A(r1): For each copy of tuple t1 in r1, there is a copy of tuple A(t1) in A(r1) where A(t1) denotes the projection of the single tuple t1. 3. r1 x r2 : If there are c1 copies of tuple t1 in r1 and c2 copies of tuple t2 in r2, there are c1 x c2 copies of the tuple t1. t2 in r1 x r2
4.14
Duplicates (Cont.)
s Example: Suppose multiset relations r1 (A, B) and r2 (C) are as
select A1,, A2, ..., An from r1, r2, ..., rm where P is equivalent to the multiset version of the expression:
A1 A2 ..., ,, , A n
4.15
Set Operations
s The set operations union, intersect, and except operate on
duplicates; to retain all duplicates use the corresponding multiset versions union all, intersect all and except all. Suppose a tuple occurs m times in r and n times in s, then, it occurs:
5 m + n times in r union all s 5 min(m,n) times in r intersect all s 5 max(0, m n) times in r except all s
4.16
Set Operations
s Find all customers who have a loan, an account, or both:
Find all customers who have both a loan and an account. (select customer-name from depositor) intersect (select customer-name from borrower)
4.17
Aggregate Functions
s These functions operate on the multiset of values of a column of
a relation, and return a value avg: average value min: minimum value max: maximum value sum: sum of values count: number of values
4.18
4.19
select branch-name, count (distinct customer-name) from depositor, account where depositor.account-number = account.account-number group by branch-name Note: Attributes in select clause outside of aggregate functions must appear in group by list
4.20
balance is more than $1,200. select branch-name, avg (balance) from account group by branch-name having avg (balance) > 1200 Note: predicates in the having clause are applied after the formation of groups whereas predicates in the where clause are applied before forming groups
4.21
Null Values
s It is possible for tuples to have a null value, denoted by null, for
5 E.g. Find all loan number which appear in the loan relation with null values for amount.
4.22
or
null = null
5 OR: (unknown or true) = true, (unknown or false) = unknown (unknown or unknown) = unknown 5 AND: (true and unknown) = unknown, (false and unknown) = false, (unknown and unknown) = unknown 5 NOT: (not unknown) = unknown 5 P is unknown evaluates to true if predicate P evaluates to unknown
s Result of where clause predicate is treated as false if it
evaluates to unknown
4.23
4.24
Nested Subqueries
s SQL provides a mechanism for the nesting of subqueries. s A subquery is a select-from-where expression that is nested
4.25
Example Query
s Find all customers who have both an account and a loan at the
bank. select distinct customer-name from borrower where customer-name in (select customer-name from depositor)
s Find all customers who have a loan at the bank but do not
have an account at the bank select distinct customer-name from borrower where customer-name not in (select customer-name from depositor)
4.26
Example Query
s Find all customers who have both an account and a loan at the
Perryridge branch select distinct customer-name from borrower, loan where borrower.loan-number = loan.loan-number and branch-name = Perryridge and (branch-name, customer-name) in (select branch-name, customer-name from depositor, account where depositor.account-number = account.account-number)
s Note: Above query can be written in a much simpler manner. The
formulation above is simply to illustrate SQL features. (Schema used in this example)
Database System Concepts 4.27 Silberschatz, Korth and Sudarshan
Set Comparison
s Find all branches that have greater assets than some branch
located in Brooklyn. select distinct T.branch-name from branch as T, branch as S where T.assets > S.assets and S.branch-city = Brooklyn
s Same query using > some clause
select branch-name from branch where assets > some (select assets from branch where branch-city = Brooklyn)
4.28
0 5 6 0 5 0 5
) = false ) = true
5)
4.29
(5< all
0 5 6 6 10 4 5
) = false
Example Query
s Find the names of all branches that have greater assets than all
branches located in Brooklyn. select branch-name from branch where assets > all (select assets from branch where branch-city = Brooklyn)
4.31
subquery is nonempty.
s exists r r s not exists r r =
4.32
Example Query
s Find all customers who have an account at all branches located
in Brooklyn.
select distinct S.customer-name from depositor as S where not exists ( (select branch-name from branch where branch-city = Brooklyn) except (select R.branch-name from depositor as T, account as R where T.account-number = R.account-number and S.customer-name = T.customer-name))
s (Schema used in this example) s Note that X Y = X Y s Note: Cannot write this query using = all and its variants
Database System Concepts 4.33 Silberschatz, Korth and Sudarshan
branch. select T.customer-name from depositor as T where unique ( select R.customer-name from account, depositor as R where T.customer-name = R.customer-name and R.account-number = account.account-number and account.branch-name = Perryridge)
s (Schema used in this example)
4.34
Example Query
s Find all customers who have at least two accounts at the
Perryridge branch. select distinct T.customer-name from depositor T where not unique ( select R.customer-name from account, depositor as R where T.customer-name = R.customer-name and R.account-number = account.account-number and account.branch-name = Perryridge)
s(Schema used in this example)
4.35
Views
s Provide a mechanism to hide certain data from the view of
certain users. To create a view we use the command: create view v as <query expression> where:
5<query expression> is any legal expression 5The view name is represented by v
4.36
Example Queries
s A view consisting of branches and their customers
create view all-customer as (select branch-name, customer-name from depositor, account where depositor.account-number = account.account-number) union (select branch-name, customer-name from borrower, loan where borrower.loan-number = loan.loan-number)
s Find all customers of the Perryridge branch
4.37
Derived Relations
s Find the average account balance of those branches where the
average account balance is greater than $1200. select branch-name, avg-balance from (select branch-name, avg (balance) from account group by branch-name) as result (branch-name, avg-balance) where avg-balance > 1200 Note that we do not need to use the having clause, since we compute the temporary (view) relation result in the from clause, and the attributes of result can be used directly in the where clause.
4.38
With Clause
s With clause allows views to be defined locally to a query, rather
with max-balance(value) as select max (balance) from account select account-number from account, max-balance where account.balance = max-balance.value
4.39
the average of the total account deposits at all branches. with branch-total (branch-name, value) as select branch-name, sum (balance) from account group by branch-name with branch-total-avg(value) as select avg (value) from branch-total select branch-name from branch-total, branch-total-avg where branch-total.value >= branch-total-avg.value
4.40
delete from account where branch-name in (select branch-name from branch where branch-city = Needham) delete from depositor where account-number in (select account-number from branch, account where branch-city = Needham and branch.branch-name = account.branch-name)
s (Schema used in this example)
4.41
Example Query
s Delete the record of all accounts with balances below the
average at the bank. delete from account where balance < (select avg (balance) from account)
5 Problem: as we delete tuples from deposit, the average balance changes 5 Solution used in SQL:
1. 2. First, compute avg balance and find all tuples to delete Next, delete all tuples found above (without recomputing avg or retesting the tuples)
4.42
insert into account values (A-9732, Perryridge,1200) or equivalently insert into account (branch-name, balance, account-number) values (Perryridge, 1200, A-9732)
s Add a new tuple to account with balance set to null
4.43
$200 savings account. Let the loan number serve as the account number for the new savings account insert into account select loan-number, branch-name, 200 from loan where branch-name = Perryridge insert into depositor select customer-name, loan-number from loan, borrower where branch-name = Perryridge and loan.account-number = borrower.account-number
s The select from where statement is fully evaluated before any of its
results are inserted into the relation (otherwise queries like insert into table1 select * from table1 would cause problems
Database System Concepts 4.44 Silberschatz, Korth and Sudarshan
update account set balance = balance 1.06 where balance > 10000 update account set balance = balance 1.05 where balance 10000
5 The order is important 5 Can be done better using the case statement (next slide)
4.45
$10,000 by 6%, all other accounts receive 5%. update account set balance = case when balance <= 10000 then balance *1.05 else balance * 1.06 end
4.46
Update of a View
s Create a view of all loan data in loan relation, hiding the amount
insert into branch-loan values (Perryridge, L-307) This insertion must be represented by the insertion of the tuple (L-307, Perryridge, null) into the loan relation
s Updates on more complex views are difficult or impossible to translate,
4.47
Transactions
s A transaction is a sequence of queries and update statements executed
as a single unit
5 Transactions are started implicitly and terminated by one of
commit work: makes all updates of the transaction permanent in the database rollback work: undoes all updates performed by the transaction.
s Motivating example
5 If one steps succeeds and the other fails, database is in an inconsistent state 5 Therefore, either both steps should succeed or neither should
s If any step of a transaction fails, all work done by the transaction can be
system failures
4.48
Transactions (Cont.)
s In most database systems, each SQL statement that executes
4.49
Joined Relations
s Join operations take two relations and return as a result another
relation.
s These additional operations are typically used as subquery
any tuple in the other relation (based on the join condition) are treated.
Join Types inner join left outer join right outer join full outer join Join Conditions natural on <predicate> using (A1, A2, ..., An)
4.50
s Relation borrower
customer-name Jones Smith Hayes loan-number L-170 L-230 L-155
s Note: borrower information missing for L-260 and loan information missing for L-155
4.51
loan.loan-number = borrower.loan-number
loan-number L-170 L-230 branch-name Downtown Redwood amount 3000 4000 customer-name Jones Smith loan-number L-170 L-230
4.52
4.53
s Find all customers who have either an account or a loan (but not both) at the bank. select customer-name from (depositor natural full outer join borrower) where account-number is null or loan-number is null
4.54
4.55
4.56
time 09:00:30.75
5 E.g. Interval 1 day 5 Subtracting a date/time/timestamp value from another gives an interval value 5 Interval values can be added to date/time/timestamp values
s Can extract values of individual fields from date/time/timestamp
4.57
create table r (A1 D1, A2 D2, ..., An Dn, (integrity-constraint1), ..., (integrity-constraintk))
5 r is the name of the relation 5 each Ai is an attribute name in the schema of relation r 5 Di is the data type of values in the domain of attribute Ai
s Example:
create table branch (branch-namechar(15) not null, branch-city char(30), assets integer)
4.58
Example: Declare branch-name as the primary key for branch and ensure that the values of assets are nonnegative. create table branch (branch-namechar(15), branch-city char(30) assets integer, primary key (branch-name), check (assets >= 0)) primary key declaration on an attribute automatically ensures not null in SQL-92 onwards, needs to be explicitly stated in SQL-89
Database System Concepts 4.59 Silberschatz, Korth and Sudarshan
existing relation. alter table r add A D where A is the name of the attribute to be added to relation r and D is the domain of A.
5 All tuples in the relation are assigned null as the value for the new attribute.
s The alter table command can also be used to drop
attributes of a relation alter table r drop A where A is the name of an attribute of relation r
5 Dropping of attributes not supported by many databases
4.60
Embedded SQL
s The SQL standard defines embeddings of SQL in a variety of
as a host language, and the SQL structures permitted in the host language comprise embedded SQL.
s The basic form of these languages follows that of the System R
to the preprocessor EXEC SQL <embedded SQL statement > END-EXEC Note: this varies by language. E.g. the Java embedding uses # SQL { . } ;
4.61
Example Query
From within a host language, find the names and cities of customers with more than the variable amount dollars in some account.
s Specify the query in SQL and declare a cursor for it
EXEC SQL declare c cursor for select customer-name, customer-city from depositor, customer, account where depositor.customer-name = customer.customer-name and depositor account-number = account.account-number and account.balance > :amount END-EXEC
4.62
result to be placed on host language variables. EXEC SQL fetch c into :cn, :cc END-EXEC Repeated calls to fetch get successive tuples in the query result
s A variable called SQLSTATE in the SQL communication area
temporary relation that holds the result of the query. EXEC SQL close c END-EXEC Note: above details vary with language. E.g. the Java embedding defines Java iterators to step through result tuples.
4.63
is for update declare c cursor for select * from account where branch-name = Perryridge for update
s To update tuple at the current location of cursor
4.64
Dynamic SQL
s Allows programs to construct and submit SQL queries at run
time.
s Example of the use of dynamic SQL from within a C program.
char * sqlprog = update account set balance = balance * 1.05 where account-number = ? EXEC SQL prepare dynprog from :sqlprog; char account [10] = A-101; EXEC SQL execute dynprog using :account;
s The dynamic SQL program contains a ?, which is a place holder
4.65
ODBC
s Open DataBase Connectivity(ODBC) standard
5 standard for application program to communicate with a database server. 5 application program interface (API) to
open a connection with a database, send queries and updates, get back results. s Applications such as GUI, spreadsheets, etc. can use ODBC
4.66
ODBC (Cont.)
s Each database system supporting ODBC provides a "driver" library that must
communicates with the server to carry out the requested action, and fetch results.
s ODBC program first allocates an SQL environment, then a database
connection handle.
s Opens database connection using SQLConnect(). Parameters for
SQLConnect:
5 connection handle, 5 the server to which to connect 5 the user identifier, 5 password
s Must also specify types of arguments:
4.67
ODBC Code
s int ODBCexample()
RETCODE error; HENV env; /* environment */ HDBC conn; /* database connection */ SQLAllocEnv(&env); SQLAllocConnect(env, &conn); SQLConnect(conn, "aura.bell-labs.com", SQL_NTS, "avi", SQL_NTS, "avipasswd", SQL_NTS); { . Do actual work } SQLDisconnect(conn); SQLFreeConnect(conn); SQLFreeEnv(env);
}
4.68
s Good programming requires checking results of every function call for errors;
4.69
char branchname[80]; float balance; int lenOut1, lenOut2; HSTMT stmt; SQLAllocStmt(conn, &stmt); char * sqlquery = "select branch_name, sum (balance) from account group by branch_name"; error = SQLExecDirect(stmt, sqlquery, SQL_NTS); if (error == SQL_SUCCESS) { SQLBindCol(stmt, 1, SQL_C_CHAR, branchname , 80, &lenOut1); SQLBindCol(stmt, 2, SQL_C_FLOAT, &balance, 0 , &lenOut2); while (SQLFetch(stmt) >= SQL_SUCCESS) { printf (" %s %g\n", branchname, balance); } } SQLFreeStmt(stmt, SQL_DROP);
Database System Concepts 4.70 Silberschatz, Korth and Sudarshan
5 SQL statement prepared: compiled at the database 5 Can have placeholders: E.g. insert into account values(?,?,?) 5 Repeatedly executed with actual values for the placeholders
s Metadata features
5 finding all the relations in the database and 5 finding the names and types of columns of a query result or a relation in the database.
s By default, each SQL statement is treated as a separate transaction
by the standard.
5 Core 5 Level 1 requires support for metadata querying 5 Level 2 requires ability to send and retrieve arrays of parameter values and more detailed catalog information.
s SQL Call Level Interface (CLI) standard similar to ODBC
4.72
JDBC
s JDBC is a Java API for communicating with database systems
supporting SQL
s JDBC supports a variety of features for querying and updating
relations present in the database and the names and types of relation attributes
s Model for communicating with the database:
5 Open a connection 5 Create a statement object 5 Execute queries using the Statement object to send queries and fetch results 5 Exception mechanism to handle errors
4.73
JDBC Code
public static void JDBCexample(String dbid, String userid, String passwd) { try { Class.forName ("oracle.jdbc.driver.OracleDriver"); Connection conn = DriverManager.getConnection( "jdbc:oracle:thin:@aura.belllabs.com:2000:bankdb", userid, passwd); Statement stmt = conn.createStatement(); Do Actual Work . stmt.close(); conn.close(); } catch (SQLException sqle) { System.out.println("SQLException : " + sqle); } }
4.74
try { stmt.executeUpdate( "insert into account values ('A-9732', 'Perryridge', 1200)"); } catch (SQLException sqle) { System.out.println("Could not insert tuple. " + sqle); }
s Execute query and fetch and print results
ResultSet rset = stmt.executeQuery( "select branch_name, avg(balance) from account group by branch_name"); while (rset.next()) { System.out.println( rset.getString("branch_name") + " " + rset.getFloat(2)); }
4.75
5 rs.getString(branchname) and rs.getString(1) equivalent if branchname is the first argument of select result.
s Dealing with Null values
4.76
Prepared Statement
s Prepared statement allows queries to be compiled and executed
multiple times with different arguments PreparedStatement pStmt = conn.prepareStatement( insert into account values(?,?,?)); pStmt.setString(1, "A-9732"); pStmt.setString(2, "Perryridge"); pStmt.setInt(3, 1200); pStmt.executeUpdate(); pStmt.setString(1, "A-9733"); pStmt.executeUpdate();
s Beware: If value to be stored in database contains a single quote or
other special character, prepared statements work fine, but creating a query string and executing it directly would result in a syntax error!
4.77
5 client connects to an SQL server, establishing a session 5 executes a series of statements 5 disconnects the session 5 can commit or rollback the work carried out in the session
s An SQL environment contains several components,
including a user identifier, and a schema, which identifies which of several schemas a session is using.
4.78
5 Database contains multiple catalogs 5 each catalog can contain multiple schemas 5 SQL objects such as relations and views are contained within a schema
s e.g. catalog5.bank-schema.account s Each user has a default catalog and schema, and the
4.79
5 permits definition of procedures in SQL, with if-then-else statements, for and while loops, etc. 5 more in Chapter 9
s Stored Procedures
5 Can store procedures in the database 5 then execute them using the call statement 5 permit external applications to operate on the database without knowing about internal details
s These features are covered in Chapter 9 (Object Relational
Databases)
4.80
Transactions in JDBC
s As with ODBC, each statement gets committed
automatically in JDBC
s To turn off auto commit use
conn.setAutoCommit(false);
s To commit or abort transactions use
conn.commit() or conn.rollback()
s To turn auto commit on again, use
conn.setAutoCommit(true);
4.82
procedures/functions to be invoked. CallableStatement cs1 = conn.prepareCall( {call proc (?,?)} ) ; CallableStatement cs2 = conn.prepareCall( {? = call func (?,?)} );
4.83
ResultSet.
s Provides Functions for getting number of columns, column
name, type, precision, scale, table from which the column is derived etc. ResultSetMetaData rsmd = rs.getMetaData ( ); for ( int i = 1; i <= rsmd.getColumnCount( ); i++ ) { String name = rsmd.getColumnName(i); String typeName = rsmd.getColumnTypeName(i); }
4.84
DatabaseMetaData dbmd = conn.getMetaData( ); ResultSet rs = dbmd.getColumns( null, BANK-DB, account, % ); //Arguments: catalog, schema-pattern, table-pattern, column-pattern // Returns: 1 row for each column, with several attributes such as // COLUMN_NAME, TYPE_NAME, etc. while ( rs.next( ) ) { System.out.println( rs.getString(COLUMN_NAME) , rs.getString(TYPE_NAME); }
s There are also functions for getting information such as
5 Foreign key references in the schema 5 Database limits like maximum row size, maximum no. of connections, etc
4.85
Application Architectures
s Applications can be built using one of two architectures
4.86
Two-tier Model
s E.g. Java code runs at client site and uses JDBC to
5 Security: passwords available at client site, all database operation possible 5 More code shipped to client 5 Not appropriate across organizations, or in large ones like universities
4.87
Application/HTTP Server
Servlets
JDBC
Database Server
Network
Client
Database System Concepts
Client
4.88
Client
Silberschatz, Korth and Sudarshan
server
s Client sends request over http or application-specific protocol s Application or Web server receives request s Request handled by CGI program or servlets s Security handled by application at server
4.89
End of Chapter
4.91
4.92
4.93
4.94
4.95
4.96
4.97
4.98