CSE5003 - DAT ABA Se Syste MS: DES IGN A ND I M PLE Ment Atio N L, T, P, J, C 2,0,2,4,4

Download as pdf or txt
Download as pdf or txt
You are on page 1of 9

CSE5003 - DAT ABA SE SYSTE MS: DES IGN A ND I M PLE MENT ATIO N

L,T,P,J,C

2,0,2,4,4

Objectives:

 To understand the underlying principles of Relational Database Management


System.
 To focus on the modeling and design of databases and usage of advanced data
models.
 To implement and maintain the structured, semi structured and unstructured data in
an efficient database system using emerging trends.
 To identify the different threats to databases

Expected Outcome:
After successfully completing the course the student should be able to

 Design and implement a database depending on the business requirements,


considering various design issues.
 Categorize and design the structured, semi structured and unstructured databases
 Characterize the database threats and its countermeasures.

Student Learning Outcome:

1.Having an ability to apply mathematics and science in engineering applications


2.Having a clear understanding of the subject related concepts and of contemporary
issues
5. Having design thinking capability
7. Having computational thinking (Ability to translate vast data in to abstract concepts
and to understand database reasoning)
11. Having interest in lifelong learning
Module
Topics L Hrs SLO
1 Relational Model

Database System Architecture – EER Modeling- Indexing – 6 1,7


Normalization –Query processing and optimization – Transaction
Processing

2 Parallel Databases

Architecture, Data partitioning strategy, Interquery and Intraquery 4 5,7


Parallelism –Parallel Query Optimization

3 Distributed Databases

Features – Distributed Database Architecture –Fragmentation – 5 5,7


Replication- Distributed Query Processing – Distributed Transactions
Processing

4 Spatial and Mobile Databases

Spatial databases -Type of spatial data– Indexing in spatial databases, 3 5


Mobile Databases – Transaction Model in MDS

5 Semi Structured Databases:

Semi Structured databases – XML –Schema-DTD- XPath- XQuery 4 7


,Semantic Web –RDF –RDFS

6 Database Security

Introduction to Database Security Issues –Security Models – Different 3 7


Threats to databases – Counter measures to deal with these problems

7 Emerging Technologies

Cloud databases – Streaming Databases - Graph Databases-New SQL 3 7


databases
8 Recent Trends
2 11

Total Lecture Hours 30

Reference Books
1. Avi Silberschatz, Hank Korth, and S. Sudarshan ,”Database
System Concepts”, 6th Ed.. McGraw Hill, 2010.

2. Ramez Elmasri & B.Navathe: “Fundamentals of database


s ystems”, 7th edition, Addison Wesley, 2014.

3. S.K.Singh, “Database S ystems: Concepts, Design &


Applications ”, 2 n d edition, Pearson education, 2011.

4. Joe Fawcett, Danny Ayers, Liam R. E. Quin: “Beginning XML”,


Wiley India Private Limited5th Edition, 2012.

5. Thomas M. Connoll y and Carol yn Begg “Database S ystems: A


Practical Approach to Design, Implementation, and
Management ”, 6th edition, Pearso n India, 2015.

List of Lab Experiments

1. Model any given scenario into ER/EER Model using any tool
(ERDPlus,ERWin,OracleSQL developer)
14,17
2. Creating applications with RDBMS

a) Table creation with constraints, alter schema ,insert values, aggregate


functions,
simple and complex queries with joins

b) PLSQL-PROCEDURES,CURSORS,FUNCTIONS,TRIGGERS

3. Partition a given database based on the type of query and compares the
execution speed of the query with/without parallelism.

4. Create an XML document and validate it against an XML Schema/DTD. Use


XQuery to query and view the contents of the database.

5. Consider an application in which the results of football games are to be


represented in XML,DTD and Xquery. For each game, we want to be able to
represent the two teams involved, which one was playing at home, which
players scored goals (some of which may have been penalties) and the time
when each was scored, and which players were shown yellow or red cards.
You might use some attributes. You can check your solutions with the online
demo of the Zorba XQuery engine4.

6. To implement parallel join and parallel sort algorithms to get marks from
different colleges of the university and publish 10 ranks for each discipline.

7. Create a distributed database scenario , insert values, fragment the database


and query the database

8. Consider a schema that contains the following table with the key underlined:
Employee (Eno, Ename,Desg,Dno). Assume that we horizontally fragment
the table as follows:

Employee1(Eno;Ename; Desg;Dno), where 1<= Dno <=10


Employee2(Eno;Ename; Desg;Dno), where 11 <= Dno <=20
Employee3(Eno;Ename; Desg;Dno), where 21 <= Dno <=30

In addition, assume we have 4 sites that contain the following fragments:


Site1 has Employee1
Site2 has Employee2
Site3 has Employee2 and Employee3
Site4 has Employee1

Implement at least 5 suitable queries on Employee fragments. Add relations to


the database as per your requirements.

9. Download a spatial dataset based on any specific theme (containing layer


information) from Quantum GIS and import it into PostgresSQL(PostGIS)
and Query and view the database.

10. Investigation of some spatial analysis techniques using Toxic Release


Inventory (www.epa.gov/triexplorer/) data for Massachusetts from the
Environmental Protection Agency (EPA), which indicate the magnitude of the
releases of toxic core chemicals into land, water and air at a site in the state.
Note that these TRI locations were geocoded from a list of addresses provided
by the EPA

11. Use sample datasets from health care domain ,Visualize and interpret the
results
12. Import the Hubway data into Neo4j and configure Neo4j. Then, answer the
following questions using the Cypher Query Language:

a) List top 10 stations with most outbound trips (Show station name and number of trips)

b) List top 10 stations with most inbound trips (Show station name and number of trips)

c) List top 5 routes with most trips (Show starting station name, ending station name and
number of trips) (4) List the hour number (for example 13 means 1pm -2pm) and number
of trips which start from the station "B.U. Central"

d) List the hour number (for example 13 means 1pm -2pm) and number of trips which
end at the station "B.U. Central"

Sample Projects

1. Developing applications by creating and modeling data into different


databases as given below and creating an interface for Querying/ Viewing the
database for various functionalities.
 Relational database
 Distributed database
 Spatial Database
 XML Database
 Mobile databases

2. Analyzing and Visualizing social networks like facebook , twitter etc using
NoSQL Databases.

3. Using Sample datasets from http://www.rdatamining.com/resources/data,


UCLA Repository, kaggle dataset etc and analyzing them using NoSQL
databases.

Knowledge area Total Hours of


coverage
CS: SF (System Fundamentals) 4
CS:IM :Information Management/ CE:DBS 14
Database Systems
CS:PD: Parallel and Distributed Computing 9
CS: IAS Information Assurance and 3
Security
Body of Knowledge Coverage

KA Knowledge Unit Topics Covered Hours


CS:IM Relational Databases Relational Model
Data Modelling
CE:DBS Database System Architecture –
EER Modeling- Indexing – 6
Normalization –Query processing
and optimization – Transaction
Processing

CS:PD Parallel and Parallel Databases


Distributed
Computing Architecture, Data partitioning
strategy, Interquery and Intraquery
Parallelism –Parallel Query
Optimization

Distributed Databases 9

Features – Distributed Database


Architecture –Fragmentation –
Replication- Distributed Query
Processing – Distributed
Transactions Processing

CS:IM Spatial and Mobile Spatial and Mobile Databases


databases
Spatial databases -Type of spatial
data– Indexing in spatial databases, 3
Mobile Databases – Transaction
Model in MDS

CS:IM Semi Structured Semi Structured Databases:


databases
Semi Structured databases – XML 4
–XPath- XQuery ,Semantic Web,
RDF /RDFs

CS: IAS Information Introduction to Database Security


Assurance and Issues –Security Models – Different 3
Security Threats to databases – Counter
measures to deal with these
problems

CS:IM Emerging Emerging Technologies- Cloud


technologies databases – Streaming Databases -
5
Graph Databases-New SQL
databases and Recent Trends

30

What is covered in the course?

Module I - Relational Model –It introduces the Database System Architecture and also
discusses how to model data using EER Modelling. Different Indexing techniques and the ways
to tune the database using Normalization is also discussed. This module also discusses on how to
process transactions and optimize the Queries using heuristics query processing.

Module II - Parallel Databases – This module discusses about the different types of Parallel
Architecture, Data partitioning strategy, Interquery and Intraquery Parallelism. It also throws
light on Parallel Query Optimization.

Module III - Distributed Databases -This segment discusses about Distributed Database
Architecture, its features and the techniques called Fragmentation and Replication. Distributed
Query Processing is elaborated here to find the optimum transfer cost of the query. Distributed
Transactions Processing covers the techniques on a transaction is handle in that kind of
environment.

Module IV –Spatial and Mobile Databases –This module gives an introduction to Spatial and
Mobile databases. It also discusses the types of spatial data, indexing in spatial databases and
also the Transaction Model in MDS

Module V - Semi Structured Databases – This Segment introduces the concept of Semi
Structured databases, XML, How to validate the the XML file against XML Schema.How to
Query the database using XQuery. It also discusses about the role of semantic web and how to
model data using RDF.

Module VI - Introduction to Database Security Issues – This module discusses the Security
Models, the different Threats to databases and Counter measures to deal with these problems
Module VII - Emerging Technologies- This module discusses about the emerging and current
technologies like Cloud databases, Streaming Databases, Graph Databases, New SQL databases
and Recent Trends and a few case studies.

What is the format of the course?

This Course is designed with 100 minutes of in-classroom sessions per week, 100 minutes of lab
hours per week, as well as 200 minutes of non-contact time spent on implementing course
related project. Generally this course should have the combination of lectures, in-class
discussion, case studies, guest-lectures, mandatory off-class reading material, and assignment.

How are students assessed?

 Students are assessed on a combination of group activities, classroom discussion, projects


and continuous, final assessment tests.
 Additional weightage will be given to students working with projects based on different
databases, and competitions and projects handling with large databases.
 Students can earn additional weightage based on certificate of completion of a related
MOOC course.

Session wise plan

Class Lab Topics Covered Level of Text/Reference Remarks


Hour Hour mastery Book
1 Data base system Architecture Familiarity 1,2
1 2 ER /EER Modelling Usage, 1,2
Assessment
1 Indexing and its types Familiarity 1,2
1 Normalization Familiarity, 1,2
Assessment
1 4 SQL Query Processing and Familiarity, 1,2
Optimization Assessment
1 Transaction Processing Familiarity 1,2
1 Introduction to parallel databases, Familiarity 1,2
Architecture
1 2 Data partitioning Strategy Usage 1,2
1 Interquery and Intraquery Familiarity 1,2
parallelism
1 Introduction to Distributed Familiarity 1,2
databases, Architecture
1 Fragmentation Familiarity 1,2
1 Replication Familiarity 1,2
1 4 Distributed Query Processing Usage, 1,2
Assessment
1 Distributed Transaction Processing Usage, 1,2
Assessment
1 4 Introduction to spatial databases, Usage 1,2
Types of spatial data
1 Indexing in spatial databases Familiarity 1,2
1 2 Mobile databases , transaction in Familiarity 1,2
MDS
1 2 XML –introduction, DTD 1,2
1 Schema and validating XML with Usage, 1,2
it, Xquery Assessment
1 Semantic web, Ontologies Familiarity 1,2
1 2 RDF, RDFs Usage, 1,2
Assessment

1 Introduction to Database Security Familiarity 1,3


Issues
1 Security Models Familiarity 1,3
1 Different threats to databases Familiarity, 1,3
Assessments

1 Counter Measures to deal with Familiarity 1,3


problems
1 Cloud databases Familiarity, 5
Usage

1 Streaming Databases Familiarity 5


1 8 Graph Databases ,New SQL Familiarity, 5
Databases Usage

2 Recent Trends Familiarity


30 Hours 30 Hours

Approved by Academic Council No.:41 Date:17.06.2016

You might also like