Lecture Database Course Introdutcion For Student
Lecture Database Course Introdutcion For Student
• Relational Algebra
• Data Warehouse technical architecture
• ETL tools
• Advanced SQL programing
Course work
▪ Class Participation: 10 %
▪ Mid-term Exam / Group work: 30 %
▪ Final exam: 60%
Textbook and references
Text book
• [1] Phan Tấn Quốc, Nguyễn Thị Uyên Nhi, Giáo trình Cơ sở dữ liệu, NXB
ĐHQG TPHCM, ISBN: 978-604-73-7236-2, 2019
References
• [2] Malik, U., Goldwasser, M. and Johnston, B., 2019. SQL for Data
Analytics. Birmingham: Packt Publishing, Limited.
• [4] Kimball, R., Ross, M., 2013. The Data Warehouse Toolkit. 3rd ed.
Hoboken: John Wiley & Sons.
How to take most out of this course
• Full attendance
• Pay attentions
• Do the homework
Customer
CustomerID CustomerName CustomerPhone
Orders
OrderID CustomerID OrderDate
• Summaries
2020 Transactions
• Trends
• Business information
Data Processing
Data processing is the conversion of raw data to meaningful information through a
process.
System System
Stream Processing: each new piece of Upload Upload
data is processed when it arrives.
Input 1 Input 2
Basic concepts of database
Data warehousing:
Large amounts of data can be imported from multiple sources and structured to enable high-performance
queries
IoT:
Although typically considered for non-relational, the data from IoT devices could be structured and consistent
The characteristics of relational data
Tables
Customers
CustomerID CustomerName CustomerPhone
Data is stored in a table
100 Muisto Linna XXX-XXX-XXXX
101 Noam Maoz XXX-XXX-XXXX
Table consists of rows and columns
102 Vanja Matkovic XXX-XXX-XXXX
103 Qamar Mounir XXX-XXX-XXXX
104 Zhenis Omar XXX-XXX-XXXX All rows have same # of columns
105 Claude Paulet XXX-XXX-XXXX
106 Alex Pettersen XXX-XXX-XXXX
Each column is defined by a datatype
107 Francis Ribeiro XXX-XXX-XXXX
The characteristics of relational data
Entities
Customers
CustomerID CustomerName CustomerPhone
100 Muisto Linna XXX-XXX-XXXX
101 Noam Maoz XXX-XXX-XXXX
102 Vanja Matkovic XXX-XXX-XXXX
103 Qamar Mounir XXX-XXX-XXXX
104 Zhenis Omar XXX-XXX-XXXX
105 Claude Paulet XXX-XXX-XXXX
106 Alex Pettersen XXX-XXX-XXXX
An entity is a representation of an item which can be physical (such as a customer or a product), or virtual (such as an
order).
Entities are connected by relations enabling interaction. For example, a customer can place an order for a product
The characteristics of relational data
Normalization
Customers Orders
CustomerID CustomerName CustomerPhone OrderID CustomerName CustomerPhone
100 Muisto Linna XXX-XXX-XXXX AD100 Noam Maoz XXX-XXX-XXXX
101 Noam Maoz XXX-XXX-XXXX AD101 Noam Maoz XXX-XXX-XXXX
102 Vanja Matkovic XXX-XXX-XXXX AD102 Noam Maoz XXX-XXX-XXXX
103 Qamar Mounir XXX-XXX-XXXX AX103 Qamar Mounir XXX-XXX-XXXX
104 Zhenis Omar XXX-XXX-XXXX AS104 Qamar Mounir XXX-XXX-XXXX
105 Claude Paulet XXX-XXX-XXXX AR105 Claude Paulet XXX-XXX-XXXX
106 Alex Pettersen XXX-XXX-XXXX MK106 Muisto Linna XXX-XXX-XXXX
Examples:
## Customer 1 ID: 1
Name: Mark Hanson
Telephone: [ Home: 1-999-9999999, Business: 1-888-8888888, Cell: 1-777- 7777777 ]
Address: [ Home: 121 Main Street, Some City, NY, 10110,
Business: 87 Big Building, Some City, NY, 10111 ]
## Customer 2 ID: 2
Title: Mr
Name: Jeff Hay
Telephone: [ Home: 0044-1999-333333, Mobile: 0044-17545-444444 ]
Address: [ UK: 86 High Street, Some Town, A County, GL8888, UK,
US: 777 7th Street, Another City, CA, 90111 ]
Multiple entities in the same collection or container Have a different, Are often defined by labeling each field with the
with different fields non-tabular schema name it represents
Identify non-relational database use cases
Gaming:
In-game stats, social media integration, leaderboards, low-latency applications
Frequently used in combination with Machine Learning capabilities to “extract data” by using:
Text Analytics
Sentiment Analysis
Computer Vision
What is NoSQL?
34
Process to design a good database
Data Modeling
35
Process to design a good database
36
Process to design a good database
37
Process to design a good database
38
Process to design a good database
39
Process to design a good database
40
Three levels of Data Modeling
41
THANK YOU !