Chapter 5 ITM100
Chapter 5 ITM100
Basic Concepts
● Entity: Person, place, thing, event about which information is maintained
● Attribute: Description of a particular entity
● Key field: Identifier field used to retrieve, update, sort a record
Enrollment System
● Maintains information about students, courses, schedule, and enrollment information
● Students: name, current major, gender, Student ID, advisor
● Courses: course identifier, department, course name, units
● Faculty: name, employee ID, department, courses taught
● Schedule: courses taught by all professors in all departments
Relational DBMS
● Represent data as two-dimensional tables
● Table: Grid of columns and rows
● Rows (tuples): Records for different entities
● Fields (columns): Represents attribute for entity
● Key field: Field used to uniquely identify each record
● Primary key: Field in table used for key fields
● Foreign key: Primary key used in second table as look-up field to identify
records from original table
Designing Databases
● Conceptual design: Abstract model of database from a business perspective
● Entity-relationship diagram: Methodology for documenting databases illustrating
relationships between database entities
● Normalization: Process of creating small stable data structures from complex groups of
data
● Physical design: Detailed description of how the data will actually be arranged and
stored on physical devices
Blockchain
● Distributed ledgers in a peer-to-peer distributed database
● Maintains a growing list of records and transactions shared by all
● Encryption used to identify participants and transactions
● Used for financial transactions, supply chain, and medical records
● Foundation of Bitcoin, and other cryptocurrencies
Data Mining
● Finds hidden patterns, relationships in datasets
● Example: Customer buying patterns
● Infers rules to predict future behavior
● Types of information obtainable from data mining:
● Associations
● Sequences
● Classification
● Clustering
● Forecasting
Text Mining and Web Mining
● Text mining: Extracts key elements from large unstructured data sets
● Web mining: Discovery and analysis of useful patterns and information from web
● Web content/structure/usage mining
● Sentiment analysis: Mines text comments in email, blog, social media conversation, or
survey to detect favorable and unfavorable opinions about specific subjects