Session 1a-Data and Data Management
Session 1a-Data and Data Management
Database
• Serves many applications by centralizing data and controlling redundant data
Database management system (DBMS)
• Interfaces between applications and physical data files
• Separates logical and physical views of data
• Solves problems of traditional file environment
Controls redundancy
Eliminates inconsistency
Uncouples programs and data
Enables organization to centrally manage data and data security
Human Resources Database with
Multiple Views
Relational DBMS
Big data
• Massive sets of unstructured/semi-structured data from web
traffic, social media, sensors, and so on
Volumes too great for typical DBMS
• Petabytes, exabytes of data
Can reveal more patterns, relationships and anomalies
Requires new tools and technologies to manage and analyze
Contemporary Business Intelligence
Infrastructure
Analytical Tools: Relationships,
Patterns, Trends
Text mining
• Extracts key elements from large unstructured data sets
• Sentiment analysis software
Web mining
• Discovery and analysis of useful patterns and information from
web
• Web content mining
• Web structure mining
• Web usage mining
Databases and the Web
Many companies use the web to make some internal databases available
to customers or partners
Typical configuration includes:
• Web server
• Application server/middleware/C G I scripts
• Database server (hosting D B M S)
Advantages of using the web for database access:
• Ease of use of browser software
• Web interface requires few or no changes to database
• Inexpensive to add web interface to system
Linking Internal Databases to the Web
Establishing an Information Policy