1) Database systems improve on traditional file systems by providing data independence, better security, and easier integration of data from multiple sources through a centralized database managed by a DBMS.
2) Well-designed databases facilitate data management by identifying expected use and generating accurate information, while poorly designed databases can cause errors and inconsistencies.
3) Traditional file systems had problems including extensive programming needs, inability to perform ad hoc queries, difficulty changing structures, and inadequate security features due to structural and data dependence issues.
1) Database systems improve on traditional file systems by providing data independence, better security, and easier integration of data from multiple sources through a centralized database managed by a DBMS.
2) Well-designed databases facilitate data management by identifying expected use and generating accurate information, while poorly designed databases can cause errors and inconsistencies.
3) Traditional file systems had problems including extensive programming needs, inability to perform ad hoc queries, difficulty changing structures, and inadequate security features due to structural and data dependence issues.
Database Systems: – Complements and expands value of data
Design, Implementation, and Management • Database management system (DBMS):
Eighth Edition collection of programs – Manage structure and control access to data Chapter 1 Database Systems Role and Advantages of the DBMS • DBMS is the intermediary between the user and Data vs. Information the database • Data are raw facts • Database structure stored as file collection • Information is the result of processing raw data • Access database through the DBMS to reveal meaning • DBMS enables data to be shared • Information requires context to reveal meaning • DBMS integrates many users’ views of the data • Raw data must be formatted for storage, processing, and presentation • Advantages of a DBMS: • Data are the foundation of information, which is – Improved data sharing the bedrock of knowledge – Improved data security • Data: building blocks of information – Better data integration • Information produced by processing data – Minimized data inconsistency • Information used to reveal meaning in data – Improved data access • Accurate, relevant, timely information is the key – Improved decision making to good decision making – Increased end-user productivity • Good decision making is the key to organizational survival Types of Databases • Databases can be classified according to: Introducing the Database and the DBMS – Number of users • Database: shared, integrated computer structure – Database location(s) that stores a collection of data – Expected type and extent of use – End-user data: raw facts of interest to the • Single-user database supports only one user at end user a time – Metadata: data about data – Desktop database – single-user, runs on • Metadata provides description of data PC characteristics and relationships in data • Multiuser database supports multiple users at – Generates accurate and valuable the same time information – Workgroup database supports a small • Poorly designed database: number – Causes difficult-to-trace errors – Enterprise database supports a large number Historical Roots: Files and File Systems • Centralized database: data located at a single • Reasons for studying file systems: site – Complexity of database design easier to • Distributed database: data distributed across understand several different sites – Understanding file system problems helps to • Operational database: supports a company’s avoid problems with DBMS systems day-to-day operations – Knowledge of file system useful for – Transactional or production database converting file system to database system • Data warehouse: stores data used for tactical or • File systems typically composed of collection of strategic decisions file folders, each tagged and kept in cabinet • Unstructured data exist in their original state – Organized by expected use • Structured data result from formatting • Contents of each file folder logically related – Structure applied based on type of • Manual system served as a data repository for processing to be performed small data collections • Semistructured data have been processed to – Cumbersome for large collections some extent • Data processing (DP) specialist converted • Extensible Markup Language (XML) computer file structure from manual system represents data elements in textual format – Wrote software that managed the data • XML database supports semistructured XML data – Designed the application programs • Initially, computer files systems resembled Why Database Design is Important? manual systems • Database design focuses on design of database • As number of files increased, file systems evolved structure used for end-user data – Each file used its own application program – Designer must identify database’s expected to store, retrieve, modify data use – Each file owned by individual or department • Well-designed database: that commissioned its creation – Facilitates data management • Data processing (DP) manager supervised the • Structural independence: change file structure DP department without affecting data access • DP department’s primary activity remained • Data dependence: data access changes when programming data storage characteristics change • Data independence: data storage Problems with File System Data Management characteristics do not affect data access • File system an improvement over manual system • Practical significance of data dependence is – File systems used for more than two difference between logical and physical format decades • Logical data format: how human views the data – Understanding the shortcomings of file • Physical data format: how computer must work systems aids in development of modern with data databases • Each program must contain: – Many problems not unique to file systems – Lines specifying opening of specific file type • Even simple file system retrieval task required – Record specification extensive programming – Field definitions – Ad hoc queries impossible – Changing existing structure difficult Field Definitions and Naming Conventions • Security features difficult to program • Storing customer name as single field is a liability – Often omitted in file system environments – Better record definition breaks fields into • Summary of file system limitations: component parts – Requires extensive programming • Selecting proper field names important; field – Can not perform ad hoc queries names are descriptive – System administration complex and difficult – With proper naming conventions, file – Difficult to make changes to existing structure becomes self-documenting structures – Some software places restrictions on length – Security features likely to be inadequate of field names • Each record should have unique identifier Structural and Data Dependence • Structural dependence: access to a file Data Redundancy dependent on its own structure • File system structure makes it difficult to combine – All file system programs must be modified data from multiple sources to conform to a new file structure – Vulnerable to security breaches • Organizational structure promotes storage of – People same data in different locations – Procedures – Islands of information – Data • Data stored in different locations unlikely to be • Hardware: all the system’s physical devices updated consistently • Software: three types of software required: • Data redundancy: same data stored – Operating system software unnecessarily in different places – DBMS software • Data inconsistency: different and conflicting – Application programs and utility software versions of same data occur at different places • People: all users of the database system: • Data anomalies: abnormalities when all changes – System and database administrators in redundant data not made correctly – Database designers – Update anomalies – Systems analysts and programmers – Insertion anomalies – End users – Deletion anomalies • Procedures: instructions and rules that govern the design and use of the database system Database Systems • Data: the collection of facts stored in the • Database system consists of logically related data database stored in a single logical data repository • Database systems created and managed at – May be physically distributed among different levels of complexity multiple storage facilities • Database solutions must be cost-effective as well • DBMS eliminates most of file system’s problems as tactically and strategically effective • Current generation stores data structures, • Database technology already in use affects relationships between structures, access paths selection of a database system – Takes care of defining, storing, managing all access paths and components DBMS Functions • Most functions transparent to end users The Database System Environment – Can only be achieved through the DBMS • Database system: defines and regulates the • Data dictionary management collection, storage, management, use of data – DBMS stores definitions of data elements • Five major parts of a database system: and relationships (metadata) in a data – Hardware dictionary – Software – DBMS looks up required data component – DBMS provides backup and data recovery to structures and relationships ensure data safety and integrity – Changes automatically recorded in the – Recovery management deals with recovery dictionary of database after a failure – DBMS provides data abstraction, removes • Critical to preserving database’s structural and data dependency integrity • Data storage management • Data integrity management – DBMS creates and manages complex – DBMS promotes and enforces integrity rules structures required for data storage • Minimizes redundancy – Also stores related data entry forms, screen • Maximizes consistency definitions, report definitions, etc. – Data relationships stored in data dictionary – Performance tuning: activities that make used to enforce data integrity the database perform more efficiently – Integrity especially important in – DBMS stores the database in multiple transaction-oriented database systems physical data files • Database access languages and application • Data transformation and presentation programming interfaces – DBMS transforms data entered to conform – DBMS provides access through a query to required data structures language – DBMS transforms physically retrieved data – Query language is a nonprocedural to conform to user’s logical expectations language • Security management – Structured Query Language (SQL) is the – DBMS creates a security system that de facto query language enforces user security and data privacy • Standard supported by majority of – Security rules determine which users can DBMS vendors access the database, which items can be • Database communication interfaces accessed, etc. – Current DBMSs accept end-user requests • Multiuser access control via multiple different network environments – DBMS uses sophisticated algorithms to – Communications accomplished in several ensure concurrent access does not affect ways: integrity • End users generate answers to • Backup and recovery management queries by filling in screen forms through Web browser • DBMS automatically publishes • Database design defines the database structure predefined reports on a Web site • Well-designed database facilitates data • DBMS connects to third-party systems management, generates valuable to distribute information via e-mail information • Poorly-designed database leads to bad Managing the Database System: decision making, organizational failure A Shift in Focus • Databases evolved from manual and • Database system provides a framework in which computerized file systems strict procedures and standards enforced • In a file system, data stored in independent – Role of human changes from programming files to managing organization’s resources • Each requires its own management • Database system enables more sophisticated use program of the data • Some limitations of file system data • Data structures created within the database and management: their relationships determine effectiveness • Requires extensive programming • Disadvantages of database systems: • System administration complex and difficult – Increased costs • Changing existing structures difficult – Management complexity • Security features likely inadequate – Maintaining currency • Independent files tend to contain redundant – Vendor dependence data – Frequent upgrade/replacement cycles • Structural and data dependency problems SUMMARY • Data are raw facts • Information is the result of processing data to reveal its meaning • Accurate, relevant, timely information is the key to good decision making • Data usually stored in a database • DBMS implements a database and manages its contents • Metadata is data about data