Additional Terminology
Additional Terminology
The database administrator (DBA) is the individual who has overall responsibility for developing and maintaining the database and for
establishing controls to protect its integrity. Thus, only the DBA should be able to update data dictionaries. In small systems, the DBA may
perform some functions of a DBMS. In larger applications, the DBA uses a DBMS as a primary tool.
● The responsibility for creating, maintaining, securing, restricting access to, and redefining and restructuring the database belongs
to the DBA.
The data dictionary is a file that describes both the physical and logical characteristics of every data element in a database.
● The data dictionary includes, for example, the name of the data element (e.g., employee name, part number), the amount of disk
space required to store the data element (in bytes), and what kind of data is allowed in the data element (e.g., alphabetic,
numeric).
● The data dictionary contains the size, format, usage, meaning, and ownership of every data element. This greatly simplifies the
programming process.
Data from a relational database can be displayed in graphs and reports, changed, and otherwise controlled using a program called
Query Management Facility (QMF).
The schema is a description of the overall logical structure of the database using data-definition language, which is the connection
between the logical and physical structures of the database.
● A subschema describes a particular user’s (application’s) view of a part of the database using data definition language.
The database mapping facility is software that is used to evaluate and document the structure of the database.
An object-oriented database is a response to the need to store not only numbers and characters but also graphics and multimedia
applications.
● Translating these data into tables and rows is difficult. However, in an object-oriented database, they can be stored, along with the
procedures acting on them, within an object.
Advanced database systems provide for online analytical processing (OLAP), also called multidimensional data analysis, which is the
ability to analyze large amounts of data from numerous perspectives. OLAP is an integral part of the data warehouse concept.
● Using OLAP, users can compare data in many dimensions, such as sales by product, sales by geography, and sales by
salesperson.
■ In-memory analytics is a process that queries data stored directly in the computer’s memory (i.e., RAM) as opposed to querying
data located on the computer’s internal or external storage devices (e.g., hard drives).
■ Search engine technology stores data at a document/transaction level, and data is not pre aggregated like it would be when
contained in an OLAP or in-memory technology application. Users are able to have full access to their raw data and create the
aggregations themselves.
A data warehouse contains not only current operating data but also historical information from throughout the organization. Thus,
data from all operational systems are integrated, consolidated, and standardized in an organization-wide database into which data are
copied periodically. These data are maintained on one platform and can be read but not changed.
Data cleansing cleans up data in a database that is incorrect, incomplete, or duplicated before loading it into the database. It
improves the quality of data. The need for data cleansing increases when multiple data sources are integrated.
Data mining, which is covered in Study Unit 20, Subunit 2, is the process of analyzing data from different perspectives and summarizing
it into useful information. Data mining is facilitated by a data warehouse and ordinarily uses data mining software.
● For example, data mining software can help find abnormal patterns and unforeseen correlations among the data.
A data mart is a subset of the data warehouse that can be tailored to user data requirements. Because data marts are developed for
specific groups of users, they are often created for individual business lines or departments.