db lecture 1 (1) (2) (2)
db lecture 1 (1) (2) (2)
Database
• A Database is an organized collection of logically related data
• A database may be of any size and complexity For example
– a salesperson may maintain a small database of customer contacts
consisting of a few megabytes of data on her laptop computer.
– A large corporation may build a large database consisting of several
terabytes of data (a terabyte is a trillion bytes) on a large mainframe
computer that is used for decision support applications
– Very large data warehouses contain more than a petabyte of data. (A
petabyte is a quadrillion bytes.)
Data
• the term data referred to the facts concerning objects and
events that could be recorded and stored on computer media
Structured data
• The most important structured data types are numeric, text,
character, and dates.
• Structured data are stored in tabular form (in tables,
relations, arrays, spreadsheets, etc.) and are most commonly
found in traditional databases and data warehouses.
• For example, in a salesperson’s database, the data would
include facts such as customer name, address, and telephone
number
• This type of data is called structured data
Unstructured data
• Databases today are used to store objects such as documents,
e-mails, tweets, Facebook posts, GPS information, maps,
photographic images, sound, and video segments in addition
to structured data.
– For example, the salesperson’s database might include a photo image
of the customer contact, a sound recording or video clip about the
most recent product.
• This type of data is referred to as unstructured data, or as
multimedia data
Unstructured data
• unstructured data are multimedia data such as documents,
e-mails, maps, images, sound, and video clips
• Unstructured data are commonly found in web server and
web enabled databases
Structured and unstructured data
• Today structured and unstructured data are often combined in
the same database to create a true multimedia environment.
– For example, an automobile repair shop can combine
structured data (describing customers and automobiles)
with multimedia data (photo images of the damaged autos
and scanned images of insurance claim forms).
This is useful information to some users, such as the course instructor and the registrar’s
office. Data security has increased
few organizations still use Social Security numbers as identifiers. Instead, most
organizations use an internally generated number for identification purposes.
Converting data to information
2. Summarized data
This information could be used as a basis for deciding whether to add new courses or to
hire new faculty members.
Examples of Data
• Some of the properties that are typically described include data names,
definitions, length (or size)
• Metadata describing data context include the source of the data, where
the data are stored
Meta data
• ” For example, when looking at one of the values of Year of
Birth in the Students table
– the data itself may be “1992”.
– The metadata about that value would be the field name Year of Birth,
the time it was last updated, and the data type (integer).
• Another example of metadata could be for an MP3 music file,
– information such as the length of the song, the artist, the album, the
file size, and even the album cover art, are classified as metadata.
• When a database is being designed, a “data dictionary” is
created to hold the metadata, defining the fields and structure
of
Meta data
• Metadata enable database designers and users to understand
what data exist, what the data mean, and how to distinguish
between
• Data items that at first glance look similar. Managing
metadata is at least as crucial as managing the associated
data because data without clear meaning can be confusing,
misinterpreted, or erroneous.
• Typically, much of the metadata are stored as part of the
database and may be retrieved using the same approaches
that are used to retrieve data or information.