Database Definition
Database Definition
This
data is often stored electronically in a computer system called a database management system (DBMS).
Databases typically organize data in rows and columns for easy processing and retrieval. Oftentimes,
you’ll need to use a programming language, such as structured query language (SQL), to interact with
your database.
Databases are similar to spreadsheets, but there are several key differences. In general, databases are
much larger than spreadsheets and so can store more data, and they allow for multiple users to access
data at the same time. For these reasons, people who work with data, such as data analysts and data
scientists, often work with databases rather than spreadsheets.
Database Types
There are many different approaches to analyzing the different database types
available. The table below provides a general overview of the various types
currently available:
1
A relational database provides a set of data rows in response to a query. A query
language, most commonly the Structured Query Language or SQL, helps create
these data views.
Relational Database Features
The main features of a relational database are:
ACID compliant. The database retrains integrity while performing transactions.
Range of data types. Provides the capability to store any data as well as carry out
complex queries.
Collaborative. Multiple users are able to access the database and work on the
same project.
Secure. Access is limited or restricted through user permissions.
Stable. Relational databases are well-understood and documented.
What are Relational Databases Used For?
Relational databases are the most implemented database type. There are many use
cases, some of which include:
Online transaction systems. The database supports many users as well as
frequent queries needed in online transactions.
IoT. Relational databases are lightweight and have the processing power needed
for edge computing.
Data warehouses. The critical component of the data warehouse architecture is
storage. Relational databases are easily integrated and optimized for massive
queries from multiple sources.
Most Popular Relational Databases
There are countless commercial as well as open-source databases.
The top ten most popular relational databases are:
1. Oracle 2. MySQL 3. Microsoft SQL Server 4. PostgreSQL 5. IBM Db2 6. SQLite 7.
Microsoft Access 8. MariaDB 9. Hive 10. Microsoft Azure SQL Database
2
A non-relational database, or NoSQL ("Not Only SQL"), is a type of database that
models and stores data differently from relational databases. Instead of tables, non-
relational databases model relationships between data in an alternative way.
3
3. Object Database
An object database similarly represents data to objects in object-oriented
programming.
The critical components of an object-oriented database are:
Objects. The basic building blocks for storing information.
Classes. The schema or blueprint for an object.
Methods. Structured behaviors of a class.
Pointers. Access elements of a database and establish relations between objects.
4
The two groups based on location are:
1. Centralized databases
2. Distributed databases
Centralized Database
A centralized database is stored as well as managed in a single location. The
information is available through a network. The end-user has access through the
network to the centralized computer, where the stored information resides.
Distributed Database
Distributed databases store information across different physical sites. The
database resides on multiple CPUs on a single site or spread out across various
locations. Due to the connections between the distributed databases, the
information appears as a single database to end-users.
5
The most exciting features of a distributed database are:
Location independency. The physical location of the database spreads out
across multiple sites.
Query processing distribution. A complex query splits into multiple sites, which
divides the tasks between different CPUs, reducing bottleneck.
Distributed transactions. Multiple storage locations provide a distributed
recovery method. Commit protocols exist in cases of numerous transactions.
Network linking. The distributed databases interlink through a network where
communication happens between the storages as well as with end-users.
Seamless integration. Although not physically connected, distributed database
parts connect into one logical database.
What Are Distributed Databases Used For?
Distributed databases work best in environments with many sectors where
companies should limit the available information to reduce redundancy. Some
examples include:
Large companies. Most company sectors do not need a complete overview of
data. Distributed databases help reduce the redundancy of data with individual
departments.
Global enterprises. Due to location independence, this database type fits well
with companies with multiple sites.
Analytical Database
Analytical databases provide a unified view of all data available within a business. A
complete overview of information within a database is essential for planning,
6
reporting, and making decisions. The database is known as an online analytical
processing (OLAP) database.
On-premises Database
An on-premises database resides in-house. All the software, infrastructure as well as
administration needed for support is local. With large-scale enterprises, the storage
grows to a local data center.
Cloud Database
A cloud database is a hosting solution given by a third-party provider. The pay-as-go
solution provides the database-as-a-service, avoiding the need to set up a data
center physically. The agile approach minimizes the initial investments needed to
acquire data space while quickly expanding as more resources are required.
7
Cost. With a cloud database, you only pay for what you need. The cost of investing
in technical staff, as well as maintenance, is minimized.
NOTE: None of the database types are a replacement for a different kind of
database, and using multiple systems provides versatility over data management.