0% found this document useful (0 votes)
24 views15 pages

Advanced Concepts-Unit-5

Mobile applications

Uploaded by

224aki
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
24 views15 pages

Advanced Concepts-Unit-5

Mobile applications

Uploaded by

224aki
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 15

ADVANCED TOPICS:

Introduction to Parallel Databases


Companies need to handle huge amount of data with high data transfer rate. The client
server and centralized system is not much efficient. The need to improve the efficiency gave
birth to the concept of Parallel Databases.

Parallel database system improves performance of data processing using multiple resources
in parallel, like multiple CPU and disks are used parallel.

It also performs many parallelization operations like, data loading and query processing.

Goals of Parallel Databases

The concept of Parallel Database was built with a goal to:

Improve performance:
The performance of the system can be improved by connecting multiple CPU and disks in
parallel. Many small processors can also be connected in parallel.

Improve availability of data:


Data can be copied to multiple locations to improve the availability of data.
For example: if a module contains a relation (table in database) which is unavailable then it
is important to make it available from another module.

Improve reliability:
Reliability of system is improved with completeness, accuracy and availability of data.

Provide distributed access of data:


Companies having many branches in multiple cities can access data with the help of parallel
database system.

Multimedia databases
The multimedia databases are used to store multimedia data such as images, animation,
audio, video along with text. This data is stored in the form of multiple file types
like .txt(text), .jpg(images), .swf(videos), .mp3(audio) etc.
Contents of the Multimedia Database
The multimedia database stored the multimedia data and information related to it. This is
given in detail as follows −
Media data
This is the multimedia data that is stored in the database such as images, videos, audios,
animation etc.
Media format data
The Media format data contains the formatting information related to the media data such
as sampling rate, frame rate, encoding scheme etc.
Media keyword data
This contains the keyword data related to the media in the database. For an image the
keyword data can be date and time of the image, description of the image etc.
Media feature data
Th Media feature data describes the features of the media data. For an image, feature data
can be colours of the image, textures in the image etc.
Challenges of Multimedia Database
There are many challenges to implement a multimedia database. Some of these are:

 Multimedia databases contains data in a large type of formats such


as .txt(text), .jpg(images), .swf(videos), .mp3(audio) etc. It is difficult to convert one
type of data format to another.
 The multimedia database requires a large size as the multimedia data is quite large
and needs to be stored successfully in the database.
 It takes a lot of time to process multimedia data so multimedia database is slow.
Advantages and Disadvantages of MM Database
Advantages:

 They support multiple formats of data (text, audio, video etc.)

 Data independence: Separate the database and the management from the
application program.

 They provide access and authorization control.

 Query support :Multimedia databases should have the ability to uniformly Query
data (media data, Textual data) represented in different formats and have the
ability to simultaneously query different media sources and conduct classical
database operations across them.
Disadvantages:

 Usually, the data size of multimedia is large such as video; therefore, multimedia
data often require a large storage.

 Multimedia database consume a lot of processing time, as well as bandwidth.


Mobile databases are separate from the main database and can easily be transported to
various places. Even though they are not connected to the main database, they can still
communicate with the database to share and exchange data.
The mobile database includes the following components −
 The main system database that stores all the data and is linked to the mobile
database.
 The mobile database that allows users to view information even while on the move.
It shares information with the main database.
 The device that uses the mobile database to access data. This device can be a mobile
phone, laptop etc.
 A communication link that allows the transfer of data between the mobile database
and the main database.
Advantages of Mobile Databases
Some advantages of mobile databases are −
 The data in a database can be accessed from anywhere using a mobile database. It
provides wireless database access.
 The database systems are synchronized using mobile databases and multiple users
can access the data with seamless delivery process.
 Mobile databases require very little support and maintenance.
 The mobile database can be synchronized with multiple devices such as mobiles,
computer devices, laptops etc.
Disadvantages of Mobile Databases
Some disadvantages of mobile databases are −
 The mobile data is less secure than data that is stored in a conventional stationary
database. This presents a security hazard.
 The mobile unit that houses a mobile database may frequently lose power because
of limited battery. This should not lead to loss of data in database.

Multidimensional Databases

 Multidimensional databases are used mostly for OLAP (online analytical processing)
and data warehousing. They can be used to show multiple dimensions of data to
users.
 A multidimensional database is created from multiple relational databases. While
relational databases allow users to access data in the form of queries, the
multidimensional databases allow users to ask analytical questions related to
business or market trends.
 The multidimensional databases use MOLAP (multidimensional online analytical
processing) to access its data. They allow the users to quickly get answers to their
requests by generating and analysing the data rather quickly.
 The data in multidimensional databases is stored in a data cube format. This means
that data can be seen and understood from many dimensions and perspectives.
 Example

 The revenue costs for a company can be understood and analysed on the basis of
various factors like the company products, the geographical locations of the
company offices, time to develop a product, promotions done etc.

Advantages of Multidimensional Databases


 Some advantages of multidimensional databases are −
 Increased performance
 The performance of the multidimensional databases is much superior to that of
normal databases such as relational database.
 Easy maintenance
 The multidimensional database is easy to handle and maintain
 Better data presentation
 The data in a multidimensional database is multi-faceted and contains many
different factors. Hence, the data presentation is far superior to conventional
databases.

Disadvantages of Multidimensional Databases


 One of the disadvantage of multidimensional databases are that it is quite complex
and it takes professionals to truly understand and analyse the data in the database.
Data Warehouse

Data Warehouse is a relational database management system (RDBMS) construct to meet


the requirement of transaction processing systems. It can be loosely described as any
centralized data repository which can be queried for business benefits. It is a database that
stores information oriented to satisfy decision-making requests. It is a group of decision
support technologies, targets to enabling the knowledge worker (executive, manager, and
analyst) to make superior and higher decisions. So, Data Warehousing support architectures
and tool for business executives to systematically organize, understand and use their
information to make strategic decisions.

Data Warehouse environment contains an extraction, transportation, and loading (ETL)


solution, an online analytical processing (OLAP) engine, customer analysis tools, and other
applications that handle the process of gathering information and delivering it to business
users.

What is a Data Warehouse?

A Data Warehouse (DW) is a relational database that is designed for query and analysis
rather than transaction processing. It includes historical data derived from transaction data
from single and multiple sources.

A Data Warehouse provides integrated, enterprise-wide, historical data and focuses on


providing support for decision-makers for data modelling and analysis.

A Data Warehouse is a group of data specific to the entire organization, not only to a
particular group of users.

It is not used for daily operations and transaction processing but used for making decisions.

A Data Warehouse can be viewed as a data system with the following attributes:

o It is a database designed for investigative tasks, using data from various applications.
o It supports a relatively small number of clients with relatively long interactions.
o It includes current and historical data to provide a historical perspective of
information.
o Its usage is read-intensive.
o It contains a few large tables.

"Data Warehouse is a subject-oriented, integrated, and time-variant store of information in


support of management's decisions."

Characteristics of Data Warehouse


Subject-Oriented

A data warehouse target on the modelling and analysis of data for decision-makers.
Therefore, data warehouses typically provide a concise and straightforward view around a
particular subject, such as customer, product, or sales, instead of the global organization's
ongoing operations. This is done by excluding data that are not useful concerning the
subject and including all data needed by the users to understand the subject.

Integrated

A data warehouse integrates various heterogeneous data sources like RDBMS, flat files, and
online transaction records. It requires performing data cleaning and integration during data
warehousing to ensure consistency in naming conventions, attributes types, etc., among
different data sources.
Time-Variant

Historical information is kept in a data warehouse. For example, one can retrieve files from
3 months, 6 months, 12 months, or even previous data from a data warehouse. These
variations with a transactions system, where often only the most current file is kept.

Non-Volatile

The data warehouse is a physically separate data storage, which is transformed from the
source operational RDBMS. The operational updates of data do not occur in the data
warehouse, i.e., update, insert, and delete operations are not performed. It usually requires
only two procedures in data accessing: Initial loading of data and access to data. Therefore,
the DW does not require transaction processing, recovery, and concurrency capabilities,
which allows for substantial speedup of data retrieval. Non-Volatile defines that once
entered into the warehouse, and data should not change.
History of Data Warehouse

The idea of data warehousing came to the late 1980's when IBM researchers Barry Devlin
and Paul Murphy established the "Business Data Warehouse."

In essence, the data warehousing idea was planned to support an architectural model for
the flow of information from the operational system to decisional support environments.
The concept attempt to address the various problems associated with the flow, mainly the
high costs associated with it.

In the absence of data warehousing architecture, a vast amount of space was required to
support multiple decision support environments. In large corporations, it was ordinary for
various decision support environments to operate independently.

Goals of Data Warehousing

o To help reporting as well as analysis


o Maintain the organization's historical information
o Be the foundation for decision making.

Need for Data Warehouse

Data Warehouse is needed for the following reasons:

1. 1) Business User: Business users require a data warehouse to view summarized data
from the past. Since these people are non-technical, the data may be presented to
them in an elementary form.
2. 2) Store historical data: Data Warehouse is required to store the time variable data
from the past. This input is made to be used for various purposes.
3. 3) Make strategic decisions: Some strategies may be depending upon the data in the
data warehouse. So, data warehouse contributes to making strategic decisions.
4. 4) For data consistency and quality: Bringing the data from different sources at a
commonplace, the user can effectively undertake to bring the uniformity and
consistency in data.
5. 5) High response time: Data warehouse has to be ready for somewhat unexpected
loads and types of queries, which demands a significant degree of flexibility and
quick response time.

Benefits of Data Warehouse

1. Understand business trends and make better forecasting decisions.


2. Data Warehouses are designed to perform well enormous amounts of data.
3. The structure of data warehouses is more accessible for end-users to navigate,
understand, and query.
4. Queries that would be complex in many normalized databases could be easier to
build and maintain in data warehouses.
5. Data warehousing is an efficient method to manage demand for lots of information
from lots of users.
6. Data warehousing provide the capabilities to analyze a large amount of historical
data.

Prerequisites

Before learning about Data Warehouse, you must have the fundamental knowledge of basic
database concepts such as schema, ER model, structured query language, etc.

Audience

This tutorial will help computer science students to understand the basic-to-advanced
concepts associated with data warehousing.

Problems

Parameters OLTP OLAP


It is an online transactional system. It
Process OLAP is an online analysis and data retrieving pro
manages database modification.
It is characterized by large numbers of
Characteristic It is characterized by a large volume of data.
short online transactions.
OLTP is an online database modifying OLAP is an online database query management
Functionality
system. system.
Parameters OLTP OLAP
Method OLTP uses traditional DBMS. OLAP uses the data warehouse.
Insert, Update, and Delete information
Query Mostly select operations
from the database.
Table Tables in OLTP database are normalized. Tables in OLAP database are not normalized.
OLTP and its transactions are the sources Different OLTP databases become the source of d
Source
of data. for OLAP.
OLTP database must maintain data OLAP database does not get frequently modified
Data Integrity
integrity constraint. Hence, data integrity is not an issue.
Response time It's response time is in millisecond. Response time in seconds to minutes.
The data in the OLTP database is always
Data quality The data in OLAP process might not be organized
detailed and organized.
It helps to control and run fundamental It helps with planning, problem-solving, and deci
Usefulness
business tasks. support.
Operation Allow read/write operations. Only read and rarely write.
Audience It is a market orientated process. It is a customer orientated process.
Queries in this process are standardized
Query Type Complex queries involving aggregations.
and simple.
Complete backup of the data combined OLAP only need a backup from time to time. Bac
Back-up
with incremental backups. not important compared to OLTP
DB design is application oriented. DB design is subject oriented. Example: Database
Design Example: Database design changes with design changes with subjects like sales, marketin
industry like Retail, Airline, Banking, etc. purchasing, etc.
It is used by Data critical users like clerk, Used by Data knowledge users like workers,
User type
DBA & Data Base professionals. managers, and CEO.
Designed for real time business Designed for analysis of business measures by
Purpose
operations. category and attributes.
Performance Transaction throughput is the
Query throughput is the performance metric.
metric performance metric
Number of This kind of Database users allows
This kind of Database allows only hundreds of us
users thousands of users.
It helps to Increase user's self-service and
Productivity Help to Increase productivity of the business ana
productivity
Data Warehouses historically have been a An OLAP cube is not an open SQL server data
Challenge development project which may prove warehouse. Therefore, technical knowledge and
costly to build. experience is essential to manage the OLAP serve
It ensures that response to the query is quicker
Process It provides fast result for daily used data.
consistently.
It lets the user create a view with the help of a
Characteristic It is easy to create and maintain.
spreadsheet.
OLTP is designed to have fast response A data warehouse is created uniquely so that it c
Style time, low data redundancy and is integrate different data sources for building a
normalized. consolidated database
We assure that you will not find any problem with this Data Warehouse Tutorial. But if there
is any mistake, please post the problem in the contact form.

Difference between OLTP and OLAP

Below is the difference between OLAP and OLTP in Data Warehouse:

OLTP vs OLAP

Introduction to NoSQL
A NoSQL originally referring to non SQL or non relational is a database that provides a
mechanism for storage and retrieval of data. This data is modeled in means other than the
tabular relations used in relational databases. Such databases came into existence in the
late 1960s, but did not obtain the NoSQL moniker until a surge of popularity in the early
twenty-first century. NoSQL databases are used in real-time web applications and big data
and their use are increasing over time. NoSQL systems are also sometimes called Not only
SQL to emphasize the fact that they may support SQL-like query languages.
A NoSQL database includes simplicity of design, simpler horizontal scaling to clusters of
machines and finer control over availability. The data structures used by NoSQL databases
are different from those used by default in relational databases which makes some
operations faster in NoSQL. The suitability of a given NoSQL database depends on the
problem it should solve. Data structures used by NoSQL databases are sometimes also
viewed as more flexible than relational database tables.
Many NoSQL stores compromise consistency in favor of availability, speed and partition
tolerance. Barriers to the greater adoption of NoSQL stores include the use of low-level
query languages, lack of standardized interfaces, and huge previous investments in
existing relational databases. Most NoSQL stores lack true ACID(Atomicity, Consistency,
Isolation, Durability) transactions but a few databases, such as MarkLogic, Aerospike,
FairCom c-treeACE, Google Spanner (though technically a NewSQL database), Symas
LMDB, and OrientDB have made them central to their designs.
Most NoSQL databases offer a concept of eventual consistency in which database changes
are propagated to all nodes so queries for data might not return updated data
immediately or might result in reading data that is not accurate which is a problem known
as stale reads. Also some NoSQL systems may exhibit lost writes and other forms of data
loss. Some NoSQL systems provide concepts such as write-ahead logging to avoid data
loss. For distributed transaction processing across multiple databases, data consistency is
an even bigger challenge. This is difficult for both NoSQL and relational databases. Even
current relational databases do not allow referential integrity constraints to span
databases. There are few systems that maintain both X/Open XA standards and ACID
transactions for distributed transaction processing.
Advantages of NoSQL:
There are many advantages of working with NoSQL databases such as MongoDB and
Cassandra. The main advantages are high scalability and high availability.

1. High scalability –
NoSQL database use shading for horizontal scaling. Partitioning of data and placing it
on multiple machines in such a way that the order of the data is preserved is sharding.
Vertical scaling means adding more resources to the existing machine whereas
horizontal scaling means adding more machines to handle the data. Vertical scaling is
not that easy to implement but horizontal scaling is easy to implement. Examples of
horizontal scaling databases are MongoDB, Cassandra etc. NoSQL can handle huge
amount of data because of scalability, as the data grows NoSQL scale itself to handle
that data in efficient manner.
2. High availability –
Auto replication feature in NoSQL databases makes it highly available because in case
of any failure data replicates itself to the previous consistent state.
Disadvantages of NoSQL:
NoSQL has the following disadvantages.
1. Narrow focus –
NoSQL databases have very narrow focus as it is mainly designed for storage but it
provides very little functionality. Relational databases are a better choice in the field of
Transaction Management than NoSQL.
2. Open-source –
NoSQL is open-source database. There is no reliable standard for NoSQL yet. In other
words two database systems are likely to be unequal.
3. Management challenge –
The purpose of big data tools is to make management of a large amount of data as
simple as possible. But it is not so easy. Data management in NoSQL is much more
complex than a relational database. NoSQL, in particular, has a reputation for being
challenging to install and even more hectic to manage on a daily basis.
4. GUI is not available –
GUI mode tools to access the database is not flexibly available in the market.
5. Backup –
Backup is a great weak point for some NoSQL databases like MongoDB. MongoDB has
no approach for the backup of data in a consistent manner.
6. Large document size –
Some database systems like MongoDB and CouchDB store data in JSON format. Which
means that documents are quite large (BigData, network bandwidth, speed), and
having descriptive key names actually hurts, since they increase the document size.
Types of NoSQL database:
Types of NoSQL databases and the name of the databases system that falls in that
category are:
1. MongoDB falls in the category of NoSQL document based database.
2. Key value store: Memcached, Redis, Coherence
3. Tabular: Hbase, Big Table, Accumulo
4. Document based: MongoDB, CouchDB, Cloudant

You might also like