Active Databases As Information Systems
Active Databases As Information Systems
Dina Goldin
Computer Science & Engr. Dept.
U. Conn
Storrs, CT, 06269
[email protected]
Srinath Srinivasa
Computer Science Dept.
IIIT
Bangalore, India
[email protected]
Abstract
Database driven Information Systems (IS) have two distinct integrity concerns: static, or data integrity and dynamic integrity. Static integrity addresses situations within
a particular database state, or instance and dynamic integrity is concerned with state sequences instead.
Dynamic integrity issues come to the fore when services
are designed over the database. Information systems are
defined as a collection of services and dynamic integrity
is therefore an essential part of IS design. Consequently,
designers of information systems have been routinely concerned with dynamic integrity.
Active Databases (ADBs) that add an element of dynamics into database systems, have mainly addressed static integrity concerns. While there have been some attempts to incorporate dynamic integrity concerns within the framework
of ADBs, a general solution is still wanting. This paper addresses dynamic integrity concerns in ADBs by changing
the perspective of an ADB from a database system augmented with rules to a database driven IS that offers rulebased services. This change in perspective offers a promising approach for addressing ADB shortcomings, and reveals a roadmap of directions for future ADB research.
1. Introduction
Historically database systems were designed to manage
large collections of static data. Data values in any database
instance are related through certain dependencies and restricted through certain constraints. The focus of database
design was to efficiently maintain these relationships among
data elements whenever the database was updated.
In contrast, the community of information system (IS) researchers addressed issues of managing information within
a larger dynamic system. The general model of an IS is in
the form of a collection of semantic services.
Vijaya Srikanti
Computer Science & Engr. Dept.
U. Conn
Storrs, CT, 06269
[email protected]
Services were initially defined and modeled as sequences of tasks resulting in various flow based representations (task-flow diagrams, workflow diagrams,
activity flowcharts, etc.) However, there is a growing realization that services are best designed using models of
interaction, developed largely in the reactive systems domain [Sri01, EGM98]. This change in perspective comes
from the observation that semantic services are usually interactive in nature, involving intermediate exchanges with
one or more external environments.
Today, most database systems offer features that go beyond management of static data and most information systems are powered by a database. As a result the distinction between the two are blurred and the terms are sometimes used interchangeably. Several issues that have been
addressed in one of the fields are reinvented in the context
of the other.
The distinction between a database and an information
system is best appreciated when we consider their function.
The job of a database is to store data and answer queries.
This entails addressing issues like data models, schema design, handling distributed data, maintaining data consistency, answering queries, etc. Updates to the stored data set
happen in atomic operations, where each update is (logically) isolated from all other update operations. Query answering is a history-less, atomic operation. As long as there
are no updates, a given query will result in the same response, regardless of how many other queries are running
concurrently and how many times the same query has been
asked before. Such a behavior is representative of closed algorithmic computation [GST00].
By contrast, the job of an information system is to provide a service, which are semantic entities entailing considerations that span the life cycle of the larger system. Except in trivially small information systems, services are interactive in nature involving user sessions with one or more
users. An interactive service need not be a closed, atomic
operation. It may involve many intermediate states where
the external environment may influence the flow of the com-
2. Active Databases
Traditionally, database systems have been passive, storing and retrieving data in direct response to user requests
without initiating any operations on their own. As the scale
and complexity of data management increased, interest has
grown in bringing active behavior into databases, allowing
them to respond independently to data-related events. Typically this behavior is described by event-condition-action
(ECA) rules.
ECA rules comprise three components: event E, condition C, and action A. The event describes an external happening to which the rule may be able to respond. The condition examines the context in which the event has taken
place. The action describes the task to be carried out by the
rule if the relevant event has taken place and the condition
has evaluated to true [PD99]. In sum, if the specified event
E occurs and if the condition C is true then the specified action A is executed.
While there is agreement that all ADBs must detect
event occurrences, support rule management, and execute
actions [DGG95], there is no consensus on how the events,
conditions and actions are specified. Rule conditions may
get arbitrarily complex and rule conditions may have to be
monitored in one of many different ways [RS99].
Databases
Nature: Algorithmic
State: User Data
Information Systems
Interactive
User data, logs
and histories, user profiles
Job:
Updates and queries of data Data backed services
to users
Output: Determined completely by Individualized based on
query/update specification. user history/preferences
<
>
<
>
<
On <event E1>
if <condition C1>
do <abort>
On event E1
if condition C1
do send message to user
>
>
>
>
From the above examples, (b) is confluent, but not observably deterministic. Rules in (c) are not confluent. If the
PC chair sends a paper having himself as the first author
and the paper has more than 5000 words, then depending
on the order of execution of the rules, the author may or
may not receive a message about the wordcount in the paper. Also, depending on how rules in (c) are scheduled, the
count of the number of papers submitted by the PC-Chair
may or may not be incremented after submission of the paper. All the above rulesets are terminating since there is no
recursive invocation of rules.
While several strategies like dynamic grouping of rules
have been suggested to ensure termination and confluence,
it has not been found effective for all rule sets. ADBs simply assume that rules are terminating and confluent [RS99],
but this is not always the case and user interaction is sometimes required to ensure their termination. The development
of effective rule analysis systems awaits further work on
communicating the results of analysis to users. This suggests that user should be modeled in a stronger sense in
ADBs to realize the full capabilities of ADBs. For example,
Active Databases as
specialized IS
Interactive
User data, rule-related
logs and histories,
rule-related user profiles
Job:
Updates and queries of data Data backed rule-based
by user as well as rule-driven services to users
Output: Determined completely by
Individualized based on
query/update specification. user history/preferences
services.
7. Conclusions
Active databases have been traditionally considered as
data transformation systems, with research methods that
borrow from traditional database arsenal. This is despite the
fact that one of the design issues in active databases is in
bringing application-level integrity concerns to the database
level. As a result, designing the rule system of ADBs is
a challenge and the mapping between application requirements and the system of rules remains complicated.
We argue for a change in viewpoint, so active databases
are embraced as a special (restricted) type of an information system rather than a special (augmented) type of a
database. As such, they are interactive service-providing
systems rather than mere data transformation engines. This
change in perspective offers a promising approach for addressing ADB shortcomings, and reveals a roadmap of additional features for ADBs.
Note that our argument can also be applied to objectoriented database systems (OODBs) [ABD89]. In contrast
with object-relational databases, a pure OODB is best
viewed as an IS. This change of perspective will yield a
roadmap of desired OODB research directions analogous to
that for ADBs.
References
[ABD89] M. Atkinson, F. Bancilhon, D. DeWitt, K. Dittrich,
D. Maier, S. Zdonik. The Object-Oriented Database System Manifesto. Proc. First Intl Conf. on Deductive and
OODBMS, 1989.
[BM99] A. Borgida, Y. Murata. Tolerating exceptions in workflows: a unified framework for data and processes. In Intl
Joint Conf. on Work Activities, Coordination and Collaboration (WACC99), ACM Press, pp. 59-68, 1999.
[Bry97] F. Bry. Query Answering for Information Systems with
Integrity constraints., Proc. IICIS, 1997.
[CCF01] F. Casati, S. Castano, and M. Fugini. Managing Workflow Authorization Constraints through Active Database
Technology . Information Systems Frontiers 3:3, Sep. 2001.
[CCPP99] F. Casati, S. Ceri, S. Paraboschi, G. Pozzi. Specification and Implementation of Exceptions in Workflow Management Systems. ACM Trans. on Database Systems, 24:3, 1999,
pp. 405-451.
[CT94] J. Chomicki, D. Toman. Implementing temporal Integrity
constraints using an Active DBMS. IEEE Proc. on Data and
Knowledge Eng., 1994.
[DGG95] K.R.Dittrich, S.Giatziu, A.Geppert. The Active
Database System Manifesto: A Rulebase of ADBMS features, In T.Sellis (ed.): Proc. (RIDS), Athens, Greece, September 1995. LNCS, Springer 1995.
[EE91] L.J.B.Essink, W.J.Erhart. Object Modeling and System
Dynamics in the Conceptualization Stages of IS Dev., in
Object Oriented Approach in IS. F.Van Assche, B.Moulin,
C.Rolland (ed.) Elsevier Pub.1991 IFIP.
[EGM98] H.D.Ehrich, U.Goltz, J.Meseguer. Information Systems as Reactive Systems. Schloss Dagstuhl Intl Conf. and Research Center for Computer Science, 16.02.-20.02.98, Seminar No:98071, Report No:200.
[GL96] M.Gertz, U.W.Lipeck. Deriving optimized monitoring
triggers from dynamic integrity constraints. IEEE Proc. of
Data and Knowledge Eng.,Vol. 20(2), pp. 163-193, 1996.
[Gol00] D. Goldin. Persistent Turing Machines as a Model of
Interactive Computation. in: K-D. Schewe and B. Thalheim
(Eds.), First Intl Symposium (FoIKS2000). LNCS, Vol.1762,
Springer-Verlag, Berlin 2000, pp. 116-135.
[GST00] D. Goldin, S. Srinivasa, B. Thalheim. Information Systems=DBS+Interaction: Towards principles of Information
System Design. Proc. ER 2000, Utah, USA, Oct. 2000.
[HW00] W-J.v.d. Heuvel, H. Weigand. Cross-Organizational
Workflow Integration using Contracts. Business Objects and
Component Design and Implementation Workshop. Proc.
ACM OOPSLA 2000, Minneapolis, USA, Oct. 2000.
[LSKM00] Z. Luo, A. Sheth, K. Kochut, J. Miller. Exception handling in workflow systems. Applied Intelligence: the Intl J.
of AI, Neural Networks, and Complex Problem-Solving Technologies 13:2, pp. 125-147, Sep. 2000.
[Mey92] B. Meyer. Applying Design by Contracts. Comm. of
the ACM 25:10, Oct. 1992.
[MPP92] Z. Manna, A. Pnueli. The Temporal Verification of Reactive and Concurrent Systems. Springer-Verlag, 1992.
[PD99] N. W.Paton, Os. Diaz. Active Database Systems, ACM
Computing Surveys, Vol 31, No 1, March 1999.
[RS95] A. Reuter, F. Schwenkreis. Contracts -a low-level mechanism for building general purpose workflow management systems. IEEE Data Eng. Bulletin, 18(1), 1995.
[RS99] T. Risch, M. Skold. Monitoring Complex Rule Conditions. N. W. Paton (Ed.): Active Rules in Database Systems.
Springer, New York 1999, pp. 81-102.
[Sri01] S. Srinivasa. Channel Sensitivity In Reactive Systems.
Proc. Intl Conf. on High Performance Computing (HiPC),
Embedded Systems Workshop, India, Dec. 2001.
[SS03] Y. Saito, M. Shapiro, Optimistic Replication. Microsoft
Research Tech Report MSR-TR-2003-60 , Oct. 2003.
[SW91] H.J.Schneider, A.I.Wasserman. Automated Tools for Information Systems Design, IFIP WG 8.1 Working Conf., Jan.
82.
[SWO95] A.P. Sistla, O. Wolfson. Temporal Conditions and Integrity Constraints in Active Database Systems. Proc. 1995
ACM SIGMOD Intl Conf. on Management of Data, pp. 269
280, 1995.
[Wei99] G. Weiss (Ed). Multiagent Systems: A Modern Approach
to Distributed Artificial Intelligence. MIT Press, 2000.
[WG99] P.Wegner, D.Goldin. Interaction as a Framework for
Modeling. In Chen, et. al. (Eds.) Conceptual Modeling: Current Issues and Future Directions, LNCS 1565, April 1999.
[KW00] L. Kerschberg, D. J.Weishar. Conceptual Models and Architectures for Advanced Information Systems. Applied Intelligence, Vol. 13,pp. 149-164,2000.