Information System Integration: June 2000
Information System Integration: June 2000
Information System Integration: June 2000
discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/220419996
CITATIONS READS
316 1,486
1 author:
Wilhelm Hasselbring
Christian-Albrechts-Universität zu Kiel
301 PUBLICATIONS 2,893 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Wilhelm Hasselbring on 30 May 2014.
INFORMATION SYSTEM
INTEGRATION
F
or information systems, it is increasingly technology, which requires interdisciplinary cooper-
difficult to draw a line around an appli- ation between the application domain and informa-
cation system and say that you own and tion technology.
control it. For example, as value chains The technology architecture layer defines the infor-
extend beyond enterprises, supplier and mation and communication infrastructure. At this
customer systems become part of each layer, IT is challenged to achieve the business
other’s information architectures. Fur- requirements.
thermore, in many application areas, data is distrib- It is important to realize that Figure 1 does not ade-
uted over a multitude of heterogeneous, often quately reflect the reality. In practice, the business
autonomous information systems, and an exchange architectures of the individual organizational units
of data among them is not easy. Figure 1 illustrates cannot be treated in isolation: the business processes
such a vertical fragmentation of organizational units. of cooperating units are highly interrelated and should
Each unit may be structured within three architec- be handled as such. Figure 2 illustrates this situation.
tural layers, as described in the following. Certain kinds of interactions among computer sys-
The business architecture layer defines the organi- tems resemble interactions among people; thus, it is
zational structure and the workflows for business important to consider all levels when integrating those
rules and processes. It is a conceptual level expressed systems. A horizontal integration of the layers is
in terms meaningful to actual users of application required to support the business processes effectively,
systems. as indicated here.
The application architecture layer defines the Interorganizational processes. At this layer, busi-
actual implementation of the business concepts in ness engineering [1] seeks to organize a commercial
terms of enterprise applications. At this layer, it is undertaking in a competitive way, whereby business
the central goal to provide the “glue” between the processes cut horizontally through the traditional
application domain described in the business archi- organization structure. Business process reengineering
tecture and the technical solutions described in the aims at continuously improving those processes.
technology architecture. Research in information To support the intraorganizational business
JAMES GARY
systems aims at filling the gap between business and processes within organizations effectively, the existing
or participatory development involving end When it comes to SI, coordination is required for
F users, applications should be built incrementally
using techniques such as Rapid Application Develop-
managing shared resources and dependencies among
activities in and across systems; it is obvious we can
ment (RAD), Joint Application Design (JAD), and learn from the trade-offs between different disci-
Prototyping. This special section, however, primarily plines involved in SI. Malone and Crowston [2] already
concentrates on the interdisciplinary nature of SI emphasized that the study of coordination is relevant
among the different disciplines in IT, and discusses for such dissimilar disciplines as computer science,
end-user involvement merely as a side issue. The linguistics, and operations research. Resource alloca-
articles in this special section are organized accord- tion, for instance, is widely studied in economics,
ing to specific application domains. A variety of IT organization theory, as well as in IT.
disciplines are studying problems and solutions for Database systems. Problems of coupling and inte-
information SI: grating heterogeneous database and information sys-
tems have been addressed in the database area for
Parallel and distributed systems. Research in oper- some time. While research on parallel and distributed
ating systems, computer networks, and parallel pro- systems emphasizes the integration of computational
gramming systems concentrates, to a great extent, on components, research on database systems is more
managing the coexistence and coordination of multi- concerned with the integration of data. Federated
ple concurrent activities. In these areas, communica- database systems, for instance, approach the inte-
tion among system components and their gration of heterogeneous databases by means of
synchronization are common problems to be solved. schema integration. However, when it comes to man-
With parallel programming, for instance, paral- aging transactions over multiple local systems, for
lelism is used as a coordination mechanism and, instance for executing transactional workflows, the
accordingly, programming is divided into two separate problems to be solved are often very similar to those
activities: a sequential language that can be used to that arise in parallel and distributed systems. These
build single-threaded computations; whereas a coor- similarities sometimes cause the reinvention of tech-
dination language is used to coordinate the activity of niques that have already been elaborated in other
those computations. Thus parallel programming con- disciplines.
sists of putting together components and letting them Software engineering. When it comes to SI, we deal
cooperate. with complex systems of systems. Software engineer-
syntax and the semantics of standardized messages Organizational Unit Organizational Unit
must be defined. Traditional EDI (Electronic Data
Interchange) is often being reexamined to define the
meaning of the transferred data, and XML is
employed as the practical foundation used to structure Enterprise Application Integration also addresses a
this information. semantic level.
Middleware integration. At this layer, the tech- The borderline between Enterprise Application and
niques for building componentized information sys- middleware integration cannot always be pinpointed
tems with state-of-the-art infrastructures such as precisely. For instance, the Object Management Archi-
CORBA, database gateways, and transaction moni- tecture of the OMG defines the Object Request Bro-
tors, are employed. Middleware integration addresses ker, which can be deployed for middleware
the syntactical level (“plumbing” and “wiring”) while integration, and also high-level services (such as busi-
ing is concerned with the systematic development of Artificial intelligence. Mediator and multiagent
such complex systems. Work in this area deals with architectures are developed in this area to achieve the
questions of adequate software architectures and integration of heterogeneous information sources by
design patterns for complex systems, composition of means of (distributed) artificial intelligence tech-
software components, the proper use and extension of niques [6]. Usually, the integration is managed
middleware tools, and methodological approaches for through the mediators and agents by means of logical
the integration process. rules employing artificial intelligence techniques.
With component-based development, it is Computations have to be coordinated, and distributed
expected that software systems may be created and access to data/knowledge bases and ontologies is
maintained at lower costs and with increased stabil- essential for the deployment of mediator and multia-
ity through reuse of approved components in flexible gent architectures.
software architectures [5]. When those components Multimedia systems. Often, the information to be
are information systems, a frequent requirement is integrated for multimedia systems consists of com-
that the systems to be integrated are to remain posite objects comprising different media components
autonomous. Preexisting applications (legacy sys- such as text, video, image, or audio (for example, for
tems) must still be able to use their local data with- digital libraries). Techniques for multimedia systems,
out modification. In this way (financial) investment such as MIME or the XML-based synchronized multime-
can be preserved and a smooth migration toward dia language SMIL (www.w3.org/AudioVideo), are
modern systems can take place. The notion of “feder- required in that context.
ation” is originally a political term: several states join Multimedia systems are often distributed over
together and constitute a federal system in which computer networks. The Web, which utilizes the Inter-
each state retains its autonomy up to a certain net, is an example of a huge, but not well-organized
degree. This idea of federation can be transferred to multimedia system. With respect to SI, content-
the integration of preexisting information systems, descriptive metadata relating to the meaning of the
which could have been developed independently actual multimedia objects plays an important role in
(autonomously) within different departments of an distributed multimedia systems. For realizing multime-
enterprise. Here, considerable overlap with research dia systems, techniques for distributed systems,
on databases and parallel/distributed systems databases, software engineering, and artificial intelli-
exists. gence are essential. c
Dimensions of Information SI
espite the differences among the various dis-
D ciplines involved, the work on information SI
focuses to a great extent on three issues: auton-
Figure 3. Problem dimensions for SI: Autonomy,
heterogeneity, and distribution.The dashed
arrows indicate some general approaches to
omy, heterogeneity, and distribution as illustrated
manage these issues.
in Figure 3: distribution
Usually, SI aims at approaching the origin in this unmarshals (de-linearizes) them, and calls the true
system of coordinates in Figure 3. Typical solutions for local callee. The callee procedure itself, just as the
the respective dimensions are: caller, follows local calling conventions and is
Distribution. Proxy services are an established unaware of being called remotely. The marshaling and
technique for “hiding” distribution. The idea of unmarshaling are responsible for converting data val-
remote procedure calls (RPCs), for instance, is to ues from their local representations to some interme-
replace the local callee’s and the remote caller’s ends diate network format, and vice versa. The stubs can
of the procedure calls by stubs. The caller uses strictly be regarded as proxies for the corresponding local
local calling conventions giving him the illusion to call procedures.
a local callee. In reality, it calls a (generated) stub The Objects Management Group’s CORBA architec-
that marshals (linearizes) the parameters and sends ture, for instance, extends remote procedure calls to
them to the remote end. At that end, another stub remote method calls in an object-oriented setting
(sometimes called skeleton) receives the parameters, (www.omg.org).
Heterogeneity. Due to the independent develop- on the latter semantics issues, partially because
ment and deployment of component systems, hetero- semantic problems are often not noticed until the
geneity occurs at various levels and for various basic access problems are solved.
reasons. On a technical level, heterogeneity comes Autonomy. Autonomy of component systems is a
from different hardware platforms, operating systems, critical issue for SI. Components may be autonomous in
database management systems, and programming their design, meaning their developers chose the cov-
languages. On a conceptual level, heterogeneity comes ered universe of discourse, programming models, nam-
from different programming and data models as well as ing concepts, and so forth. The systems may also be
different understanding and modeling of the same autonomous with respect to communication and exe-
real-world concepts, for example, the use of the same cution, meaning that a component may independently
name to denote different concepts (homonyms) decide how to handle interaction with the outside
and the use of different names for the same concept world.
(synonyms). The feasibility of reducing autonomy by technical
Bridging heterogeneity is one of the most difficult means is highly limited. Usually, autonomy can only be
tasks of SI. Typical techniques for overcoming hetero- reduced in connection with organizational changes.
geneity are the use of common programming and data The implications of, for instance, enforcing a two-
models, and similar structuring of information. phase commit over several local databases by means
Domain-specific standards are useful for defining the of a transaction monitor may be unacceptable to the
meaning of information to be shared among dissimilar corresponding organizational departments due to the
organizations. Wrappers that provide unified interfaces impact on the local system’s execution performance.
are an established technique for integrating legacy As illustrated in Figure 3, SI aims at approaching the
systems. origin in this system of coordinates. However, it is not
The fact that autonomy of a source means not only always possible—and even not always reasonable—
having heterogeneity of access and representation attempting to eliminate autonomy, heterogeneity, or
(computer system, operating system, database sys- distribution entirely. For instance, distribution is a
tem, interface conventions, and so forth), but also matter of fact when connecting systems of dissimilar
content heterogeneity (partial overlap, different orga- organizations. Autonomy allows for flexible architec-
nization, differences in term semantics) should be tures whereby individual subsystems are able to adapt
understood. For instance, the address of a person may themselves to changing requirements. By allowing for
be an attribute of person objects in one system and an heterogeneity, organizational departments may
entity with its own identity in another system. More choose the optimal systems for achieving their individ-
work has been done on the former technical issues than ual business goals. c
Wilhelm Hasselbring
([email protected]) is an assistant
professor at the Infolab in the
Department of Information
Management and Computer Science,
Tilburg University, The Netherlands.
References
1. Jacobson, I., Ericsson, M., and Jacobson,
A. The Object Advantage: Business Process
Reengineering with Object Technology.
Addison-Wesley, Wokingham, England,
1994.
2. Malone, T. and Crowston, K. The inter-
disciplinary study of coordination. ACM
Computing Surveys 26, 1 (1994), 87–119.
3. Michelis, G.D., et al. A three-faceted view
of information systems. Commun. ACM
41, 12 (Dec. 1998), 64–70.
4. Munz, R. Usage scenarios of DBMS.
Keynote Address at VLDB’99, Sept.
1999; www.dcs.napier.ac.uk/~vldb99/
IndustrialSpeakerSlides/.
5. Szyperski, C. Component Software. Addi-
son-Wesley, Harlow, England, 1998.
6. Wiederhold, G., Ed. Intelligent Integra-
tion of Information. Kluwer Academic
Publishers, Boston, 1996.