0% found this document useful (0 votes)

67 views11 pages

CPS - Data Intensive Distributed Computing

This document discusses a medical healthcare application of data intensive distributed computing. It describes a model where applications interface with a distributed cache to access large datasets. This cache acts as a window onto datasets that may be larger than the cache size and reside in tertiary storage. It provides high-speed access and isolates applications from storage systems. The document then describes the Distributed Parallel Storage System used to implement this cache, allowing parallel access across dispersed storage resources. It concludes with an example medical application called BAGNet that used this infrastructure to share healthcare imaging data over a high-speed network.

Uploaded by

Manoj Kavedia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views11 pages

CPS - Data Intensive Distributed Computing

Uploaded by

Manoj Kavedia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 11

CPS - Data Intensive Distributed Computing: A Medical Healthcare

Application Example
Dhiraj Tawani
Jyoti Tejwani
NamrataShownkeen
[email protected]
[email protected]
[email protected]
Smt. S. H. Mansukhani Institute of Technology
Ulhasnagar, Dist. Thane, Maharashtra, India

Abstract
Modern scientific computing involves organizing, moving, visualizing, and
analyzing massive amounts of data from around the world, as well as employing largescale computation. The distributed systems that solve large-scale problems will always
involve aggregating and scheduling many resources. Data must be located and staged,
cache and network capacity must be available at the same time as computing capacity,
etc.
Every aspect of such a system is dynamic: locating and scheduling resources,
adapting running application systems to availability and congestion in the middleware
and infrastructure, responding to human interaction, etc. The technologies, the
middleware services, and the architectures that are used to build useful high-speed, wide
area distributed systems, constitute the field of data intensive computing. This paper
explores some of the history and future directions of that field, and describes a specific
medical application example.
Computational Grids enable the sharing of a wide variety of geographically
distributed resources including supercomputers, storage systems, databases, data sources
and specialized devices owned by different organizations in order to create virtual
enterprises and organizations. They allow selection and aggregation of distributed
resources across multiple organizations for solving large-scale computational and data
intensive problems in science, engineering and commerce. The parallel processing of
applications on wide-area distributed systems provide a scalable computing power.
This enables exploration of large problems with huge data sets, which is essential for
creating new insights into the problem. Molecular modeling for drug design is one of the

scientific applications that can benefit from the availability of a large computational
capability.

Introduction
The advent of shared, widely available, high-speed networks is providing the
potential for new approaches to the collection, storage, and analysis of large data-objects.
Two examples of large data-object environments, that despite the very different
application areas have much in common, are health care imaging information systems
and atomic particle accelerator detector data systems.
Health care information, especially high-volume image data used for diagnostic purposes
- e.g. X-ray CT, MRI, and cardio-angiography - are increasingly collected at tertiary
(centralized) facilities, and may now be routinely stored and used at locations other than
the point of collection. The importance of distributed storage is that a hospital (or any
other instrumentation environment) may not be the best environment in which to
maintain a large-scale digital storage system, and an affordable, easily accessible, highbandwidth network can provide location independence for such storage. The importance
of remote end-user access is that the health care professionals at the referring facility
(frequently remote from the tertiary imaging facility) will have ready access to not only
the image analyst's reports, but the original image data itself.
This general scenario extends to other fields as well. In particular, the same basic
infrastructure is required for remote access to large-scale scientific and analytical
instruments, both for data handling and for direct, remote-user operation. In this paper we
describe and illustrate a set of concepts that are contributing to a generalized, high
performance, distributed information infrastructure, especially as it concerns the types of
large data-objects generated in the scientific and medical environments. We will describe
the general issues, architecture, and some system components that are currently in use to
support distributed large data-objects. We describe a health care information system that
has been built, and is in prototype operation.

An Overall Model for Data-Intensive Computing

The concept of a high-speed distributed cache as a common element for all of the
sources and sinks of data involved in high-performance data systems has proven very
successful in several application areas, including the automated processing and
cataloguing of real-time instrument data and the staging of data from an Massive Storage
System (MSS) for high data-rate applications. For the various data sources and sinks, the
cache, which is itself a complex and widely distributed system, provides:
A standardized approach for high data-rate interfaces;

An impedance matching function (e.g., between the coarse-grained nature of

parallel tape drives in the tertiary storage system and the fine-grained access of
hundreds of applications);

flexible management of on-line storage resources to support initial caching of

data, processing, and interfacing to tertiary storage;

A unit of high-speed, on-line storage that is large compared to the available disks
of the computing environments, and very large (e.g., hundreds of gigabytes)
compared to any single disk.

The model for data intensive computing, shown in Figure 1, includes the following:

Each application uses a standard high data-rate interface to a large, high-speed,

application-oriented cache that provides semi-persistent, named datasets / objects;

Data sources deposit data in a distributed cache, and consumers take data from the
cache, usually writing processed data back to the cache when the consumers are
intermediate processing operations;

Metadata is typically recorded in a cataloguing system as data enters the cache,or

after intermediate processing;

A tertiary storage system manager typically migrates data to and from the cache.

The cache can thus serve as a moving window on the object/dataset, since, depending
on the size of the cache relative to the objects of interest, only part of the object data may
be loaded in the cache - though the full objection definition is present: that is, the cache is
a moving window for the off-line object/data set; the native cache access interface is at

the logical block level, but client-side libraries implement various access I/O semantics e.g., Unix I/O (upon request available data is returned; requests for data in the dataset, but
not yet migrated to cache, cause the application-level read to block or be signaled);

Fig.1. Data Intensive Computing model

The Distributed Parallel Storage Server
A key aspect of this data intensive computing environment has turned out to be a
high-speed, distributed cache. LBNL designed and implemented the Distributed-Parallel
Storage System (DPSS) as part of the MAGIC project, and as part of the U.S. Department
of Energys high-speed distributed computing program.
This technology has been quite successful in providing an economical, high-performance,
widely distributed, and highly scalable architecture for caching large amounts of data that
can potentially be used by many different users. The DPSS serves several roles in highperformance, data-intensive computing environments.
This application-oriented cache provides a standard high data rate interface for
high-speed access by data sources, processing resources, mass storage systems, and user
interface elements. It provides the functionality of a single very large, random access,
block-oriented I/O device (i.e., a virtual disk) with very high capacity (we anticipate a
terabyte sized system for high-energy physics data) and serves to isolate

the application from tertiary storage systems and instrument data sources. Many large
data sets may be logically present in the cache by virtue of the block index maps being
loaded even if the data is not yet available. Data blocks are declustered (dispersed in such
a way that as many system elements as possible can operate simultaneously to satisfy a
given request) across both disks and servers. This strategy allows a large collection of
disks to seek in parallel, and all servers to send the resulting data to the application in
parallel shown in Figure2. In this way processing can begin as soon as the first data
blocks are generated by an instrument or migrated from tertiary storage.

Fig.DPSS Storage Architecture

The high performance of the DPSS - about 14 megabytes/sec of data delivered to
the user application per disk server - is obtained through parallel operation of
independent, network-based components. Flexible resource management - dynamically
adding and deleting storage elements, partitioning the available storage, etc. - is provided
by design, as are high availability and strongly bound security contexts. The scalable
nature of the system is provided by many of the same design features that provide the
flexible resource management (that in turn provides the capability to aggregate dispersed
and independently owned storage resources into a single cache).

The DPSS provides several important and unique capabilities for a data intensive
computing environment. It provides application-specific interfaces to an extremely large
space of logical blocks; it offers the ability to build large, high-performance storage
systems from inexpensive commodity components; and it offers the ability to increase
performance by increasing the number of parallel disk servers. Various cache
management policies operate on a per-data set basis to provide block aging and
replacement.

Medical Application Example

BAGNet was an IP over OC-3 (155 Mbit/sec) ATM, metropolitan area network
testbed that operated in the San Francisco Bay Area (California) for two years starting in
early 1994. The participants included government, academic, and industry computer
science and telecommunications R&D groups from fifteen Bay Area organizations. The
goal was to develop and deploy the infrastructure needed to support a diverse set of
distributed applications in a large-scale, IP-over-ATM network environment.
In BAGNet, there were several specific projects involving subsets of the
connected sites. In particular, LBNL, the Kaiser Permanente health care organization, and
Philips Palo Alto Research Center collaborated to produce a prototype production, online, distributed, high data rate medical imaging system. (Philips and Kaiser were added
to BAGNet for this project through the Pacific Bell CalREN program.)
The Kaiser project focused on using high data rate, on-line instrument systems as
remote data sources. When data is generated in large volumes and with high throughput,
and especially in a distributed environment where the people generating the data are
geographically separated from the people cataloguing or using the data, there are several
important considerations for managing instrument generated data:

Automatic generation of at least minimal metadata;

Automatic cataloguing of the data and the metadata as the data is received (or as
close to real time as possible);

Transparent management of tertiary storage systems where the original data is

archived;

facilitation of cooperative research by providing specified users at local and

remote sites immediate as well as long-term access to the data;

Mechanisms to incorporate the data into other databases or documents.

Fig.Digitized AngioGram Image

The WALDO (Wide-area Large-Data-Object) system was developed to provide these
capabilities, especially when the data is gathered in real time from a high data rate
instrument . WALDO is a digital data archive that is optimized to handle real-time data. It
federates textual and URL linked metadata to represent the characteristics of large data
sets. Automatic cataloguing of incoming real-time data is accomplished by extracting
associated metadata and converting it into text records; by generating auxiliary metadata
and derived data; and by combining these into Web-based objects that include persistent
references to the original data components (called large data objects, or LDOs).
Tertiary storage management for the data components (i.e., the original datasets)
is accomplished by using the remote program execution capability of Web servers to

manage the data on mass storage systems. For subsequent use, the data components may
be staged to a local disk and then returned as usual via the Web browser, or, as is the case
for several of our applications, moved to a high-speed cache for access by specialized
applications (e.g., the high-speed video player illustrated in the right-hand part of the
right-hand panel in Figure3. The location of the data components on tertiary storage, how
to access them, and other descriptive material are all part of the LDO definition. The
creation of object definitions, the inclusion of standardized derived-data-objects as part
of the metadata, and the use of typed links in the object definition, are intended to provide
a general framework for dealing with many different types of data, including, for
example, abstract instrument data and multi-component multimedia programs. WALDO
was used in the Kaiser project to build a medical application that automatically manages
the collection, storage, cataloguing, and playback of video-angiography data 2 collected
at a hospital remote from the referring physician.
Using a shared, metropolitan area ATM network and a high-speed distributed data
handling system, video sequences are collected from the video-angiography imaging
system, then processed, catalogued, stored, and made available to remote users. This
permits the data to be made available in near-real time to remote clinics (see Fig u re3).
The LDO becomes available as soon as the catalogue entry is generated
derived data is added as the processing required to produce it completes. Whether the
storage systems are local or distributed around the network is entirely a function of
optimizing logistics.
In the Kaiser project, cardio-angiography data was collected directly from a Philips
scanner by a computer system in the San Francisco Kaiser hospital Cardiac
Catheterization Laboratory. This system is, in turn, attached to an ATM network provided
by the NTON and BAGNet testbeds. When the data collection for a patient is complete
(about once every 2040 minutes), 5001000 megabytes of digital video data is sent
across the ATM network to LBNL (in Berkeley) and stored first on the DPSS distributed
cache (described above), and then the WALDO object definitions are generated and made
available to physicians in other Kaiser hospitals via BAGNet.
Auxiliary processing and archiving to one or more mass storage systems proceeds
independently. This process goes on 810 hours a day.

Distributed System Performance Monitoring

A central issue for using high-speed networks and widely distributed systems as
the foundation of a large data-object management strategy is the performance of the
system components, the transport / OS software, and the underlying network.
Problems in any of these regimes will hurt a data intensive computing strategy, but such
problems can usually be corrected if they can be isolated and characterized. A significant
part of our work with high-speed distributed systems in MAGIC has been developing a
monitoring methodology and tools to locate and characterize bottlenecks.
There are virtually no behavioral aspects of high-speed, wide area IP-over-ATM
networks that can be taken for granted, even in end-to-end ATM networks. By network
we mean the end-to-end data path from the transport API through the host network
protocol (TCP/IP) software, the host network adaptors and their device drivers, the many
different kinds of ATM switches and physical links, up through the corresponding
software stack on the receiver. Further, the behavior of different elements at similar
places in the network architecture can be quite different because they are implemented in
different ways. The combination of these aspects can lead to complex and unpredictable
network behavior
We have built performance and operation monitoring into the storage system and
several applications, and have designed tools and methodologies to characterize the
distributed operation of the system at many levels. As requests and data enter and leave
all parts of the user-level system, synchronized timestamps are logged using a common
logging format. At the same time, various operating system and network parameters may
be logged in the same format. This is accomplished by the Netlogger monitoring system,
which has been used to analyze several network-generated problems that showed up in
the distributed applications.

Other DPSS Applications

We have conducted a set of high-speed, network based, data intensive computing
experiments between Lawrence Berkeley National Laboratory (LBNL) in Berkeley,
Calif., and the Stanford Linear Accelerator (SLAC) in Palo Alto, Calif. The results of this

experiment were that a sustained 57 megabytes/sec of data were delivered from datasets
in the distributed cache to the remote application memory, ready for analysis
algorithms to commence operation. This experiment represents an example of our data
intensive computing model in operation.
The prototype application was the STAR analysis system that analyzes data from
high energy physics experiments. Shown in fig [3]. A four-server DPSS located at LBNL
was used as a prototype front end for a high-speed mass storage system. A 4-CPU Sun E4000 located at SLAC was a prototype for a physics data analysis computing cluster, as
shown in Figure 2. The National Transparent Optical Network test bed (NTON see [7])
connects LBNL and SLAC and provided a five-switch, 100-km, OC-12 ATM path. All
experiments were application-to-application, using TCP transport.
Multiple instances of the STAR analysis code read data from the DPSS at LBNL
and moved that data into the memory of the STAF application where it was available to
the analysis algorithms. This experiment resulted in a sustained data transfer rate of 57
MBytes/sec from DPSS cache to application memory. This is the equivalent of about 4.5
TeraBytes / day. The goal of the experiment was to demonstrate that high-speed mass
storage systems could use distributed caches to make data available to the systems
running the analysis codes. The experiment was successful, and the next steps will
involve completing the mechanisms for optimizing the MSS staging patterns and
completing the DPSS interface to the bit file movers that interface to the MSS tape
drives.

Conclusion
We believe this architecture, and its integration with systems like Globus, will enable the
next generation of configurable, distributed, high-performance, data-intensive systems;
computational steering; and integrated instrument and computational simulation. We also
believe a high performance network cache system such as the DPSS will be an important
component to these computational grids and metasystems.
[1] DPSS, The Distributed Parallel Storage System, http://www-didc.lbl.gov/DPSS/
[2] Globus, The Globus Project, http://www.globus.org/

[3] Greiman, W., W. E. Johnston, C. McParland, D. Olson, B. Tierney, C. Tull, HighSpeed Distributed Data Handling for HENP, Computing in High Energy Physics, April,
1997. Berlin, Germany. http://www-itg.lbl.gov/STAR/
[4] Grimshaw, A., A. Ferrari, G. Lindahl, K. Holcomb, Metasystems, Communications
of the ACM, November, 1998, Volume 41, no 11
[5] Foster, I., C. Kesselman, eds., The Grid: Blueprint for a New Computing
Infrastructure, Morgan Kaufmann, publisher. August, 1998.
[6] B. Fuller and I. Richer The MAGIC Project: From Vision to Reality, IEEE
Network, May, 1996, Vol. 10, no. 3. http://www.magic.net/
[7]

NTON,

National

Transparent

Optical

Network

Consortium.

See

http://www.ntonc.org/.
[8] Johnston, W., G. Jin, C. Larsen, J. Lee, G. Hoo, M. Thompson, B. Tierney, J.
Terdiman, Real-Time Generation and Cataloguing of Large Data-Objects in Widely
Distributed Environments, International Journal of Digital Libraries - Special Issue on
Digital Libraries in Medicine. November, 1997. (Available at http://wwwitg.lbl.gov/WALDO/)
[9] Thompson, M., W. Johnston, J. Guojun, J. Lee, B. Tierney, and J. F. Terdiman,
Distributed health care imaging information systems, PACS Design and Evaluation:
Engineering and Clinical Issues, SPIE Medical Imaging 1997. (Available at http://wwwitg.lbl.gov/Kaiser.IMG)
[10] Tierney, B., W. Johnston, B. Crowley, G. Hoo, C. Brooks, D. Gunter, The NetLogger Methodology for High Performance Distributed Systems Performance Analysis,
Seventh IEEE International Symposium on High Performance Distributed Computing,
Chicago, Ill., July 28-31, 1998. Available athttp://www-itg.lbl.gov/DPSS/papers.html.
[11] Tierney, B., W. Johnston, J. Lee, and G. Hoo, Performance Analysis in High-Speed
Wide Area ATM Networks: Top-to-bottom end-to-end Monitoring, IEEE Networking,
May 1996. (Available at
http://www-itg.lbl.gov/DPSS/papers.)

Interpreters April 01 2011
100% (4)
Interpreters April 01 2011
254 pages
HVB Dark Series 4 Di Balik Sosok Gelap Lexie Xu 4 PDF Free
100% (2)
HVB Dark Series 4 Di Balik Sosok Gelap Lexie Xu 4 PDF Free
356 pages
Distributed Data Storage Systems - From RAID To Blockchains
No ratings yet
Distributed Data Storage Systems - From RAID To Blockchains
6 pages
E+H BDQC-Operation Manual
No ratings yet
E+H BDQC-Operation Manual
35 pages
Data Intensive Computing
No ratings yet
Data Intensive Computing
33 pages
Unit 1 - Data Access in The Internet Era (Part A)
No ratings yet
Unit 1 - Data Access in The Internet Era (Part A)
134 pages
Cloud COMPUTING Module 4
No ratings yet
Cloud COMPUTING Module 4
50 pages
MODULE 4 Notes
No ratings yet
MODULE 4 Notes
12 pages
Distributed Parallel Architecture For "Big Data"
No ratings yet
Distributed Parallel Architecture For "Big Data"
12 pages
CloudComputing Unit 3
No ratings yet
CloudComputing Unit 3
8 pages
Software and Systems Modeling
No ratings yet
Software and Systems Modeling
17 pages
Classic Data Centre
No ratings yet
Classic Data Centre
16 pages
Unit 5 CC
No ratings yet
Unit 5 CC
8 pages
Monitoring Data Lake Solution
No ratings yet
Monitoring Data Lake Solution
19 pages
Self-Medical Analysis Using Internet-Based Computing Upon Big Data
No ratings yet
Self-Medical Analysis Using Internet-Based Computing Upon Big Data
6 pages
Cloud Data Storage
No ratings yet
Cloud Data Storage
47 pages
Data-Intensive Computing
No ratings yet
Data-Intensive Computing
88 pages
Storage Solutions For Bioinformatics: Li Yan
No ratings yet
Storage Solutions For Bioinformatics: Li Yan
30 pages
Research Paper 3
No ratings yet
Research Paper 3
6 pages
The Scientific Data Management Center: Arie Shoshani (PI)
No ratings yet
The Scientific Data Management Center: Arie Shoshani (PI)
38 pages
Module-3 (Part-2)
No ratings yet
Module-3 (Part-2)
46 pages
Cse 2054 Dqac QB PC Aug 2023
No ratings yet
Cse 2054 Dqac QB PC Aug 2023
16 pages
Exp 1 (Ismdr)
No ratings yet
Exp 1 (Ismdr)
5 pages
Finalreport
No ratings yet
Finalreport
92 pages
07 DistributedDataManagement
No ratings yet
07 DistributedDataManagement
44 pages
CC Unit 4
No ratings yet
CC Unit 4
46 pages
15CS754 SAN Solution Manual
No ratings yet
15CS754 SAN Solution Manual
15 pages
CIS Module 2 - Classic Data Center - Modi
No ratings yet
CIS Module 2 - Classic Data Center - Modi
67 pages
De Unit 4
No ratings yet
De Unit 4
33 pages
A Survey: Data Sharing Approach Using Parallel Processing Techniques
No ratings yet
A Survey: Data Sharing Approach Using Parallel Processing Techniques
3 pages
BDA Answer Bank
No ratings yet
BDA Answer Bank
24 pages
Unit-Iii CC
No ratings yet
Unit-Iii CC
14 pages
03 Intro HadoopAndMapReduce BigData
No ratings yet
03 Intro HadoopAndMapReduce BigData
91 pages
Birla Institute of Technology & Science, Pilani: Work Integrated Learning Programmes
No ratings yet
Birla Institute of Technology & Science, Pilani: Work Integrated Learning Programmes
9 pages
IJCSNS International Journal of Computer Science and Network Security, VOL.8
No ratings yet
IJCSNS International Journal of Computer Science and Network Security, VOL.8
7 pages
Big Data Analytics On Large Scale Shared Storage System: University of Computer Studies, Yangon, Myanmar
No ratings yet
Big Data Analytics On Large Scale Shared Storage System: University of Computer Studies, Yangon, Myanmar
7 pages
XSEDE15 Part1 Intro
No ratings yet
XSEDE15 Part1 Intro
101 pages
Ddbms-Unit 1 Part2
No ratings yet
Ddbms-Unit 1 Part2
16 pages
CC Unit-V
No ratings yet
CC Unit-V
6 pages
GFD-I.14 DAIS Working Group
100% (2)
GFD-I.14 DAIS Working Group
27 pages
Dic PLB L1
No ratings yet
Dic PLB L1
64 pages
Research Paper 10
No ratings yet
Research Paper 10
5 pages
A Cost-Effective, High-Bandwidth Storage Architecture
No ratings yet
A Cost-Effective, High-Bandwidth Storage Architecture
12 pages
15CS565 Module4
No ratings yet
15CS565 Module4
61 pages
Unit-II CC Notes
No ratings yet
Unit-II CC Notes
33 pages
A Review On HADOOP MAPREDUCE-A Job Aware Scheduling Technology
No ratings yet
A Review On HADOOP MAPREDUCE-A Job Aware Scheduling Technology
5 pages
Storage in Cloud
No ratings yet
Storage in Cloud
51 pages
Examine The Concept of Distributed Object Storage in Distributed Object Databases, and Compare Its
No ratings yet
Examine The Concept of Distributed Object Storage in Distributed Object Databases, and Compare Its
11 pages
Session 8 - George Strawn - Big Data
No ratings yet
Session 8 - George Strawn - Big Data
34 pages
Research Paper 8
No ratings yet
Research Paper 8
14 pages
Big Data and Cloud Computing
No ratings yet
Big Data and Cloud Computing
27 pages
Distributed Data Management and Processing
No ratings yet
Distributed Data Management and Processing
54 pages
Data Science and Big Data UNIT 3
No ratings yet
Data Science and Big Data UNIT 3
11 pages
Hadoop & BigData (UNIT - 2)
No ratings yet
Hadoop & BigData (UNIT - 2)
22 pages
Ccomputing Madurya
No ratings yet
Ccomputing Madurya
20 pages
CXL Over Ethernet A Novel FPGA-based Memory
No ratings yet
CXL Over Ethernet A Novel FPGA-based Memory
9 pages
Compression For Cloud
No ratings yet
Compression For Cloud
12 pages
Moving Processing To Data: On The Influence of Processing in Memory On Data Management
No ratings yet
Moving Processing To Data: On The Influence of Processing in Memory On Data Management
21 pages
What Are Basic Characteristics of Data and How Is Parallel Processing System Different From Distributed System?
No ratings yet
What Are Basic Characteristics of Data and How Is Parallel Processing System Different From Distributed System?
24 pages
What Are Basic Characteristics of Data and How Is Parallel Processing System Different From Distributed System?
No ratings yet
What Are Basic Characteristics of Data and How Is Parallel Processing System Different From Distributed System?
24 pages
Parallel Processing
No ratings yet
Parallel Processing
5 pages
Dumpshq Dell Information Storage and Management Foundations 2023 Questions by Gardner 24-05-2024 12qa
No ratings yet
Dumpshq Dell Information Storage and Management Foundations 2023 Questions by Gardner 24-05-2024 12qa
13 pages
Experiment No 9 Calculations On Memory Locations
No ratings yet
Experiment No 9 Calculations On Memory Locations
2 pages
Establishing JDBC Connection in Java
No ratings yet
Establishing JDBC Connection in Java
39 pages
Arduino - Basic Programming
No ratings yet
Arduino - Basic Programming
47 pages
Chapter Wise Question Bank
No ratings yet
Chapter Wise Question Bank
15 pages
08 - Chapter 2
No ratings yet
08 - Chapter 2
107 pages
String Instructions
100% (1)
String Instructions
7 pages
22-03-2017 Ste PST
No ratings yet
22-03-2017 Ste PST
5 pages
Appendix A - Unit Test - 1 Sample Paper
No ratings yet
Appendix A - Unit Test - 1 Sample Paper
13 pages
Machine, Domain Specific - Automotive. Peripherals, Memory - RAM, ROM, Types of RAM and ROM, Memory Testing, CRC, Flash Memory
No ratings yet
Machine, Domain Specific - Automotive. Peripherals, Memory - RAM, ROM, Types of RAM and ROM, Memory Testing, CRC, Flash Memory
2 pages
The 8051 Microcontrollers: Microcontrollers and Embedded Processors, Overview of 8051
No ratings yet
The 8051 Microcontrollers: Microcontrollers and Embedded Processors, Overview of 8051
1 page
Integrator and Differentiator
100% (1)
Integrator and Differentiator
7 pages
Applet Examples
No ratings yet
Applet Examples
34 pages
Electronic Circuits
No ratings yet
Electronic Circuits
3 pages
Simple Calculator Using Applet
50% (2)
Simple Calculator Using Applet
6 pages
Double-Sideband Suppressed Carrier Modulation (DSB-SC) : T A T A T M T C T S
No ratings yet
Double-Sideband Suppressed Carrier Modulation (DSB-SC) : T A T A T M T C T S
28 pages
Applet Examples
No ratings yet
Applet Examples
34 pages
Street Lighting Technology Comparison
No ratings yet
Street Lighting Technology Comparison
7 pages
Peltier Effect Based Solar Powered Air Conditioning System
100% (1)
Peltier Effect Based Solar Powered Air Conditioning System
6 pages
Exception Handling 1
No ratings yet
Exception Handling 1
32 pages
Junior College Aided Unaided
No ratings yet
Junior College Aided Unaided
40 pages
Design and Development of Floor Cleaner Robot (Automatic and Manual)
No ratings yet
Design and Development of Floor Cleaner Robot (Automatic and Manual)
7 pages
Appl Itude
No ratings yet
Appl Itude
73 pages
Docslide - Us - Trimble Series 4000 Reference Manual PDF
No ratings yet
Docslide - Us - Trimble Series 4000 Reference Manual PDF
349 pages
Instruction Execution and Data Path
No ratings yet
Instruction Execution and Data Path
12 pages
LECT03 - PLC Addressing and Basic Instructions
No ratings yet
LECT03 - PLC Addressing and Basic Instructions
7 pages
UV Printer Resource Chart
100% (1)
UV Printer Resource Chart
2 pages
2N IP Automation Manual EN 2.35
No ratings yet
2N IP Automation Manual EN 2.35
108 pages
ROCCAT-Isku QIG
No ratings yet
ROCCAT-Isku QIG
2 pages
Standards Related Document AEP-84.1 Implementation Guidelines For AEP-84
No ratings yet
Standards Related Document AEP-84.1 Implementation Guidelines For AEP-84
548 pages
A111wan03105060e
100% (3)
A111wan03105060e
158 pages
ICS 143 - Principles of Operating Systems
No ratings yet
ICS 143 - Principles of Operating Systems
38 pages
Throttling Calorimeter
No ratings yet
Throttling Calorimeter
16 pages
Acctg 027
No ratings yet
Acctg 027
6 pages
Vista BR
No ratings yet
Vista BR
12 pages
KT1025A Datasheet: Shenzhen Qingyue Electronics Co., LTD
No ratings yet
KT1025A Datasheet: Shenzhen Qingyue Electronics Co., LTD
12 pages
Computer Ports PDF
No ratings yet
Computer Ports PDF
22 pages
BG25Q40A Datasheet PDF
No ratings yet
BG25Q40A Datasheet PDF
62 pages
CP Unit 1 Notes B r23 Notes
No ratings yet
CP Unit 1 Notes B r23 Notes
54 pages
STC8H1K08 Features
No ratings yet
STC8H1K08 Features
10 pages
3rd Term j1 Business Studies - 010532
No ratings yet
3rd Term j1 Business Studies - 010532
19 pages
BODAS Controller
No ratings yet
BODAS Controller
28 pages
Report Final
No ratings yet
Report Final
67 pages
MetroSelect Configuration Guide 02407H
No ratings yet
MetroSelect Configuration Guide 02407H
210 pages
Unit 3 Interfacing Microprocessor
No ratings yet
Unit 3 Interfacing Microprocessor
45 pages
D Pac-Relay
No ratings yet
D Pac-Relay
16 pages
Structure Chart
No ratings yet
Structure Chart
31 pages
Manual - MC9090 Keymap
No ratings yet
Manual - MC9090 Keymap
3 pages
AC6900A规格书V1 2
No ratings yet
AC6900A规格书V1 2
14 pages
Optidrive P2 Advanced User Guide Rev 1.00
No ratings yet
Optidrive P2 Advanced User Guide Rev 1.00
56 pages

CPS - Data Intensive Distributed Computing

Uploaded by

CPS - Data Intensive Distributed Computing

Uploaded by

CPS - Data Intensive Distributed Computing: A Medical Healthcare

An Overall Model for Data-Intensive Computing

An impedance matching function (e.g., between the coarse-grained nature of

flexible management of on-line storage resources to support initial caching of

Each application uses a standard high data-rate interface to a large, high-speed,

Metadata is typically recorded in a cataloguing system as data enters the cache,or

Fig.1. Data Intensive Computing model

Fig.DPSS Storage Architecture

Medical Application Example

Automatic generation of at least minimal metadata;

Transparent management of tertiary storage systems where the original data is

facilitation of cooperative research by providing specified users at local and

Mechanisms to incorporate the data into other databases or documents.

Fig.Digitized AngioGram Image

Distributed System Performance Monitoring

Other DPSS Applications

You might also like