Content Based Image Retrieval
Content Based Image Retrieval
REVIEW
A review of content-based image retrieval systems in medical applicationsclinical benets and future directions
Henning Mller* , Nicolas Michoux, David Bandon, Antoine Geissbuhler
Service of Medical Informatics, University Hospital of Geneva, Rue Micheli-du-Crest 24, 1211 Geneva 14, Switzerland
Received 22 July 2003; accepted 13 November 2003
KEYWORDS
Medical image retrieval; Content-based search; Visual information retrieval; PACS
Summary Content-based visual information retrieval (CBVIR) or content-based image retrieval (CBIR) has been one on the most vivid research areas in the eld of computer vision over the last 10 years. The availability of large and steadily growing amounts of visual and multimedia data, and the development of the Internet underline the need to create thematic access methods that offer more than simple text-based queries or requests based on matching exact database elds. Many programs and tools have been developed to formulate and execute queries based on the visual or audio content and to help browsing large multimedia repositories. Still, no general breakthrough has been achieved with respect to large varied databases with documents of differing sorts and with varying characteristics. Answers to many questions with respect to speed, semantic descriptors or objective image interpretations are still unanswered. In the medical eld, images, and especially digital images, are produced in everincreasing quantities and used for diagnostics and therapy. The Radiology Department of the University Hospital of Geneva alone produced more than 12,000 images a day in 2002. The cardiology is currently the second largest producer of digital images, especially with videos of cardiac catheterization (1800 exams per year containing almost 2000 images each). The total amount of cardiologic image data produced in the Geneva University Hospital was around 1 TB in 2002. Endoscopic videos can equally produce enormous amounts of data. With digital imaging and communications in medicine (DICOM), a standard for image communication has been set and patient information can be stored with the actual image(s), although still a few problems prevail with respect to the standardization. In several articles, content-based access to medical images for supporting clinical decision-making has been proposed that would ease the management of clinical data and scenarios for the integration of content-based access methods into picture archiving and communication systems (PACS) have been created. This article gives an overview of available literature in the eld of content-based access to medical image data and on the technologies used in the eld. Section 1 gives an introduction into generic content-based image retrieval and the technologies used. Section 2 explains the propositions for the use of image retrieval in medical practice and the various approaches. Example systems and application areas are
*Corresponding author. Tel.: +41-22-372-6175; fax: +41-22-372-8680. E-mail address: [email protected] (H. Mller). 1386-5056/$ see front matter 2004 Elsevier Ireland Ltd. All rights reserved. doi:10.1016/j.ijmedinf.2003.11.024
H. Mller et al.
described. Section 3 describes the techniques used in the implemented systems, their datasets and evaluations. Section 4 identies possible clinical benets of image retrieval systems in clinical practice as well as in research and education. New research directions are being dened that can prove to be useful. This article also identies explanations to some of the outlined problems in the eld as it looks like many propositions for systems are made from the medical domain and research prototypes are developed in computer science departments using medical datasets. Still, there are very few systems that seem to be used in clinical practice. It needs to be stated as well that the goal is not, in general, to replace text-based retrieval methods as they exist at the moment but to complement them with visual search tools. 2004 Elsevier Ireland Ltd. All rights reserved.
medical domain and will be a good starting point for research projects on medical image retrieval as useful techniques for certain sorts of images can be isolated and past errors can be avoided.
3 most all systems employ colors. Although most of the images are in the red, green, blue (RGB) color space, this space is only rarely used for indexing and querying as it does not correspond well to the human color perception. It only seems reasonable to be used for images taken under exactly the same conditions each time such as trademark images. Other spaces such as hue, saturation, value (HSV) [24,27,28] or the CIE Lab [15] and Luv [29] spaces are much better with respect to human perception and are more frequently used. This means that differences in the color space are similar to the differences between colors that humans perceive. Much effort has also been spent on creating color spaces that are optimal with respect to lighting conditions or that are invariant to shades and other inuences such as viewing position [30,31]. This allows to identify colors even under varying conditions but on the other hand information about the absolute colors is lost. In specialized elds, namely in the medical domain, absolute color or grey level features are often of very limited expressive power unless exact reference points exist as it is the case for computed tomography images. 1.2.2. Texture Partly due to the imprecise understanding and denition of what exactly visual texture actually is, texture measures have an even larger variety than color measures. Some of the most common measures for capturing the texture of images are wavelets [32,33] and Gabor lters [24,34,35] where the Gabor lters do seem to perform better and correspond well to the properties of the human visual cortex for edge detection [36,37]. These texture measures try to capture the characteristics of the image or image parts with respect to changes in certain directions and the scale of the changes. This is most useful for regions or images with homogeneous texture. Again, invariances with respect to rotations of the image, shifts or scale changes can be included into the feature space but information on the texture can get lost in this process [38]. Other popular texture descriptors contain features derived from co-occurrence matrices [3941], features based on the factors of the Fourier transform [38] and the so-called Wold features [42]. 1.2.3. Local and global features Both, color and texture features can be used on a global image level or on a local level on parts of the image. The easiest way to use regional features is to use blocks of xed size and location, so-called partitioning of the image [7,24] for local feature extraction. These blocks do not take into account
Most of these systems have a very similar architecture for browsing and archiving/indexing images comprising tools for the extraction of visual features, for the storage and efcient retrieval of these features, for distance measurements or similarity calculation and a type of graphical user interface (GUI). This general system setup is shown in Fig. 1. All shown components are described in more detail further on.
4 any semantics of the image itself. When allowing the user to choose image regions (regions of interest (ROI)) [43], to delineate objects in the image [44] or when segmenting the image into areas with similar properties [45], the locally extracted features contain more information about the image objects or underlying structures. 1.2.4. Segmentation and shape features Fully automated segmentation of images into objects itself is an unsolved problem. Even in fairly specialized domains, fully automated segmentation causes many problems and is often not easy to realize. In image retrieval, several systems attempt to perform an automatic segmentation of the images in the collection for feature extraction [21,46]. To have an effective segmentation of images using varied image databases the segmentation process has to be done based on the color and texture properties of the image regions [45]. Much has also been written on medical image segmentation with respect to browsing image repositories [47,48]. After segmentation, the resulting segments can be described by shape features that commonly exist, including those with invariances with respect to shifts, rotations and scaling [49,50]. 1.2.5. Semantics? All these visual features, and even features derived from segmented regions, are still fairly low-level compared to high level concepts that are contained in the images. They do not necessarily correspond to objects in the images or the semantic concepts or structures that a user is interested in. Several articles speak of semantic or cognitive image retrieval [5154] but in the end this has not yet been realized with visual features alone. It often comes down to connecting visual low-level features with textual high level features which has already been proposed in [55] as early as 1996. The annotation of image collections for retrieval or for the combination with visual features for retrieval is another very active research area [26,56]. Many problems such as the subjectiveness of annotations need to be addressed even when working with restricted vocabularies. The users annotations do not only vary between persons, they are also varying in time for the same person and they depend strongly on the users actual search tasks. However, in the medical domain, good annotated atlases of medical images do exist that contain objective knowledge, for example based on the images of the visible human.9 The denition of visual similarity or relevance with
http://www.nlm.nih.gov/research/visible/visible/human. html.
9
H. Mller et al. respect to visual similarity are also philosophical questions that have been discussed for a long time [57].
Content-based image retrieval systems of pattern recognition methods and various comparison techniques is given in a very good review article [187]. This article describes the feature extraction, selection, feature space reduction techniques that are equally important in the image retrieval domain.
5 three-dimensional representation of the similarity space as well as the El Nio system, where the user moves images together into clusters that (s)he thinks are similar. The correlation across various media (text, image, video, audio) should also not be forgotten if these are available. Whenever additional information is available such as annotations of the images, it should be used for the retrieval.
H. Mller et al. tained in the DICOM headers and many imaging devices are DICOM-compliant at this time, there are still some problems. DICOM headers have proven to contain a fairly high rate of errors, for example for the eld anatomical region, error rates of 16% have been reported [105]. This can hinder the correct retrieval of all wanted images. Clinical decision support techniques such as case-based reasoning [106] or evidence-based medicine [107,108] can even produce a stronger need to retrieve images that can be valuable for supporting certain diagnoses. It could even be imagined to have image-based reasoning (IBR) as a new discipline for diagnostic aid. Decision support systems in radiology [109] and computer-aided diagnostics for radiological practice as demonstrated at the Radiological Society of North America (RSNA) [110] are on the rise and create a need for powerful data and meta-data management and retrieval. The general clinical benet of imaging systems has also already been demonstrated in [111]. In [112] an initiative is described to identify important tasks for medical imaging based on their possible clinical benets. It needs to be stated that the purely visual image queries as they are executed in the computer vision domain will most likely not be able to ever replace text-based methods as there will always be queries for all images of a certain patient, but they have the potential to be a very good complement to text-based search based on their characteristics. Still, the problems and advantages of the technology have to be stressed to obtain acceptance and use of visual and text-based access methods up to their full potential. A scenario for hybrid, textual and visual queries is proposed in the CBIR2 system [113]. Besides diagnostics, teaching and research especially are expected to improve through the use of visual access methods as visually interesting images can be chosen and can actually be found in the existing large repositories. The inclusion of visual features into medical studies is another interesting point for several medical research domains. Visual features do not only allow the retrieval of cases with patients having similar diagnoses but also cases with visual similarity but different diagnoses. In teaching, it can help lecturers as well as students to browse educational image repositories and visually inspect the results found. This can be the case for navigating in image atlases.12 It can also be used to cross-correlate visual and textual features of the images.
12
Fig. 2 A screenshot of a typical image retrieval system showing retrieved images similar to an example in a web browser interface.
inclusion into various applications [101103], often without any implementation. Still, for a real medical application of content-based retrieval methods and the integration of these tools into medical practice a very close cooperation between the two elds is necessary for a longer period of time and not simply an exchange of data or a list of the necessary functionality. An interface of a typical content-based retrieval system is shown in Fig. 2. The interface shows the images retrieved with their similarity score to an example image. The user can then mark images as relevant, non-relevant or leave them as neutral, change the parameters for retrieval and start a new query for renement.
http://www.loni.ucla.edu/MAP/index.html.
Fig. 3 The basic position of a PACS within the information system environment in a hospital.
and [114] is another very early article on the architecture of a distributed multimedia database. [127] describes an active index for medical image data management, and in [116] a newer image management environment is described. In [118,119], two frameworks for image management and retrieval are described focusing on technical aspects and stating application areas. One of the few frameworks with at least a partial implementation is the image retrieval in medical applications (IRMA) framework [100,128] that allows for a relatively robust classication of incoming images into anatomical regions, modality and the taken orientation. This project also developed a classication code for medical images based on four axes (modality, body orientations, body region, biological system) to uniquely classify medical images and allow to test and measure the performance of classication [129]. The use of content-based techniques has been proposed several times in a PACS environment. PACS are the main software components to store and access the large amount of visual data used in medical departments. Often, several-layer architectures exist for quick short-term access and slow long-term storage. More information on PACS can be found in [130]. A web-based PACS architecture is proposed in [131]. The general schema of a PACS system within the hospital is shown in Fig. 3. The Integrating the Healthcare Enterprise (IHE)13 standard is aiming at data integration in healthcare including all the systems described in Fig. 3. An indexing of the entire PACS causes problems with respect to the sheer amount of data that needs to be processed to efciently allow access by content to all the images. This issue of the amount of data that needs to be indexed is not discussed in any of the articles. [122] proposes to use content-based image retrieval techniques in a PACS system as a search method but no implementation
13
http://www.rsna.org/IHE/index.shtml.
8 details are given. In [120] an integration into the PACS is described that uses the text attached to the images as content. More on this IDEM project can be found at14 [123] proposes an extension to the database management system for integrating content-based queries based on simple visual features into PACS systems. A classication of systems is given in [121] proposing an integration into the PACS, but no implementation details are stated in the text. A coupling of a PACS and an image classication system is given in [85]. Here, it is possible to search for certain anatomic regions, modalities or views of an image. A simple interface for coupling the PACS and the image retrieval system is stated as well. The identication is based on the DICOM unique identier (UIDs) of the images. Still, there is lack of publications describing the integration of image retrieval into the workow in a medical institution and visual knowledge management in a learning institution has not been the subject of publications either. Besides the use directly within a PACS system or very general image database environment, content-based image retrieval has also been used or proposed in a couple of specialized collections. In [92], CBIR is proposed in the context of a case database containing images and attached case descriptions. [124] describes the use in a medical reference database and [132] the use within a teaching le assistant. An object-oriented approach to store and access medical databases is given in [126]. But it remains unclear what kind of visual features are supposed to be used. In [133] an online pathology atlas uses the search-by-similarity paradigm. Decision support systems are another application of content-based medical image retrieval [134]. In [135] access-control models for content-based retrieval are discussed. It can be seen that the number and sort of applications is large and diverse, and the techniques used or proposed for an implementation contain a variety almost as large as for general image retrieval.
H. Mller et al. A categorization of images from various departments has been described in [54,100]. A classication of dermatologic images is explained in [75,136,137]. Cytological specimens have already been described very early (in 1986, [138]) and also later on [139] whereas the search for 3D cellular structures followed later on [96]. Pathology images have often been proposed for content-based access [43,140] as the color and texture properties can relatively easy be identied. The tasks of a pathologist when searching for reference cases also supports the use of an image retrieval system instead of only reference books. The use with tuberculosis smears is described in [141]. An application with histopathologic images is described in [142] and histologic images are analyzed in [134,143,144]. Within cardiology, CBIR has been used to discover stenosis images [97]. MRIs of the heart have been used in [145]. Within the radiology department, mammographies are one of the most frequent application areas with respect to classication and content-based search [146149]. The negative psychological effects of removing tissue for false positive patients have been described of one of the principal goals to be reduced. Ultrasound images of the breast are used in [41]. Varied ultrasound images are used in [150]. Another active area is the classication of high resolution computed tomography (HRCT) scans of the lung as done by the Assert project [151,152]. A study about the diagnostic quality with and without using the system showed a signicant improvement of the diagnostic quality with using a retrieval system for nding similar cases [99]. A less sophisticated project also using HRCT lung images is described in [125,132]. A justication of use in this area is the hard decision-making task and the strong dependence of the diagnoses from texture properties. Descriptions of HRCT lung images, their visual features and their pathologies are given in [153,154]. The use of thorax radiographies is proposed in [110]. This will be an even harder task as several layers are superposed and many factors other than the pathology can inuence the visual content strongly. Many other articles use medical images to demonstrate their algorithms but a clinical evaluation of their use has rarely been done. In [53,54,155], magnetic resonance images (MRIs) of the brain are used to demonstrate the image search algorithms but the articles do not talk about any medical integration. [115,156] also use MRIs of the head for testing their algorithms. CT brain scans to classify lesions are used in [157]. The search for medical tumors by their shape properties (after segmentation) have
9 Machine learning in medical applications also gets increasingly more important and it is essential to research the various possibilities. Specialized workshops exist for this area [164].
are using these images Images used HRCTs of the lung Functional PET Spine X-rays Pathologic images CTs of the head Mammographies Images from biology Dermatology Breast cancer biopsies Varied images
been described in [147]. Functional photon emission tomography (PET) images for retrieval are used in [158]. Spine X-rays are used in [113,159]. Table 1 shows an overview of several image types and the systems that are used to retrieve these images.
10 an image denition language [126] for the querying of images in image repositories. [167,168] uses text from radiology reports to transform it into concepts in the UMLS metathesaurus to then retrieve the images. The use of text for queries is undeniable efcient but the question is whether this can really be called content-based queries as the text does not necessarily dene the image content. It rather puts the images into the context they have been taken in, so it should maybe called context-based queries as dened in [67]. The combination of textual with visual features or content and context of the images does have the most potential to lead to good results [113]. One can also be used to control the quality of the other or to obtain a better recall of the retrieval results. Besides the free text that is frequently used for retrieval, medical patient records also contain very valuable structured information such as age, sex and profession of the patient. This information is just as important as free text to put the images into a context. 3.1.3. Visual features Unfortunately, most articles that propose contentbased queries do not explain in detail which visual features have been used or are planned to be used. Sometimes, only a very vague description such as general texture and color or grey level features are given as in [54,127,169]. Basically all systems that do give details use color and grey level features, mostly in the form of a histogram [134,143,150,151]. Local and global grey level features are used in [170]. [100,128] use statistical distributions of grey levels for the classication of images and [122] proposes a brightness histogram. As many of the images in the medical domain do not contain colors or are taken under controlled conditions, the color properties are not at all in the center of research and the same holds for invariants to lighting conditions. This can change when using photographs such as in dermatology. Pathologic images will need to be normalized in some way as different staining methods can produce different colors [171]. Within radiology, the normalization of grey levels between different modalities or even for the same modality can cause problems when there is no exact reference point as is for the density of the CT, for example. [172] illustrates the dependency of intensity values of the brain from the used modalities. As color and grey level features are of less importance in medical images than in stock photography, the texture and shape features gain in importance. Basically all of the standard techniques for texture characterization are used from edge detection
H. Mller et al. using Canny operators [141] to Sobel descriptors [151]. [113,139,151] also use Fourier descriptors to characterize shapes, [113,123,139] use invariant moments and [113] also scale-space ltering. Features derived from co-occurrence matrices are also frequently used [96,115,150,151], as well as responses of Gabor lters [134,143,170], wavelets [140,150] and Markov texture characteristics [139]. In mammography, denseness is used for nding small nodules [148]. It would be interesting to have a comparison of several texture descriptors. Many of them model the same information and will most likely deliver very similar results. In connection with segmentation, the shape of the segments can be used as a powerful feature. Again, often the exact nature of the shape features is not described [115] which makes it impossible to dene what exactly had been used. In [145] no segmentation has been done for the acquisition of shape features but computer-assisted outlining. The segmentation of pathologic images is described in [140]. In [156] even shape descriptors for 3D structures using modal modeling are described. Most common shape descriptors are Fourier descriptors [43,132,141] that easily allow to obtain invariant descriptions. The pattern spectrum is proposed in [147] and morphological features are used in [147]. Using segments in the images also allows to use spatial relationships as visual descriptors of the images. This is often proposed [114,116,121,169,173] but rarely any detail is given on how to obtain the objects/segments in the images, which does not permit to judge whether an implementation is possible. Another article not taking into account the problems of automatic segmentation is [116]. The use of Eigenimages for the retrieval of medical images in analogy to Eigenfaces for face recognition is proposed in [74,124]. These features can be used for classication when a number of images for each class exist. Still, the features are purely statistical and it is hard to actually explain the similarity of two images based on these features which can more easily be done for a histogram intersection, for example. In [121], signatures of the manually segmented objects of the images are proposed to reduce the list of resulting images. It is hard to say whether these features can count as visual features as they are not extracted automatically but based on semiautomatic segmentations and marking of the segments. Tissue timeactivity curve (TTAC) curves for the retrieval of PET images are used in [158]. These are not really image features but rather 1D
Content-based image retrieval systems temporal signals that are compared. However, the results seem to be good. Similar to general CBIR, semantic features are proposed for visual similarity queries with medical images [143,144]. But again, it comes down to simple textual labels attached to the images and a mapping between the text and the low-level features. A project for automatically attaching semantic labels to images or regions is described in [134] and in ProjetImage.15
11 more than 25,000 images in [92] is the spectrum of the articles analyzed for this review. Often, the images are pre-processed into sometimes fairly small blocks (18 12, [166], 64 64, [134] and 256 256, [170]) before the visual features are extracted. In some cases, even a reduction to 32 32 pixels has proven not to inuence the quality of the results compared with using the original size [100]. Some systems use pre-processing to remove artifacts from the image or to improve image quality such as the removal of hairs from dermatologic images [75]. Unfortunately, most of the larger databases such as [124] containing 10,000 MRI images, [116,173] containing 13,500 CT and MRI images and [147] using 1,000 tumor shapes only use simulated images. Although these simulated images are easy and cheap to obtain, their use for any qualitative assessments is more than questionable. On the other hand, only [121] uses a very large database containing 22,000 images of a PACS but without any further assessment of image categories and qualities and without an evaluation. [159] uses 17,000 spinal X-ray images as the basis of their research [92] proposes even more images, but here as well, no content-based access mechanisms are implemented as of yet. An interesting approach to obtain a large database is taken in [54], where 2000 images from freely available medical image databases on the web are taken. A database containing more than 8000 varied medical images is available free of charge from the casimage webpage or can be ordered from the author of this article.16 [123] uses a varied set of 4247 medical images. The other, often specialized image collections for content-based retrieval are unfortunately sometimes too small for delivering any statistically signicant measurements: [158] uses 15 PET studies, [149] 41 biopsy slides, [157] 48 brain CTs, [167] 50 varied images with radiology reports, [141] 65 smears for tuberculosis identication, [145] 85 MRI images, [74] 100 axial brain images, [75] 100 dermatologic images, [43] 261 cell images, [41] 263 ultrasound breast images, [132] 266 CT images and [96] 300 cell images. 312 HRCTs of the lung are used in [151,152], 345 liver disorders in [175], 404 biopsy proven mammography masses in [148] and 749 dermatological images in [136]. Almost as interesting as the image database itself is the question of how to choose query topics and then how to assess relevance for the query topics. The subject of relevance alone can ll several books [176,177]. This is relatively easy for simulated images as there is a model plus some added
16
http://perso-iti.enst-bretagne.fr/brunet/Boulot/ ProjetImage/ProjetImage.html.
http://www.casimage.com/.
12 noise and the noise level basically determines the measured quality of retrieval. Simulated images are consequently only usable for showing efciency of an algorithm using large image repositories. Nothing can really be said about retrieval quality when using simulated images. For the future, it is extremely important that image databases are made available free of charge and/or copyright for the comparison and verication of algorithms. Only such reference databases allow to compare systems and to have a reference for the evaluation that is done based on the same images. Some medical image collections are freely available on the Internet.17 18 19 20 An important effort is underway by the European Federation of Medical Informatics (EFMI) in a working group on medical image processing21 to generate reference databases and identify important medical imaging tasks [112].
H. Mller et al. In medical statistics commonly used measurements are sensitivity and specicity dened as follows: sensitivity = specicity = positive items classied as pos. all positive items negative items classied as neg. all negative items (1) (2)
Systems that use sensitivity and specicity include [41,136,141]. These values can also be presented in the form of a ROC curve which contains much more information and is done in [136,157]. As many of the presented systems use classications of images, accuracy is very often used to evaluate the system [96,100,125,141,143,146]. This can be dened as follows: accuracy = items classied correctly all items classied (3)
Still, it has to be kept in mind that content-based retrieval systems are not mainly being employed for classication of the images but for nding similar images or cases. This is often more helpful as the practitioner must still judge the retrieved cases and the reasons for retrieving the images are often clearer whereas classication results are sometimes hard to detail and need to be explained. Only rarely are measurements used that are common to the domains of information retrieval [180] or content-based image retrieval [179] such as precision and recall dened as follows: precision = recall = number of relevant items retrieved (4) number of items retrieved (5)
In [140], for example, the precision after 50 images are retrieved is measured to describe the system performance [123] mentions precision and recall for the evaluation but then, does not use it. [116] uses the precision at ve different cutoff points. These data are incomplete and hard to interpret as little is known about the number of relevant images and thus on the difculty of the query task. Much better is the use of a precision vs. recall graph that puts the two values on the axis of a graph as in [147]. Another rarely mentioned evaluation parameter is the speed of the system which is very important for an interactive system. In [123] it is only mentioned that the speed is reduced from hours to minutes for a set of 4000 images which is completely insufcient for an interactive system where response times should be around one second at a maximum.
Content-based image retrieval systems This list with few in depth evaluations shows that evaluation is very often neglected in medical image retrieval. It is extremely important and crucial for the success of this technology. Measurement parameters need to show the usefulness of an application and the possible impact that an application of the method can have. Such an evaluation does not only contain the validation of a technology which is commonly evaluated with measures such as specicity and sensitivity but also the inclusion of human factors into the process such as usability issues and acceptance of the technology [178], which can be obtained through real user tests. Finally, it will be interesting to evaluate the clinical impact of the application when it is used in real clinical practice. Are these technologies able to reduce the length of stay of patients or do they manage to reduce the use of human resources for the patient care? Studies on clinical effects of image retrieval technologies might still be a distance away but there are several necessities that can be done at the moment such as the denition of standard databases that are freely available, the denition of query topics for these databases including the creation of a gold standard or ground truth for these topics. This can, in the long run, make way for real clinical studies once the general retrieval performance is proven.
13 Another domain not discussed at all for medical images are the user interfaces. Sometimes web-based interfaces are proposed [170,182] but no comparison of interfaces is reported and no real usability studies have been published to the authors knowledge so far. As there are several creative solutions in image retrieval it will be interesting to study the effects of interfaces, ergonomics and usability issues on the acceptance and use of the technology in clinical practice. Performance comparisons for different feature sets have also never been performed and are important to identify well-performing visual features and the applications that they can successfully be used for. This would help a great deal to start new projects in the domain and also to optimize existing systems.
14 available for medical students for educational purposes. Content-based techniques allow browsing databases and then comparisons of diagnoses of visually similar cases. Especially for Internet-based teaching, this can offer new possibilities. As most of the systems are based on Internet technologies this does not cause any implementation problems. Research can also benet from visual retrieval methods. Researchers have more options for the choice of cases to include into research and studies by allowing text-based and visual access. It can also be imagined that by including visual features directly into medical studies, new correlations between the visual nature of a case and its diagnosis or textual description could be found. Visual data can also be mined to nd changes or interesting patterns which can lead to the discovery of new knowledge by combining the various knowledge sources. Finally, diagnostics will be the hardest but most important application for image retrieval. To be used as a diagnostic aid, the algorithms need to prove their performance and they need to be accepted by the clinicians as a useful tool. This also implies an integration of the systems into daily clinical practice which will not be an easy task. It is often hard to change the methods that people are used to, condence needs to be won. For domains such as evidence-based medicine or case-based reasoning it is essential to supply relevant, similar cases for comparison. Such retrieval will need special visual features that model the visual detection of an MD using as much domain knowledge as possible. Images are normally taken for a very specic reason and this needs to be modeled. There are two principal ideas for supporting the clinical decision-making process. The rst one is to supply the medical doctor with cases that offer a similar visual appearance. This can supply a second opinion for the MD and (s)he can perform the reasoning based on the various cases that are supplied by the system and the data that is available on the current patient. Another idea is the creation of databases containing normal (non-pathologic) cases and compare the distance of a new case with the existing cases doing thus dissimilarity retrieval as opposed to similarity retrieval (distance to normality). This is even more natural compared to the normal workow in medicine where the rst requirement is to nd out whether the case is pathologic or not. A tumor or fracture are such differences from normal cases, for example. A dissimilarity could be combined with highlighting regions in the image where the strongest dissimilarity occurred. Such a technique can help to nd cases that might otherwise be
H. Mller et al. missed. A combination of the two approaches is also possible where rstly, the requirement is whether the image contains abnormalities and if it does, a query to nd similar cases is done with another image database containing the pathologic cases. High quality annotation/codication is a problem not only in radiology but also in other medical departments. Good annotation and codication takes time and experience that is unfortunately sometimes not available in medical routine. Much research is done on natural language processing techniques to extract diagnoses from the patient record [183] and many tools exist to ease the coding, for example for the American College of Radiology (ACR) codes22 in radiology.23 When large databases of correctly coded images are available, image retrieval systems can be used for semi-automatic coding by retrieving visually similar cases and proposing the codes of the images from the database. Studies will need to prove the quality of the coding but time can be saved even when a medical doctor only has to control the codes that the system is proposing. Retrieval methods can also be used as simple tools to have a quality control on the DICOM headers. The combination of textual and visual attributes denitely promises the best results. In principle, all image-producing departments can prot from content-based technologies but there are some departments and some sorts of images that seem to stand out as textures and colors do play an important role for the diagnostics. Color and texture features are normally easy to index with current retrieval systems. This includes Pathology where microscopic images are analyzed and the clinical decision-making depends on the color changes and textures within the images. Many books with example images for typical or hard cases exist and it is relatively easy to provide these books in a digital form and search for them not only based on text or hierarchies but also based on the visual content. Care needs to be taken with respect to different staining methods. Images need to be normalized with respect to that [171]. Hematology already contains a large number of tools to automatically count blood cells but an interesting application would be the classication of abnormal white blood cells and the comparison of diagnoses between a new case and cases with similar abnormalities stored in the databases. Dermatology already has classication applications for potential melanoma cases that work fairly
22 23
http://www.acr.org/. http://www.casimage.com/ACR.html.
Content-based image retrieval systems well. Content-based access can help to make understand the decision of an expert system to the practitioner. Within the Radiology department there are a number of possible applications that can deliver good results. For HRCTs of the lung, computer-based tools have already been proven to help in the diagnostics process and diagnostics in this case are fairly difcult. Three-dimensional retrieval can also help to retrieve tumor forms and to classify observed tumors. As a tool for the use in PACS systems, a large number of people can prot from the methods to retrieve similar cases for a number of applications, often without realizing that the results come from a content-based retrieval engine.
15 images and the linked case information are stored to allow for retrieval methods based on structured information, free text and the visual image content. Of course, evaluation of the retrieval quality is an extremely important topic as well. Research will need to focus on the development of open test databases and query topics plus dened gold standards for the images to be retrieved. Retrieval systems need to be compared to identify good techniques. This can advance the eld much more than any single technique developed so far. But evaluation also needs to go one step further and prepare eld studies on the use and the inuence of retrieval techniques on the diagnostic process. So far, only one study on the impact of image retrieval system on the diagnostics of HRCT images of the lung has been published and shows a signicant improvement in diagnostic quality even for senior radiologists [99]. Practitioners need to give their opinion on the usability and applicability of the technologies and acceptance needs to be gained before they can be used in daily practice. Such communication with the system users can also improve the interface and retrieval quality signicantly when good feedback is delivered. User interaction and relevance feedback are two other techniques that need to be integrated more into retrieval systems as this can help to lead to much better results. Image retrieval needs to be interactive and all the interaction needs to be exploited for delivering the best possible results. Multimedia data mining will also be made possible once features of good quality are available to describe the images. This will help to nd new relationships among images and certain diseases or it will simply improve the retrieval quality of medical image search engines. Although rst applications will most likely be on large image archives for teaching and research, a specialization of the retrieval systems for promising domains such as dermatology or pathology will be necessary to include as much domain knowledge as possible into the retrieval. This will be necessary for decision-support systems such as systems for case-based reasoning. Such a specialization can be done in the easiest way with a modular retrieval system based on components where feature sets can be exchanged easily and modules for new retrieval techniques or efcient storage methods can be integrated easily. Fig. 4 shows such a component-based architecture where system parts can be changed and optimized easily. Easy plug-in mechanisms for the different components need to be dened. Besides the use of images, system developments also need to put a focus on higher-dimensional data. Already tomographic images contain three
http://mirc.rsna.org/.
16
H. Mller et al. structure and that use open standards, so data can be exchanged with other applications. It needs to become easy to integrate these new functionalities into other existing applications such as a hospital information system (HIS)/radiology information system (RIS)/PACS or other medical image management or viewing software. In this way, it will become much easier to have prototypes running for a sample of users and to get feedback on the clinical use of systems. To get acceptance, it is important to be integrated into the current applications and with interfaces that the users are familiar with. To win acceptance from the users it is also important to show the performance of the systems and to optimize the performance of systems for certain specialized tasks or people. The development of open toolboxes is another important factor for successful applications. Not only do interfaces for the communication with other applications need to be developed, also within the application it is important to stay modular, so parts and pieces can be exchanged easily. This will help to reduce the number of applications developed and will make it possible to spend more time on the important tasks of integration and development of new methods and system optimizations. It is clear that new tools and methods are needed to manage the increasing amount of visual information that is produced in medical institutions. Content-based access methods have an enormous potential when used in the correct way. It is now the time to create medical applications and use this potential for clinical decision-making, research and teaching.
Fig. 4 ment.
dimensions as do video sequences of endoscopy or ultrasound. Tools for retrieval of videos for example by motion parameters do exist for general videos [185,186] but to our knowledge do not exist specialized for the medical domain. Fast scanners also allow for the registration of 4D-data streams such as tomographic images taken over time. Combinations of modalities such as PET/CT scanners or the use of image fusion techniques also create multi-dimensional data that needs to be analyzed and retrieved. Omitting these high-dimensional informations will result in a signicant lack of knowledge.
5. Conclusion
The large number of research publications in the eld of content-based medical image retrieval especially in recent years shows that it is very active and that it is starting to get more attention. This will hopefully advance the eld as new tools and technologies will be developed and performance will increase. Content-based visual information retrieval denitely has a large potential in the medical domain. The amount of visual data produced in medical departments shows the importance of developing new and alternative access methods to complement text. Content-based methods can be used on a large variety of images and in a wide area of applications. Still, much work needs to be done to produce running applications and not only research prototypes. When looking at most current systems, it becomes clear that few to none of them are actually in routine use. An important factor is to build prototypes that are integrated with a hospital-wide communication
6. Summary
This article gives an overview of the currently available literature on content-based image retrieval in the medical domain. It evaluates after a few years of developments the need for image retrieval and presents concrete scenarios for promising future research directions. The necessity for additional, alternative access methods to the currently-used, text-based methods in medical information retrieval is detailed. This need is mainly due to the large amount of visual data produced and the unused information that these data contain, which could be used for diagnostics, teaching and research. The systems described in the literature and published propositions for image retrieval in medicine are critically reviewed and sorted by medical departments, image categories and technologies used. A short overview of nonmedical image retrieval is given as well.
Content-based image retrieval systems The lack of evaluations of the retrieval quality of systems becomes apparent along with the unavailability of large image databases free of charge with dened query topics and gold standards. However, some databases are available, from the National Institutes of Health (NIH), for example. Ideas for creating such image databases and evaluation methods are proposed. Also, several research directions for improving the retrieval quality based on the experiences from other closely related research elds are given in the paper. Possible clinical benets from the use of content-based access methods are described as well as promising elds of applications.
17
[14] M. Flickner, H. Sawhney, W. Niblack, J. Ashley, Q. Huang, B. Dom, M. Gorkani, J. Hafner, D. Lee, D. Petkovic, D. Steele, P. Yanker, Query by image and video content: the QBIC system, IEEE Comput. 28 (9) (1995) 2332. [15] W. Niblack, R. Barber, W. Equitz, M.D. Flickner, E.H. Glasman, D. Petkovic, P. Yanker, C. Faloutsos, G. Taubin, QBIC project: querying images by content, using color, texture, and shape, in: W. Niblack (Ed.), Proceedings of the of SPIE Conference on Storage and Retrieval for Image and Video Databases, vol. 1908, 1993, pp. 173187. [16] J.R. Bach, C. Fuller, A. Gupta, A. Hampapur, B. Horowitz, R. Humphrey, R. Jain, C.-F. Shu, The Virage image search engine: an open framework for image management, in: I.K. Sethi, R.C. Jain (Eds.), Proceedings of the SPIE Conference on Storage & Retrieval for Image and Video Databases IV, vol. 2670, San Jose, CA, USA, 1996, pp. 7687. [17] A. Hampapur, A. Gupta, B. Horowitz, C.-F. Shu, C. Fuller, J. Bach, M. Gorkani, R. Jain, Virage video engine, in: I.K. Sethi, R.C. Jain (Eds.), Proceedings of the of SPIE Conference on Storage and Retrieval for Image and Video Databases V, vol. 3022, 1997, pp. 352360. [18] P.M. Kelly, M. Cannon, D.R. Hush, Query by image example: the Candid approach, in: W. Niblack, R.C. Jain (Eds.), Proceedings of the of SPIE Conference on Storage and Retrieval for Image and Video Databases III, vol. 2420, 1995, pp. 238248. [19] A. Pentland, R.W. Picard, S. Sclaroff, Photobook: tools for content-based manipulation of image databases, Int. J. Comput. Vis. 18 (3) (1996) 233254. [20] W.Y. Ma, Y. Deng, B.S. Manjunath, Tools for texture- and color-based search of images, in: B.E. Rogowitz, T.N. Pappas (Eds.), Proceedings of the of SPIE Conference on Human Vision and Electronic Imaging II, vol. 3016, San Jose, CA, 1997, pp. 496507. [21] C. Carson, M. Thomas, S. Belongie, J.M. Hellerstein, J. Malik, Blobworld: a system for region-based image indexing and retrieval, in: D.P. Huijsmans, A.W.M. Smeulders (Eds.), Proceedings of the Third International Conference On Visual Information Systems (VISUAL99), no. 1614 in Lecture Notes in Computer Science, Springer-Verlag, Amsterdam, The Netherlands, 1999, pp. 509516. [22] S. Belongie, C. Carson, H. Greenspan, J. Malik, Color- and texture-based image segmentation using EM and its application to content-based image retrieval, in: Proceedings of the International Conference on Computer Vision (ICCV98), Bombay, India, 1998, pp. 675682. [23] I.J. Cox, M.L. Miller, S.M. Omohundro, P.N. Yianilos, Target testing and the PicHunter Bayesian multimedia retrieval system, in: Advances in Digital Libraries (ADL96), Library of Congress, Washington, D.C., 1996, pp. 6675. [24] D.M. Squire, W. Mller, H. Mller, T. Pun, Content-based query of image databases: in-spirations from text retrieval, Pattern Recognition Letters, vol. 21, 2000, pp. 11931198 (Selected papers from the 11th Scandinavian Conference on Image Analysis, SCIA99). [25] D.M. Squire, H. Mller, W. Mller, S. Marchand-Maillet, T. Pun, Design and evaluation of a content-based image retrieval system, in: Design & Management of Multimedia Information Systems: Opportunities & Challenges, IDEA Group Publishing, pp. 125151 (Chapter 7), 2001. [26] T. Pfund, S. Marchand-Maillet, Dynamic multimedia annotation tool, in: G. Beretta, R. Schettini (Eds.), Proceedings of the SPIE Photonics West Conference on Internet Imaging III, vol. 4672, San Jose, CA, USA, 2002, pp. 216224. [27] C. Carson, S. Belongie, H. Greenspan, J. Malik, Region-based image querying, in: Proceedings of the 1997
Acknowledgements
The authors would like to thank the reviewers for their comments that helped to improve the quality of this paper.
References
[1] S.-K. Chang, T. Kunii, Pictorial data-base applications, IEEE Comput. 14 (11) (1981) 1321. [2] P.G.B. Enser, Pictorial information retrieval, J. Document. 51 (2) (1995) 126170. [3] A. Gupta, R. Jain, Visual information retrieval, Commun. ACM 40 (5) (1997) 7079. [4] Y. Rui, T.S. Huang, S.-F. Chang, Image retrieval: past, present and future, in: M. Liao (Ed.), Proceedings of the International Symposium on Multimedia Information Processing, Taipei, Taiwan, 1997. [5] J.P. Eakins, M.E. Graham, Content-based image retrieval, Tech. Rep. JTAP-039, JISC Technology Application Program, Newcastle upon Tyne, 2000. [6] C.C. Venters, M. Cooper, Content-based image retrieval, Tech. Rep. JTAP-054, JISC Technology Application Program, 2000. [7] A.W.M. Smeulders, M. Worring, S. Santini, A. Gupta, R. Jain, Content-based image retrieval at the end of the early years, IEEE Trans. Pattern Anal. Machine Intel. 22 (12) (2000) 13491380. [8] H. Mller, User interaction and performance evaluation in content-based visual information retrieval, Ph.D. thesis, Computer Vision and Multimedia Laboratory, University of Geneva, Geneva, Switzerland, June 2002. [9] J.R. Smith, Integrated spacial and feature image systems: retrieval, compression and analysis, Ph.D. thesis, Graduate School of Arts and Sciences, Columbia University, 2960 Broadway, New York, NY, USA, 1997. [10] A. del Bimbo, Visual Information Retrieval, Academic Press, New York, 1999. [11] S.M. Rahman, Design & Management of Multimedia Information Systems: Opportunities & Challenges, Idea Group Publishing, London, 2001. [12] L.H.Y. Tang, R. Hanka, H.H.S. Ip, A review of intelligent content-based indexing and browsing of medical images, Health Informatics J. 5 (1999) 4049. [13] N.-S. Chang, K.-S. Fu, Query-by-pictorial-example, IEEE Trans. Software Eng. SE 6 (6) (1980) 519524.
18
IEEE Conference on Computer Vision and Pattern Recognition (CVPR97), IEEE Computer Society, San Juan, Puerto Rico, 1997, pp. 4251. J.R. Smith, S.-F. Chang, Visualseek: a fully automated content-based image query system, in: Proceedings of the Fourth ACM International Multimedia Conference and Exhibition, Boston, MA, USA, 1996. S. Sclaroff, L. Taycher, M. La Cascia, ImageRover: a content-based browser for the world wide web, in: IEEE Workshop on Content-Based Access of Image and Video Libraries, San Juan, Puerto Rico, 1997, pp. 29. T. Gevers, A.W.M. Smeulders, A comparative study of several color models for color image invariants retrieval, in: Proceedings of the First International Workshop ID-MMS96, Amsterdam, The Netherlands, 1996, pp. 1726. J.-M. Geusebroek, R. van den Boogaard, A.W.M. Smeulders, H. Geerts, Color invariance, IEEE Trans. Pattern Anal. Machine Intel. 23 (12) (2001) 13381350. M. Ortega, Y. Rui, K. Chakrabarti, K. Porkaew, S. Mehrotra, T.S. Huang, Supporting ranked boolean similarity queries in MARS, IEEE Trans. Knowledge Data Eng. 10 (6) (1998) 905925. J. Ze Wang, G. Wiederhold, O. Firschein, S. Xin Wei, Wavelet-based image indexing techniques with partial sketch retrieval capability, in: Proceedings of the Fourth Forum on Research and Technology Advances in Digital Libraries, Washington D.C., 1997, pp. 1324. W. Ma, B. Manjunath, Texture features and learning similarity, in: Proceedings of the 1996 IEEE Conference on Computer Vision and Pattern Recognition (CVPR96), San Francisco, California, 1996, pp. 425430. S. Santini, R. Jain, Gabor space and the development of preattentive similarity, in: Proceedings of the 13th International Conference on Pattern Recognition (ICPR96), IEEE, Vienna, Austria, 1996, pp. 4044. J.G. Daugman, An information theoretic view of analog representation in striate cortex, Computat. Neurosci. 2 (1990) 918. J.G. Daugman, High condence visual recognition of persons by a test of statistical independence, IEEE Trans. Pattern Anal. Machine Intel. 15 (11) (1993) 11481161. R. Milanese, M. Cherbuliez, A rotation, translation and scale-invariant approach to content-based image retrieval, J. Visual Commun. Image Represent. 10 (1999) 186196. J.S. Weszka, C.R. Dyer, A. Rosenfeld, A comparative study of texture measures for terrain classication, IEEE Trans. Sys. Man Cybernetics 6 (4) (1976) 269285. C.-R. Shyu, C.E. Brodley, A.C. Kak, A. Kosaka, A.M. Aisen, L.S. Broderick, ASSERT: A physician-in-the-loop content-based retrieval system for HRCT image databases, Comput. Vis. Image Understand. 75 (12) (1999) 111132. W.-J. Kuo, R.-F. Chang, C.C. Lee, W.K. Moon, D.-R. Chen, Retrieval technique for the diagnosis of solid breast tumors on sonogram, Ultrasound Med. Biol. 28 (7) (2002) 903909. C.-S. Lu, P.-C. Chung, Wold features for unsupervised texture segmentation, in: Proceedings of the 14th International Conference on Pattern Recognition (ICPR98), IEEE, Brisbane, Australia, 1998, pp. 16891693. D. Comaniciu, P. Meer, D. Foran, A. Medl, Bimodal system for interactive indexing and 18 retrieval of pathology images, in: Proceedings of the Fourth IEEE Workshop on Applications of Computer Vision (WACV98), Princeton, NJ, USA, 1998, pp. 7681.
H. Mller et al.
[44] S.T. Perry, P.H. Lewis, A novel image viewer providing fast object delineation for content based retrieval and navigation, in: I.K. Sethi, R.C. Jain (Eds.), Proceedings of the of SPIE Conference on Storage and Retrieval for Image and Video Databases VI, vol. 3312, 1997. [45] A. Winter, C. Nastar, Differential feature distribution maps for image segmentation and region queries in image databases, in: IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL99), Fort Collins, Colorado, USA, 1999, pp. 917. [46] L. Lucchese, S.K. Mitra, Unsupervised segmentation of color images based on -means clustering in the chromaticity plane, in: IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL99), Fort Collins, CO, USA, 1999, pp. 7478. [47] S. Ghebreab, Strings and necklaces on learning and browsing medical image segmentations, Ph.D. thesis, Faculty of computer science, University of Amsterdam, Amsterdam, The Netherlands, July 2002. [48] R.J. Lapeer, A.C. Tan, R. Aldridge, A combined approach to 3D medical image segmentation using marker-based watersheds and active contours: The active watershed method, in: T. Dohi, R. Kikin (Eds.), International Conference on Medical Image Computing and Computer-Assisted Interventions (MICCAI 2002), no. 2488 in Lecture Notes in Computer Science, Springer-Verlag, Tokyo, Japan, 2002, pp. 596603. [49] S. Loncaric, A survey of shape analysis techniques, Pattern Recog. 31 (8) (1998) 9831001. [50] R.C. Veltkamp, M. Hagedoorn, State-of-the-art in shape matching, in: Principles of Visual Information Retrieval, Springer, Heidelberg, 2000, pp. 87119. [51] D. Dori, Cognitive image retrieval, in: A. Sanfeliu, J.J. Villanueva, M. Vanrell, R. Alczar, J.-O. Eklundh, Y. Aloimonos (Eds.), Proceedings of the 15th International Conference on Pattern Recognition (ICPR 2000), IEEE, Barcelona, Spain, 2000, pp. 4245. [52] J.Z. Wang, J. Li, G. Wiederhold, Simplicity: semanticssensitive integrated matching for picture libraries, IEEE Trans. Pattern Anal. Machine Intel. 23 (9) (2001) 117. [53] Y. Liu, A. Lazar, W.E. Rothfus, M. Buzoiano, T. Kanade, Classication-driven feature space reduction for semantic-based neuroimage retrieval, in: Proceedings of the International Symposium on Information Retrieval and Exploration in Large Medical Image Collections (VISIM 2001), Utrecht, The Netherlands, 2001. [54] A. Mojsilovis, J. Gomes, Semantic based image categorization, browsing and retrieval in medical image databases, in: IEEE International Conference on Image Processing (ICIP2000), Rochester, NY, USA, 2000. [55] T.P. Minka, R.W. Picard, Interactive learning using a society of models, in: Proceedings of the 1996 IEEE Conference on Computer Vision and Pattern Recognition (CVPR96), San Francisco, California, 1996, pp. 447 452. [56] C. Jrgensen, Retrieving the unretrievable in electronic imaging systems: emotions, themes and stories, in: B. Rogowitz, T.N. Pappas (Eds.), Proceedings of the of SPIE Photonics West Conference on Human Vision and Electronic Imaging IV, vol. 3644, San Jose, California, USA, 1999. [57] K. Koffka, Principles of Gestalt Psychology, Lund Humphries, London, 1935. [58] A.K. Jain, A. Vailaya, Image retrieval using color and shape, Pattern Recog. 29 (8) (1996) 12331244. [59] A. Tversky, Features of similarity, Psychol. Rev. 84 (4) (1977) 327352.
[28]
[29]
[30]
[31] [32]
[33]
[34]
[35]
[39] [40]
[41]
[42]
[43]
19
[75] P. Schmidt-Saugeon, J. Guillod, J.-P. Thiran, Towards a computer-aided diagnosis system for pigmented skin lesions, Comput. Med. Imag. Graphics 27 (2003) 65 78. [76] T. Kmpfe, T.W. Nattkemper, H. Ritter, Combining independent component analysis and self-organizing maps for cell image classication, in: B. Radig, S. Florczyk (Eds.), Pattern Recognition: DAGM Symposium 2001 Lecture Notes in Computer Science 2191, Munich, Germanny, 2001, pp. 262268. [77] J.J. Rocchio, Relevance feedback in information retrieval, in: The SMART Retrieval System, Experiments in Automatic Document Processing, Prentice Hall, Englewood Cliffs, NJ, USA, 1971, pp. 313323. [78] H. Mller, W. Mller, D.M. Squire, S. Marchand-Maillet, T. Pun, Strategies for positive and negative relevance feedback in image retrieval, in: A. Sanfeliu, J.J. Villanueva, M. Vanrell, R. Alczar, J.-O. Eklundh, Y. Aloimonos (Eds.), Proceedings of the 15th International Conference on Pattern Recognition (ICPR 2000), IEEE, Barcelona, Spain, 2000, pp. 10431046. [79] Y. Rui, T.S. Huang, M. Ortega, S. Mehrotra, Relevance feedback: a power tool for interactive content-based image retrieval, IEEE Trans.Circuits Sys. Video Technol. 8 (5) (1998) 644655 (special issue on segmentation, description, and retrieval of video content).. [80] Y. Rui, T.S. Huang, S. Mehrotra, Relevance feedback techniques in interactive content-based image retrieval, in: I.K. Sethi, R.C. Jain (Eds.), Proceedings of the SPIE Conference on Storage and Retrieval for Image and Video Databases VI, vol. 3312, 1997, pp. 2536. [81] M. Worring, A.W.M. Smeulders, S. Santini, Interaction in content-based image retrieval: An evaluation of the state of the art, in: R. Laurini (Ed.), Proceedings of the Fourth International Conference On Visual Information Systems (VISUAL2000), no. 1929 in Lecture Notes in Computer Science, Springer-Verlag, Lyon, France, 2000, pp. 2636. [82] H. Mller, D.M. Squire, T. Pun, Learning from user behavior in image retrieval: application of the market basket analysis, Int. J. Comput. Vis. (special issue on content-based image retrieval) vol. 56 (12), pp. 6577, 2004. [83] M. Nakazato, T.S. Huang, 3D MARS: Immersive virtual reality for content-based image retrieval, in: Proceedings of the Second International Conference on Multimedia and Exposition (ICME2001), IEEE Computer Society, IEEE Computer Society, Tokyo, Japan, 2001, pp. 4548. [84] S. Santini, R. Jain, Direct manipulation of image databases, Tech. rep., Department of Computer Science, University of California San Diego, San Diego, CA, November 1998. [85] T.M. Lehmann, M.O. Gld, C. Thies, B. Fischer, M. Keysers, D. Kohnen, H. Schubert, B.B. Wein, Content-based image retrieval in medical applications for picture archiving and communication systems, in: Proceedings of the SPIE Conference on Medical Imaging, vol. 5033, San Diego, CA, USA, 2003. [86] B. Revet, DICOM Cook Book for Implementations in Madalities, Philips Medical Systems, Eindhoven, Netherlands, 1997. [87] R.A. Greenes, J.F. Brinkley, Imaging systems, in: Medical Informatics: Computer Applications in Healthcare, second ed., Springer, New York, 2000, pp. 485538 (Chapter 14). [88] C. Kulikowski, E. Ammenwerth, A. Bohne, K. Ganser, R. Haux, P. Knaup, C. Maier, A. Michel, R. Singer, A.C. Wolff, Medical imaging informatics and medical informatics: opportunities and constraints, Methods Inform. Med. 41 (2002) 183189.
20
[89] M.W. Vannier, E.V. Staab, L.C. Clarke, Medical image archives present and future, in: H.U. Lemke, M.W. Vannier, K. Inamura, A.G. Farman, J.H.C. Reiber (Eds.), Proceedings of the International Conference on Computer-Assited Radiology and Surgery (CARS 2002), Paris, France, 2002. [90] A.P. Sarvazyan, F.L. Lizzi, P.N.T. Wells, A new philosophy of medical imaging, Med. Hypotheses 36 (1991) 327335. [91] T. Pun, G. Gerig, O. Ratib, Image analysis and computer vision in medicine, Comput. Med. Imag. Graphics 18 (2) (1994) 8596. [92] A. Rosset, O. Ratib, A. Geissbuhler, J.-P. Valle, Integration of a multimedia teaching and reference database in a PACS environment, RadioGraphics 22 (6) (2002) 1567 1577. [93] E. Binet, J.H. Trueblood, K.J. Macura, R.T. Macura, B.D. Morstad, R.V. Finkbeiner, Computer-based radiology information system: from oppy disk to CD-ROM, RadioGraphics 15 (1995) 12031214. [94] K. Maloney, C.T. Hamlet, The clinical display of radiologic information as an interactive multimedia report, J. Digital Imag. 12 (2) (1999) 119121. [95] T. Frankewitsch, U. Prokosch, Navigation in medical internet image databases, Med. Informatics 26 (1) (2001) 115. [96] S. Beretti, A. Del Bimbo, P. Pala, content-based retrieval of 3D cellular structures, in: Proceedings of the Second International Conference on Multimedia and Exposition (ICME2001), IEEE Computer Society, IEEE Computer Society, Tokyo, Japan, 2001, pp. 10961099. [97] M.R. Ogiela, R. Tadeusiewicz, Semantic-oriented syntactic algorithms for content recognition and understanding of images in medical databases, in: Proceedings of the second International Conference on Multimedia and Exposition (ICME2001), IEEE Computer Society, IEEE Computer Society, Tokyo, Japan, 2001, pp. 621624. [98] S.C. Orphanoudakis, C.E. Chronaki, S. Kostomanolakis, I2 Cnet: a system for the indexing. storage and retrieval of medical images by content, Med. Informatics 19 (2) (1994) 109122. [99] A.M. Aisen, L.S. Broderick, H. Winer-Muram, C.E. Brodley, A.C. Kak, C. Pavlopoulou, J. Dy, C.-R. Shyu, A. Marchiori, Automated storage and retrieval of thin-section CT images to assist diagnosis: system description and preliminary assessment, Radiology 228 (2003) 265270. [100] D. Keysers, J. Dahmen, H. Ney, B.B. Wein, T.M. Lehmann, A statistical framework for model-based image retrieval in medical applications, J. Electronic Imag. 12 (1) (2003) 5968. [101] H.D. Tagare, C. Jaffe, J. Duncan, Medical image databases: a content-based retrieval approach, J. Am. Med. Informatics Assoc. 4 (3) (1997) 184198. [102] H.J. Lowe, I. Antipov, W. Hersh, C. Arnott Smith, Towards knowledge-based retrieval of medical images. the role of semantic indexing, image content representation and knowledge-based retrieval, in: Proceedings of the Annual Symposium of the American Society for Medical Informatics (AMIA), Nashville, TN, USA, 1998, pp. 882886. [103] W.D. Bidgood, B. Bray, N. Brown, A.R. Mori, K.A. Spackman, A. Golichowsky, R.H. Jones, L. Korman, B. Dove, L. Hildebrand, M. Berg, Image acquisition context: procedure description attributes for clinically relevant indexing and selective retrieval of biomedical images, J. Am. Med. Informatics Assoc. 6 (1) (1999) 6175. [104] A. Winter, R. Haux, A three-level graph-based model for the management of hospital information systems, Methods Information Med. 34 (1995) 378396.
H. Mller et al.
[105] M.O. Gld, M. Kohnen, D. Keysers, H. Schubert, B.B. Wein, J. Bredno, T.M. Lehmann, Quality of DICOM header information for image categorization, in: Proceedings of the International Symposium on Medical Imaging, vol. 4685, San Diego, CA, USA, 2002, pp. 280287. [106] C. LeBozec, M.-C. Jaulent, E. Zapletal, P. Degoulet, Unied modeling language and design of a case-based retrieval system in medical imaging, in: Proceedings of the Annual Symposium of the American Society for Medical Informatics (AMIA), Nashville, TN, USA, 1998. [107] A.A.T. Bui, R.K. Taira, J.D.N. Dionision, D.R. Aberle, S. El-Saden, H. Kangarloo, Evidence-based radiology, Acad. Radiol. 9 (6) (2002) 662669. [108] J.-P. Boissel, M. Cucherat, E. Amsallem, P. Nony, M. Fardeheb, W. Manzi, M.C. Haugh, Getting evidence to prescribers and patients or how to make EBM a reality, in: Proceedings of the Medical Informatics Europe Conference (MIE 2003), St. Malo, France, 2003. [109] C.E. Kahn, Articial intelligence in radiology: decision support systems, RadioGraphics 14 (1994) 849861. [110] H. Abe, H. MacMahon, R. Engelmann, Q. Li, J. Shiraishi, S. Katsuragawa, M. Aoyama, T. Ishida, K. Ashizawa, C.E. Metz, K. Doi, Computer-aided diagnosis in chest radiography: Results of large-scale observer tests at the 1996-2001 RSNA scientic assemblies, RadioGraphics 23 (1) (2003) 255265. [111] B. Kaplan, H.P. Lundsgaarde, Toward an evaluation of an integrated clinical imaging system: Identifying clinical benets, Methods Inform. Med. 35 (1996) 221229. [112] A. Horsch, R. Thurmayr, How to identify and assess tasks and challenges of medical image processing, in: Proceedings of the Medical Informatics Europe Conference (MIE 2003), St. Malo, France, 2003. [113] S. Antani, L.R. Long, G.R. Thoma, A biomedical information system for combined content-based retrieval of spine X-ray images and associated text information, in: Proceedings of the Third Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP 2002), Ahamdabad, India, 2002. [114] W.W. Chu, A.F. Crdenas, R.K. Taira, KMED: A knowledgebased multimedia distributed database system, Inform. Sys. 19 (4) (1994) 3354. [115] S.C. Orphanoudakis, C.E. Chronaki, D. Vamvaka, I2 Cnet: content-based similarity search in geographically distributed repositories of medical images, Comput. Med. Imag. Graphics 20 (4) (1996) 193207. [116] E.G.M. Petrakis, Content-based retrieval of medical images, Int. J. Comput. Res. 11 (2) (2002) 171182. [117] M. Tsiknakis, D. Katehakis, C. Orphanoudakis, Stelios, intelligent image management in a distributed PACS and telemedicine environment, IEEE Commun. Magazine 34 (7) (1996) 3645. [118] H.J. Lowe, B.G. Buchanan, G.F. Cooper, J.K. Vreis, Building a medical multimedia database system to integrate clinical information: an application of high-performance computing and communications technology, Bull. Med. Library Assoc. 83 (1) (1995) 5764. [119] S.T. Wong, H.K. Huang, Networked multimedia for medical imaging, IEEE Multimedia Magazine AprilJune (1997) 2435. [120] C. Le Bozec, E. Zapletal, M.-C. Jaulent, D. Heudes, P. Degoulet, Towards content-based image retrieval in HIS-integrated PACS, in: Proceedings of the Annual Symposium of the American Society for Medical Informatics (AMIA), Los Angeles, CA, USA, 2000, pp. 477 481.
21
tersttzung der Frherkennung des malignen Melanoms durch digitale Bildverarbeitung, in: Proceedings of the Workshop Bildverarbeitung fr die Medizin, Munich, Germany, 2000. A. Sbober, C. Eccher, E. Blanzieri, P. Bauer, M. Cristifolini, G. Zumiani, S. Forti, A multiple classier system for early melanoma diagnosis, Articial Intel. Med. 27 (2003) 29 44. F. Meyer, Automatic screening of cytological specimens, Comput. Vis. Graphics Image Proces. 35 (1986) 356369. M.E. Mattie, L. Staib, E. Stratmann, H.D. Tagare, J. Duncan, P.L. Miller, PathMaster: Content-based cell image retrieval using automated feature extraction, J. Am. Med Informatics Assoc. 7 (2000) 404415. Pathnder: Region-based searching of Pathology Images using IRM. K. Veropoulos, C. Campbell, G. Learnmonth, Image processing and neural computing used in the diagnosis of tuberculosis, in: Proceedings of the Colloquium on Intelligent Methods in Healthcare and Medical Applications (IMHMA), York, UK, 1998. M.-C. Jaulent, C. Le Bozec, Y. Cao, E. Zapletal, P. Degoulet, A property concept frame representation for exible image content retrieval in histopathology databases, in: Proceedings of the Annual Symposium of the American Society for Medical Informatics (AMIA), Los Angeles, CA, USA, 2000. L.H. Tang, R. Hanka, H.H.S. Ip, R. Lam, Extraction of semantic features of histological images for content-based retrieval of images, in: Proceedings of the IEEE Symposium on Computer-Based Medical Systems (CBMS 2000), Houston, TX, USA, 2000. H.Y. Tang, Lilian, R. Hanka, H.H.S. Ip, K.K.T. Cheung, R. Lam, Semantic query processing and annotation generation for content-based retrieval of histological images, in: International Symposium on Medical Imaging, vol. 3976, San Diego, CA, USA, 2000. G.P. Robinson, H.D. Targare, J.S. Duncan, C.C. Jaffe, Medical image collection indexing: shape-based retrieval using KD-trees, Comput. Vis. Graphics Image Proces. 20 (4) (1996) 209217. A.S. Constantinidis, M.C. Fairhurst, A.F.R. Rahman, A new multi-expert decision combination algorithm and its application to the detection of circumscribed masses in digital mammograms, Pattern Recog. 34 (2001) 1527 1537. P. Korn, N. Sidiropoulos, C. Faloutsos, E. Siegel, Z. Protopapas, Fast and effective retrieval of medical tumor shapes, IEEE Trans. Knowledge Data Eng. 10 (6) (1998) 889904. S. Baeg, N. Kehtarnavaz, Classication of breast mass abnormalities using denseness and architectural distorsion, Electronic Lett. Comput. Vis. Image Anal. 1 (1) (2002) 120. F. Schnorrenberg, C.S. Pattichis, C.N. Schizas, K. Kyriacou, Content-based retrieval of breast cancer biopsy slides, Technol. Health Care 8 (2000) 291297. D.-M. Kwak, B.-S. Kim, O.-K. Yoon, C.-H. Park, J.-U. Won, K.-H. Park, Content-based ultrasound image retrieval using a coarse to ne approach, Annals New York Acad. Sci. 980 (2002) 212224. C. Brodley, A. Kak, C. Shyu, J. Dy, L. Broderick, A.M. Aisen, Content-based retrieval from medical image databases: A synergy of human interaction, machine learning and computer vision, in: Proceedings of the 10th National Conference on Articial Intelligence, Orlando, FL, USA, 1999, pp. 760767.
[137]
[138] [139]
[140] [141]
[142]
[143]
[144]
[145]
[146]
[147]
[148]
[149] [150]
[151]
22
[152] C.-R. Shyu, A. Kak, C. Brodley, L.S. Broderick, Testing for human perceptual categories in a physician-in-the-loop CBIR system for medical imagery, in: Proceedings of the IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL99), Fort Collins, CO, USA, 1999, pp. 102108. [153] C. Schaefer-Prokop, M. Prokop, D. Fleischmann, C. Herold, High-resolution CT of diffuse interstitial lung disease: key ndings in common disorders, Europe. Radiol. 11 (2001) 373392. [154] D.M. Hansell, High-resolution CT of diffuse lung disease, Radiol. Clin. North Am. 39 (6) (2001) 10911113. [155] C.Y. Han, H. Chen, L. He, W.G. Wee, A web-based distributed image processing system, in: S. Santini, R. Schettini (Eds.), Proceedings of the SPIE Photonics West Conference on Internet Imaging IV, vol. 5018, San Jose, CA, USA, 2003, pp. 111122. [156] S. Sclaroff, A.P. Pentland, On modal modeling for medical images: Underconstrained shape description and data compression, in: Proceedings of the IEEE Workshop on Biomedical Image Analysis (BIA1994), Seattle, WA, USA, 1994, pp. 7079. [157] Y. Liu, F. Dellaert, Classication-driven medical image retrieval, in: Proceedings of the ARPA Image Understanding Workshop, 1997. [158] W. Cai, D.D. Feng, R. Fulton, Content-based retrieval of dynamic PET functional images, IEEE Trans. Information Technol. Biomed. 4 (2) (2000) 152158. [159] L.R. Long, G.R. Thoma, L.E. Berman, A prototype client/server application for biomedical text/image retrieval on the internet, in: I.K. Sethi, R.C. Jain (Eds.), Proceedings of the Conference on Storage and Retrieval for Image and Video Databases VI, vol. 3312, 1997, pp. 362372. [160] S. Santini, A. Gupta, The role of internet images in the biomedical informatics research network, in: S. Santini, R. Schettini (Eds.), Proceedings of the Conference on Internet Imaging IV, vol. 5018, San Jose, CA, USA, 2003, pp. 99110. [161] J.M. Carazo, H.K. Stelter, The BioImage database project: organizing multidemensional biological images in an object-relational database, J. Struct. Biol. 125 (1999) 97102. [162] Z. Geradts, H. Hardy, A. Poortmann, J. Bijhold, Evaluation of contents based image retrieval methods for a database of logos on drug tablets, in: E.M. Carapezza (Ed.), Proceedings of the Conference on Technologies for Law Enforcement, vol. 4232, Boston, MA, USA, 2000. [163] N. Laitinen, O. Antikainen, J.-P. Mannermaa, J. Yliruusi, Content-based image retrieval: a new promising technique in powder technology, Pharma. Dev. Technol. 5 (2) (2000) 171179. [164] G.D. Magoulas, A. Prentza, Machine learning in medical applications, in: G. Paliouras, V. Karkaletsis, C.D. Spyrpoulos (Eds.), Machine Learning and its Applications, Lecture Notes in Computer Science, Springer-Verlag, Berlin, 2001, pp. 300307. [165] M.J. Egenhofer, Spatial-query-by-sketch, in: Proceedings of the IEEE Symposium on Visual Languages (VL 1996), Boulder, CO, USA, 1996, pp. 667. [166] T. Ikeda, M. Hagiwara, Content-based image retrieval system using neural networks, Int. J. Neural Sys. 10 (5) (2000) 417424. [167] W. Hersh, M. Mailhot, C. Arnott-Smith, H. Lowe, Selective automated indexing of ndings and diagnoses in radiology reports, J. Biomed. Informatics 34 (2001) 262273.
H. Mller et al.
[168] M.M. Wagner, G.F. Cooper, Evaluation of a meta-1-based automatic indexing methods for medical documents, Comput. Biomed Res. 25 (1992) 336350. [169] R. Chbeir, Y. Amghar, A. Flory, MIMS: A prototype for medical image retrieval, in: Recherche dInformations Assiste par Ordinateur (RIAO2000) Computer-Assisted Information Retrieval, vol. 1, Paris, France, 2000. [170] H. Mller, A. Rosset, J.-P. Valle, A. Geissbuhler, Integrating content-based visual access methods into a medical case database, in: Proceedings of the Medical Informatics Europe Conference (MIE 2003), St. Malo, France, 2003, pp. 480485. [171] T. Wringer, J. Stockhausen, D. Meyer-Ebrecht, A. Bcking, Automatic coregistration, segmentation and classication for multimodal cytopathology, in: Proceedings of the Medical Informatics Europe Conference (MIE 2003), St. Malo, France, 2003. [172] A.W. Toga, P. Thompson, An introduction to brain warping, in: Brain Warping, Academic Press, New York, 1998, pp. 128 (Chapter 1). [173] E.G.M. Patrakis, C. Faloutsos, Similarity searching in medical image databases, IEEE Trans. Knowledge Data Eng. 9 (3) (1997) 435447. [174] J.I. Khan, D.Y.Y. Yun, Holographic image archive, Comput. Med. Imag. Graphics 20 (4) (1996) 243257. [175] L. Yu, Kaia nd Ji, X. Zhang, Kernel nearest-neighbor algorithm, Neural Proces. Lett. 15(2) (2002) 147156. [176] T. Saracevis, Relevance: a review of and a framework for the thinking on the notion in information science, J. Am. Soc. Inform. Sci., NovemberDecember (1975) 321343. [177] L. Schamber, M.B. Eisenberg, M.S. Nilan, A re-examination of relevance: toward a dynamic, situational denition, Inform. Proces. Manage. 26 (6) (1990) 755775. [178] T. Brkle, E. Ammenwerth, H.-U. Prokosch, J. Dudeck, Evaluation of clinical information systems. what can be evaluated and what cannot, J. Evaluat. Clin. Practice 7 (4) (2001) 373385. [179] H. Mller, W. Mller, D.M. Squire, S. Marchand-Maillet, T. Pun, Performance evaluation in content-based image retrieval: Overview and proposals, Pattern Recog. Lett. 22 (5) (2001) 593601. [180] G. Salton, The evaluation of computer-based information retrieval systems, in: Proceedings of the 1965 Congress International Federation for Documentation (IFD 1965), Spartan Books Washington, Washington DC, USA, 1965, pp. 125133. [181] J. Nielsen, Usability Engineering, Academic Press, Boston, MA, 1993. [182] T.M. Lehmann, B.B. Wein, H. Greenspan, Integration of content-based image retrieval to picture archiving and communication systems, in: Proceedings of the Medical Informatics Europe Conference (MIE 2003), St. Malo, France, 2003. [183] P. Franz, A. Zaiss, U. Schulz, Stefana nd Hahn, R. Klar, Automated coding of diagnoses three methods compared, in: Proceedings of the Annual Symposium of the American Society for Medical Informatics (AMIA), Los Angeles, CA, USA, 2000. [184] A. Geissbuhler, C. Lovis, A. Lamb, S. Spahni, Experience with an XML/HTTP-based federative approach to develop a hospital-wide clinical information system, in: R. Rogers, R. Haux, V. Patel (Eds.), Proceedings of the International Medical Informatics Conference (Medinfo 2001), London, UK, 2001, pp. 735739. [185] J.R. Smith, VideoZoom: A spatial-temporal video browser for the internet, in: C.-C.J. Kuo, S.-F. Chang, S. Panchanathan (Eds.), Proceedings of the Conference on Mul-
23
Archiving Systems III (VV02), vol. 3527, Boston, MA, USA, 1998, pp. 233244. [187] Anil K. Jain, Robert P.W. Dvin, Jianchang Mao, Statistical Pattern Recognition: A Review, IEEE Transaction on Pattern Analysis and Machin Intelligence, vol. 22 (1), pp. 437, 2000.