ORIGINAL ARTICLE
|
|
Implementation and evaluation of a medical image management system with content-based retrieval support |
|
Autho(rs): Edilson Carlos Caritá, Enzo Seraphim, Marcelo Ossamu Honda, Paulo Mazzoncini de Azevedo-Marques |
|
Keywords: CBIR, PACS, DICOM, Medical images, Health information system |
|
Abstract:
IPhD, Docent at the Center of Exact, Natural and Technological Sciences, Universidade de Ribeirão Preto (Unaerp), Ribeirão Preto, SP, Brazil
INTRODUCTION The "picture archiving and communication systems" (PACS) has already become the preferred technological option for transmission, storage and visualization of images in the field of imaging diagnosis. Most recently, because of the increasing number of images produced by the currently available diagnostic imaging methods, the optimization of data storage and retrieval has stood out as a relevant topic in the study of radiological IT solutions. So, while the baseline requirement for a PACS would be the ability of visualizing related clinical data and images, a great interest in techniques utilizing "content-based image retrieval" (CBIR) has emerged in the last years. Based on data extracted from a determined case, these techniques seek to localize similar cases previously diagnosed and stored in a database. CBIR systems utilize data extracted from images for representing them, with the primary objective of establishing a semantic description for these images(1). In this type of approach(2-5), the images retrieval is based on the similarity of their attributes which are compared with attributes extracted from a reference image. Color, texture, shape, structures and spatial relationship are attributes most frequently utilized and serve as keys for images retrieval based on a previously established approximation by similarity. Content-based retrieval has shown to be a relevant alternative and an essential complement to the traditional text-based query systems(6-8). A PACS with content-based images retrieval (PACS-CBIR) consists in a system that provides a connection with images modalities, visualization, storage, images retrieval and reports by means of alphanumeric or textual information and the studies/test retrieval based on queries by similarity of images characteristics. The present study describes the implementation of an architecture for a medical images management system in a hospital environment integrated with the CBIR function, and the corresponding evaluation covering the functionalities of the modules dedicated to medical images management, content-based images retrieval support and associated images/data retrieval by means of textual data. The images utilized in the present study originate from digital imaging methods such as magnetic resonance imaging (MRI) and computed tomography (CT) in "digital imaging and communication in medicine" (DICOM) format, indexed, stored and linked to the hospital "radiology information system" (RIS)(9), for later visualization with their respective clinical reports. Queries for both images and reports can be performed by means of keywords (textual query) or by means of an images retrieval interface based on pictoric patterns similarity, quantified through texture attributes.
MATERIALS AND METHODS The PACS-CBIR architecture modeling was based on the information and job stream system of the school-hospital where the present study was developed. Figure 1 illustrates the modeling of this architecture.
For the images server implementation the source codes of PACSOne version 3.1.8 were utilized. This software was developed in 2004 by Xiaohui Li in the United States of America and distributed by the company RainbowFish Software(10). PACSOne is a DICOM-compliant server for short- and long-term medical images storage for later retrieval, transmission and visualization. The server configurations and images data are stored in a relational database such as the MySQL version 4.1.16 , the "database management system" (DBMS) utilized in the present study, that is a free, open source, portable software compatible with different programming languages(11). The textual and similarity images retrieval interfaces were developed with network Technologies utilizing "hypertext markup language" (HTML) and "hypertext preprocessor" (PHP) version 4.0.4. For visualization of DICOM-compatible images on the browser, an application was developed utilizing the Java programming language(12), including functions for brightness and contrast adjustment. The CBIR system was created for extracting the features of images stored in the PACS server, indexing them for later similarity retrieval. The application responsible by the images features was developed in C++ programming language and is executed through a command line. Upon initialization, the application searches for the studies/tests stored in the PACS server, extracts the images features utilizing texture descriptors developed by Haralick et al.(13) and stores them as feature vectors in the database of the CBIR module. The metric structure on which the images features indexation is based is the slim-tree proposed by Traina Jr et al.(14). Performance, stability, main functionalities and DICOM-compatibility were taken into consideration in the evaluation of the implemented system. A total of 120 studies including 60,401 MRI and CT images were utilized for this evaluation. The textual retrieval system underwent two evaluation processes: a quantitative evaluation aimed at validating and analyzing the DICOM functionalities for guaranteeing the availability of all the images stored in the PACS server and linked to the RIS at the moment of retrieval in conjunction with the demographic data of the patient and the clinical study data, besides the measurement of the response time for each study; and a qualitative evaluation considering the user satisfaction in relation to the system. Both evaluations were performed by four physicians and a computation scientist. Chart 1 presents the observer's education and the experience in years. The system interface response time for studies, response time for images, images quality, browsing facility, and image settings were evaluated. Quality levels considered in the present evaluation were the following: 1 (very poor quality - inappropriate performance); 2 (poor quality - insufficient performance); 3 (satisfactory quality - it can be utilized although with some significant limitation); 4 (good quality - it can be utilized, although with some non-significant limitation); 5 (very good quality - appropriate performance).
Only MRI studies were considered in the evaluation of the system of query by similarity, including 61 cases and 17,099 images (39 cases with 12,939 brain images; 8 cases with 1,456 abdominal images; 10 cases with 2,448 spine images; and 4 cases with 256 pharynx images). Initially, the user selected a imaging diagnosis modality, defined a reference image and informed the number of similar images which should be retrieved by the system (Figure 2). Following the process of images retrieval, an image was considered as correct provided it corresponded to the same region of interest or diagnostic condition (normal or abnormal) of the reference image. For studies/tests with abnormal reports, the diagnostic descriptions compatibility was evaluated.
RESULTS The system for textual retrieval has been implemented to allow the integration between the imaging server, the RIS server and the images viewer, assuming that the authorized user enters a query for patients' data, clinical studies/tests and respective images. Chart 2 presents the textual query options available in the system. Figure 3 presents the screen showing the complex query option. Based on the patient's data and the list of studies/tests performed, one can access and visualize the images (Figure 4). Results of the qualitative evaluation of the textual retrieval system are shown on Table 1. Table 2 shows the results of queries performed to evaluate the similarity between images retrieved through texture descriptors, taking the region of interest into consideration. The column "Average" presents mean percentages of hits in queries for images for each region of interest. Overall, the mean rate of hits reached 72% with 0.84 standards deviation.
Table 3 presents results validating the clinical report conclusion (normal or abnormal) and the compatibility of the clinical reports description for abnormal cases, for each region of interest. Overall, the mean rate of hits reached 81% with 1.47 standards deviation for the clinical report conclusion, and 64% with 1.13 standard deviation for compatibility of clinical reports description.
DISCUSSION In the qualitative evaluation of the textual retrieval system, one can observe that four observers attributed grade 5 and one, grade 4 for "system interface"; so this item can be considered as very good. In the implementation of the PACS-CBIR project interfaces (PACS interface and textual retrieval and content-based retrieval interfaces) the global strategy involving the development of Internet-based systems for a safe, practical and fast provision of information was adopted. As a result, the users do not need to have the system installed in their own computers, considering that it is easily accessible from any place. Four observers attributed grade 5 and one, grade 3 for "response time for studies/tests"; most of them considered this response time as very good. It is important to note that this time has not exceeded five seconds, but variations in the transfer rate may occur as a function of the traffic in the local area network of the hospital. In the evaluation of "response time for images", two observers attributes grade 5, two grade 4, and one, grade 3, so this item may be considered as good; however, the mean response time has always ranged between 10 and 15 seconds. Although this aspect has not been quantified in the present study, it is likely that the highest waiting time is associated to the process of image loading and displaying on the screen rather than to the process of query through the CBIR module, in compliance with the results reported by Harrison(15), who has been responsible for the implantation of a PACS-Web in the University of Mississippi Medical Center in Jackson, MS, USA. According to the author, the query is fast, and what is computationally expensive is retrieving the image from the database and loading and displaying the imaging on the screen. In the topic "visualization quality", grade 4 was given by all the observers, corresponding to good, and the main limitation being associated to the windowing for images displaying. This limitation is confirmed in the evaluation of the topic "images settings" that was given grade 3 by all the observers, with the suggestion of adding some tools such as zoom, segmentation and binarization. As regards "browsing facility", four observers attributed grade 5, and one, grade 4. The majority of the observers considered this topic as very good, a result compatible with other studies approaching the utilization of Internet technologies in PACS. According to Harrison(15), Peer et al.(16) and Cao et al.(17), the main advantages of the utilization of these technologies are costs reduction and studies/tests decentralization, allowing the radiologists to easily access images through hospital local network or Internet. The results regarding the performance of the system for images retrieval by similarity utilizing texture attributes, with an overall mean rate of hits reaching 72% are compatible with results found by other studies in the literature(1,18,19). Pereira Jr et al.(18) have developed a study demonstrating that texture attributes can be useful in the automatic differentiation between normal regions and regions with nodules or microcalcifications on digitized mammographic images, with more than 90% of hits. However, these attributes sensitivity for differentiating malignant from benign lesions decreases to as low as 50%. In a study involving the analysis of texture for images retrieval by similarity, Oliveira et al.(19) have found results for accuracy of about 54% for sagittal images of knee, and 40% for axial images of head. Kinoshita et al.(1) have presented results ranging between 78% and 83% of hits in a CBIR system based on texture and artificial neural networks developed to retrieve mammographic images by tissue density similarity.
CONCLUSIONS The architecture implemented was based on non-proprietary, open-source Technologies and the features of the information system and the flow of studies/tests in the hospital where the present project was implemented. However, its structure can be adapted for operating in conjunction with any DICOM-compatible management system and imaging diagnosis modality, basically by the addition of a content-based images retrieval module. The implementation of the similarity retrieval algorithm required texture descriptors which provide measurements of regions properties such as mildness, roughness and smoothness(20). However, the system performance can be enhanced with the inclusion of further descriptors dedicated to the characterization of shape, for example. As far as its application is concerned, the system implemented can be utilized in the clinical routine as tool for aiding in the decision-making process, as described in the study developed by Huang et al.(21), Who have described the implementation and evaluation of a diagnosis support tool based on similar images retrieval in the Children Hospital Los Angeles. Their study has involved 2500 consecutive MRI studies of children's brains, with results ranging between 10% and 60% of hits in queries by similarity, respectively for arachnoid cyst and neurofibromatosis. It is evident that the utilization of a CBIR system as an ancillary diagnostic tool will negatively affect the work flow and, consequently, the unit productivity. But the benefits of this system as a tool for supporting Radiology teaching and research are quite evident, as described in a review article published by Müller et al.(8). Finally, it is important to note that there are still few reports in the literature approaching the integration of content-based retrieval tools into images management systems, reinforcing the innovative character of the present study.
REFERENCES 1. Kinoshita SK, Azevedo-Marques PM, Pereira RR Jr, et al. Content-based retrieval of mammograms using visual features related to breast density patterns. J Digit Imaging. 2007;20:172-90. [ ] 2. Song BC, Kim MJ, Ra JB. A fast multiresolution feature matching algorithm for exhaustive search in large image databases. IEEE Trans Circuits Syst Video Technol. 2001;11:673-8. [ ] 3. Cai W, Feng DD, Fulton R. Content-based retrieval of dynamic PET functional images. IEEE Trans Inf Technol Biomed. 2000;4:152-8. [ ] 4. Assfalg J, Pala P. Querying by photographs: a VR metaphor for image retrieval. IEEE Multimedia. 2000;7:52-9. [ ] 5. Wang Z, Chi Z, Feng D. Content-based image retrieval using block-constrained fractal coding and nona-tree decomposition. IEEE Proc Vis Image Signal Process. 2000;147:9-15. [ ] 6. Alto H, Rangayyan RM, Desautels JEL. Content-based retrieval and analysis of mammographic masses. J Electron Imaging. 2005;14:1-17. [ ] 7. Vailaya A, Figueiredo MAT, Jain AK, et al. Image classification for content-based indexing. IEEE Trans Image Processing. 2001;10:117-30. [ ] 8. Müller H, Michoux N, Bandon D, et al. A review of content-based image retrieval systems in medical applications - clinical benefits and future directions. Int J Med Inform. 2004;73:1-23. [ ] 9. Azevedo-Marques PM, Caritá EC, Benedicto AA, et al. Integração RIS/PACS no Hospital das Clínicas de Ribeirão Preto: uma solução baseada em "web". Radiol Bras. 2005;38:37-43. [ ] 10. PacsOne. [Acessado em: 10/6/2007]. Disponível em: http://www.pacsone.net [ ] 11. Milani A. MySQL - guia do programador. São Paulo: Novatec; 2006. [ ] 12. Caritá EC, Matos ALM, Azevedo-Marques PM. Ferramentas para visualização de imagens médicas em hospital universitário. Radiol Bras. 2004;37:437-40. [ ] 13. Haralick RM, Shanmugan K, Dinstein I. Textural features of images classification. IEEE Trans Systems Man Cybernetics. 1973;3:610-21. [ ] 14. Traina C Jr, Traina AJM, Faloutsos C, et al. Fast indexing and visualization of metric data sets using slim-trees. IEEE Trans Knowledge Data Engineering. 2002;14:244-60. [ ] 15. Harrison SW. Success with Web-based image access. Radiol Manage. 2003;25:36-8. [ ] 16. Peer S, Vogl R, Peer R, et al. Sophisticated hospital information system/radiology information system/picture archiving and communications system (PACS) integration in a large-scale traumatology PACS. J Digit Imaging. 1999;12:99-102. [ ] 17. Cao F, Huang HK, Zhou XQ. Medical image security in a HIPAA mandated PACS environment. Comput Med Imaging Graph. 2003;27:185-96. [ ] 18. Pereira RR Jr, Azevedo-Marques PM, Honda MO, et al. Usefulness of texture analysis for computerized classification of breast lesions on mammograms. J Digit Imaging. 2007;20:248-55. [ ] 19. Oliveira MC, Azevedo-Marques PM, Cirne Filho WC. Grades computacionais na otimização da recuperação de imagens médicas baseada em conteúdo. Radiol Bras. 2007;40:255-61. [ ] 20. Gonzalez RC, Woods RE. Processamento de imagens digitais. São Paulo: Edgard Blücher; 2000. [ ] 21. Huang HK, Nielsen JF, Nelson MD, et al. Image-matching as a medical diagnostic support tool (DST) for brain diseases in children. Comput Med Imaging Graph. 2005;29:195-202. [ ] Mailing address: Received August 29, 2007. Accepted after revision February 25, 2008. * Study developed at Faculdade de Medicina de Ribeirão Preto da Universidade de São Paulo (FMRPUSP), Ribeirão Preto, SP, Brazil. |