Hosting and managing large sets of virtual microscopy slides on the internet for E-learning and for reference
© Sinn; licensee BioMed Central Ltd. 2013
Published: 30 September 2013
While the imaging technology for virtual microscopy can be considered mature, the techniques used for the organization and presentation of the digital slides are more complex because of the diversity of user requirements for every project and the technical challenges regarding the structuring and management of the slides. Generally, the presentation of virtual microscopy on the intra- and internet requires the availability of tools that act as a bridge between the digital slide repository and a content management system (CMS) for the web presentation. This involves specific programming of the CMS in order to display the virtual slides in a consistent and specific manner and provide for easy expandability. This is true for E-Learning systems, but also for general digital slide repositories on the web. We have created internet platforms for undergraduate and postgraduate pathology teaching and reference and will present our approach for an effective management and presentation of large numbers of digital slides.
Materials and methods
Educational slides were digitized using an Aperio CS scanner and stored in the Aperio svs file format, using JPEG and JPEG2000 compression. The Aperio Spectrum platform was used as the primary image and metadata repository, but because not used to publicly display its content because of its technical limitations and licensing issues. Therefore, all images and metadata information were exported to specialized E-Learning and pathology reference platforms. The implementation of digital pathology slides on these platforms included the following common important aspects:
1. Load balancing: The simultaneous access of up to 70 students in the classroom to few digital slides at one time, or dozens of users to many slides on an open internet platform requires effective load balancing for the image server. This was achieved by setting up a reverse proxy load balancer on Apache that distributes the load to several image server processes which are running on subordinated machines.
3. For the postgraduate digital slide pathology repository, a general purpose content management system (CMS) was chosen (Textpattern), that builds upon the open source LAMP (Linux, Apache, MySQL, PHP) architecture. All metadata, and including the links to the virtual slides could be seamlessly transferred from the Spectrum database into Textpattern.
4. The Textpattern CMS was programmed to provide for a hierarchical structure that reflects the pathological disease taxonomy in a hierarchical fashion. It uses a tree structure with unlimited depth, and is easily expandable. Digital slides are assigned to one or more categories in this system.
Results and discussion
The user of a pathology reference site expects a systematic layout of the internet site, allowing for easy access of its content based on the medical disease categories, and to be able to browse related entities more easily than in a medical textbook. Because of the many synonyms and related terms of medical terminology, it is not sufficient to rely on a built-in search engine to access the digital slides in question. For an effective approach, a nosological hierarchy of the digital slides must be built that is governed by the pathological diagnosis and organ sites in focus. This requires the building of hierarchical and associative relationships for the diagnostic terminology as well as the use of dictionaries and thesauri. In this fashion, the content management system is able to focus on the pathological slides searched for, and at the same time provide links to related entities in the database. An associative and disease-oriented browsing becomes possible.
Both pathology reference sites and E-Learning sites which present digital slides must have sufficient server capacity to be able to deal with multiple concurrent users who are accessing the same or different digital slides simultaneously. A typical example is a class of more than 50 medical students that use virtual microscopy in the classroom. The user experience of quickly navigating the digital slides is dependent on an effective load balancing solution and an image server technology that supports this. Today’s advanced servers with multiprocessor and multicore architectures allow for the upscaling of digital pathology proportional to the number of cores in one machine, and thus limit the need of running multiple servers in parallel for the handling of the image data.
The building and management of a digital pathology internet site for the purpose of pathology teaching and reference requires special attention to the database design, the handling of metadata, the user interface, and load balancing.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.