Automated Selection of Hotspots (ASH): enhanced automated segmentation and adaptive step finding for Ki67 hotspot detection in adrenal cortical cancer
© Lu et al.; licensee BioMed Central Ltd. 2014
Received: 17 May 2014
Accepted: 26 October 2014
Published: 25 November 2014
In prognosis and therapeutics of adrenal cortical carcinoma (ACC), the selection of the most active areas in proliferative rate (hotspots) within a slide and objective quantification of immunohistochemical Ki67 Labelling Index (LI) are of critical importance. In addition to intratumoral heterogeneity in proliferative rate i.e. levels of Ki67 expression within a given ACC, lack of uniformity and reproducibility in the method of quantification of Ki67 LI may confound an accurate assessment of Ki67 LI.
We have implemented an open source toolset, Automated Selection of Hotspots (ASH), for automated hotspot detection and quantification of Ki67 LI. ASH utilizes NanoZoomer Digital Pathology Image (NDPI) splitter to convert the specific NDPI format digital slide scanned from the Hamamatsu instrument into a conventional tiff or jpeg format image for automated segmentation and adaptive step finding hotspots detection algorithm. Quantitative hotspot ranking is provided by the functionality from the open source application ImmunoRatio as part of the ASH protocol. The output is a ranked set of hotspots with concomitant quantitative values based on whole slide ranking.
We have implemented an open source automated detection quantitative ranking of hotspots to support histopathologists in selecting the `hottest’ hotspot areas in adrenocortical carcinoma. To provide wider community easy access to ASH we implemented a Galaxy virtual machine (VM) of ASH which is available from http://bioinformatics.erasmusmc.nl/wiki/Automated_Selection_of_Hotspots.
The virtual slide(s) for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/13000_2014_216
KeywordsAdrenal Cortical Carcinoma (ACC) Gastroenteropancreatic Neuroendocrine Tumor (GEP-NET) Automated Selection of Hotspots (ASH) Adaptive step finding Ki67 LI ImmunoRatio Galaxy Virtual machine (VM)
Adrenal cortical carcinoma (ACC) is a rare type of endocrine malignancy with an estimated incidence of 0.7-2.0 cases per million population per year and a poor overall prognosis . According to recent evidence from the European Network for the Study of Adrenal Tumors (ENS@T) ACC study group, the resection status and the Ki67 labelling index (LI) in both localized and advanced ACC , constitute the most relevant prognostic parameters . In this regard, it has been suggested that the histopathology report should include Ki67 LI along with confirmation of the adrenocortical origin on immunohistochemical grounds, Weiss score and resection status . Importantly, Ki67 LI has been integrated in treatment flow charts for ACC patients with either tumor amenable to radical resection or advanced disease .
Taken together, the production of accurate and reproducible Ki67 LIs remains a key issue and main responsibility of pathologists. It should be recognized that various factors, such as pre-analytical, analytical, interpretation, scoring, and data analysis, might affect Ki67 LI . Given the biological heterogeneity of Ki67 immunostaining across tumor specimens ,, the area of slide read has been controversial for Ki67 LI assessment e.g. in breast cancer ,. According to the European Society of Neuroendocrine Tumors (ENETS), the mitotic count and the Ki67 LI should be assessed in areas with the highest proliferating activity (hotspots) in order to determine the proliferation grade in gastroenteropancreatic neuroendocrine tumors (GEP-NETs) . As far as ACCs are concerned, there is not only lack of studies addressing the issues of a potential biological heterogeneity of Ki67 staining and inter-observer variation, but also different methods of objective quantification of the Ki67 proliferative index.
In routine diagnostic practice, representative areas of slides are manually selected by histopathologists using visual examination of whole mount Ki67-immuostained slides at a low magnification. Of note, this process might lack reproducibility and affect the Ki67 LI . Since digitized immunohistochemical (IHC) stained tissue sections have become amenable to the application of computerized image analyses, two independent groups have developed either a hybrid clustering approach for the detection of Ki67 hotspots in whole tumor slide images  or a simplified computerized method for hotspot detection in digitized IHC slides . In this context, we developed Automated Selection of Hotspots (ASH) to provide clinical labs with the ability to determine the most active areas in proliferative rate within a slide and subsequently quantitate Ki67 LI using a desktop PC without requiring extensive bioinformatics support. ASH uses Galaxy  as a simple graphical user interface and to join the components of ASH into an analytical workflow for hotspot detection and this, Galaxy is contained in a VMware virtual machine (VM)  which ensures that the system is platform independent. The use of VM technology has been highlighted by Nocq et al. , to improve the usability of next generation sequencing software by simply sharing entire installations.
We believe that this is the first time that Galaxy-VM has been used to deliver single user (on a personal computer) or as a multi-user (on a server) hotspot detection software with the same easy access via the Galaxy graphical user interface (GUI).
ASH provides an end to end workflow for hotspot detection using the functionality of a Galaxy GUI to provide the user with a simple data upload and html style reporting environment.
The application is for digital images obtained on the Hamamatsu NanoZoomer Digital Pathology (NDP) System (Hamamatsu Photonics K.K. Japan), in their proprietary NDP Image (NDPI) file format. NDPI image segmentation using NDPI-splitter is available from . Quantitation of segmented blocks with ImmunoRatio is available from . For image processing, analysis, and visualization, we adopted OpenCV . The ASH software tool is developed on the Ubuntu 12.04  Linux operating system, as a Galaxy application  and is distributed as a VMware virtual machine  for a Windows user.
The detection of hotspots uses adaptive step finding methodology which has been utilized in engineering for many years  and extensively evaluated and validated . Experimental evaluation has demonstrated the effectiveness of the adaptive step size  and the adaptive step finding method applied in ASH has the same functionality. The selection of the step size is critical both from the point of view of computational efficiency and detection performance.
To simplify the use of ASH, we have implemented a Galaxy within the same virtual machine (VM) to provide a standardized graphical user interface (GUI) for accessing, running and visualizing ASH. Galaxy is an open, web-based platform  and developed tools to upload image files, to analyse the files by ASH in batch mode and to deliver a html report of the selected image with the quantitative ranking of the hotspots displayed in that image. All components and dependencies were created into a VMware virtual machine (VM)  which is an environment that is used like any physical computer  but also shared by download. The entire virtual machine is usually contained in a few files on the host computer (the physical machine that the virtual machine is running on). This means that all the dependency’s required by ASH, including NDPI splitter, ImmunoRatio, openCV and Galaxy, are replaced by just having VMware installed.
Automated selection of hotspots
The whole scanned image is segmented with NDPI splitter, as shown from the left upper image to the right upper image in Figure 4.
Based on the split images, we shift them by 1/4 of the side length, as shown from the right upper image to the bottom image in Figure 4. After the successful creation of JPEG images from the NDPI files, we adopt ImmunoRatio to calculate the IR% per block of the image, and rank the top 10 IR% image blocks.
Adaptive step finding
Visualization and reporting
Optimization of adaptive step selection
Optimization of ASH: The effect of step size on the performance of ASH was determined by as the average % ImmunoRatio (IR%), the number of blocks (# blocks) and the time in seconds to completion (Time) at decreasing step size intervals (Step size)
Validation of quantitative hotspot detection
Adaptive step finding has been utilized in engineering for many years  and extensively evaluated and validated by . In , experimental evaluation demonstrates the effectiveness of the adaptive step size, while the adaptive step finding method applied in ASH had the same functionality. We have tested ASH in a set of >60 whole-slide digitally-scanned ACC images and in comparison with manual assessment labelling index assessment achieved a strong correlation (rho >0.8, p = 0) as calculated with Spearman rank order metric (publication in progress).
There are many commercial image analysis products such as AQUA , Genie (Aperio) , TissueStudio (Definiens) , InForm (PerkinElmer)  which are capable of high quality image processing and Ki67 quantitation, which are cited in other studies and are not freely available for comparative testing. Whilst there are several open source image analysis tools (e.g. ImageJ , ImmunRatio ) and multiple custom built in house applications (e.g. Seedlink ) and our requirements included that the applications be open source and that it could provide hotspot detection and quantitative Ki67 scoring in a desktop application. Thus, we developed ASH, an open source, open access, application using Galaxy-VM technology, to support histopathologists in determining the most active areas in proliferative rate within a slide based on Ki67 LI staining. Additionally since ASH was developed in a Galaxy environment the currently segmentation and quantitation methods can be easily supplemented or replaced, in the central ASH application (by the authors) or by a user (in their local ASH instance), with improved methods developed by other research teams.
We implemented an overlapping block creation method, Step Shifting, since NDPI splitter is only capable of splitting an image and not generating overlapping blocks and to support our Adaptive Step Finding method which has been utilised in multiple engineering projects over many years -.
When we shift the image block by different steps, we can see that the averaged ImmunoRatio increases when step decreases. Therefore, we developed an adaptive step finding technique to obtain the tradeoff between hotspot detection resolution and processing time. Whilst the accuracy of the ImmunoRatio % per image block improves there is an increased cost for calculation time. Optimal calculation time to accuracy ratio occurs at 1/16 step size with ~1000 block based on the time to calculate one block is 1.0069 s based on a single core on an Intel Xeon X5650 processor.
Seedlink, a hybrid clustering method , that provides the users with automatic identification of hotspots is comparable to ASH with respect to usability and output. Seedlink requires a post-processing step to determine true hotspots from the false positive hotspots to ensure accurate determination of Ki67 whilst ASH provide a ranked set of regions for from which the user can include or reject as part of the quantitation of Ki67. Thus ASH simplifies the decision making process by integrating the visualization of the detected hotspots with the quantitation of detected hotspots as a single output in the Galaxy-VM GUI.
Since different types of colored pollutions and colour interferences sometimes cause trouble to the hotspot detection, Adobe photoshop or an alternative program enabling pathologists to delete parts of the scanned image i.e. artifacts created during slide production, will improve the accuracy of the hotspot detection. Whilst we have tested ASH in a training set it is clear that there are `inactive’ areas apparently with `low’ Ki67 Labelling index. Hence it is more prudent to compare automated selected hot spot areas versus hot spot areas as selected by pathologists and further studies are warranted to confirm our findings in a lager cohort.
Galaxy provides the user with a simple GUI to apply ASH using only standard web browser (see background, reference Galaxy). Galaxy provides the remote access for ASH, so people can benefit from the higher processing speed and larger storage space than a local computer. To ensure that ASH is available to individual researchers and/or pathologists as well as those who are supported by a bioinformatics team, we have implemented this Galaxy as a VMware-VM. The combination of Galaxy in a VM provides a multi-user environment in which users can analyse their images in a password protected user specific space, but with the additional functionality of Galaxy and the capability to share any of the data, analysis and results. The current Galaxy-VM has been implemented to run using 1 CPUs, but can be scaled up by resetting the VM once installed to run more CPUs (see project website for help documentation).
We have developed ASH, an open source Galaxy virtual machine application designed for Ki67 LI hotspot detection support, aimed at both individual and large diagnostic laboratories who have little bioinformatics experience or support. ASH is designed to assist pathologists and accelerate the time-consuming Ki67 hotspot selection procedure, enhance the detection resolution and eventually lead to improved reproducible Ki67 LI reporting. Prior to image processing, pathologists should initially exclude with an interface tool various artifacts, such as tissue folds, intrinsic/extrinsic pigmentation (deposit artifacts), necrotic areas, etc. ASH delivers a ranked list of hotspots as a combination of images and quantitative values for each hotspot detected, based on the Adaptive step finding algorithm - developed as part of ASH. The selection of the step size is critical both from the point of view of computational efficiency and detection performance and although we have successfully tested ASH in a training set of whole-slide digitally-scanned ACC images, further studies are warranted in to confirm its efficiency with a larger ACC set.
Availability and requirements
Project name: Automated Selection of Hotspots (ASH)
Project home page:http://bioinformatics.erasmusmc.nl/wiki/index.php/Automated_Selection_of_Hotspots which has a Galaxy VM instance of ASH.
Operating system(s): Windows, Linux (Ubuntu 12.04).
Programming language: C++, Bash, Java.
Requirements: VM ware player, Hamamatsu SDK, JAI 1.1.3, JAI Image IO 1.1, Ant, Deep Zoom.
License: GNU GPL version 3 .
These ACCs were assessed anonymously according to the Proper Secondary Use of Human Tissue code established by the Dutch Federation of Medical Scientific Societies (http://www.federa.org) and the Medical Ethical Committee of the Erasmus MC, Rotterdam, The Netherlands, approved the study.
TP, HL and AS conceived the project. HL developed the adaptive step finding and reporting protocol. HL, DvZ, IP and AS developed ASH and the Galaxy virtual machine. TP, WD and RdK are responsible for the test case samples used in ASH. HL, TP, PvdS and AS wrote the paper and all authors read and approved the final manuscript.
This work was supported by the Seventh Framework Programme (FP7/2007-2013) under grant agreement number 259735 (ENS@T-Cancer).
- Fassnacht M, Libé R, Kroiss M, Allolio B: Adrenocortical carcinoma: a clinician’s update. Nat Rev Endocrinol. 2011, 7 (6): 323-335. 10.1038/nrendo.2010.235.PubMedView ArticleGoogle Scholar
- Beuschlein F, Obracay J, Saeger W, Kroiss M, Quinkler M, Lichtenauer UD, Deutschbein T, Ronchi CL, Willenberg H, Reisch N, Reincke M, Libe R, Baudin E, Bertherat JY, Haak H, Feelders RA, de Krijger R, Loli P, Terzolo M, Allolio B, Mueller H-H, Fassnacht M: Prognostic value of histological markers in localized adrenocortical carcinoma after complete resection. Endocr Rev. 2013, 34: OR29-23.Google Scholar
- Libé R, Borget I, Ronchi CL, Terzolo M, Haaf M, Laino F, Kerkhofs T, Corsini E, Tabarin A, Chabre O, de la Fouchardière C, Niccoli P, Caron P, Mannelli M, Haak H, Beuschlein F, Bertherat J, Berruti A, Fassnacht M, Baudin E: Prognostic factors of advanced unresectable by stage III and IV ENS@T adrenocortical carcinomas (ACC). Endocr Abstr. 2013, 32: OC4.3-Google Scholar
- Fassnacht M, Kroiss M, Allolio B: Update in adrenocortical carcinoma. J Clin Endocrinol Metab. 2013, 98 (12): 4551-4564. 10.1210/jc.2013-3020.PubMedView ArticleGoogle Scholar
- Dowsett M, Nielsen TO, A’Hern R, Bartlett J, Coombes RC, Cuzick J, Ellis M, Henry NL, Hugh JC, Lively T, McShane L, Paik S, Penault-Llorca F, Prudkin L, Regan M, Salter J, Sotiriou C, Smith IE, Viale G, Zujewski JA, Hayes DF: Assessment of Ki67 in breast cancer: recommendations from the International Ki67 in breast cancer working group. J Natl Cancer Inst. 2011, 103 (22): 1656-1664. 10.1093/jnci/djr393.PubMedPubMed CentralView ArticleGoogle Scholar
- Adsay V: Ki67 labeling index in neuroendocrine tumors of the gastrointestinal and pancreatobiliary tract: to count or not to count is not the question, but rather how to count. Am J Surg Pathol. 2012, 36 (12): 1743-1746. 10.1097/PAS.0b013e318272ff77.PubMedView ArticleGoogle Scholar
- Mikami Y, Ueno T, Yoshimura K, Tsuda H, Kurosumi M, Masuda S, Horii R, Toi M, Sasano H: Interobserver concordance of Ki67 labeling index in breast cancer. Japan breast cancer research group Ki67 ring study.Cancer Sci 2013, doi: 10.1111/cas.12245,Google Scholar
- Rindi G, Bordi C, La Rosa S, Solcia E, Delle Fave G: Gastroenteropancreatic (neuro)endocrine neoplasms: the histology report. Gruppo Italiano Patologi Apparato Digerente (GIPAD); Società Italiana di Anatomia Patologica e Citopatologia Diagnostica/International Academy of Pathology, Italian division (SIAPEC/IAP). Dig Liver Dis. 2011, 43 (Suppl 4): 356-360. 10.1016/S1590-8658(11)60591-4.View ArticleGoogle Scholar
- Lopez XM, Debeir O, Maris C, Rorive S, Roland I, Saerens M, Salmon I, Decaestecker C: Clustering methods applied in the detection of Ki67 hot-spots in whole tumor slide images: an efficient way to characterize heterogeneous tissue-based biomarkers. Cytometry. 2012, 81 (9): 765-775. 10.1002/cyto.a.22085.PubMedView ArticleGoogle Scholar
- Elie N, Plancoulaine B, Signolle JP, Herlin P: A simple way of quantifying immunostained cell nuclei on the whole histologic section. Cytometry A. 2003, 56 (1): 37-45. 10.1002/cyto.a.10075.PubMedView ArticleGoogle Scholar
- The Galaxy Project: Online bioinformatics analysis for everyone , [http://galaxyproject.org]
- VMware virtual machine , [http://www.vmware.com]
- Nocq J, Celton M, Gendron P, Lemieux S, Wilhelm BT: Harnessing virtual machines to simplify next-generation DNA sequencing analysis. Bioinformatics. 2013, 29 (17): 2075-2083. 10.1093/bioinformatics/btt352.PubMedView ArticleGoogle Scholar
- Deroulers C, Ameisen D, Badoual M, Gerin C, Granier A, Lartaud M: Analyzing huge pathology images with open source software. Diagn Pathol. 2013, 8: 92-10.1186/1746-1596-8-92.PubMedPubMed CentralView ArticleGoogle Scholar
- Tuominen VJ, Ruotoistenmaki S, Viitanen A, Jumppanen M, Isola J: ImmunoRatio: a publicly available web application for quantitative image analysis of estrogen receptor (ER), progesterone receptor (PR), and Ki-67. Breast Cancer Res. 2010, 12: R56-10.1186/bcr2615.PubMedPubMed CentralView ArticleGoogle Scholar
- NDPITools: NDPI split download for Linux x86_64 , [http://www.imnc.in2p3.fr/pagesperso/deroulers/software/ndpitools/download/ndpitools-1.6.5/lin64/ndpisplit]
- ImmunoRatio home page , [jvsmicroscope.uta.fi/immunoratio]
- OpenCV: OpenCV for Linux/Mac , [http://opencv.org]
- Download Ubuntu Desktop , [http://www.ubuntu.com/download/desktop]
- Schumer MA, Steiglitz K: Adaptive step size random search. IEEE Trans Automat Contr. 1968, 13: 270-276. 10.1109/TAC.1968.1098903.View ArticleGoogle Scholar
- White L, Day R: An evaluation of adaptive step-size random search. Automat Control, IEEE Trans on. 1971, 16 (5): 475-478. 10.1109/TAC.1971.1099768.View ArticleGoogle Scholar
- Mekuz N, Derpanis KG, Tsotsos JK: Adaptive step size window matching for detection. IEEE ICIP. 2006, 2: 259-262.Google Scholar
- Goecks J, Nekrutenko A, Taylor J: Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life science. Genome Biol. 2010, 11 (8): R86-10.1186/gb-2010-11-8-r86.PubMedPubMed CentralView ArticleGoogle Scholar
- Smith JE, Nair R: The architecture of virtual machines. Comput (IEEE Comput Soc). 2005, 38 (5): 32-38. 10.1109/MC.2005.173.View ArticleGoogle Scholar
- Camp RL, Chung GG, Rimm DL: Automated subcellular localization and quantification of protein expression in tissue microarrays. Nat Med. 2002, 8: 1323-1327. 10.1038/nm791.PubMedView ArticleGoogle Scholar
- Aperio ePathology: Leica Biosystems, Nussloch, GmbH , [http://www.leicabiosystems.com/pathology-imaging/aperio-epathology/]
- Tissue Studio 3.5: Definiens AG, Bernhard-Wicki-Straße 5, 80636 Mßnchen, Germany , [http://tissuestudio.definiens.com/]
- InForm: PerkinElmer, 940 Winter Street, Waltham, Massachusetts 02451, USA , [http://www.perkinelmer.com/CMSResources/Images/44-144380PRD_inForm.pdf]
- ImageJ: Image Processing and Analysis in Java , [http://imagej.nih.gov/ij/docs/index.html]
- Free Software Foundation: A Quick Guide to GPLv3 - GNU Project - Free Software Foundation (FSF) , [http://www.gnu.org/licenses/quick-guide-gplv3.html]
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.