Automatic measurement of epithelium differentiation and classification of cervical intraneoplasia by computerized image analysis
© Jondet et al; licensee BioMed Central Ltd. 2010
Received: 20 December 2009
Accepted: 22 January 2010
Published: 22 January 2010
The feasibility of evaluating an objective grading of cervical intraneoplasia lesions (CIN) is attempted using an automatic computerized system able to measure several valuable parameters with special reference to epithelium differentiation.
4 groups of 10 images each were selected at random from 68 consensus images coming from 80 archival cervical biopsies, normal (n = 10), CIN 1 (n = 10), CIN 2 (n = 10), CIN 3 (n = 10). Representative images of lesions were captured from the microscopic slides and were analyzed using mathematical morphology, with special reference toVoronoï tessellation and Delaunay triangulation. Epithelium surface, nuclear and cytoplasm area, triangle edge and area, total and upper nuclear index were precisely measured in each lesion, and discriminant coefficients were calculated therewith. A dilation/erosion coefficient was automatically defined using triangle edge length and nuclear radius in order to measure the epithelium ratio of differentiation. A histogram ratio was also automatically established between total nuclei and upper nuclei on top of differentiated epithelium. With the latter two ratios added to the nucleo-cytoplasmic ratio, a cervical score able to classify CIN is proposed.
There is a quasi-linear increase of mean cervical score values between normal epithelium and CIN 3: (27) for normal epithelium, (51) for CIN 1, (78) for CIN 2 and (100) for CIN 3, with significant differences (P < 0.05).
Our results highlight the possibility of applying a cervical score for the automatic grading of CIN lesions and thereby assisting the pathologist for improvement of grading. The automatic measure of epithelium differentiation ratio appears to be a new interesting parameter in computerized image analysis of cervical lesions.
Cervical intraepithelial neoplasia (CIN) has aroused a lot of interest with regard to classification. The world-wide classification, proposed by Richart , is based on several morphologic criteria in order to predict the biological behavior of abnormal epithelium. Among them, the most reliable features are: 1 - degree of epithelium differentiation, 2 - nucleus to cytoplasm ratio, 3 - cellular maturation, 4 - nuclear analysis (maturation and mitotic activity). When scrutinizing the literature, it appears that a strong consensus exists on the diagnosis of normal epithelium and high grade lesions, but problems appears mainly with the classification of CIN 1 and CIN 2 due to poor inter- and intra-observer correlations [2–8]. A new classification introducing low grade (HPV infections + CIN 1) and high grade (CIN 2 to carcinoma in situ) represents a cut-off, pointing out patient treatment in the high grade group only.
Several attempts have already been made to automatically and objectively grade CIN using image analysis. Based on mathematical morphology Keenan et al.  focused on morphology and Guillaud et al.  paid more attention on nuclear features. An automatic stereologic measurement using Ki67 marker has been proposed by Kruse et al . Nevertheless, these techniques still remain of poor applicability due to difficulties with their implementation. Following up these original reports, we have developed a similar procedure taking into account a limited number of parameters and we propose a new concept able to automatically measure the degree of epithelium differentiation.
Weigert-Lead stain (M. Jondet)
1 - Weigert's iron haematoxylin
a - Haematoxylin solution
b - Iron solution
30% Ferric chloride solution in water
Hydrochloric acid 1 M
Mix equal volumes of solution a and b, use within 60 min
2 - Lead citrate
Add sodium hydroxyde 0,4 M untill precipitate dissolves
a - dewax and rehydrate sections
b - Weighert's haematoxylin
c - wash with distilled water
d - saturated picric acid in water
e - wash with distilled water
f - lead citrate
g - wash with water
h - mount in Eukitt after alcohol and xylene clarification
Microscopic slides were observed with a Zeiss Axioskop microscope (Zeiss, Göttingen, Germany) with a ×25 magnification (Zeiss Plan-Neofluar 25/0.8 lens) under standard illumination (same voltage and diaphragm aperture). For each biopsy, at least 1 image was recorded. In some samples, where different grades of lesion were observed on the same biopsy, a microscopic field of each grade was recorded. The images were recorded with a black and white CCD camera (model PRO-110SP, Apro Media, South Korea), they were digitized (472 × 608 pixels, 256 grey level) by an electronic video card (ATI-All in Wonder 128, ATI Technologies Inc., Ontario, Canada) and recorded in bitmap format. Each image had a 286,976 pixel area, corresponding to a 0.066 mm2 real surface on the microscopic slide, so that 1 pixel corresponded to 0.22 × 10-6 μm2.
A total of 86 microscopic fields, corresponding to normal and cervical lesions of different grades, were submitted to 4 gynaeco-pathologists for classification, one of them being the first author [MJ]. Images were submitted to the 4 pathologists independently, and the same images were viewed again by each one 3 months later, independently of the first vision. The pathologists were asked to grade the images in 4 groups (normal, CIN 1, CIN 2 and CIN 3). An intra- and inter-observer consensus was established with 68 images, among which 10 images were selected at random in each group.
a - software
Micromorph 1.4 version image analysis system (Mathematical Morphology Software CMM-ENSMP, Paris, France) was used for image processing. This software was developed for the use of mathematical morphology and contains nearly all the tools of this image analysis procedure . Some features which are lacking in the original software, such as Delaunay triangulation, have been developed by RA . Automatic algorithms for the measurement of epithelium differentiation area and nuclei histogram have been developed by MJ and RA, with specific automation in order to perform calculations on large series of images.
b - Manual and semi-automatic procedures
Each gray image was manually processed with Photoshop 5.5 in order to select the original epithelial tissue, thereby eliminating artifacts and stroma. The basal lamina and the surface of the epithelium were delineated, before running image analysis in the region of interest (ROI), delimited in that way. The cleared image was then treated by the image analysis system for segmentation of nuclei. After increasing light and contrast, the image was submitted to thresholding, followed by proper segmentation. A 1 pixel border was added to the final image and the non-tissue areas were filled by superposition of the original image with the segmented one. With such a procedure, the nuclei cutting the image edge were suppressed, allowing a precise measurement of ROI and average nuclear areas.
c - Automatic procedure
- area of the selected epithelium (A)
- mean area of the nuclei (N)
- mean area of the zone of influence (V)
- epithelial differentiation (D)
- total nuclei histogram (Hst)
- upper nuclei histogram (HstU).
From these 6 parameters, 3 ratios were inferred:
nucleo-cytoplasmic ratio (N/V) = ratio of nucleus area to cytoplasmic area;
epithelial differentiation ratio (DR) = ratio of differentiated epithelium area (D) to total epithelium area (A);
upper nuclei histogram ratio (HstR) = ratio of upper nuclei histogram (HstU) to total nuclei histogram (Hst).
The same calculation was applied for all groups of images.
Distributions have been found normal, except for 2 out of 40 data (10 values in each of the 4 groups under study). Therefore comparisons between 2 successive groups (normal vs CIN1, CIN 1 vs CIN 2, CIN 2 vs CIN 3) have been made by t-test, with Welch correction when unequal variance was assumed. The levels of significance are P < 0.05. GraphPad Prism 5.02 (GraphPad Software Inc., La Jolla, California, USA) was the statistical software used.
Means (± SD) of significant features measured in the 4 diagnostic groups (values are expressed in pixel)
Epithelium area (A)
Nucleus area (N)
Cytoplasm area (V)
Epithelium differentiation (D)
Total nucleus histogram (Hst)
Upper nucleus histogram (HstU)
mean epithelium area (A) decreases without statistical significance;
mean nucleus area (N) increases significantly from normal to CIN 1 and remains further at the same level for high grade lesions;
mean cytoplasm area (V) decreases from normal up to CIN 3;
mean epithelium differentiation (D) increases significantly from normal up to CIN 3;
total nucleus histogram (Hst) increases significantly from normal up to CIN 3;
upper nucleus histogram (Hst U) increases significantly from normal to CIN 1, then without significant difference between CIN1 and CIN 2, and with a significant decrease between CIN 2 and CIN 3 due to a drastic diminution of non differentiated epithelium in CIN 3 (by definition, nuclei maturation should be 100% within CIN 3).
nucleo-cytoplasmic ratio (N/V) increases significantly from normal up to CIN 3;
epithelium differentiation ratio (DR) increases significantly from normal up to CIN 3, and this means that the mathematical procedure is in accordance with pathology. Measurements have been made on the nuclei and did not take into account the cytoplasm, thereby resulting into ratios which do not reach 100%, as expected. This is due to the fact that epithelium differentiation was measured with nuclei area and did not take into account cytoplasm area, although present, but in a small amount on the upper side of the basal part of epithelium;
upper nuclei histogram ratio (Hst R) decreases with increasing epithelium differentiation from normal up to CIN 1, with a drastic fall for CIN 3 due to the low number of nuclei in the upper part.
Finally, the cervical score, which represents a calculation from the preceding ratios, displays a quasi-linear and significant progression from normal up to CIN 3 lesions with no overlapping between the 4 groups.
A vision machine is able to measure a nearly unlimited number of parameters when analyzing an image, but we completely agree with Keenan  who pointed out that the development of a mechanical vision system is complicated.
The purpose of this work was to limit the number of parameters by retaining only those values which are as close as possible to the parameters taken into account when a pathologist analyses and grades a cervical biopsy. With a specific algorithm we were able to record automatically 6 reliable parameters and to establish a cervical score for grading objectively a cervical biopsy. We took into account the original classification proposed by Richart  which is still the gold standard in the field and pointed out the epithelial differentiation parameter.
Image captures were done with a ×25 objective which suits perfectly to our optical system, allowing maximum vision of total epithelium height, especially in the normal group. The application of contrast staining appeared to be a valuable procedure in the segmentation process. Segmentation is a major problem in image analysis, as the nuclei are often superimposed, particularly in high grade lesions and cancer. Up to now there is no valid solution, because the manually segmenting of clusters is time consuming and subject to errors, particularly in high grade lesions and cancer. After several attempts to achieve a fully automatic process, it appeared that a manual or semi-automatic procedure was necessary to reach better segmentation before the automatic measurement of the pertinent parameters. This is a general problem in image analysis and represents an important issue for the standardization of image quality, as pointed out by Kayser [14, 15].
Voronoï tessellation is based on nuclei segmentation and determines on each nucleus the zone of influence, which is a mathematical morphology item that suits well with biological observation, because the nucleus exerts an "influence" on the surrounding cytoplasm. Therefore, we can relate the zone of influence area to cytoplasm area. As proposed by Serra (Personal Communication), the Delaunay triangulation was then drawn from the geodesic center of each zone of influence, this procedure allowing a more accurate triangulation because it takes into account the nearest neighbors and eliminates false relation of 2 cells being far away from each other, when there is convexity in the epithelium for example.
The Delaunay triangulation measures the mean triangle edge length which was used to define the dilation/erosion coefficient. By combining this value with the mean value of nuclei area it became possible to automatically measure differentiated epithelium. An alternative for the measurement the dilation/erosion coefficient consist in the application of the minimum spanning tree technique giving rise to the minimum distance between the nuclei's nearest neighbors . A specific algorithm has to be developed in our vision machine in order to check which is the adequate technique. Our procedure, using automatic dilatation/erosion makes it easier in cases of convexity or irregularities of the basal lamina or papillomatosis, which represents most of the cases encountered in routine pathology. Considering percentage differentiation, correlation with Richart's classification was found, though normal epithelium is made up of limited number of basal cells, not considered as differentiated upon microscopic observation, but displaying a small degree of differentiation with the vision machine.
Cell maturation in the upper part of the differentiated epithelium represents an important factor in the grading, especially in cases of koïlocytosis. Our staining procedure is based on iron-haematoxylin treatment. By increasing the nucleus contrast with our modified technique, although haematoxylin is not stochiometric per se, the intensity of the nucleic staining, which is taken into account in routine microscopy, can bring up an interesting data. When establishing the histogram of all nuclei on one end and the upper nuclei on the other end, the histogram ratio gives an adequate appreciation of nucleic chromaticism, independent of specimen thickness and staining intensity.
Nucleo-cytoplasmic ratio is an important data from the spatial point of view. In normal epithelium near the surface, the cytoplasm is very large and the nucleus becomes smaller during normal maturation. As the histological section is 2.5 μm thick, only a few nuclei are present with their surrounding cytoplasm, leading to a false estimation of nucleo-cytoplasmic ratios. Data are nevertheless comparable because they are objectively done by the machine and are reproducible.
We did not consider mitoses which is an important parameter in the classification. This can be achieved when images are captured with a ×1000 magnification . Other parameters such as DNA ploïdy can be measured by image analysis [17, 18], but they represent a more complicated approach and are time consuming. We also did not take into account the immuno-histochemistry with Ki65 and P16, which can represent an additional parameter . Comparing data from the 4 pathologists, a good correlation was observed with the machine in 79% of the cases, with consensus cases taken into consideration for the selection of the 4 groups submitted to the vision machine. The data produced by automatic calculation display some statistical significance among the 4 groups, but the low number of cases in each group may be a limitation. Twenty six more images were added to the 86 previously analyzed, and were submitted to the 4 pathologists in a confrontation meeting without regard to the previous results. There was 77% correlation between the consensus diagnosis and the vision machine results which confirms the results of Keenan .
The use of mathematical morphology makes our algorithm transposable to any system based on that image analysis system, after proper calibration of the optical system specific to each user. The staining procedure, the microscopic field illumination and the automatic measures with our system were standard and gave reproducible results, these parameters being considered as very important .
The cervical score we proposed presents a quasi-linear progression from normal to high grade lesions and fits closely to Richart's classification. The problem however is to determine an adequate cut-off value for the decision to treat or not to treat a patient.
Though it may not always be comfortable to accept that a machine can be able to grade a CIN, we hope that our contribution to the proposal for a reliable scoring will help pathologists to deliver objective diagnoses, further more in experimental studies where objectivity and quantification are requested. So far, we consider that the vision machine is not able to produce a diagnosis per se, the selection of the optical field to be analyzed remaining operator dependent. Nevertheless it can help the pathologist to accurately establish the most valuable diagnosis. Although some artefacts may occur, precision seems fairly good, which is hardly the case when grading is done by routine observation on microscopic slides, as inter and intra variability is a limiting factor. The present investigation should be considered as a contribution to total morphologic analysis of any cervical lesion by measuring objective parameters, with special reference to epithelial differentiation.
Using Voronoï tessellation and Delaunay triangulation on an image analysis system we were able to demonstrate that it is possible to grade automatically and precisely a cervical intraneoplasia lesion. This was achieved using a modified Weigert dye enhancing nuclear contrast and we were able to propose a cervical scoring. Our contribution to this difficult approach, compared to previous published results, was to add the automatic measurement of the pathologic epithelium differentiation. Our algorithms developed for this specific application can be adapted to any vision machine using mathematical morphology. More information can be made available by immuno-histochemistry combined with image analysis. We are looking forward pursuing further developments to this preliminary work in order to help pathologists to become more and more accurate and objective in their grading of cervical intraneoplasia.
The authors thank Drs S. Akwright, C. Depinay and G. Terrasse, for their help with the interpretation of histopathological images and classification, and Gabrielle Jondet for reviewing the english manuscript.
- Richart RM: Natural history of cervical intraepithelial neoplasia. Clin Obst Gynec. 1967, 748-784. 10.1097/00003081-196712000-00002. 10
- Ismail SM, Colclough AB, Dinnen JS, Eakins D, Evans DM, Gradwell E, O'Sullivan JP, Summerell JM, Newcombe RG: Observer variation in histopathological diagnosis and grading of cervical intraepithelial neoplasia. BMJ. 1989, 707-10. 10.1136/bmj.298.6675.707. 298
- Creagh T, Bridger JE, Kupek E, Fish DE, Martin-Bates E, Wilkins MJ: Pathologist variation in reporting cervical borderline epithelial abnormalities and cervical intraepithelial neoplasia. J Clin Pathol. 1995, 59-60. 10.1136/jcp.48.1.59. 48
- Mccluggage WG, Walsh MY, Thornton CM, Hamilton PW, Date A, Caughley LM, Bharucha H: Inter- and intra-observer variation in the histopathological reporting of cervical squamous intraepithelial lesions using a modified Bethesda grading system. Br J Obstet Gynaecol. 1998, 206-210. 105
- Malpica A, Matisic JP, Niekirk DV, Crum CP, Staerkel GA, Yamal JM, Guillaud MH, Cox DD, Atkinson EN, Adler-Storthz K, Poulin NM, Macaulay CA, Follen M: Kappa statistics to measure interrater and intraraterment for 1790 cervical biopsy specimens among twelve pathologists: qualitative histopathologic analysis and methodologic issues. Gynecol Oncol. 2005, 3 (Suppl 1): S38-52. 10.1016/j.ygyno.2005.07.040.View ArticleGoogle Scholar
- Cai B, Ronnett BM, Stoler M, Ferenczy A, Kurman RJ, Sadow D, Alvarez F, Pearson J, Sings HL, Barr E, Liaw KL: Longitudinal evaluation of interobserver and intraobserver agreement of cervical intraepithelial neoplasia diagnosis among an experienced panel of gynecologic pathologists. Am J Surg Pathol. 2007, 1854-60. 12
- Ceballos KM, Chapman W, Daya D, Julian JA, Lytwyn A, McLachlin CM, Elit L: Reproducibility of the histological diagnosis of cervical dysplasia among pathologists from 4 continents. Int J Gynecol Pathol. 2008, 101-7. 10.1097/pgp.0b013e31814fb1da. 1
- Dalla Palma P, Giorgi Rossi P, Collina G, Buccoliero AM, Ghiringhello B, Gilioli E, Onnis GL, Aldovini D, Galanti G, Casadei G, Aldi M, Gomes VV, Giubilato P, Ronco G, NTCC Pathology Group: The reproducibility of CIN diagnoses among different pathologists: data from histology reviews from a multicenter randomized study. Am J Clin Pathol. 2009, 125-32. 10.1309/AJCPBRK7D1YIUWFP. 132
- Keenan SJ, Diamond J, McCluggage WG, Bharucha H, Thompson D, Bartels PH, Hamilton PW: An automated machine vision system for the histological grading of cervical intraepithelial neoplasia. J Pathol. 2000, 351-362. 10.1002/1096-9896(2000)9999:9999<::AID-PATH708>3.0.CO;2-I. 192
- Guillaud M, Cox D, Adler-Storthz K, Malpica A, Staerkel G, Matisic J, Van Niekerk D, Poulin N, Follen M, Macaulay C: Exploratory analysis of quantitative histopathology of cervical intraepithelial neoplasia: objectivity, reproducibility, malignancy associated changes, and human papillomavirus. Cytometry A. 2004, 81-9. 10.1002/cyto.a.20034. 60
- Kruse AJ, Baak JP, Janssen EA, Kjellevold KH, Fiane B, Lovslett K, Bergh J, Robboy S: Ki67 predicts progression in early CIN: validation of a multivariate progression- risk model. Cell Oncol. 2004, 13-20. 26
- Serra J: Image Analysis and Mathematical Morphology. 1982, Academic Press, LondonGoogle Scholar
- Agoli-Abgo Regis: Adaptation d'un logiciel de morphologie mathématique pour l'étude de la quantification des lésions du col utérin. Rapport de stage de fin d'étude Institut Superieur des Biosciences. Paris. 2006, 1-49.Google Scholar
- Kayser K, Hoshang SA, Metze K, Goldmann T, Vollmer E, Radziszowski D, Kosjerina Z, Mireskandari M, Kayser G: Texture- and object-related automated information analysis in histological still images of various organs. Anal Quant Cytol Histol. 2008, 323-35. 6
- Kayser K, Görtler J, Metze K, Goldmann T, Vollmer E, Mireskandari M, Kosjerina Z, Kayser G: How to measure image quality in tissue-based diagnosis (diagnostic surgical pathology). Diagn Pathol. 2008, 3 (Suppl 1): S11-10.1186/1746-1596-3-S1-S11.PubMed CentralView ArticlePubMedGoogle Scholar
- Kayser K, Stute H: Minimum spanning tree, Voronoi's tesselation and Johnson Mehl diagrams in human lung carcinoma. Pathol Res Pract. 1989, 729-34. 185
- Shirata NK, Sredni ST, Castelo A, Santinelli A, Mendonça B, Montironi R, Filho AL, Zerbini MC: Texture image analysis in differentiating malignant from benign adrenal cortical tumors in children and adults. Anticancer Res. 2009, 3365-8. 8
- Baak JP, Janssen E: DNA ploidy analysis in histopathology. Morphometry and DNA cytometry reproducibility conditions and clinical applications. Histopathology. 2004, 603-14. 10.1111/j.1365-2559.2004.01897.x. 44
- Regauer S, Reich O: CK17 and p16 expression patterns distinguish (atypical) immature squamous metaplasia from high-grade cervical intraepithelial neoplasia (CIN III). Histopathology. 2007, 629-35. 10.1111/j.1365-2559.2007.02652.x. 5
- Kayser K, Schultz H, Goldmann T, Görtler J, Kayser G, Vollmer E: Theory of sampling and its application in tissue based diagnosis. Diagn Pathol. 2009, 4-6. 16
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.