Automated segmentation of blood cells in Giemsa stained digitized thin blood films

Automated separation of blood cells from the background in thin blood film samples is of importance in the process of cell counting and as a first step in image analysis of red and white blood cells morphology.

From 11th European Congress on Telepathology and 5th International Congress on Virtual Microscopy Venice, Italy. 6-9 June 2012 Background Assessment of erythrocytes and leucocytes in thin blood films can be used as an inexpensive diagnostic aid in a series of disease states, e.g. infections, anemia and hematological malignancies. Manual counting of cells is still considered the gold standard for example to establish the level of parasitemia in malaria. However, manual cell counting is time consuming and subject to variability [1]. We here propose an image analysis method that is a combination of adaptive histogram thresholds and morphologic characteristics for the segmentation of red blood cells (RBCs) and white blood cells (WBCs) in digitized thin blood films. The method is implemented on a virtual microscopy platform, the Webmicroscope [2].

Methods
Ten Giemsa stained thin blood films were digitized with a microscopy slide scanner (Axio Imager Z2, Carl Zeiss MicroImaging, Jena controlled by Metafer software, MetaSystems, Altlussheim) using a 63x objective with a numerical aperture (NA) of 1.4 (Plan-Apochmat, Carl Zeiss, Jena) and oil immersion. Image acquisition was performed with a monochrome CCD camera with a 1360×1024 pixel sensor and a pixel size of 6.45 µm (CoolCube 1, MetaSystems, Altlussheim), a 1.0 camera adapter and illumination with an RGB illuminator (MetaLED Z, red 619 nanometer, green 515 nanometer, blue 465 nanometer, MetaSystems, Altlussheim). The pixel size in the digital images was approximately 0.10µm and the original TIFF images were converted into a wavelet format (Enhanced Compression Wavelet, ERDAS/ Intergraph, Norcross, GA) and transferred to a virtual microscopy image server (http://fimm.webmicroscope. net/Research/Momic/tp2012) [3]. Approximately fivehundred (473 -505) fields of view from each blood film sample were captured and stored in the database. Five of the samples were infected with Plasmodium falciparum and five were non-infected control samples.
The described method (Fig. 1.) involves 1) separation of background and foreground, 2) recognition of objects that compose the foreground and 3) cell counting (i.e. RBCs and WBCs).

Image preprocessing
As a preprocessing step for each thin blood film sample, the green channel was selected from the original RGB image [4] and smoothed by applying a median filter 3X3 to reduce the' salt and pepper' noise [5]. The green channel is extracted using a color deconvolution between the original image and a vector [0,1,0].

Adaptive histogram thresholds
Let each pixel of the preprocessed image have intensity levels in [0, 1, 2, …, L-1] with L= 256. The number of pixels with intensity level i is denoted by n i , ∀ i=0, 1, 2,…, is the total amount of pixels.
We defined the histogram distribution as p(i) = n i /N, For any monolayer stained blood film, the histogram is bimodal. A typical histogram shape for a monolayer thin blood film is shown in Fig. 2.
There are two local maxima located at m 1 and m 2 , where m 2 <m 1 and P = p(m 2 ) <p(m 1 ) = Q.
A threshold to differentiate the background from the foreground B, is defined by finding the maximum distance

P Q m m
In a similar manner, a threshold to find heavily stained objects H, was defined by finding the maximum distance between the histogram distribution and the line l 2 described between (0, R) and (m 1 , P), with R= p(0); The image background is given by Eq. (1), while the heavily stained objects are described by Eq. (2).
Heavily Stained Objects where = gray level Image.

RoundCells separation
Using the image histogram and based on its bimodal shape, two important thresholds were extracted (Fig. 2). The first threshold (B) defines a binary separation between the image background and foreground. From the image foreground, all objects with roundness bigger than 0.6 were selected and the area of each of the objects was measured. The mean diameter to be 7.52µm and standard deviation of 0.06µm for the whole set of objects was calculated. Finally, only the subset of objects with an area equal to m+/-s (3848+/-688 pixels) was chosen and defined as RoundCells. From this set of round objects of similar size, the average diameter was calculated and used to define a representative red blood cell, designated as AvgRBC (diameter~7µm) and to establish limit diameters for WBCs (~7-21 µm) and platelets (~2-3µm). The second threshold (H), defines the heavily stained objects in the foreground (i.e. WBC, platelets, artifacts and debris). The heavily stained objects larger than AvgRBC are the FoundWBCs.

Detection of circular shapes by Hough transform
Hough transform is calculated on gray level images that contain only the regions of interest while the remaining is set to zero. The region of interest is composed by foreground without RoundCells, FoundWBCs and debris. The maximization of Hough transform for a radius  is performed, where r = radius (AvgRBC). The result is a set of accumulations of hits (votes). The accumulations are concentrated around the centers of the circular shapes. Hough transform detected cells are filtered by selecting the pixel with the maximum vote and deleting all the pixels with less than 20% of the votes. Thus the selection of nearly circular shapes is ensured. Finally, a morphological opening is performed to discard accumulations with less than 50 pixels. The remaining objects are centers of FoundCells.

FoundCells detection
After subtracting the RoundCells and the heavily stained objects from the original image, to compensate for the holes left from the subtraction of the platelets, debris and parasites, a morphologic filling was performed. By using Hough transform, circular shapes were detected in the grayscale image and designated as FoundCells, resolving the center positions of the nearly circular objects. A second representative red blood cell AvgRBC2 was defined from the area of FoundCells.

ApproxCells detection
After subtracting the RoundCells, the FoundWBCs and the FoundCells, the remaining image contains fragments of RBCs and deformed RBCs which Hough transform was not able to define as circular shapes. The total area covered by these objects, named ApproxCells was divided by the area of AvgRBC2 which is estimation for the number of cells that still remain without being counted #ApproxCells.
Finally, the total number of RBCs in the image is calculated by summing up the partial results;

Results and discussion
RBCs were manually annotated in 30 fields of views per thin blood film and WBCs were annotated in the entire data set ( Table 1). The results from the manual counting and automated counting are shown (Table 1.) Using the annotated fields of view, automated quantification of RBCs and WBCs was compared against the manual annotations and RBCs showed an overall error rate of 0.06%, WBCs counting showed an overall error rate of 0.21%. A test for the automated counting of RBCs and WBCs was performed on whole slides of thin blood films and approximately half a million red blood cells and 477 white blood cells were counted (Table 2.). Previously published studies have addressed the separation and counting blood cells, but fixed thresholds for colors, sizes and intensity values restrict the use to particular data sets [6][7][8][9][10]. Here, we make use of adaptive thresholds for size and intensity values, which converges to a solution.

Conclusions
The segmentation of RBCs and WBCs is an easy task for a human observer. Humans have the ability of distinguishing large number of colors, shades and hues, also estimating shapes and size similarities while referring to prior knowledge, making global and local comparisons simultaneously. However, performing large Results comparing manual and automated counting for red blood cells and white blood cells (WBCs). Red blood cells were annotated in a region equivalent to 30 fields of view per film, while the annotations for WBCs were performed on 500 fields of view per film. The samples I1-I5 are Plasmodium falciparum infected cases, C1-C5 are non-infected controls.
Submit your next manuscript to BioMed Central and take full advantage of: