Centre for Computational and Numerical Studies, Institute of Advanced Study in Science and Technology 2. Dept. of Statistics, Gauhati University

Similar documents
AUTOMATED MALARIA PARASITE DETECTION BASED ON IMAGE PROCESSING PROJECT REFERENCE NO.: 38S1511

Image segmentation applied to cytology

Image Database and Preprocessing

An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi

Estimating malaria parasitaemia in images of thin smear of human blood

Computational approach for diagnosis of malaria through classification of malaria parasite from microscopic image of blood smear.

Automatic Locating the Centromere on Human Chromosome Pictures

Acute Lymphocytic Leukemia Detection and Classification (ALLDC) System

Enhanced Identification of Malarial Infected Objects using Otsu Algorithm from Thin Smear Digital Images

COMPUTERIZED HEMATOLOGY COUNTER

Project: Sudoku solver

NON UNIFORM BACKGROUND REMOVAL FOR PARTICLE ANALYSIS BASED ON MORPHOLOGICAL STRUCTURING ELEMENT:

An Image Processing Approach for Screening of Malaria

Malignancy Detection of Candidate for Basal Cell Carcinoma Using Image Processing and Artificial Neural Network

White Blood Cells Identification and Counting from Microscopic Blood Image

COMPUTER-AIDED DETECTION OF CLUSTERED CALCIFICATION USING IMAGE MORPHOLOGY

Keyword: Morphological operation, template matching, license plate localization, character recognition.

A Hybrid Colour Image Enhancement Technique Based on Contrast Stretching and Peak Based Histogram Equalization

Automatic Morphological Segmentation and Region Growing Method of Diagnosing Medical Images

Comparison of Segmentation Framework on Digital Microscope Images for Acute Lymphoblastic Leukemia Diagnosis using RGB and HSV Color Spaces

License Plate Localisation based on Morphological Operations

Chapter 6. [6]Preprocessing

Segmentation of Microscopic Bone Images

ImageJ: Introduction to Image Analysis 3 May 2012 Jacqui Ross

Number Plate Recognition Using Segmentation

MAV-ID card processing using camera images

Image Extraction using Image Mining Technique

Scanned Image Segmentation and Detection Using MSER Algorithm

A Fast Algorithm of Extracting Rail Profile Base on the Structured Light

Counting Sugar Crystals using Image Processing Techniques

A Study On Preprocessing A Mammogram Image Using Adaptive Median Filter

Scrabble Board Automatic Detector for Third Party Applications

Detection and Counting of Blood Cells in Blood Smear Image

Introduction Approach Work Performed and Results

THERMAL DETECTION OF WATER SATURATION SPOTS FOR LANDSLIDE PREDICTION

FEATURE EXTRACTION AND CLASSIFICATION OF BONE TUMOR USING IMAGE PROCESSING. Mrs M.Menagadevi-Assistance Professor

Differentiation of Malignant and Benign Masses on Mammograms Using Radial Local Ternary Pattern

User Reference Manual

International Journal of Advanced Research in Computer Science and Software Engineering

Automatic Licenses Plate Recognition System

Technical Aspects in Digital Pathology

8.2 IMAGE PROCESSING VERSUS IMAGE ANALYSIS Image processing: The collection of routines and

CHAPTER 4 LOCATING THE CENTER OF THE OPTIC DISC AND MACULA

Study and Analysis of various preprocessing approaches to enhance Offline Handwritten Gujarati Numerals for feature extraction

A Study of Image Processing on Identifying Cucumber Disease

KEYWORDS Cell Segmentation, Image Segmentation, Axons, Image Processing, Adaptive Thresholding, Watershed, Matlab, Morphological

Research on 3-D measurement system based on handheld microscope

THEORY AND APPROACHES TO AUTOMATED IMAGE ANALYSIS IN DIGITAL PATHOLOGY

Introduction to Image Analysis with

Machine Vision for the Life Sciences

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition

Segmentation and Analysis of Microscopic Osteosarcoma Bone Images

Multispectral Enhancement towards Digital Staining

Brain Tumor Segmentation of MRI Images Using SVM Classifier Abstract: Keywords: INTRODUCTION RELATED WORK A UGC Recommended Journal

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 05, 2016 ISSN (online):

][ R G [ Q] Y =[ a b c. d e f. g h I

Finger print Recognization. By M R Rahul Raj K Muralidhar A Papi Reddy

Classification Accuracies of Malaria Infected Cells Using Deep Convolutional Neural Networks Based on Decompressed Images

Keywords: - Gaussian Mixture model, Maximum likelihood estimator, Multiresolution analysis

AGRICULTURE, LIVESTOCK and FISHERIES

Urban Feature Classification Technique from RGB Data using Sequential Methods

Automated Detection of Early Lung Cancer and Tuberculosis Based on X- Ray Image Analysis

CHAPTER-4 FRUIT QUALITY GRADATION USING SHAPE, SIZE AND DEFECT ATTRIBUTES

ROBOT VISION. Dr.M.Madhavi, MED, MVSREC

INDIAN VEHICLE LICENSE PLATE EXTRACTION AND SEGMENTATION

ME 6406 MACHINE VISION. Georgia Institute of Technology

A New Framework for Color Image Segmentation Using Watershed Algorithm

Displacement Measurement of Burr Arch-Truss Under Dynamic Loading Based on Image Processing Technology

Malaysian Car Number Plate Detection System Based on Template Matching and Colour Information

International Journal of Advance Engineering and Research Development

COMPARATIVE PERFORMANCE ANALYSIS OF HAND GESTURE RECOGNITION TECHNIQUES

An Efficient Color Image Segmentation using Edge Detection and Thresholding Methods

An Algorithm for Fingerprint Image Postprocessing

Traffic Sign Recognition Senior Project Final Report

A NOVEL APPROACH FOR CHARACTER RECOGNITION OF VEHICLE NUMBER PLATES USING CLASSIFICATION

Image Processing and Particle Analysis for Road Traffic Detection

Region Based Satellite Image Segmentation Using JSEG Algorithm

IDENTIFICATION OF FISSION GAS VOIDS. Ryan Collette

Color Image Segmentation Using K-Means Clustering and Otsu s Adaptive Thresholding

Detection of Microcalcifications in Mammographies Based on Linear Pixel Prediction and Support-Vector Machines

Implementation of License Plate Recognition System in ARM Cortex A8 Board

COLOR LASER PRINTER IDENTIFICATION USING PHOTOGRAPHED HALFTONE IMAGES. Do-Guk Kim, Heung-Kyu Lee

An Efficient Pre-Processing Method to Extract Blood Vessel, Optic Disc and Exudates from Retinal Images

MATHEMATICAL MORPHOLOGY AN APPROACH TO IMAGE PROCESSING AND ANALYSIS

FPGA based Real-time Automatic Number Plate Recognition System for Modern License Plates in Sri Lanka

ECC419 IMAGE PROCESSING

AN EXPANDED-HAAR WAVELET TRANSFORM AND MORPHOLOGICAL DEAL BASED APPROACH FOR VEHICLE LICENSE PLATE LOCALIZATION IN INDIAN CONDITIONS

Filtering for More Accurate Dense Tissue Segmentation in Digitized Mammograms

Studying of Reflected Light Optical Laser Microscope Images Using Image Processing Algorithm

SECTION I - CHAPTER 2 DIGITAL IMAGING PROCESSING CONCEPTS

Maturity Detection of Fruits and Vegetables using K-Means Clustering Technique

Colored Rubber Stamp Removal from Document Images

Contrast enhancement with the noise removal. by a discriminative filtering process

AUTOMATIC DETECTION OF HEDGES AND ORCHARDS USING VERY HIGH SPATIAL RESOLUTION IMAGERY

Raster Based Region Growing

Multi-resolution Cervical Cell Dataset

Stamp detection in scanned documents

An Improved Binarization Method for Degraded Document Seema Pardhi 1, Dr. G. U. Kharat 2

AUTOMATIC IRAQI CARS NUMBER PLATES EXTRACTION

Research of an Algorithm on Face Detection

Transcription:

Cervix Cancer Diagnosis from Pap Smear Images Using Structure Based Segmentation and Shape Analysis 1 Lipi B. Mahanta, 2 Dilip Ch. Nath, 1 Chandan Kr. Nath 1 Centre for Computational and Numerical Studies, Institute of Advanced Study in Science and Technology 2 Dept. of Statistics, Gauhati University ABSTRACT This work presents an approach for analysis of PAP smear images of cervical region based on cell nuclei distribution and shape and size analysis. PAP smear test is an efficient and easy procedure to detect any abnormality in cervical cells. But human observation is not always satisfying and it is a tedious task to manually analyze a large number of PAP smear images. The purpose of this study is to automate the screening process and to provide specific statistical data which will be helpful for detecting abnormalities in cervical region. The proposed approach is implemented in MATLAB, a high level, interactive environment for data visualization/analysis/computation. The MATLAB Image Processing Toolbox was used to segment the digital images and calculate various statistical data. By comparing cell nuclei distribution and taking into account the shape and size features MATLAB can be programmed to distinguish normal cervical cell to questionable ones. Keywords: PAP smear, cervical cancer, image processing, MATLAB 1. INTRODUCTION The automated detection and segmentation of cell nuclei in PAP smear images is one of the most interesting fields in cytological image analysis as observed by Plissiti et. al. [1]. There is a high degree of cell overlap in such images, the presence of more than one nucleus in a cell and the lack of homogeneity in image intensity. These confront any method to overcome the complexity of conventional cervical cell images and to accomplish a correct segmentation. In addition, the nucleus is a very important structure within the cell and it presents significant changes when the cell is affected by a disease and thus the accurate definition of the nucleus boundary is a crucial task. The identification and quantification of these changes in the nucleus morphology and density contribute in the discrimination of normal and abnormal cells in PAP smear images. The segmentation of nuclei in cytological images has been studied by several researchers [2-7]. In this paper we present a method for analysis of PAP smear images based on histogram and structuring element based segmentation and shape and size analysis of the cell nuclei. As mentioned earlier human observation is not always satisfying and according to Marroquin et. al. [8], 61.5% manual screening of available pathological tests remains unclassified. Our method includes segmentation, calculation of cell nuclei distribution and shape and size analysis of the cell nuclei. The first order statistics of an image is its histogram that gives information about the distribution of the gray level value in its dynamic range [0, L-1], where L is the number of gray levels. The histogram gives information of an image as a whole and hence it can be considered as global statistics of the image. Morphological operations are based on shapes. In a morphological operation, the value of each pixel in the output image is based on a comparison of the corresponding pixel in the input image with its neighbours. Dilation and Erosion are the two most common morphological operations. Dilation adds pixels to the object boundary of an image and erosion removes pixels on the object boundary of an image. The number of pixels added or removed from the object boundary of an image depends in the size and shape of the structuring element. For an image, A and a structuring element, B, dilation and erosion are defined as A B and AӨB respectively. Another morphological operation is filling operation which is carried out on binary or grayscale images. For binary images, filling operation changes connected background pixels to foreground pixels. For grayscale images, filling operation brings the intensity values of dark areas that are surrounded by lighter areas up to the same intensity level as surrounding pixels. This work is based on Marroquin et. al. [8] and Hernandez et. al. [9]. Hernandez described a way to segment biopsy images of brain cell and calculate the cell distribution. Marroquin used a fuzzy classifier after extracting features related to texture, shape and size to classify the breast biopsy images. The aim of this study was to provide assistance to pathologists and to enhance the accuracy of statistical data. 2. METHODOLGY The images used in this work are conventional PAP stained cervical cell images, acquired through a CCD camera adapted to an optical microscope. We used a 40 magnification lens and images were stored in JPEG format having size 2048 1536 pixels. With consultation of the pathologists, we selected the following features to be extracted 245

1. Size of the cell nuclei 2. Shape of the cell nuclei elongation occurs when abnormality occurs. The degree of change in shape is a good measure to analyze the shape of the nuclei. We took into account two factors. 3. Cell nuclei distribution The preprocessing is arranged into three steps: simplification and image enhancement, segmentation and feature extraction. I) Simplification and image enhancement The digitized images are coloured in RGB mode. Matrix corresponding to colour image is three dimensional and hence difficult to process. The images in RGB mode have three colour components and it is a tedious task to segment using those colour component. So due to the fact that gray level image are easy to process we converted the images to gray level by using the rgb2gray unction in MATLAB. Then we enhance the contrast of the images using histogram analysis using imadjust function in MATLAB. II) Segmentation The cell nuclei are darker than the surrounding cytoplasm and all the cell nuclei belonging to a class tend to have same gray level. We created the histogram for each image to view he density distribution of the different shades of gray. Based on the fact that cell nuclei are darker, we filtered out the light areas and created a uniform background. Then using structuring element function, we created binary images containing only the cell nuclei. We considered only round shape for segmenting cell nuclei, but all the nuclei are not round. So, the binary images are again processed with dilation and filling function. As a result of applying dilation and erosion, extra parts which were not part of the nuclei were removed and the boundary of the cell nuclei became prominent. Then we apply filling operation to create a uniform intensity level inside the cell nuclei. III) Feature Extraction a) Cell nuclei distribution The number of normal and abnormal cell in a cytological image is good criteria for identifying abnormality. The size of the cell nuclei can be treated as factor for determining whether they are normal or not. We calculated the area of each nucleus. The area was calculated in pixels. We, then, calculated the cell nuclei distribution per images and presented the result in tabular from with one column having the area and the other having the number of nuclei with that area. First one is Compactness. It is a dimensionless shape feature which measure compactness. It is defined by Sheng et. al. [10] as C = P 2 /A where P is the perimeter and A is the area of the cell nuclei. The calculation was done in pixel. Second one is Eccentricity. The cell nuclei are round in shape in normal condition. The roundness is not perfect and the shape can be considered as ellipse. We calculate the semimajor and semiminor axes lengths and the eccentricity for each nucleus using the following formula ε = {(a 2 b 2 )/b 2 } 1/2 where a and b are the semimajor and semiminor axes of the ellipse respectively. The eccentricity value 0 corresponds to a circle and with an increasing value the deviation from circle becomes more significant as observed by Marroquin. 3. RESULTS AND DISCUSSION The pre-processing step excludes all the background and leaves for further processing the parts of the image which contain isolated cells or cell clusters and as such is a fast process which results in the reduction of the region of interest in the image. Our method has been applied in several PAP smear images defined by an expert observer. The step for the detection of the cell nucleus centroid has exposed that the resulted points of the image indicate the area of the nuclei, as it is confirmed by the expert observer. Two images are shown here, one showing cancerous cells in initial stage that is mild dysplasia and the other showing normal cervical cell. The statistics were then obtained and compared to confirm the differences. The results are shown below. 4. SIMPLIFICATION AND IMAGE ENHANCEMENT Figure 1-2 (a), (b) and (c) show the results after applying the techniques mentioned above for simplification and enhancement of two images. b) Shape and size analysis Shape is another important feature for classification of cervical cells in PAP smear images. Generally, the shape of cell nuclei are round and 246

6. FEATURE EXTRACTION 1) Cell nuclei distribution according to area Table 1- Cancerous Cells Number of nuclei Area in pixels 2 45 Figure 1 - (a) Original image showing cancerous cells in initial stage that is mild dysplasia., (b) Image simplified /converted to gray level and (c) Image after enhancement 2 30 1 40 1 42 1 25 Table 2 - Normal Cells Number of nuclei Area in pixels 2 13 2 24 1 21 Figure 2 - (a) Original image showing normal cervical cell (b) Image simplified/converted to gray level and (c) Image after enhancement 1 18 1 26 5. SEGMENTATION: The images were then segmented for the region of interest, that is the cell nuclei. They are shown below: Figure 3,4 - Segmented image showing only the cell nuclei The distribution of cell nuclei is an indicative measure of the number of normal and abnormal nuclei in a single slide. When cells in a tissue begin to show abnormal growth, the cells deviates from the normal morphological characteristics one by one. Size of the cell nuclei is one of the features that increase during abnormal or cancerous growth. We calculated the size of the each cell nucleus in terms of area and presented them in tabular form. The cell nuclei distribution table clearly shows that the abnormal samples tend to have more number of nuclei with higher value of area. The normal samples show a lower range of distribution as in the normal sample in Table 1 which has distribution range from 13 to 26 pixels. Whereas the cancerous sample in Fig.(c) has a range from 25 to 45 pixels. We can say that more the number of bigger nuclei more the extent of abnormality. 247

2) Compactness 3) Eccentricity Table 3 - Cancerous Cells Perimeter Area Compactness 27 45 0.0617 22 42 0.0867 18 30 0.0925 18 30 0.0925 24 45 0.0781 16 25 0.0976 21.4 40 0.0873 Table 5 - cancerous cells Cell nuclei Eccentricity 1 0.6591 2 0.5151 3 0.5528 4 0.5528 5 0.8351 6 0 7 0.5684 Table 4 - Normal Cells Perimeter Area Compactness 11.3137 13 0.1015 11.3137 13 0.1015 14.7279 21 0.0968 14.1421 18 0.0900 16.7990 24 0.0850 16.1421 24 0.0921 16.7279 26 0.0929. Table 6 - Normal Cells Cell nuclei Eccentricity 1 0 2 0 3 0.5111 4 0.6939 5 0.6635 6 0.4167 7 0.5956 Compactness is the dimensionless shape measure of the cell nuclei. A normal nucleus has a well-formed and a compact shape in normal condition. Cells with abnormality gradually deform and the compactness decreases. We found out the compactness of the cell nuclei and found that normal cells have higher value of compactness then that of the abnormal nuclei which is very obvious from the tables showing the compactness of the normal and abnormal cells. Eccentricity is the measure of roundness of the cell nuclei. In general, the eccentricity can be said to be calculated from the width and height of the cell nuclei. The normal nuclei have a minimal proportion between the width and height and thus have greater roundness. Uncontrolled growth of the nuclei does not keep this uniform proportion and as result their eccentricity deviated farther away from zero (0). The eccentricity table of the normal sample shows some of the nuclei having 0 eccentricity value, whereas the abnormal sample deviates away from 0. 7. CONCLUSION: An effective method to identify and classify cervical cancer is becoming increasingly needed due to the fact that early detection and a decision of correct therapy 248

may save the patient. Medical images have various limitations such as low quality, presence of noise and human error in interpretation. Digital image processing can help the pathologists to a great extent. The statistical data can be used as benchmark to flag normal or questionable sample while the pathologist looks at the slide under a microscope which will be highly time saving. Some ideas for future enhancement includes: to design an interactive system where a pathologist can feed his own grayscale threshold or to automate the process by computer using histogram or fuzzy logic. Another enhancement can be to establish cutoff values between normal and abnormal values and to classify the abnormal values according to stage of the cancer. The images processed are magnified and the calculations are done in pixels. So, if a relation among magnification, pixels and actual size is established, the analysis will be more efficient. [2] Bamford P., Lovell B., Unsupervised cell nucleus segmentation with active contours, Signal Processing 71(2), pp. 203-213, 1998. [3] Bamford P., Lovell B., A water immersion algorithm for cytological image segmentation, Proceedings of the APRS Image segmantation workshop, pp. 75-79, University of Technology Sydney, Sydney 1996. [4] Mouroutis T., Roberts S. J., Robust cell nuclei segmentation using statistical modelling, IOP Bioimaging, 6, pp. 79-91, 1998. [5] Garrido A., Perez de la Blanca N., Applying deformable templates for cell image segmentation, Pattern Recognition 33, pp. 821-832, 2000. [6] Lee K.M., Street W.N., Learning shapes for automatic image segmentation, Proc. INFORMS-KORMS Conference, pp. 1461-1468, Seoul, Korea, June 2000. ACKNOWLEDGMENT We would like to offer our gratitude to Dr. J. D. Sharmah, Chief consultant, Pathology department and Dr.(Miss) Banani Sharmah, Pathologist of Dr. B. Baruah Cancer Research Institute for providing the images and helping us to prepare documentation. This paper has the consent of all co-authors and authorities of the institute, where this study has been carried out and there exists no conflict of interest anywhere. REFERENCES [1] Plissiti M.E., Charchanti A., Krikoni O. and Fotiadis D.I., Automated segmentation of cell nuclei in PAP smear images, ITAB Proceedings International Special Topic Conference on Information Technology in Biomedicine, Greece, Ionnia, 26-28, October 2006. [7] Begelman G., Gur E., Rivlin E., Rudzsky M., Zalevsky Z., Cell nuclei segmentation using fuzzy logic engine, International Conference on Image Processing, Vol. 5, pp 2937-2940, October 2004. [8] Marroquin E. Martinez, Vos C., Santamaria E., Jove X., Socoro J.C., Non Linear Image Analysis for Fuzzy Classification of Breast Cancer, IEEE Proceedings of International Conference on Image Processing, vol.2, 943 946, 1996. [9] Hernandez L., Gothreaux P., Shih L., Towards Realtime Biopsy Image Analysis and Cell Segmentation. In Proceedings of IPCV, pp.81-87, 2006. [10] Sheng L., Rangayyan R. M., Desautels L., Application of Shape analysis to Mammographic Calcification. IEEE Trans. On Medical Imaging, Vol. 13, NO. 2, 1994. 249