IMPLEMENTATION OF NAÏVE BAYESIAN DATA MINING ALGORITHM ON DECEASED REGISTRATION DATA

Similar documents
Knowledge discovery & data mining Classification & fraud detection

LIST OF PUBLICATIONS

Latest trends in sentiment analysis - A survey

Comparative Study of various Surveys on Sentiment Analysis

IJITKMI Volume 7 Number 2 Jan June 2014 pp (ISSN ) Impact of attribute selection on the accuracy of Multilayer Perceptron

CONTROL METHOD FOR LCC CURRENT OUTPUT RESONANT CONVERTER

International Journal of Computer Techniques - Volume 2 Issue 5, Sep Oct 2015

Analysis of Data Mining Methods for Social Media

Techniques for Sentiment Analysis survey

COMPARATIVE PERFORMANCE ANALYSIS OF HAND GESTURE RECOGNITION TECHNIQUES

COMPARISON OF MACHINE LEARNING ALGORITHMS IN WEKA

Data Mining for Healthcare Data: A Comparison of Neural Networks Algorithms

Sentiment Analysis of User-Generated Contents for Pharmaceutical Product Safety

Classification Experiments for Number Plate Recognition Data Set Using Weka

IDENTICAL AND FRATERNAL TWIN RECOGNITION USING PHOTOPLETHYSMOGRAM SIGNALS

Contents. List of Figures List of Tables. Structure of the Book How to Use this Book Online Resources Acknowledgements

DEVELOPMENT OF AN EXPERT SYSTEM FOR CONDITION MONITORING OF SUBMARINES USING IR THERMOGRAPHY

Reduce the Wait Time For Customers at Checkout

Image Finder Mobile Application Based on Neural Networks

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET)

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET)

The Automatic Classification Problem. Perceptrons, SVMs, and Friends: Some Discriminative Models for Classification

DEFOCUS BLUR PARAMETER ESTIMATION TECHNIQUE

Privacy preserving data mining multiplicative perturbation techniques

PROPOSED SYSTEM FOR MID-AIR HOLOGRAPHY PROJECTION USING CONVERSION OF 2D TO 3D VISUALIZATION

TF-IDF

ACTIVE FILTERS: A RELIABILITY AND PERFORMANCE ANALYSIS

Rahul Misra. Keywords Opinion Mining, Sentiment Analysis, Modified k means, NLP

DATA MINING AND DATA ANALYTICS

On the Diversity of the Accountability Problem

THE CHALLENGES OF SENTIMENT ANALYSIS ON SOCIAL WEB COMMUNITIES

ISSN: (Online) Volume 2, Issue 4, April 2014 International Journal of Advance Research in Computer Science and Management Studies

A COMPARATIVE ANALYSIS OF IMAGE SEGMENTATION TECHNIQUES

LOAD BALANCING OF FEEDER USING FUZZY AND OPTIMIZATION TECHNIQUE

Cómo estructurar un buen proyecto de Machine Learning? Anna Bosch Rue VP Data Launchmetrics

COMPARATIVE ANALYSIS OF ACCURACY ON MISSING DATA USING MLP AND RBF METHOD V.B. Kamble 1, S.N. Deshmukh 2 1

Toward AI Network Society

Surveillance and Privacy in the Information Age. Image courtesy of Josh Bancroft on flickr. License CC-BY-NC.

SELECTING RELEVANT DATA

A Novel Fuzzy Neural Network Based Distance Relaying Scheme

FIRST ORDER SIGMA DELTA MODULATOR USING 0.25 µm CMOS TECHNOLOGY AT 2.5 V

Image Extraction using Image Mining Technique

SCIENCE & TECHNOLOGY

Image Restoration and De-Blurring Using Various Algorithms Navdeep Kaur

INTERNATIONAL JOURNAL OF MECHANICAL ENGINEERING AND TECHNOLOGY (IJMET)

MACHINE AS ONE PLAYER IN INDIAN COWRY BOARD GAME: BASIC PLAYING STRATEGIES

MAHATMA GANDHI INSTITUTE OF TECHNOLOGY Gandipet, Hyderabad

Survey on: Prediction of Rating based on Social Sentiment

INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN ENGINEERING AND TECHNOLOGY (IJARET)

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET)

ABSTRACT I. INTRODUCTION II. LITERATURE REVIEW

Classroom Konnect. Artificial Intelligence and Machine Learning

DESIGN OF LOW POWER HIGH SPEED ERROR TOLERANT ADDERS USING FPGA

A HYBRID ALGORITHM FOR FACE RECOGNITION USING PCA, LDA AND ANN

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET) DESIGN OF A LINE FOLLOWING SENSOR FOR VARIOUS LINE SPECIFICATIONS

Digital Neural Network Hardware For Classification

AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND TRANSFER FUNCTIONS

EFFICIENT IMAGE ENHANCEMENT TECHNIQUES FOR MICRO CALCIFICATION DETECTION IN MAMMOGRAPHY

Pragati Keshavrao Patil

Machine Learning for Language Technology

INTELLIGENT SOFTWARE QUALITY MODEL: THE THEORETICAL FRAMEWORK

AUTOMATIC GENERATION CONTROL OF REHEAT THERMAL GENERATING UNIT THROUGH CONVENTIONAL AND INTELLIGENT TECHNIQUE

THREE PHASE SEVENTEEN LEVEL SINGLE SWITCH CASCADED MULTILEVEL INVERTER FED INDUCTION MOTOR

SRF CONTROLLED DVR FOR COMPENSATION OF BALANCED AND UNBALANCED VOLTAGE DISTURBANCES

Categorizing Distinct Carcinoma from Gene Expression Data using Multi-Layer Perceptron

SIMULATION STUDIES ON AUTOTRANSFORMER RECTIFIER UNIT FOR AIRCRAFT APPLICATIONS

MACHINE LEARNING AND FINTECH

A STUDY OF IMPACT OF USE OF IT ON ACADEMIC LIBRARIES IN INDIA

CROSS-LAYER FEATURES IN CONVOLUTIONAL NEURAL NETWORKS FOR GENERIC CLASSIFICATION TASKS. Kuan-Chuan Peng and Tsuhan Chen

6979(Print), ISSN (Online), Volume 6, Issue 1, January- April (2015), pp IAEME RESEARCH AND DEVELOPMENT (IJIERD)

MICROCHIP PATTERN RECOGNITION BASED ON OPTICAL CORRELATOR

EXPLORATION ON POWER DELAY PRODUCT OF VARIOUS VLSI MULTIPLIER ARCHITECTURES

NEURAL NETWORK DEMODULATOR FOR QUADRATURE AMPLITUDE MODULATION (QAM)

DESIGN OF MICROSTRIP RECTANGULAR PATCH ANTENNA FOR CANCER DETECTION

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET) HIGH-SPEED 64-BIT BINARY COMPARATOR USING NEW APPROACH

Human-Centric Trusted AI for Data-Driven Economy

DESIGN OF HIGH PERFORMANCE MODIFIED RADIX8 BOOTH MULTIPLIER

Practical Text Mining for Trend Analysis: Ontology to visualization in Aerospace Technology

Analysis of Footprint in a Crime Scene

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET)

DESIGN AND SIMULATION OF IMPROVED DC- DC CONVERTERS USING SIMULINK FOR GRID CONNECTED PV SYSTEMS

Assessment of Spatiotemporal Changes in Vegetation Cover using NDVI in The Dangs District, Gujarat

IBM SPSS Neural Networks

INTELLIGENT APRIORI ALGORITHM FOR COMPLEX ACTIVITY MINING IN SUPERMARKET APPLICATIONS

AUTOMATIC SWITCHING CONTROL OF A HYBRID SOLAR-WIND SYSTEM USING FUZZY LOGIC

I. INTRODUCTION. Keywords - Data mining; Sentiment Analysis; Social Media; Indian Cities Traffic; Twitter.

Hence analysing the sentiments of the people are more important. Sentiment analysis is particular to a topic. I.e.,

REDUCTION OF HARMONIC DISTORTION IN BLDC DRIVE USING BL-BUCK BOOST CONVERTER BLDC DRIVE

A Comparative Analysis Of Back Propagation And Random Forest Algorithm For Character Recognition From Handwritten Document

ABSTRACT I. INTRODUCTION

A Comparative Performance Analysis of High Pass Filter Using Bartlett Hanning And Blackman Harris Windows

DESIGN AND ANALYSIS OF TUNING TECHNIQUES USING DIFFERENT CONTROLLERS OF A SECOND ORDER PROCESS

Social Media Intelligence in Practice: The NEREUS Experimental Platform. Dimitris Gritzalis & Vasilis Stavrou June 2015

5 Lambodar Jena Kamila N.K. S.Gayatri, International Journal of Application or 67-76

ArcGIS Pro: What s New in Analysis

MAMMOGRAM ENHANCEMENT USING QUADRATIC ADAPTIVE VOLTERRA FILTER- A COMPARATIVE ANALYSIS IN SPATIAL AND FREQUENCY DOMAIN

Fault Detection Using Hilbert Huang Transform

Comparison of various Error Diffusion Algorithms Used in Visual Cryptography with Raster scan

Analysis of Secure Text Embedding using Steganography

Notes from a seminar on "Tackling Public Sector Fraud" presented jointly by the UK NAO and H M Treasury in London, England in February 1998.

Transcription:

International Journal of Computer Engineering & Technology (IJCET) Volume 10, Issue 1, January February 2019, pp. 32 37, Article ID: IJCET_10_01_004 Available online at http://www.iaeme.com/ijcet/issues.asp?jtype=ijcet&vtype=10&itype=1 Journal Impact Factor (2016): 9.3590(Calculated by GISI) www.jifactor.com ISSN Print: 0976-6367 and ISSN Online: 0976 6375 IAEME Publication IMPLEMENTATION OF NAÏVE BAYESIAN DATA MINING ALGORITHM ON DECEASED REGISTRATION DATA Dr. P. Y. Desai Associate Professor, Department of ICT, Veer Narmad South Gujarat University, India ABSTRACT The use of data mining algorithm in different domains like marketing, finance, retail, cyber security, fraud detection, medical science, etc is well known. In recent times, the data mining is also implemented for e-governance data. Normally, e- governance data are only used for Online Transaction Processing System (OLTP). However, one can use data mining algorithm to uncover hidden and new trends from e-governance data. In this paper, Naïve Bayes data mining algorithm is applied on Deceased Registration data to find hidden trends. Key words: Deceased Registration Data, Naive Bayes Algorithm, Data Mining. Cite this Article: Dr. P.Y. Desai, Implementation of Naïve Bayesian Data Mining Algorithm on Deceased Registration Data. International Journal of Computer Engineering and Technology, 10(1), 2019, pp. 32 37. http://www.iaeme.com/ijcet/issues.asp?jtype=ijcet&vtype=10&itype=1 1. INTRODUCTION The naïve bayes data mining algorithm is used to solve different types of real world problems. As per [1], the naïve bayes algorithm is used for text classification. In [2], the naïve bayes classification is used for loan risk assessment and data set of tunisian commercial bank. Their study demonstrated that classification rate was 58.66% [2]. Reference [3] used naïve bayes algorithm for student dataset classification. Their experiments showed that naïve bayes classifier was having accuracy of 66.67% [3]. In [4], naïve baye algorithm was implemented for email spam classification problem. Their study recommends that naïve bayes algorithm is better results than support vector machine [4]. In [5], the naïve bayes algorithm was used to find credit card fraud detection. Their study showed that their accuracy was 95% using the naïve bayes algorithm [5]. Reference [6] discussed about use of the naïve bayes algorithm in text news classification. Their study showed that using naïve bayes algorithm, they were able to classify text news into four different categories [6]. In [7], naïve bayes classifier was used for prediction of cancer risk in perspective of symptoms. Their study included symptoms of lung cancer, symptoms of breast cancer, oral cancer, ovarian cancer, and ovarian cancer. Their study concluded that the naïve bayes classification can be effectively used in early http://www.iaeme.com/ijcet/index.asp 32 editor@iaeme.com

Dr. P.Y. Desai cancer detection [7]. As per [8], naïve bayes classification is used for opining mining. Their study showed that sentiment analysis can be done based on reviews made by the users. The naïve bayes algorithm is used to classify review in positive and negative classes [8]. Similarly, the naïve bayes algorithm can also be effectively used for deceased registration data for finding hidden trends and relationship among various attributes. 2. RESEARCH METHODOLOGY In this paper, Microsoft Analysis Services and Microsoft Naïve Bayes algorithm is used. The proposed research methodology is as follows: Select the Deceased Data Cube First, Deceased Data Cube was selected to perform the Naïve Bayes Data Mining algorithm. The Data Cube contains various dimensions and measures pertaining to Deceased Registration data. Configure parameters of Microsoft Naïve Bayes algorithm The algorithm parameters were set considering the Deceased Data Cube. The MAXIMUM INPUT ATTRIBUTES parameter was set to seven, as seven input attributes were considered. The MAXIMUM OUTPUT ATTRIBUTES parameter was set to one, as only one predictable attribute was considered. The MAXIMUM STATES which indicate total states of an attribute to be considered while building data mining model. This attribute was kept to default as it was difficult to find number of possible states from all data. Select the Input Attributes In this step various input attributes like Death Place, Age, Deceased Religion, Deceased Sex, Medical Attention Type, Year and Death Date were selected. Select the Key Attributes The Death Date (Date and Time Field) attribute was selected as Key attribute to uniquely indentify each row from the data cube. Select the Output Attributes Medical Attention Type attribute was selected as Predict attribute to find relationship among various input attributes and Medical Attention Type attribute. Generate Mining Model Considering steps A to E, the Naïve Bayes data mining model was generated for Deceased Registration Data Cube. The research methodology is depicted in the Figure 1. and results are discussed in the result section. http://www.iaeme.com/ijcet/index.asp 33 editor@iaeme.com

Implementation of Naïve Bayesian Data Mining Algorithm on Deceased Registration Data Figure 1 Proposed Methodology 3. RESULTS In the Naïve Bayes Data Mining model, Medical Attention Type was kept as Predictable attribute. The Medical Attention Type has three possible values. The Medical Attention Type 1 indicates Institutional, Medical Attention Type 2 indicates Medical attention other than Institution and Medical Attention Type indicates No medical attention. http://www.iaeme.com/ijcet/index.asp 34 editor@iaeme.com

Dr. P.Y. Desai Figure 2 Medical Attention Type Output Attribute The possible states for Religion ID input attribute is as per Figure 3. Figure 3 Religion ID Input Attribute The result showed interesting relationship between Religion ID and Medical Attention Type attribute. The results indicate that for Religion ID=6 which is Muslim religion is having highest probability for No medical attention with probability of 0.55. The result is shown in Figure 4. Figure 4 Result for Religion ID=6 input Attribute http://www.iaeme.com/ijcet/index.asp 35 editor@iaeme.com

Implementation of Naïve Bayesian Data Mining Algorithm on Deceased Registration Data The results showed that for Religion ID=4 which is Hindu religion is having highest probability for No medical attention with probability of 0.65. The result is shown in Figure 5. Figure 5 Result for Religion ID=4 input Attribute More trends were derived from the attribute discrimination option provided in the Microsoft Analysis Services. As per attribute discrimination results, Religion ID=6 which is Muslim favours Medical Attention Type =1, where as Religion ID=4 which is Hindu favours Medical Attention Type=2. This result is shown in the Figure 6. Figure 6 Result for Attribute Discrimination Medical Attention Type 1 and 2 In other results, Religion ID=8 which is Parsi favours Medical Attention Type =2 whereas Religion ID=4 which is Hindu favours Medical Attention Type=3. The result is shown in the Figure 7. http://www.iaeme.com/ijcet/index.asp 36 editor@iaeme.com

Dr. P.Y. Desai Figure 7 Result for Attribute Discrimination Medical Attention Type 2 and 3 4. CONCLUSIONS The Naïve Bayes algorithm was implemented on Deceased Registration data and some important relationships were derived from the implementation. Furthermore, Naïve Bayes algorithm can also be utilized for Deceased Registration data exploration to provide further insight to the data. REFERENCES [1] Shuo Xu, Bayesian Naïve Bayes classifiers to text classification, Journal of Information Science, SAGA Journals. First Published November 1, 2016, Vol 44, Issue 1, 2018. [2] Aida Krichene, Using a naive Bayesian classifier methodology for loan risk assessment. Evidence from a Tunisian commercial bank. Journal of Economics, Finance and Administrative Science, vol. 22, no. 42, 2017, Universidad ESAN. [3] Rajeswari R.P,, Kavitha Juliet, Dr.Aradhana, Text Classification for Student Data Set using Naive Bayes Classifier and KNN Classifier, International Journal of Computer Trends and Technology (IJCTT) Volume 43 Number 1 January 2017. [4] Priyanka Sao, Pro. Kare Prashanthi, E-mail Spam Classification Using Naïve Bayesian Classifier, International Journal of Advanced Research in Computer Engineering & Technology (IJARCET), Volume 4 Issue 6, June 2015. [5] Sai Kiran, Jyoti Guru, Rishabh Kumar, Naveen Kumar, Deepak Katariya, and Maheshwar Sharma, Credit card fraud detection using Naïve Bayes model based and KNN classifier, International Journal of Advance Research, Ideas and Innovations in Technology, ISSN: 2454-132X, Volume 4, Issue 3. [6] Shruti Bajaj Mangal and Dr. Vishal Goyal, Text News Classification System using Naïve Bayes Classifier, Research Cell : An International Journal of Engineering Sciences, Issue December 2014, Vol. 3, ISSN: 2229-6913 (Print), ISSN: 2320-0332 (Online), pp 209-213. [7] Pallavi Mirajkar and Dr. G. Prasanna Lakshmi, Prediction of Cancer Risk in Perspective of Symptoms using Naïve Bayes Classifier, International Journal of Engineering Research in Computer Science and Engineering, Vol 4, Issue 9, September 2017, ISSN (Online) 2394-2320. [8] Vrinda and Dr. Komal Kumar Bhatia, Opinion Mining using Naïve Bayes Classifier, International Journal of Engineering Research & Technology (IJERT), ISSN: 2278-0181, Vol. 6 Issue 04, April-2017 http://www.iaeme.com/ijcet/index.asp 37 editor@iaeme.com