A rule based stemmer

R. Cynthia Monica Priya; J. G.R. Sathiaseelan

Journal ArticleOPEN ACCESS

A rule based stemmer

International Journal of Engineering and Advanced Technology (2019) 9(1) 2026-2029

DOI: 10.35940/ijeat.A9545.109119

1Citations

9Readers

Get full text

Abstract

The present digital world generates enormous amount of data instantaneously. The need to effectively mine knowledge seems to be the need of the hour. Sentiment Analysis, a part of web content mining which is a subpart of web mining has gained momentum in the field of research. It analyses the opinion of variety of people all over the world. Sentiment Analysis encompasses preprocessing, feature selection, classification and sentiment prediction. Preprocessing is an important process and it deals with many techniques. Stop word removal, punctuation removal, conversion of numbers to number names are some of the basic techniques. Stemming is yet another important preprocessing technique that reduces the different words form to its root. There are basically three types of stemmers namely truncating, statistical and hybrid. The aim of this paper is to propose a rule based stemmer that is a truncating stemmer. It deals with rules for truncation and replacement. The data given as input passes through a series of rules. If the condition specified gets satisfied then the associated rule gets executed otherwise the input is checked with the next rule and the process continues further. The result of execution is stemmed words. The performance of the proposed rule based stemmer is compared with the existing stemmers under the same rule based category namely Porter and Lancaster. Various metrics have been used for evaluation. The observations reveal the fact that the proposed stemmer out performs the Porter and Lancaster stemmers in terms of correctly stemmed words factor and shows a good average conflation factor and lesser over stemming and under stemming errors.

Author supplied keywords

Cite

CITATION STYLE

APA

Cynthia Monica Priya, R., & Sathiaseelan, J. G. R. (2019). A rule based stemmer. International Journal of Engineering and Advanced Technology, 9(1), 2026–2029. https://doi.org/10.35940/ijeat.A9545.109119

A rule based stemmer

Abstract

Author supplied keywords

Cite

Register to see more suggestions