Symbolic representation of text documents using multiple kernel FCM

B. S. Harish; M. B. Revanasiddappa; S. V. Aruna Kumar

Conference Proceedings

Symbolic representation of text documents using multiple kernel FCM

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9468 93-102

DOI: 10.1007/978-3-319-26832-3_10

3Citations

3Readers

Get full text

Abstract

In this paper, we proposed a novel method of representing text documents based on clustering of term frequency vector. In order to cluster the term frequency vectors, we make use of Multiple Kernel Fuzzy C-Means (MKFCM). After clustering, term frequency vector of each cluster are used to form a interval valued representation (symbolic representation) by the use of mean and standard deviation. Further, interval value features are stored in knowledge base as a representative of the cluster. To corroborate the efficacy of the proposed model, we conducted extensive experimentation on standard datset like Reuters- 21578 and 20 Newsgroup. We have compared our classification accuracy achieved by the Symbolic classifier with the other existing Naive Bayes classifier, KNN classifier and SVM classifier. The experimental result reveals that the classification accuracy achieved by using symbolic classifier is better than other three classifiers.

Author supplied keywords

Cite

CITATION STYLE

APA

Harish, B. S., Revanasiddappa, M. B., & Aruna Kumar, S. V. (2015). Symbolic representation of text documents using multiple kernel FCM. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9468, pp. 93–102). Springer Verlag. https://doi.org/10.1007/978-3-319-26832-3_10

Symbolic representation of text documents using multiple kernel FCM

Abstract

Author supplied keywords

Cite

Register to see more suggestions