Preprocessing Methods for Unstructured Healthcare Text Data

N/ACitations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

At present, the amount unstructured text data is increasing exponentially from the past periodically. Information retrieval (IR) from these unstructured text data is challenging. As the data users foresee for particular/specific outcomes. Retrieval of the significant outcomes depends on the fashion, how they are associated/indexed. Unstructured text data like clinical data containing more health information requires challenging preprocessing methods, which also help to reduce the size of the dataset so that it will optimize the performance of the IR system. In this paper, we have proposed the pre-processing methods such as Data collection, Data Cleaning, Tokenization, Stemming, Removal of Stop words which will efficiently help the data users to find the specific patterns from the unstructured text data.

Cite

CITATION STYLE

APA

Preprocessing Methods for Unstructured Healthcare Text Data. (2019). International Journal of Innovative Technology and Exploring Engineering, 9(2S), 715–719. https://doi.org/10.35940/ijitee.b1024.1292s19

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free