Prediction of pos tagging for unknown words for specific hindi and marathi language

2Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Part of Speech (POS) tagging for Indian languages like Hindi and Marathi is generally not an investigated territory. Some of the best taggers accessible for Indian dialects utilize crossbreeds of machine learning or stochastic techniques and phonetic information. Available corpuses for Hindi and Marathi are limited. Hence, when Natural Language Processing (NLP) is applied to Hindi and Marathi sentences, desired results are not achieved. Current POS tagging techniques give UNKNOWN (UNK) POS tag for words which are not present in the corpus. This paper proposes how Hidden Markov Model (HMM)-based approach for POS tagging can be extended using Naïve Bayes theorem for prediction of UNK POS tag.

Cite

CITATION STYLE

APA

Chiplunkar, K., Kharche, M., Chaudhari, T., Shaligram, S., & Limkar, S. (2021). Prediction of pos tagging for unknown words for specific hindi and marathi language. In Advances in Intelligent Systems and Computing (Vol. 1177, pp. 133–143). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-15-5679-1_13

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free