Part of Speech (POS) tagging for Indian languages like Hindi and Marathi is generally not an investigated territory. Some of the best taggers accessible for Indian dialects utilize crossbreeds of machine learning or stochastic techniques and phonetic information. Available corpuses for Hindi and Marathi are limited. Hence, when Natural Language Processing (NLP) is applied to Hindi and Marathi sentences, desired results are not achieved. Current POS tagging techniques give UNKNOWN (UNK) POS tag for words which are not present in the corpus. This paper proposes how Hidden Markov Model (HMM)-based approach for POS tagging can be extended using Naïve Bayes theorem for prediction of UNK POS tag.
CITATION STYLE
Chiplunkar, K., Kharche, M., Chaudhari, T., Shaligram, S., & Limkar, S. (2021). Prediction of pos tagging for unknown words for specific hindi and marathi language. In Advances in Intelligent Systems and Computing (Vol. 1177, pp. 133–143). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-15-5679-1_13
Mendeley helps you to discover research relevant for your work.