Prediction of pos tagging for unknown words for specific hindi and marathi language

Kirti Chiplunkar; Meghna Kharche; Tejaswini Chaudhari; Saurabh Shaligram; Suresh Limkar

Conference Proceedings

Prediction of pos tagging for unknown words for specific hindi and marathi language

Advances in Intelligent Systems and Computing (2021) 1177 133-143

DOI: 10.1007/978-981-15-5679-1_13

2Citations

4Readers

Get full text

Abstract

Part of Speech (POS) tagging for Indian languages like Hindi and Marathi is generally not an investigated territory. Some of the best taggers accessible for Indian dialects utilize crossbreeds of machine learning or stochastic techniques and phonetic information. Available corpuses for Hindi and Marathi are limited. Hence, when Natural Language Processing (NLP) is applied to Hindi and Marathi sentences, desired results are not achieved. Current POS tagging techniques give UNKNOWN (UNK) POS tag for words which are not present in the corpus. This paper proposes how Hidden Markov Model (HMM)-based approach for POS tagging can be extended using Naïve Bayes theorem for prediction of UNK POS tag.

Author supplied keywords

Cite

CITATION STYLE

APA

Chiplunkar, K., Kharche, M., Chaudhari, T., Shaligram, S., & Limkar, S. (2021). Prediction of pos tagging for unknown words for specific hindi and marathi language. In Advances in Intelligent Systems and Computing (Vol. 1177, pp. 133–143). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-15-5679-1_13

Prediction of pos tagging for unknown words for specific hindi and marathi language

Abstract

Author supplied keywords

Cite

Register to see more suggestions