A New Method to Measure Similarity of Words in Japanese Twitter Based on Related Images

0Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Twitter, as a popular form of social media in Japan, has emerged as a valuable data resource for various important social network analysis tasks. However, Japanese tweets often contain nonstandard words and variant notations, owing to which several words with the same meaning may be written differently. The use of such words will generate the sparsity problem and decrease the accuracy of similarity measures between users. Furthermore, the performance of user or tweet recommendations may be deteriorated. Therefore, words with the same meaning must be unified in the preprocessing step. In this research, assuming that words with the same meaning have similar or common related images, we propose a method to use word-related images to measure the similarity between words. A manually annotated Japanese data set is created to evaluate the proposed method. Experimental results indicate that the proposed method outperforms the existing methods in most cases.

Cite

CITATION STYLE

APA

Xu, Z., Matsumura, A., & Satoh, T. (2022). A New Method to Measure Similarity of Words in Japanese Twitter Based on Related Images. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13635 LNCS, pp. 489–503). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-21047-1_45

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free