A New Method to Measure Similarity of Words in Japanese Twitter Based on Related Images

Zhelin Xu; Atsushi Matsumura; Tetsuji Satoh

Conference Proceedings

A New Method to Measure Similarity of Words in Japanese Twitter Based on Related Images

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2022) 13635 LNCS 489-503

DOI: 10.1007/978-3-031-21047-1_45

0Citations

3Readers

Get full text

Abstract

Twitter, as a popular form of social media in Japan, has emerged as a valuable data resource for various important social network analysis tasks. However, Japanese tweets often contain nonstandard words and variant notations, owing to which several words with the same meaning may be written differently. The use of such words will generate the sparsity problem and decrease the accuracy of similarity measures between users. Furthermore, the performance of user or tweet recommendations may be deteriorated. Therefore, words with the same meaning must be unified in the preprocessing step. In this research, assuming that words with the same meaning have similar or common related images, we propose a method to use word-related images to measure the similarity between words. A manually annotated Japanese data set is created to evaluate the proposed method. Experimental results indicate that the proposed method outperforms the existing methods in most cases.

Author supplied keywords

Cite

CITATION STYLE

APA

Xu, Z., Matsumura, A., & Satoh, T. (2022). A New Method to Measure Similarity of Words in Japanese Twitter Based on Related Images. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13635 LNCS, pp. 489–503). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-21047-1_45

A New Method to Measure Similarity of Words in Japanese Twitter Based on Related Images

Abstract

Author supplied keywords

Cite

Register to see more suggestions