A deep learning based multimodal fusion model for skin lesion diagnosis using smartphone collected clinical images and metadata

Chubin Ou; Sitong Zhou; Ronghua Yang; Weili Jiang; Haoyang He; Wenjun Gan; Wentao Chen; Xinchi Qin; Wei Luo; Xiaobing Pi; Jiehua Li

Journal ArticleOPEN ACCESS

A deep learning based multimodal fusion model for skin lesion diagnosis using smartphone collected clinical images and metadata

Frontiers in Surgery (2022) 9

DOI: 10.3389/fsurg.2022.1029991

8Citations

32Readers

Abstract

Introduction: Skin cancer is one of the most common types of cancer. An accessible tool to the public can help screening for malign lesion. We aimed to develop a deep learning model to classify skin lesion using clinical images and meta information collected from smartphones. Methods: A deep neural network was developed with two encoders for extracting information from image data and metadata. A multimodal fusion module with intra-modality self-attention and inter-modality cross-attention was proposed to effectively combine image features and meta features. The model was trained on tested on a public dataset and compared with other state-of-the-art methods using five-fold cross-validation. Results: Including metadata is shown to significantly improve a model's performance. Our model outperformed other metadata fusion methods in terms of accuracy, balanced accuracy and area under the receiver-operating characteristic curve, with an averaged value of 0.768±0.022, 0.775±0.022 and 0.947±0.007. Conclusion: A deep learning model using smartphone collected images and metadata for skin lesion diagnosis was successfully developed. The proposed model showed promising performance and could be a potential tool for skin cancer screening.

Author supplied keywords

Cite

CITATION STYLE

APA

Ou, C., Zhou, S., Yang, R., Jiang, W., He, H., Gan, W., … Li, J. (2022). A deep learning based multimodal fusion model for skin lesion diagnosis using smartphone collected clinical images and metadata. Frontiers in Surgery, 9. https://doi.org/10.3389/fsurg.2022.1029991

A deep learning based multimodal fusion model for skin lesion diagnosis using smartphone collected clinical images and metadata

Abstract

Author supplied keywords

Cite

Register to see more suggestions