This research targets general-purpose smart computer vision that eliminates reliance on domain-specific knowledge to reach adaptable generic models for flexible applications. It proposes a novel approach in which several deep learning models are trained for each image. Statistical information of each trained image is then calculated and stored with the loss values of each model used in the training phase. The stored information is finally used to select the appropriate model for each new image data in the testing phase. To efficiently select the appropriate model, a kNN (k Nearest Neighbors) strategy is used to select the best model in the testing phase. The developed framework called KGDL (Knowledge Guided Deep Learning) was evaluated and tested using two computer vision benchmarks, 1) ImageNet for image classification, and 2) COCO for object detection. The results reveal the effectiveness of KGDL in terms of accuracy and competitiveness of inference runtime. In particular, it achieved 94 % of classification rate in ImageNet, and 92% of intersection over union in COCO dataset.
CITATION STYLE
Djenouri, Y., Belbachir, A. N., Jhaveri, R. H., & Djenouri, D. (2023). Knowledge Guided Deep Learning for General-Purpose Computer Vision Applications. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 14184 LNCS, pp. 185–194). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-44237-7_18
Mendeley helps you to discover research relevant for your work.