Automatic Turkish Image Captioning: The Impact of Deep Machine Translation Otomatik Turkfe Goruntu Altyazilama: Derin Makine Qevirisinin Etkisi


Yildiz S., Memis A., Carli S.

8th International Conference on Computer Science and Engineering, UBMK 2023, Burdur, Turkey, 13 - 15 September 2023, pp.414-419 identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/ubmk59864.2023.10286693
  • City: Burdur
  • Country: Turkey
  • Page Numbers: pp.414-419
  • Keywords: deep learning, image captioning, machine translation, Turkish image caption database
  • Yıldız Technical University Affiliated: Yes

Abstract

This paper presents a research study on the impact of deep machine translation on automatic Turkish image captioning. In the current literature of image processing, related studies on Turkish image captioning are quite limited. One of the main reasons why Turkish image captioning studies are quite limited is that large-scale datasets in Turkish for image captioning have not been constructed yet. In this study, an image caption set for Turkish was generated by using a recent deep machine translation model that becomes prominent with its high performance. In this context, for the MS COCO database, which is a commonly known and widely used image set, the original image captions written in English for images in this database were translated into Turkish using the NLLB (No Language Left Behind) deep machine translation model. In addition, a Turkish image captioning model based on the LSTM (Long Short-term Memory) which uses ResNet, ResNext and Swin deep learning structures as a backbone has also been evaluated on this derived Turkish image caption set. In performance evaluation tests, generally close performances were observed for all image encoder backbone models, and an average of 0.31 BLEU-1, 0.10 BLEU-2, 0.04 BLEU-3, 0.02 BLEU-4, 0.11 METEOR, 0.26 ROUGE-L and 0.04 CIDer values were measured. The related caption set created by using the NLLB deep machine translation model within the scope of the study has also been made available for general use over the web so that researchers working on similar topics can also benefit from it.