Automatic content moderation on social media


Karabulut D., Ozcinar C., Anbarjafari G.

MULTIMEDIA TOOLS AND APPLICATIONS, vol.82, pp.4439-4463, 2023 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Article
  • Volume: 82
  • Publication Date: 2023
  • Doi Number: 10.1007/s11042-022-11968-3
  • Journal Name: MULTIMEDIA TOOLS AND APPLICATIONS
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus, FRANCIS, ABI/INFORM, Applied Science & Technology Source, Compendex, Computer & Applied Sciences, INSPEC, zbMATH
  • Page Numbers: pp.4439-4463
  • Keywords: Inappropriate scene recognition, Content obfuscation, Convolutional neural networks, WEB PAGES
  • Yıldız Technical University Affiliated: No

Abstract

Millions of users produce and consume billions of content on social media. Therefore, human-reviewed content moderation is not achievable in such volume. Automating content moderation is a scalable solution for social media platforms. In this research work, we propose an automatic content moderation pipeline based on deep neural networks. Our solution consists of two main parts: the first part classifies a given image into granular content classes; and a second part obfuscates the part of a given image that might be inappropriate for the target audience. Our proposed solution is a cost-efficient in terms of human labour and practical for deploying the real-time systems. Our classification network is trained with automatically labelled data using noise-robust techniques. Our automatic obfuscation algorithm uses the information obtained from the classification network and does not require additional annotation or supplementary training. This obfuscation algorithm presents a novel-use case of class-specific activation mappings for censoring regional explicit nudity in images. The classification network achieves a top-1 accuracy of 0.903 and a top-2 accuracy of 0.986. The obfuscation algorithm covers a minimum explicitly nude area of 0.68 on average.