On the Comparative Analysis of Sequence Mining Algorithms: Case Study in Telecommunications


Tiktiklar D., Baltaoglu G., Çakir E., Kücük Z., AKTAŞ M. S.

6th International Conference on Computer Science and Engineering, UBMK 2021, Ankara, Türkiye, 15 - 17 Eylül 2021, ss.145-150 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/ubmk52708.2021.9558935
  • Basıldığı Şehir: Ankara
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.145-150
  • Anahtar Kelimeler: Sequence mining, Sequential pattern mining, Sequential rule mining, Telecommunication
  • Yıldız Teknik Üniversitesi Adresli: Evet

Özet

This paper examines existing sequence mining algorithms. Sequence mining algorithms are used in many domains, including cyber-security, telecommunications, user behaviour, and air quality patterns. We draw the underlying principles of the representative sequence mining algorithms and introduce a comparative analysis methodology for them. To test the methodology, we provide a prototype testing framework. We conduct a comprehensive experimental study on publicly available data sets, real-life telecommunication data set and data sets generated by a data generator. We compare GSP, PrefixSpan and CMRules algorithms. Comparing these sequence mining algorithms, we conclude that the fastest among the targeted three algorithms may differ for different data sets. Furthermore, we search for situations where sequential pattern mining algorithms can be used instead of sequential rule mining algorithms.