A Study on Turkish Meronym Extraction Using a Variety of Lexico-Syntactic Patterns


Yildiz T., Yildirim S., Diri B.

6th Language and Technology Conference (LTC), Poznan, Polonya, 7 - 09 Aralık 2013, cilt.9561, ss.386-394 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası: 9561
  • Doi Numarası: 10.1007/978-3-319-43808-5_29
  • Basıldığı Şehir: Poznan
  • Basıldığı Ülke: Polonya
  • Sayfa Sayıları: ss.386-394
  • Yıldız Teknik Üniversitesi Adresli: Evet

Özet

In this paper, we applied lexico-syntactic patterns to disclose meronymy relation from a huge Turkish raw text. Once, the system takes a huge raw corpus and extract matched cases for a given pattern, it proposes a list of whole-part pairs depending on their co-occur frequencies. For the purpose, we exploited and compared a list of pattern clusters. The clusters to be examined could fall into three types; general patterns, dictionary-based pattern, and bootstrapped pattern. We evaluated how these patterns improve the system performance especially within corpusbased approach and distributional feature of words. Finally, we discuss all the experiments with a comparison analysis and we showed advantage and disadvantage of the approaches with promising results.

In this paper, we applied lexico-syntactic patterns to disclose meronymy relation from a huge Turkish raw text. Once, the system takes a huge raw corpus and extract matched cases for a given pattern, it proposes a list of whole-part pairs depending on their co-occur frequencies. For the purpose, we exploited and compared a list of pattern clusters. The clusters to be examined could fall into three types; general patterns, dictionary-based pattern, and bootstrapped pattern. We evaluated how these patterns improve the system performance especially within corpus-based approach and distributional feature of words. Finally, we discuss all the experiments with a comparison analysis and we showed advantage and disadvantage of the approaches with promising results.

Keywords

Meronym Lexico-syntactic patterns Corpus-based approaches