A Study on Turkish Meronym Extraction Using a Variety of Lexico-Syntactic Patterns


Yildiz T., Yildirim S., Diri B.

6th Language and Technology Conference (LTC), Poznan, Poland, 7 - 09 December 2013, vol.9561, pp.386-394 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume: 9561
  • Doi Number: 10.1007/978-3-319-43808-5_29
  • City: Poznan
  • Country: Poland
  • Page Numbers: pp.386-394

Abstract

In this paper, we applied lexico-syntactic patterns to disclose meronymy relation from a huge Turkish raw text. Once, the system takes a huge raw corpus and extract matched cases for a given pattern, it proposes a list of whole-part pairs depending on their co-occur frequencies. For the purpose, we exploited and compared a list of pattern clusters. The clusters to be examined could fall into three types; general patterns, dictionary-based pattern, and bootstrapped pattern. We evaluated how these patterns improve the system performance especially within corpus-based approach and distributional feature of words. Finally, we discuss all the experiments with a comparison analysis and we showed advantage and disadvantage of the approaches with promising results.

Keywords

Meronym Lexico-syntactic patterns Corpus-based approaches 

In this paper, we applied lexico-syntactic patterns to disclose meronymy relation from a huge Turkish raw text. Once, the system takes a huge raw corpus and extract matched cases for a given pattern, it proposes a list of whole-part pairs depending on their co-occur frequencies. For the purpose, we exploited and compared a list of pattern clusters. The clusters to be examined could fall into three types; general patterns, dictionary-based pattern, and bootstrapped pattern. We evaluated how these patterns improve the system performance especially within corpusbased approach and distributional feature of words. Finally, we discuss all the experiments with a comparison analysis and we showed advantage and disadvantage of the approaches with promising results.