AUTOMATIC DISCOVERY OF SIMILAR WORDS BY SUBSTITUTE VECTORS


Creative Commons License

Düzenli İ., Amasyalı M. F.

SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, vol.34, no.1, pp.125-133, 2016 (Peer-Reviewed Journal) identifier

  • Publication Type: Article / Article
  • Volume: 34 Issue: 1
  • Publication Date: 2016
  • Journal Name: SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI
  • Journal Indexes: Emerging Sources Citation Index, Academic Search Premier, Directory of Open Access Journals
  • Page Numbers: pp.125-133

Abstract

Patterns between words are generally used for automatic information extraction. However, the patterns can only find related words close to each other. In this study, a method based on substitute vectors can overcome of this difficulty. Firstly, the word sets having the same substitute vector are constructed. Then, similar word sets are obtained according to the number of co-occurring sets. In this sets, semantically relatedness ratio is above 70%. The proposed method is unsupervised. Because, it does not require any seed words manually labeled.