AUTOMATIC DISCOVERY OF SIMILAR WORDS BY SUBSTITUTE VECTORS


Creative Commons License

Düzenli İ., Amasyalı M. F.

SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, cilt.34, sa.1, ss.125-133, 2016 (ESCI) identifier

Özet

Patterns between words are generally used for automatic information extraction. However, the patterns can only find related words close to each other. In this study, a method based on substitute vectors can overcome of this difficulty. Firstly, the word sets having the same substitute vector are constructed. Then, similar word sets are obtained according to the number of co-occurring sets. In this sets, semantically relatedness ratio is above 70%. The proposed method is unsupervised. Because, it does not require any seed words manually labeled.