0 citations0 references

Interpretability analysis for Turkish word embeddings

2018pp. 1–4

Citations Over Time

Lütfi Kerem Şenel, Veysel Yücesoy, Aykut Koç, Tolga Çukur

Abstract

Due to the performance improvements they provided in natural language processing (NLP) applications, word embeddings are commonly studied and used. The algorithms that generate word embeddings, learn low dimensional, dense vector spaces that encode semantic relations among words in an unsupervised manner from large unannotated corpora. However, these vector spaces usually do not have interpretable dimensions making their semantic structure more challenging to be comprehended by the researchers. To have a better understanding of the inner structures of the word embeddings and further improve their utility, learning new, interpretable word embeddings is an active research area. In this study, a semantic category dataset (ANKAT) that contains more than 4000 unique Turkish words grouped under 62 different categories is composed to quantitatively evaluate the interpretability of the word embeddings. An interpretability analysis method based on this dataset is proposed and tested on five different embedding spaces.

Citations Over Time

Abstract

Related Papers