Take and took, gaggle and goose, book and read: Evaluating the utility of vector differences for lexical relation learning

E Vylomova, L Rimell, T Cohn, T Baldwin - arXiv preprint arXiv …, 2015 - arxiv.org
arXiv preprint arXiv:1509.01692, 2015arxiv.org
Recent work on word embeddings has shown that simple vector subtraction over pre-trained
embeddings is surprisingly effective at capturing different lexical relations, despite lacking
explicit supervision. Prior work has evaluated this intriguing result using a word analogy
prediction formulation and hand-selected relations, but the generality of the finding over a
broader range of lexical relation types and different learning settings has not been
evaluated. In this paper, we carry out such an evaluation in two learning settings:(1) spectral …
Recent work on word embeddings has shown that simple vector subtraction over pre-trained embeddings is surprisingly effective at capturing different lexical relations, despite lacking explicit supervision. Prior work has evaluated this intriguing result using a word analogy prediction formulation and hand-selected relations, but the generality of the finding over a broader range of lexical relation types and different learning settings has not been evaluated. In this paper, we carry out such an evaluation in two learning settings: (1) spectral clustering to induce word relations, and (2) supervised learning to classify vector differences into relation types. We find that word embeddings capture a surprising amount of information, and that, under suitable supervised training, vector subtraction generalises well to a broad range of relations, including over unseen lexical items.
arxiv.org
Showing the best result for this search. See all results