Word Embeddings for the Armenian Language: Intrinsic and Extrinsic Evaluation

Avetisyan, Karen; Ghukasyan, Tsolak

Computer Science > Computation and Language

arXiv:1906.03134 (cs)

[Submitted on 7 Jun 2019]

Title:Word Embeddings for the Armenian Language: Intrinsic and Extrinsic Evaluation

Authors:Karen Avetisyan, Tsolak Ghukasyan

View PDF

Abstract:In this work, we intrinsically and extrinsically evaluate and compare existing word embedding models for the Armenian language. Alongside, new embeddings are presented, trained using GloVe, fastText, CBOW, SkipGram algorithms. We adapt and use the word analogy task in intrinsic evaluation of embeddings. For extrinsic evaluation, two tasks are employed: morphological tagging and text classification. Tagging is performed on a deep neural network, using ArmTDP v2.3 dataset. For text classification, we propose a corpus of news articles categorized into 7 classes. The datasets are made public to serve as benchmarks for future models.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1906.03134 [cs.CL]
	(or arXiv:1906.03134v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1906.03134

Submission history

From: Tsolak Ghukasyan [view email]
[v1] Fri, 7 Jun 2019 14:45:49 UTC (864 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Karen Avetisyan
Tsolak Ghukasyan

export BibTeX citation

Computer Science > Computation and Language

Title:Word Embeddings for the Armenian Language: Intrinsic and Extrinsic Evaluation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Word Embeddings for the Armenian Language: Intrinsic and Extrinsic Evaluation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators