research-article

Multi-Label Annotation and Classification of Arabic Texts Based on Extracted Seed Keyphrases and Bi-Gram Alphabet Feed Forward Neural Networks Model

Author:

Fatma ElghannamAuthors Info & Claims

ACM Transactions on Asian and Low-Resource Language Information Processing, Volume 22, Issue 1

Article No.: 31, Pages 1 - 16

https://doi.org/10.1145/3539607

Published: 25 November 2022 Publication History

Get Access

Abstract

In natural language processing, text classification is a fundamental problem. Multi-label classification of textual data is a challenging topic in text classification where an instance can be associated with more than one label. This paper presents a multi-label annotation and classification methodology for Arabic text data that is not currently classified as multi-label, aiming to analyze and compare the performance of various multi-label learning approaches. The current work includes two phases: The first involves automatic annotation of hotel reviews with more than one label based on the aspects found in the reviews. In this phase, review data instances were automatically annotated as multi-label based on the extracted seed keyphrases clusters. The second phase involves experiments to compare the performance of various multi-label classification learning methods. In this phase, we introduced different models including a feed-forward networks model that learns a vector representation based on the bi-gram alphabet rather than the commonly used bag-of-words model. The bi-gram alphabet vector representation model has the advantage of having reduced feature dimensions and not requiring natural language processing tools. The results indicated that employing the bi-gram alphabet vector representation feed forward neural network is a competitive solution for the multi-label text classification problem. It has achieved an accuracy of about 75.2%, and standard deviation (0.062).

Acknowledgment

Thank you to all of the reviewers who took the time to read my paper and provide feedback. I appreciate the suggestions made by the reviewers. The suggestions offered by the reviewers have been immensely helpful and I have addressed all the concerns they raised.

I re-drafted the required portions, explained some areas in more detail, repaired typographical, grammatical, and lingual issues, added examples, equations, used pseudo-code to clarify the technique, and included experiments to demonstrate the quality of the labels I generated automatically.

References

[1]

M. Afzaal, M. Usman, A. C. M. Fong, S. Fong, and Y. Zhuang. 2016. Fuzzy aspect based opinion classification system for mining tourist reviews. Advances in Fuzzy Systems, (2016).

Abstract

Acknowledgment

References

Cited By

Index Terms

Recommendations

Multi-label Classification with ART Neural Networks

Confidence-based Weighted Loss for Multi-label Classification with Missing Labels

A multi-label classification based approach for sentiment classification

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Full Text

HTML Format

Share

Share this Publication link

Share on social media

Affiliations