Depression Symptoms Modelling from Social Media Text: A Semi-supervised Learning Approach

Farruque, Nawshad; Goebel, Randy; Sivapalan, Sudhakar; Zaiane, Osmar

Computer Science > Computation and Language

arXiv:2209.02765 (cs)

[Submitted on 6 Sep 2022 (v1), last revised 28 Sep 2022 (this version, v3)]

Title:Depression Symptoms Modelling from Social Media Text: A Semi-supervised Learning Approach

Authors:Nawshad Farruque, Randy Goebel, Sudhakar Sivapalan, Osmar Zaiane

View PDF

Abstract:A fundamental component of user-level social media language based clinical depression modelling is depression symptoms detection (DSD). Unfortunately, there does not exist any DSD dataset that reflects both the clinical insights and the distribution of depression symptoms from the samples of self-disclosed depressed population. In our work, we describe a Semi-supervised Learning (SSL) framework which uses an initial supervised learning model that leverages 1) a state-of-the-art large mental health forum text pre-trained language model further fine-tuned on a clinician annotated DSD dataset, 2) a Zero-Shot learning model for DSD, and couples them together to harvest depression symptoms related samples from our large self-curated Depression Tweets Repository (DTR). Our clinician annotated dataset is the largest of its kind. Furthermore, DTR is created from the samples of tweets in self-disclosed depressed users Twitter timeline from two datasets, including one of the largest benchmark datasets for user-level depression detection from Twitter. This further helps preserve the depression symptoms distribution of self-disclosed Twitter users tweets. Subsequently, we iteratively retrain our initial DSD model with the harvested data. We discuss the stopping criteria and limitations of this SSL process, and elaborate the underlying constructs which play a vital role in the overall SSL process. We show that we can produce a final dataset which is the largest of its kind. Furthermore, a DSD and a Depression Post Detection (DPD) model trained on it achieves significantly better accuracy than their initial version.

Comments:	Title and relevant changes are made
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2209.02765 [cs.CL]
	(or arXiv:2209.02765v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2209.02765

Submission history

From: Nawshad Farruque [view email]
[v1] Tue, 6 Sep 2022 18:41:57 UTC (1,546 KB)
[v2] Thu, 8 Sep 2022 04:17:32 UTC (1,546 KB)
[v3] Wed, 28 Sep 2022 19:22:41 UTC (1,611 KB)

Computer Science > Computation and Language

Title:Depression Symptoms Modelling from Social Media Text: A Semi-supervised Learning Approach

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Depression Symptoms Modelling from Social Media Text: A Semi-supervised Learning Approach

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators