Conceptor-Aided Debiasing of Large Language Models.

AllImages Videos News Maps Shopping Books

Conceptor-Aided Debiasing of Large Language Models - arXiv

Nov 20, 2022 · We find that conceptor post-processing achieves state-of-the-art (SoTA) debiasing results while maintaining LLMs' performance on the GLUE ...

Scholarly articles for Conceptor-Aided Debiasing of Large Language Models.

scholar.google.com › citations

Conceptor debiasing of word representations …
Karve · Cited by 38

Sociodemographic bias in language models: A survey …
Gupta · Cited by 6

The role of protected class word lists in bias …
Sedoc · Cited by 26

[PDF] Conceptor-Aided Debiasing of Large Language Models - ACL Anthology

aclanthology.org › 2023.emnlp-ma...

However, such debiasing often fails to debias effectively and reduces language model performance in downstream tasks (Meade et al., 2022).

Conceptor-Aided Debiasing of Large Language Models - OpenReview

openreview.net › forum

The paper use conceptors to reduce social biases in large language models (LLMs) like BERT and GPT, including two approaches: 1) post-processing to remove bias, ...

[PDF] arXiv:2211.11087v3 [cs.CL] 30 Oct 2023

arxiv.org › pdf

Oct 30, 2023 · We further show that cocneptor-aided debiasing is robust in differ- ent LLMs, various layers of models, and varied types of biases. Moreover, ...

(PDF) Conceptor-Aided Debiasing of Contextualized Embeddings

www.researchgate.net › ... › Embedding

Nov 20, 2022 · Pre-trained language models reflect the inherent social biases of their training corpus. Many methods have been proposed to mitigate this ...

‪Li S. Yifei‬ - ‪Google Scholar‬

scholar.google.com › citations

Co-authors ; Conceptor-Aided Debiasing of Large Language Models. LS Yifei, L Ungar, J Sedoc. EMNLP, 2023. 1, 2023 ...

Lyle Ungar | Papers With Code

paperswithcode.com › author › lyle-ungar

We propose two methods of applying conceptors (1) bias subspace projection by post-processing by the conceptor NOT operation; and (2) a new architecture, ...

João Sedoc - ACL Anthology

aclanthology.org › people › joao-sedoc

Conceptor-Aided Debiasing of Large Language Models · Li Yifei | Lyle Ungar | João Sedoc · Proceedings of the 2023 Conference on Empirical Methods in Natural ...

AXOLOTL - arxiv-sanity

arxiv-sanity-lite.com › ...

Large Language models (LLMs), while powerful, exhibit harmful social biases. Debiasing is often challenging due to computational costs, data constraints, and ...

Main Conference - EMNLP 2023

2023.emnlp.org › program › accepted_...

Conceptor-Aided Debiasing of Large Language Models Li Yifei, Lyle Ungar ... Language Understanding with Contrastive Reading Model and Frozen Large Language Models