A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias
Y Xu, L Hu, J Zhao, Z Qiu, Y Ye, H Gu - arXiv preprint arXiv:2404.00929, 2024 - arxiv.org
… Thirdly, we survey the existing studies on multilingual representations and investigate
whether the … Aligning the representation of diverse languages acts as an integral part of NLP’s …
whether the … Aligning the representation of diverse languages acts as an integral part of NLP’s …
Measuring the Robustness of NLP Models to Domain Shifts
N Calderon, N Porat, E Ben-David, A Chapanin… - arXiv preprint arXiv …, 2023 - arxiv.org
… 2022). Following that, there has been an improvement in the … study the DR challenge in
modern NLP models. To this end, we constructed a new DR benchmark comprising various NLP …
modern NLP models. To this end, we constructed a new DR benchmark comprising various NLP …
Veridark: A large-scale benchmark for authorship verification on the dark web
… We evaluate competitive NLP baselines on the three datasets and perform an analysis of
the predictions to better understand the limitations of such approaches. We make the datasets …
the predictions to better understand the limitations of such approaches. We make the datasets …
Optimization Strategies for BERT-Based Named Entity Recognition
M Monteiro, C Zanchettin - Brazilian Conference on Intelligent Systems, 2023 - Springer
… In recent years, advances in transfer learning with deep neural networks based on … NLP
models, especially in low-resource languages. In this context, transfer learning enables training …
models, especially in low-resource languages. In this context, transfer learning enables training …
Label Alignment and Reassignment with Generalist Large Language Model for Enhanced Cross-Domain Named Entity Recognition
K Bao, C Yang - arXiv preprint arXiv:2407.17344, 2024 - arxiv.org
… sively discussed in the NLP community and made significant … In this study, we introduce a
label alignment and reassignment … 2022): This work utilizes a pluggable prompting method to …
label alignment and reassignment … 2022): This work utilizes a pluggable prompting method to …
Toward Exploring the Code Understanding Capabilities of Pre-trained Code Generation Models
J Lin, Y Xie, Y Yu, Y Yang, L Zhang - arXiv preprint arXiv:2406.12326, 2024 - arxiv.org
… (MLM) objective in natural language processing tasks. They … 2022) incorporate the serialized
AST as part of the input for … in representation learning of deep learning networks. Recent …
AST as part of the input for … in representation learning of deep learning networks. Recent …
Deja vu: Contrastive Historical Modeling with Prefix-tuning for Temporal Knowledge Graph Reasoning
… of various NLP applications including question answering (Yasunaga et al.… 2022), etc.
Considering facts inherently evolve in KGs over time, Temporal Knowledge Graphs (TKGs) are …
Considering facts inherently evolve in KGs over time, Temporal Knowledge Graphs (TKGs) are …
What Learned Representations and Influence Functions Can Tell Us About Adversarial Examples
… In this paper, noting significant differences between inputs in NLP and image processing (…
can help in NLP in detecting adversarial examples using learned representations, and what …
can help in NLP in detecting adversarial examples using learned representations, and what …
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression
… exceptional performance in various natural language processing (NLP) tasks. However, the …
2022… 2022) that possess extensive language knowledge and can be effectively employed in …
2022… 2022) that possess extensive language knowledge and can be effectively employed in …
DDK: Dynamic structure pruning based on differentiable search and recursive knowledge distillation for BERT
Z Zhang, Y Lu, T Wang, X Wei, Z Wei - Neural Networks, 2024 - Elsevier
Large-scale pre-trained models, such as BERT, have demonstrated outstanding performance
in Natural Language Processing (NLP). Nevertheless, the high number of parameters in …
in Natural Language Processing (NLP). Nevertheless, the high number of parameters in …