Google Scholar

A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias

Y Xu, L Hu, J Zhao, Z Qiu, Y Ye, H Gu - arXiv preprint arXiv:2404.00929, 2024 - arxiv.org

… Thirdly, we survey the existing studies on multilingual representations and investigate
whether the … Aligning the representation of diverse languages acts as an integral part of NLP’s …

Save Cite Cited by 7 Related articles All 2 versions View as HTML

[PDF] arxiv.org

Measuring the Robustness of NLP Models to Domain Shifts

N Calderon, N Porat, E Ben-David, A Chapanin… - arXiv preprint arXiv …, 2023 - arxiv.org

… 2022). Following that, there has been an improvement in the … study the DR challenge in
modern NLP models. To this end, we constructed a new DR benchmark comprising various NLP …

[PDF] neurips.cc

Veridark: A large-scale benchmark for authorship verification on the dark web

A Manolache, F Brad, A Barbalau… - Advances in …, 2022 - proceedings.neurips.cc

… We evaluate competitive NLP baselines on the three datasets and perform an analysis of
the predictions to better understand the limitations of such approaches. We make the datasets …

Save Cite Cited by 13 Related articles All 6 versions View as HTML

Optimization Strategies for BERT-Based Named Entity Recognition

M Monteiro, C Zanchettin - Brazilian Conference on Intelligent Systems, 2023 - Springer

… In recent years, advances in transfer learning with deep neural networks based on … NLP
models, especially in low-resource languages. In this context, transfer learning enables training …

Save Cite Cited by 2 Related articles All 3 versions

[PDF] arxiv.org

Label Alignment and Reassignment with Generalist Large Language Model for Enhanced Cross-Domain Named Entity Recognition

K Bao, C Yang - arXiv preprint arXiv:2407.17344, 2024 - arxiv.org

… sively discussed in the NLP community and made significant … In this study, we introduce a
label alignment and reassignment … 2022): This work utilizes a pluggable prompting method to …

[PDF] arxiv.org

Toward Exploring the Code Understanding Capabilities of Pre-trained Code Generation Models

J Lin, Y Xie, Y Yu, Y Yang, L Zhang - arXiv preprint arXiv:2406.12326, 2024 - arxiv.org

… (MLM) objective in natural language processing tasks. They … 2022) incorporate the serialized
AST as part of the input for … in representation learning of deep learning networks. Recent …

Save Cite Related articles View as HTML

[PDF] arxiv.org

Deja vu: Contrastive Historical Modeling with Prefix-tuning for Temporal Knowledge Graph Reasoning

M Peng, B Liu, W Xu, Z Jiang, J Zhu, M Peng - arXiv preprint arXiv …, 2024 - arxiv.org

… of various NLP applications including question answering (Yasunaga et al.… 2022), etc.
Considering facts inherently evolve in KGs over time, Temporal Knowledge Graphs (TKGs) are …

[PDF] arxiv.org

What Learned Representations and Influence Functions Can Tell Us About Adversarial Examples

SM Tonni, M Dras - arXiv preprint arXiv:2309.10916, 2023 - arxiv.org

… In this paper, noting significant differences between inputs in NLP and image processing (…
can help in NLP in detecting adversarial examples using learned representations, and what …

[PDF] arxiv.org

Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression

J Liu, J Liu, Q Wang, J Wang, X Cai, D Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org

… exceptional performance in various natural language processing (NLP) tasks. However, the …
2022… 2022) that possess extensive language knowledge and can be effectively employed in …

DDK: Dynamic structure pruning based on differentiable search and recursive knowledge distillation for BERT

Z Zhang, Y Lu, T Wang, X Wei, Z Wei - Neural Networks, 2024 - Elsevier

Large-scale pre-trained models, such as BERT, have demonstrated outstanding performance
in Natural Language Processing (NLP). Nevertheless, the high number of parameters in …

Create alert

Cite

Advanced search

Saved to My library

A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias

Measuring the Robustness of NLP Models to Domain Shifts

Veridark: A large-scale benchmark for authorship verification on the dark web

Optimization Strategies for BERT-Based Named Entity Recognition

Label Alignment and Reassignment with Generalist Large Language Model for Enhanced Cross-Domain Named Entity Recognition

Toward Exploring the Code Understanding Capabilities of Pre-trained Code Generation Models

Deja vu: Contrastive Historical Modeling with Prefix-tuning for Temporal Knowledge Graph Reasoning

What Learned Representations and Influence Functions Can Tell Us About Adversarial Examples

Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression

DDK: Dynamic structure pruning based on differentiable search and recursive knowledge distillation for BERT