Lawyer LLaMA Technical Report

Huang, Quzhe; Tao, Mingxu; Zhang, Chen; An, Zhenwei; Jiang, Cong; Chen, Zhibin; Wu, Zirui; Feng, Yansong

Computer Science > Computation and Language

arXiv:2305.15062 (cs)

[Submitted on 24 May 2023 (v1), last revised 14 Oct 2023 (this version, v2)]

Title:Lawyer LLaMA Technical Report

Authors:Quzhe Huang, Mingxu Tao, Chen Zhang, Zhenwei An, Cong Jiang, Zhibin Chen, Zirui Wu, Yansong Feng

View PDF

Abstract:Large Language Models (LLMs), like LLaMA, have exhibited remarkable performance across various tasks. Nevertheless, when deployed to specific domains such as law or medicine, the models still confront the challenge of a deficiency in domain-specific knowledge and an inadequate capability to leverage that knowledge to resolve domain-related problems. In this paper, we propose a new framework to adapt LLMs to specific domains and build Lawyer LLaMA, a legal domain LLM, based on this framework. Specifically, we inject domain knowledge during the continual training stage and teach the model to learn professional skills using properly designed supervised fine-tuning tasks. Moreover, to alleviate the hallucination problem during the model's generation, we add a retrieval module and extract relevant legal articles before the model answers any queries. When learning domain-specific skills, we find that experts' experience is much more useful than experiences distilled from ChatGPT, where hundreds of expert-written data outperform tens of thousands of ChatGPT-generated ones. We will release our model and data.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2305.15062 [cs.CL]
	(or arXiv:2305.15062v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.15062

Submission history

From: Quzhe Huang [view email]
[v1] Wed, 24 May 2023 11:52:07 UTC (7,455 KB)
[v2] Sat, 14 Oct 2023 02:14:51 UTC (7,400 KB)

Computer Science > Computation and Language

Title:Lawyer LLaMA Technical Report

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Lawyer LLaMA Technical Report

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators