TY - GEN
T1 - The Factuality of Large Language Models in the Legal Domain
AU - El Hamdani, Rajaa
AU - Bonald, Thomas
AU - Malliaros, Fragkiskos D.
AU - Holzenberger, Nils
AU - Suchanek, Fabian
N1 - Publisher Copyright:
© 2024 Owner/Author.
PY - 2024/10/21
Y1 - 2024/10/21
N2 - This paper investigates the factuality of large language models (LLMs) as knowledge bases in the legal domain, in a realistic usage scenario: we allow for acceptable variations in the answer, and let the model abstain from answering when uncertain. First, we design a dataset of diverse factual questions about case law and legislation. We then use the dataset to evaluate several LLMs under different evaluation methods, including exact, alias, and fuzzy matching. Our results show that the performance improves significantly under the alias and fuzzy matching methods. Further, we explore the impact of abstaining and in-context examples, finding that both strategies enhance precision. Finally, we demonstrate that additional pre-training on legal documents, as seen with SaulLM, further improves factual precision from 63% to 81%.
AB - This paper investigates the factuality of large language models (LLMs) as knowledge bases in the legal domain, in a realistic usage scenario: we allow for acceptable variations in the answer, and let the model abstain from answering when uncertain. First, we design a dataset of diverse factual questions about case law and legislation. We then use the dataset to evaluate several LLMs under different evaluation methods, including exact, alias, and fuzzy matching. Our results show that the performance improves significantly under the alias and fuzzy matching methods. Further, we explore the impact of abstaining and in-context examples, finding that both strategies enhance precision. Finally, we demonstrate that additional pre-training on legal documents, as seen with SaulLM, further improves factual precision from 63% to 81%.
KW - factuality
KW - hallucination
KW - knowledge bases
KW - legal domain
KW - llm
U2 - 10.1145/3627673.3679961
DO - 10.1145/3627673.3679961
M3 - Conference contribution
AN - SCOPUS:85210015897
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 3741
EP - 3746
BT - CIKM 2024 - Proceedings of the 33rd ACM International Conference on Information and Knowledge Management
PB - Association for Computing Machinery
T2 - 33rd ACM International Conference on Information and Knowledge Management, CIKM 2024
Y2 - 21 October 2024 through 25 October 2024
ER -