TY - GEN
T1 - Can GPT-3 Perform Statutory Reasoning?
AU - Blair-Stanek, Andrew
AU - Holzenberger, Nils
AU - Van Durme, Benjamin
N1 - Publisher Copyright:
© ICAIL 2023. All rights reserved.
PY - 2023/9/7
Y1 - 2023/9/7
N2 - Statutory reasoning is the task of reasoning with facts and statutes, which are rules written in natural language by a legislature. It is a basic legal skill. In this paper we explore the capabilities of the most capable GPT-3 model, text-davinci-003, on an established statutory-reasoning dataset called SARA. We consider a variety of approaches, including dynamic few-shot prompting, chain-of-thought prompting, and zero-shot prompting. While we achieve results with GPT-3 that are better than the previous best published results, we also identify several types of clear errors it makes. We investigate why these errors happen. We discover that GPT-3 has imperfect prior knowledge of the actual U.S. statutes on which SARA is based. More importantly, we create simple synthetic statutes, which GPT-3 is guaranteed not to have seen during training. We find GPT-3 performs poorly at answering straightforward questions about these simple synthetic statutes.
AB - Statutory reasoning is the task of reasoning with facts and statutes, which are rules written in natural language by a legislature. It is a basic legal skill. In this paper we explore the capabilities of the most capable GPT-3 model, text-davinci-003, on an established statutory-reasoning dataset called SARA. We consider a variety of approaches, including dynamic few-shot prompting, chain-of-thought prompting, and zero-shot prompting. While we achieve results with GPT-3 that are better than the previous best published results, we also identify several types of clear errors it makes. We investigate why these errors happen. We discover that GPT-3 has imperfect prior knowledge of the actual U.S. statutes on which SARA is based. More importantly, we create simple synthetic statutes, which GPT-3 is guaranteed not to have seen during training. We find GPT-3 performs poorly at answering straightforward questions about these simple synthetic statutes.
KW - GPT-3
KW - law
KW - natural language processing
KW - reasoning
KW - statutes
UR - https://www.scopus.com/pages/publications/85177860113
U2 - 10.1145/3594536.3595163
DO - 10.1145/3594536.3595163
M3 - Conference contribution
AN - SCOPUS:85177860113
T3 - 19th International Conference on Artificial Intelligence and Law, ICAIL 2023 - Proceedings of the Conference
SP - 22
EP - 31
BT - 19th International Conference on Artificial Intelligence and Law, ICAIL 2023 - Proceedings of the Conference
PB - Association for Computing Machinery, Inc
T2 - 19th International Conference on Artificial Intelligence and Law, ICAIL 2023
Y2 - 19 June 2023 through 23 June 2023
ER -