Can GPT-3 Perform Statutory Reasoning?

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Statutory reasoning is the task of reasoning with facts and statutes, which are rules written in natural language by a legislature. It is a basic legal skill. In this paper we explore the capabilities of the most capable GPT-3 model, text-davinci-003, on an established statutory-reasoning dataset called SARA. We consider a variety of approaches, including dynamic few-shot prompting, chain-of-thought prompting, and zero-shot prompting. While we achieve results with GPT-3 that are better than the previous best published results, we also identify several types of clear errors it makes. We investigate why these errors happen. We discover that GPT-3 has imperfect prior knowledge of the actual U.S. statutes on which SARA is based. More importantly, we create simple synthetic statutes, which GPT-3 is guaranteed not to have seen during training. We find GPT-3 performs poorly at answering straightforward questions about these simple synthetic statutes.

Original languageEnglish
Title of host publication19th International Conference on Artificial Intelligence and Law, ICAIL 2023 - Proceedings of the Conference
PublisherAssociation for Computing Machinery, Inc
Pages22-31
Number of pages10
ISBN (Electronic)9798400701979
DOIs
Publication statusPublished - 7 Sept 2023
Event19th International Conference on Artificial Intelligence and Law, ICAIL 2023 - Braga, Portugal
Duration: 19 Jun 202323 Jun 2023

Publication series

Name19th International Conference on Artificial Intelligence and Law, ICAIL 2023 - Proceedings of the Conference

Conference

Conference19th International Conference on Artificial Intelligence and Law, ICAIL 2023
Country/TerritoryPortugal
CityBraga
Period19/06/2323/06/23

Keywords

  • GPT-3
  • law
  • natural language processing
  • reasoning
  • statutes

Fingerprint

Dive into the research topics of 'Can GPT-3 Perform Statutory Reasoning?'. Together they form a unique fingerprint.

Cite this