Skip to main navigation Skip to search Skip to main content

Caradoc: A Pragmatic Approach to PDF Parsing and Validation

  • ENAC-IIC-GEL
  • Agence Nationale de la Sécurité des Systèmes d'Information

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

PDF has become a de facto standard for exchanging electronic documents, for visualization as well as for printing. However, it has also become a common delivery channel for malware, and previous work has highlighted features that lead to security issues. In our work, we focus on the structure of the format, independently from specific features. By methodically testing PDF readers against hand-crafted files, we show that the interpretation of PDF files at the structural level may cause some form of denial of service, or be ambiguous and lead to rendering inconsistencies among readers. We then propose a pragmatic solution by restricting the syntax to avoid common errors, and propose a formal grammar for it. We explain how data consistency can be validated at a finer-grained level using a dedicated type checker. Finally, we assess this approach on a set of real-world files and show that our proposals are realistic.

Original languageEnglish
Title of host publicationProceedings - 2016 IEEE Symposium on Security and Privacy Workshops, SPW 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages126-139
Number of pages14
ISBN (Electronic)9781509008247
DOIs
Publication statusPublished - 1 Aug 2016
Externally publishedYes
Event2016 IEEE Symposium on Security and Privacy Workshops, SPW 2016 - San Jose, United States
Duration: 23 May 201625 May 2016

Publication series

NameProceedings - 2016 IEEE Symposium on Security and Privacy Workshops, SPW 2016

Conference

Conference2016 IEEE Symposium on Security and Privacy Workshops, SPW 2016
Country/TerritoryUnited States
CitySan Jose
Period23/05/1625/05/16

Keywords

  • PDF
  • caradoc
  • document structure
  • parser

Fingerprint

Dive into the research topics of 'Caradoc: A Pragmatic Approach to PDF Parsing and Validation'. Together they form a unique fingerprint.

Cite this