Skip to main navigation Skip to search Skip to main content

P4ce: Consensus over RDMA at Line Speed

  • Rémi Dulong
  • , Nathan Felber
  • , Pascal Felber
  • , Gilles Hopin
  • , Baptiste Lepers
  • , Valerio Schiavoni
  • , Gaël Thomas
  • , Sébastien Vaucher
  • University of Neuchatel
  • EPFL
  • Ecole Polytechnique

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

P4ce is the first replication protocol that exhibits the same latency and requires the same network capacity as sending data to a single server. P4ce builds upon previous RDMA-based consensus protocols. They achieve consensus with a single network round-trip, but with a reduced network throughput. P4ce also achieves consensus with a single round-trip, but without degrading throughput by decoupling the consensus decisions from the RDMA communications. The decision part of the consensus protocol runs on a commodity server, but the communication part of P4ce is fully implemented on a programmable switch, which replicates data and aggregates the acknowledgements in the network, avoiding the throughput bottleneck at the leader. Although simple in its principle, the implementation of P4ce raises many challenging issues, notably caused by the complexity of RDMA and the underlying network protocols, the intricacies of packet rewriting during replication and aggregation, and the restricted set of operations that can be implemented at wire speed in the programmable switch. We implemented P4ce and deployed it on a commercially-available Intel Tofino switch, achieving up to 4x better through-put and better latency than state-of-the-art consensus protocols.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE 44th International Conference on Distributed Computing Systems, ICDCS 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages508-519
Number of pages12
ISBN (Electronic)9798350386059
DOIs
Publication statusPublished - 1 Jan 2024
Event44th IEEE International Conference on Distributed Computing Systems, ICDCS 2024 - Jersey City, United States
Duration: 23 Jul 202426 Jul 2024

Publication series

NameProceedings - International Conference on Distributed Computing Systems
ISSN (Print)1063-6927
ISSN (Electronic)2575-8411

Conference

Conference44th IEEE International Conference on Distributed Computing Systems, ICDCS 2024
Country/TerritoryUnited States
CityJersey City
Period23/07/2426/07/24

Keywords

  • consensus
  • programmable switch
  • rdma
  • smr
  • tofino

Fingerprint

Dive into the research topics of 'P4ce: Consensus over RDMA at Line Speed'. Together they form a unique fingerprint.

Cite this