TY - GEN
T1 - Quarts
T2 - 22nd IEEE International Conference on Emerging Technologies and Factory Automation, ETFA 2017
AU - Saab, Wajeb
AU - Mohiuddin, Maaz
AU - Bliudze, Simon
AU - Le Boudec, Jean Yves
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/6/28
Y1 - 2017/6/28
N2 - Real-time control systems (RTCSs) tolerate delay and crash faults by replicating the controller. Each replica computes and issues setpoints to actuators over a network that might drop or delay messages. Hence, the actuators might receive an inconsistent set of setpoints. Such inconsistency is avoided either by having a single primary replica compute and issue setpoints (in passive replication) or a consensus algorithm select one sending-replica (in active replication). However, due to the impossibility of a perfect failure-detector, passive-replication schemes can have multiple primaries, causing inconsistency, especially in the presence of intermittent delay faults. Furthermore, the impossibility of bounded-latency consensus causes both schemes to have poor real-time performance. We identified three properties of RTCSs that enable active-replication schemes to agree on the measurements before computing, instead of using traditional consensus. As all computing replicas compute with the same state, the resulting setpoints are guaranteed to be consistent. We present the design of Quarts, an agreement solution for active replication that guarantees consistency and bounded latency-overhead. We prove the guarantees and compare the performance of Quarts with existing solutions through simulation. We show that Quarts provides an availability higher than existing solutions, and that the availability improvement is up to 10x with two replicas.
AB - Real-time control systems (RTCSs) tolerate delay and crash faults by replicating the controller. Each replica computes and issues setpoints to actuators over a network that might drop or delay messages. Hence, the actuators might receive an inconsistent set of setpoints. Such inconsistency is avoided either by having a single primary replica compute and issue setpoints (in passive replication) or a consensus algorithm select one sending-replica (in active replication). However, due to the impossibility of a perfect failure-detector, passive-replication schemes can have multiple primaries, causing inconsistency, especially in the presence of intermittent delay faults. Furthermore, the impossibility of bounded-latency consensus causes both schemes to have poor real-time performance. We identified three properties of RTCSs that enable active-replication schemes to agree on the measurements before computing, instead of using traditional consensus. As all computing replicas compute with the same state, the resulting setpoints are guaranteed to be consistent. We present the design of Quarts, an agreement solution for active replication that guarantees consistency and bounded latency-overhead. We prove the guarantees and compare the performance of Quarts with existing solutions through simulation. We show that Quarts provides an availability higher than existing solutions, and that the availability improvement is up to 10x with two replicas.
KW - Agreement
KW - Consensus
KW - Consistency
KW - Control systems
KW - Real-time
KW - Replication
UR - https://www.scopus.com/pages/publications/85044437993
U2 - 10.1109/ETFA.2017.8247590
DO - 10.1109/ETFA.2017.8247590
M3 - Conference contribution
AN - SCOPUS:85044437993
T3 - IEEE International Conference on Emerging Technologies and Factory Automation, ETFA
SP - 1
EP - 8
BT - 2017 22nd IEEE International Conference on Emerging Technologies and Factory Automation, ETFA 2017
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 12 September 2017 through 15 September 2017
ER -