Abstract
The k-set agreement problem is a generalization of the consensus problem: considering a system made up of n processes where each process proposes a value, each non-faulty process has to decide a value such that a decided value is a proposed value, and no more than k different values are decided. It has recently be shown that, in the crash failure model, \(\min(\lfloor \frac{f}{k}\rfloor+2,\lfloor \frac{t}{k}\rfloor +1)\) is a lower bound on the number of rounds for the non-faulty processes to decide (where t is an upper bound on the number of process crashes, and f, 0≤f≤t, the actual number of crashes).
This paper considers the k-set agreement problem in synchronous systems where up to t<n/2 processes can experience general omission failures (i.e., a process can crash or omit sending or receiving messages). It first introduces a new property, called strong termination. This property is on the processes that decide. It is satisfied if, not only every non-faulty process, but any process that neither crashes nor commits receive omission failure decides. The paper then presents a k-set agreement protocol that enjoys the following features. First, it is strongly terminating (to our knowledge, it is the first agreement protocol to satisfy this property, whatever the failure model considered). Then, it is early deciding and stopping in the sense that a process that either is non-faulty or commits only send omission failures decides and halts by round \(\min(\lfloor \frac{f}{k}\rfloor+2,\lfloor \frac{t}{k}\rfloor +1)\) . To our knowledge, this is the first early deciding k-set agreement protocol for the general omission failure model. Moreover, the protocol provides also the following additional early stopping property: a process that commits receive omission failures (and does not crash) executes at most \(\min(\lceil \frac{f}{k}\rceil +2,\lfloor \frac{t}{k}\rfloor +1)\) rounds. It is worth noticing that the protocol allows each property (strong termination vs early deciding/stopping vs early stopping) not to be obtained at the detriment of the two others.
The combination of the fact that \(\min(\lfloor \frac{f}{k}\rfloor+2,\lfloor \frac{t}{k}\rfloor +1)\) is a lower bound on the number of rounds in the crash failure model, and the very existence of the proposed protocol has two noteworthy consequences. First, it shows that, although the general omission failure model is more severe than the crash failure model, both models have the same lower bound for the non-faulty processes to decide. Second, it shows that, in the general omission failure model, this bound applies also the processes that do not crash and commit only send omission failures.
Similar content being viewed by others
References
Aguilera, M.K., Toueg, S.: A simple bivalency proof that t-resilient consensus requires t+1 rounds. Inf. Process. Lett. 71, 155–178 (1999)
Attiya, H., Welch, J.: Distributed Computing, Fundamentals, Simulation and Advanced Topics (2nd edn.). Wiley Series on Parallel and Distributed Computing. Wiley, New York (2004)
Bazzi, R.A., Neiger, G.: Simplifying fault-tolerance: providing the abstraction of crash failures. J. ACM 48(3), 499–554 (2001)
Biran, O., Moran, S., Zaks, S.: A combinatorial characterization of the distributed 1-solvable tasks. J. Algorithms 11(3), 420–440 (1990)
Biran, O., Moran, S., Zaks, S.: Tight bounds on the round complexity of distributed 1-solvable tasks. Theor. Comput. Sci. 145(1-2), 271–290 (1995)
Borowsky, E., Gafni, E.: Generalized FLP impossibility results for t-resilient asynchronous computations. In: Proc. 25th ACM Symposium on Theory of Computation (STOC’93), California, USA, pp. 91–100 (1993)
Charron-Bost, B., Schiper, A.: Uniform consensus is harder than consensus. J. Algorithms 51(1), 15–37 (2004)
Chaudhuri, S.: More Choices allow more faults: set consensus problems in totally asynchronous systems. Inf. Comput. 105, 132–158 (1993)
Chaudhuri, S., Herlihy, M., Lynch, N., Tuttle, M.: Tight bounds for k-set agreement. J. ACM 47(5), 912–943 (2000)
Dolev, D., Reischuk, R., Strong, R.: Early stopping in byzantine agreement. J. ACM 37(4), 720–741 (1990)
Fischer, M.J., Lynch, N.A.: A lower bound on the time to assure interactive consistency. Inf. Process. Lett. 14(4), 183–186 (1982)
Fischer, M.J., Lynch, N.A., Paterson, M.S.: Impossibility of distributed consensus with one faulty process. J. ACM 32(2), 374–382 (1985)
Gafni, E., Guerraoui, R., Pochon, B.: From a static impossibility to an adaptive lower bound: the complexity of early deciding set agreement. In: Proc. 37th ACM Symposium on Theory of Computing (STOC’05), Baltimore, MD, pp. 714–722, May 2005
Guerraoui, R., Herlihy, M., Pochon, B.: A topological treatment of early-deciding set agreement. In: Proc. 10th International Conference on Principles of Distributed Systems (OPODIS’06). Lecture Notes in Computer Science, vol. 4305, pp. 20–35. Springer, Berlin (2006)
Guerraoui, R., Pochon, B.: The complexity of early deciding set agreement: how topology can help? In: Proc. 4th Workshop in Geometry and Topology in Concurrency and Distributed Computing (GETCO’04). BRICS Notes Series, NS-04-2, pp. 26-31, Amsterdam, NL (2004)
Hadzilacos, V.: Issues of fault tolerance in concurrent computations. PhD thesis, Tech. Report 11-84, Harvard University, Cambridge, MA (1985)
Hadzilacos, V., Toueg, S.: Reliable broadcast and related problems. In: Mullender, S. (ed.) Distributed Systems, pp. 97–145. ACM Press, New York (1993)
Herlihy, M.P., Penso, L.D.: Tight bounds for k-set agreement with limited scope accuracy failure detectors. Distrib. Comput. 18(2), 157–166 (2005)
Herlihy, M.P., Shavit, N.: The topological structure of asynchronous computability. J. ACM 46(6), 858–923 (1999)
Keidar, I., Rajsbaum, S.: A simple proof of the uniform consensus synchronous lower bound. Inf. Process. Lett. 85, 47–52 (2003)
Lamport, L., Fischer, M.: Byzantine generals and transaction commit protocols. Unpublished manuscript, 16 pages, April 1982
Lynch, N.A.: Distributed Algorithms. Morgan Kaufmann, San Francisco (1996)
Mostéfaoui, A., Raynal, M.: k-set agreement with limited accuracy failure detectors. In: Proc. 19th ACM Symposium on Principles of Distributed Computing (PODC’00), Portland, pp. 143–152. ACM, New York (2000)
Mostéfaoui, A., Raynal, M.: Randomized set agreement. In: Proc. 13th ACM Symposium on Parallel Algorithms and Architectures (SPAA’01), Hersonissos, Crete, pp. 291–297. ACM, New York (2001)
Neiger, G., Toueg, S.: Automatically increasing the fault-tolerance of distributed algorithms. J. Algorithms 11, 374–419 (1990)
Pease, L., Shostak, R., Lamport, L.: Reaching agreement in presence of faults. J. ACM 27(2), 228–234 (1980)
Perry, K.J., Toueg, S.: Distributed agreement in the presence of processor and communication faults. IEEE Trans. Softw. Eng. SE-12(3), 477–482 (1986)
Raïpin Parvédy, Ph., Raynal, M.: Optimal early stopping uniform consensus in synchronous systems with process omission failures. In: Proc. 16th ACM Symposium on Parallel Algorithms and Architectures (SPAA’04), Barcelona, pp. 302–310. ACM, New York (2004)
Raïpin Parvédy, Ph., Raynal, M., Travers, C.: Early-stopping k-set agreement in synchronous systems prone to any number of process crashes. In: 8th International Conference on Parallel Computing Technologies (PaCT’05), Krasnoyarsk, Russia. Lecture Notes in Computer Science, vol. 3606, pp. 49–58. Springer, Berlin (2005)
Raïpin Parvédy, Ph., Raynal, M., Travers, C.: Decision optimal early-stopping k-set agreement in synchronous systems prone to send omission failures. In: Proc. 11th IEEE Pacific Rim International Symposium on Dependable Computing (PRDC’05), Changsha, pp. 23–30. IEEE Computer Press, New York (2005)
Raïpin Parvédy, Ph., Raynal, M., Travers, C.: strongly terminating early-stopping k-set agreement in synchronous systems with general omission failures. In: Proc. 13th Colloquium on Structural Information and Communication Complexity (SIROCCO’06), Liverpool. Lecture Notes in Computer Science, vol. 4056, pp. 182–196. Springer, Berlin (2006)
Raynal, M.: Consensus in synchronous systems: a concise guided tour. In: Proc. 9th IEEE Pacific Rim International Symposium on Dependable Computing (PRDC’02), Tsukuba, Japan, pp. 221–228. IEEE Computer Press, Berlin (2002)
Raynal, M., Travers, C.: Synchronous set agreement: a concise guided tour (including a new algorithm and a list of open problems). In: Proc. 12th IEEE Pacific Rim International Symposium on Dependable Computing (PRDC’05), Riverside, CA, pp. 267–274. IEEE Computer Press, Berlin (2006)
Saks, M., Zaharoglou, F.: Wait-free k-set agreement is impossible: the topology of public knowledge. SIAM J. Comput. 29(5), 1449–1483 (2000)
Yang, J., Neiger, G., Gafni, E.: Structured derivations of consensus algorithms for failure detectors. In: Proc. 17th International ACM Symposium on Principles of Distributed Computing (PODC’98), Puerto Vallarta, Mexico, pp. 297–308. ACM Press, New York (1998)
Author information
Authors and Affiliations
Corresponding author
Additional information
An extended abstract of a preliminary version of this paper has appeared in the proceedings of SIROCCO 2006 [31].
Rights and permissions
About this article
Cite this article
Raïpin Parvédy, P., Raynal, M. & Travers, C. Strongly Terminating Early-Stopping k-Set Agreement in Synchronous Systems with General Omission Failures. Theory Comput Syst 47, 259–287 (2010). https://doi.org/10.1007/s00224-008-9157-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00224-008-9157-3