Abstract
Parallel programs can be nondeterministic: consecutive runs with the same input can result in different executions. Therefore we cannot use cyclic debugging techniques. In order to be able to use those techniques we need a tool that traces information about an execution so it can be replayed for debugging. Because the recording interferes with the program, possibly perturbating the execution, we must limit the amount of information and keep the algorithm simple. This paper presents an implementation of the ROLT replay mechanism for a multithreaded operating system (Solaris).
Michiel Ronsse is supported by a grant from the Flemish Institute for the Promotion of the Scientific-Technological Research in the Industry (IWT).
Chapter PDF
Similar content being viewed by others
References
Leslie Lamport. Time, clocks, and the ordering of events in a distributed system. Communications of the ACM, 21(7):558–565, July 1978.
Luk J. Levrouw, Koenraad M. Audenaert, and Jan M. Van Campenhout. A new trace and replay system for shared memory programs based on Lamport Clocks. In Proceedings of the Second Euromicro Workshop on Parallel and Distributed Processing, pages 471–478. IEEE Computer Society Press, January 1994.
Thomas J. LeBlanc and John M. Mellor-Crummey. Debugging parallel programs with Instant Replay. IEEE Transactions on Computers, C-36(4):471–482, April 1987.
Robert H.B. Netzer. Optimal tracing and replay for debugging sharedmemory parallel programs. In Proceedings ACM/ONR Workshop on Parallel and Distributed Debugging, pages 1–11, May 1993.
M.A. Ronsse, L.J. Levrouw, and K. Bastiaens. Efficient coding of execution-traces of parallel programs. In J. P. Veen, editor, Proceedings of the ProRISC/IEEE Benelux Workshop on Circuits, Systems and Signal Processing, pages 251–258. STW, Utrecht, March 1995.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ronsse, M., Levrouw, L. (1996). On the implementation of a replay mechanism. In: Bougé, L., Fraigniaud, P., Mignotte, A., Robert, Y. (eds) Euro-Par'96 Parallel Processing. Euro-Par 1996. Lecture Notes in Computer Science, vol 1123. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61626-8_7
Download citation
DOI: https://doi.org/10.1007/3-540-61626-8_7
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61626-9
Online ISBN: 978-3-540-70633-5
eBook Packages: Springer Book Archive