Abstract
This paper emphasizes the power ofmonitoring of distributed real-time systems as a promising tool for both scientific work and practical purposes. Starting out from a number of well-known problems with today's (industrial) real-time systems, a classification of remedial monitoring applications is given. The most important features of a monitoring system suitable for such purposes are discussed and related to the current research into monitoring of (general) distributed systems. Finally, some of the resulting conceptual issues underlying our prototype VTA monitoring system—currently being under development at our department—are presented.
Similar content being viewed by others
References
R. K. Abbott and H. Garcia-Molina. Scheduling I/O requests with deadlines: A performance evaluation. InProc. IEEE Real-Time Syst. Symp., pagers 113–124, December 1990.
Z. Aral, I. Gertner, and G. Schaffer. Efficient debugging primitives for multiprocessors. InProc. 3rd Int. Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pages 87–95, April 1989.
V. P. Banda and R. A. Volz. Architectural support for debugging and monitoring real-time software. InProc. IEEE EUROMICRO Workshop on Real Time, Villa Olmo, Como/Italy, pages 200–210, June 1989.
P. Bates and J. C. Wileden. High-level debugging of distributed systems: The behavioral abstraction approach.J. Syst. Software, 3:255–264, 1983.
Th. Bemmerl, R. Lindhof, and Th. Treml. Ein Monitorsystem zur verzögerungsfreien Überwachung von Multiprozessoren. InProc. GI/ITG Fachtagung Messung, Modellierung und Bewertung von Rechensystemen und Netzen, Braunschweig, pages 51–59, September 1989.
D. Bhatt, A. Ghonami, and R. Ramanujan. An instrumented testbed for real-time distributed systems development. InProc. IEEE Real-Time Syst. Symp., pages 241–250, December 1987.
H. Burkhart and R. Millen. Performance-measurement tools in a multiprocessor environment.IEEE Trans. Comput., 38(5):725–737, May 1989.
S.-C. Cheng, J. A. Stankovic, and K. Ramamritham. Scheduling algorithms for hard real-time systems: A brief survey. In J. A. Stankovic and K. Ramamritham, editors,Tutorial: Hard Real-Time Systems, pages 150–173, IEEE Computer Society Press, 1988.
P. S. Dodd and C. V. Ravishankar. Monitoring and debugging distributed real-time programs.Software-Practice and Experience, 22(10):863–877, October 1992.
M. Drmota and U. Schmid. Exponential limiting distributions in queueing systems with deadlines.SIAM J. Appl. Math., 53(1):301–318, 1993.
O. Endriss, M. Steinbrunn, and M. Zitterbart. NETMON-II a monitoring tool for distributed and multiprocessor systems.Performance Evaluation, 12:191–202, 1991.
W. Föckler and N. Rüssing. Aktuelle Probleme und Lösungen zur Leistungsanalyse von modernen Rechensystemen mit Hardware-Meßwerkzeugen. InProc. GI/ITG Fachtagung Messung, Modellierung und Bewertung von Rechensystemen und Netzen, pages 39–50, September 1989.
R. J. Fowler, T. J. LeBlanc, and J. M. Mellor-Crummey. An integrated approach to parallel program debugging and performance analysis on large-scale multiprocessors. InProc. ACM SIGPLAN and SIGOPS Workshop on Parallel and Distributed Debugging, pages 163–173, May 1988.
J. Gait. A probe effect in concurrent programs.Software-Practice and Experience, 16(3):225–233, March 1986.
P. Gopinath and R. Gupta. Applying compiler techniques to scheduling in real-time systems. InProc. IEEE Real-Time Systems Symposium, pages 247–256, December 1990.
M. M. Gorlick. The flight recorder: An architectural aid for system monitoring. InProc. ACM/ONR Workshop on Parallel and Distributed Debugging, Santa Cruz/California, pages 175–183, May 1991.
V. J. Griswold. Core algorithms for autonomous monitoring of distributed systems. InProc. ACM/ONR Workshop on Parallel and Distributed Debugging, pages 36–45, May 1991.
D. Haban and K. G. Shin. Application of real-time monitoring to scheduling tasks with random execution times. InProc. Real-Time Systems Symposium IEEE, Santa Monica, California, pages 172–181, 1989.
D. Haban and D. A. Wybranietz. A hybrid monitor for behavior and performance analysis of distributed systems.IEEE Trans. Soft. Eng., 16:2:197–211, February 1990.
W. Halang and A. Stoyenko.Constructing Predictable Real-Time Systems. Kluwer Academic Press, 1991.
M. G. Harmon, T. P. Baker, and D. B. Whalley. A retargetable technique for predicting execution time. InProc. IEEE Real-Time Systems Symposium, pages 68–77, December 1992.
R. Henn. Feasible processor allocation in a hard-real-time environment.J. Real-Time Systems, 1:77–93, 1989.
R. Hofmann, R. Klar, B. Mohr, A. Quick, and M. Siegle.Distributed Performance Monitoring: Methods, Tools, and Applications (manuscript submitted), 1991.
J. Joyce, G. Lomow, K. Slind, and B. Unger. Monitoring distributed systems.ACM Trans. Comput. Syst., 5(2):121–150, May 1987.
C. Kilpatrick and K. Schwan. ChaosMON—Application-specific monitoring and display of performance information for parallel and distributed systems. InProc. ACM/ONR Workshop on Parallel and Distributed Debugging, Santa Cruz/California, pages 57–67, May 1991.
A. Kishon, P. Hudak, and C. Consel. Monitoring semantics: A formal framework for specifying, implementing and reasoning about execution monitors. InProc. ACM SIGPLAN '91 Conf. on Programming Language Design and Implementation, Toronto/Canada, pages 338–352, June 1991.
H. Kopetz and W. Ochsenreiter. Clock synchronization in distributed real-time systems.IEEE Trans. Comput., 36(8):933–940, August 1987.
H. Kopetz, A. Damm, C. Koza, M. Mulazzani, W. Schwabl, C. Senft, and R. Zainlinger. Distributed fault-tolerant real-time systems: The MARS approach.IEEE MIcro, February: 25–40, 1989.
L. Lamport. Time, clocks and the ordering of events in a distributed system.Comm. ACM, 21(7): 558–565, July 1978.
T. J. LeBlanc and J. M. Mellor-Crummey. Debugging parallel programs with instant replay.IEEE Trans. Comput., C-36(4):471–482, April 1987.
T. J. LeBlanc, J. M. Mellor-Crummey, and R. J. Fowler. Analyzing parallel program executions using multiple views.J. Par. Distr. Comput., 9:203–217, 1990.
G. LeLann. Designing real-time dependable distributed systems.INRIA Rapports de Recherche, No. 1425, April 1991.
Y. Liao and D. Cohen. A specificational approach to high level program monitoring and measuring.IEEE Trans. Soft. Eng., 18(11):969–978, November 1992.
A. D. Malony and D. A. Reed. Models for performance perturbation analysis. InProc. ACM/ONR Workshop on Parallel and Distributed Debugging, Santa Cruz/California, pages 15–25, May 1991.
D. C. Marinescu, J. E. Lumpp jr., T. L. Casavant, and H. J. Siegel. Models for monitoring and debugging tools for parallel and distributed software.J. Par. Distr. Comput., 9:171–184, 1990.
K. Marzullo and G. Neiger. Detection of global state predicates. InProc. 5th. Int. Workshop on Distributed Algorithms (WDAG '91), pages 254–272, October 1991 (Springer LNCS no. 579).
C. E. McDowell and D. P. Helmbold. Debugging concurrent programs.ACM Comput. Surv., 21(4):593–622, December 1989.
A. Mink, R. Carpenter, G. Nacht, and J. Roberts. Multiprocessor performance-measurement instrumentation.Computer, September: 63–75, 1990.
C. M. Pancake and S. Utter. A bibliography of parallel debuggers.1990 Edition, ACM SIGPLAN Notices, 26(1):21–37, January 1991.
B. Plattner. Real-time execution monitoring.IEEE Trans. Soft. Eng., SE-10(6):756–764, November 1984.
K. Ramamritham and J. A. Stankovic. Efficient scheduling algorithms for real-time multiprocessor systems.IEEE Trans. Par. Distr. Syst., 1(2):184–194, April 1990.
J. F. Ready. VRTX: A real-time operating system for embedded microprocessor applications.IEEE Micro, August: 8–17, 1986.
S. C. V. Raju, R. Rajkumar, and F. Jahanian. Monitoring timing constraints in distributed real-time systems. InProc. IEEE Real-Time Systems Symposium, pages 57–67, December 1992.
R. V. Rubin, L. Rudolph, and D. Zernik. Debugging parallel programs in parallel. InProc. ACM SIGPLAN and SIGOPS Workshop on Parallel and Distributed Debugging, pages 216–225, May 1988.
G. Schrott and T. Tempelmeier. Monitoring of real-time systems by a separate processor. InProc. IFAC/IFIP Workshop on Real-Time Processing, pages 69–79, March 1983.
R. Snodgrass. A relational approach to monitoring complex systems.ACM Trans. Comput. Syst., 6(2):157–196, May 1988.
M. Spezialetti and J. P. Kearns. A general methodology for the system state characterization of event recognitions. InProc. 9th Symposium on Reliable Distributed Systems, pages 175–184, October 1990.
J. A. Stankovic and K. Ramamritham (eds.).Tutorial: Hard Real-Time Systems. IEEE Computer Society Press, 1988.
A. D. Stoyenko, V. C. Hamacher, and R. C. Holt. Analyzing hard-real-time programs for guaranteed schedulability.IEEE Trans. Soft. Eng., 17(8):737–750, August 1991.
J. J. P. Tsai, K. Fang, H. Chen, and Y. Bi. A noninterference monitoring and replay mechanism for real-time software testing.IEEE Trans. Soft. Eng., 16(8):897–916, August 1990.
H. Tokuda, M. Kotera, and C. W. Mercer. A real-time monitor for a distributed real-time operation system. InProc. ACM SIGPLAN SIGOPS Workshop Parall. Distrib. Debugging, pages 68–77, January 1989.
J. H. Wensley, L. Lamport, J. Goldberg, M. W. Green, K. N. Levitt, P. M. Melliar-Smith, R. E. Shostak, and C. B. Weinstock. SIFT: Design and analysis of a fault-tolerant computer for aircraft control. InProc. IEEE, pages 1240–1255, October 1978.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Schmid, U. Monitoring distributed real-time systems. Real-Time Syst 7, 33–56 (1994). https://doi.org/10.1007/BF01088621
Issue Date:
DOI: https://doi.org/10.1007/BF01088621