Abstract
Applications executing on complex computational systems provide a challenge for the development of runtime performance monitoring software. We discuss a computational model, application monitoring, data access models, and profiler functionality. We define data consistency within and across threads as well as across contexts and nodes. We describe the TAU runtime monitoring framework which enables on-demand, low-interference data access to TAU profile data and provides the flexibility to enforce data consistency at the thread, context or node level. We present an example of a Java-based runtime performance monitor utilizing the framework.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Gannon, D., Beckman, P., Johnson, E., Green, T., Levine, M.: HPC++ and the HPC++LIB Toolkit, Technical Report Department of Computer Science, Indiana University (1998)
Laure, E., Mehrotra, P., Zima, H.: Opus: Heterogeneous Computing With Data Parallel Tasks, Technical Report TR 99-04, Institute for Software Technology and Parallel Systems, University of Vienna (1999), http://www.par.univie.ac.at
OpenMP: OpenMP Fortran Interpretations Versions 1.0(1999), http://www.openmp.org
Chandy, K., Kesselman, C.: CC++: A Declarative Concurrent Object Oriented Programming Notation. In: Agha, G., Wegner, P., Yonesawa, A. (eds.) Research Directions in Concurrent Object Oriented Programming, pp. 218–313. MIT Press, Cambridge (1993)
O.M.G.: CORBA/IIOP 2.2 Specification (1998), http://www.omg.org
The Staff, Advanced Computing Laboratory, Los Alamos National Laboratory: Taming Complexity in High-Performance Computing. White Paper (November 1998), Accessible from http://www.acl.lanl.gov/software
Reynders, J., et al.: Pooma: A Framework for Scientific Simulation on Parallel Architectures. In: Wilson, G., Lu, P. (eds.) Parallel Programming using C++, pp. 553–594. MIT Press, Cambridge (1996)
Miller, B., Callaghan, M., Cargille, J., Hollingsworth, J., Irvin, R., Karavanic, K., Kunchithapadam, K., Newhall, T.: The Paradyne Parallel Performance Measurement Tools. IEEE Computer 28(11) (1995)
Shende, S., Cuny, J., Hansen, L., Kundu, J., McLaughry, S., Wolf, O.: Event and State Based Debugging in TAU: A Prototype. In: Proc. of ACM SIGMETRICS Symp. on Parallel and Distributed Tools, pp. 21–30 (May 1996)
Cuny, J., Dunn, R., Hackstadt, S., Harrop, C., Hersey, H., Malony, A., Toomey, D.: Building Domain-Specific Environments for Computational Science: A Case Study in Seismic Tomography. Intl. Jour. of Supercomputing Applications and High Performance Computing 11 (March 1997)
Shende, S.: Profiling and Tracing in Linux. In: Proc. Second Extreme Linux Workshop #2, USENIX Annual Technical Conference, pp. 26–30 (1999)
Hackstadt, S., Harrop, C., Malony, A.: A Framework for Interacting with Distributed Programs and Data. In: Proc. of the Seventh Int’l Symp. on High Performance Distributed Computing 1998 (HPDC-7). IEEE, Los Alamitos (July 1998)
Hackstadt, S., Malony, A.: DAQV: Distributed Array Query and Visualization Framework. Journal of Theoretical Computer Science, special issue on Parallel Computing 196(1-2), 289–317 (1998)
Shende, S., Malony, A.D., Hackstadt, S.: Dynamic Performance Callstack Sampling: Merging TAU and DAQV. In: Kågström, B., Elmroth, E., Waśniewski, J., Dongarra, J. (eds.) PARA 1998. LNCS, vol. 1541, pp. 515–520. Springer, Heidelberg (1998)
Shende, S., Malony, A.D., Cuny, J., Lindlan, K., Beckman, P., Karmesin, S.: Portable Profiling and Tracing for Parallel, Scientific Applications using C++. In: Proc. of ACM SIGMETRICS Symp. on Parallel and Distributed Tools, pp. 134–145 (August 1998)
Advanced Computing Laboratory (LANL): TAU Portable Profiling (1998), http://www.acl.lanl.gov/tau
Ludwig, T., Wismüller, R., Sunderam, V., Bode, A.: OMIS – On-line Monitoring Interface Specification (Version 2.0), Vol. 9, LRR-TUM Research Report Series. Springer, Germany (1997), ISBN 3-8265-3035-7
Wismüller, R., Trinitis, J., Ludwig, T.: OCM – A Monitoring System for Interoperable Tools. In: Proceedings of the SIGMETRICS Symposium on Parallel and Distributed Tools, pp. 1–9 (August 1998)
Ribler, R., Vetter, J., Simitci, H., Reed, D.: Autopilot: Adaptive Control of Distributed Applications. In: Proceedings of the 7th IEEE International Symposium on High Performance Distributed Computing, pp. 172–179 (July 1998)
Eisenhauer, G., Schwan, K.: An Object-Based Infrastructure for Program Monitoring and Steering. In: Proceedings of the SIGMETRICS Symposium on Parallel and Distributed Tools, pp. 10–20 (August 1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sheehan, T.J., Malony, A.D., Shende, S.S. (1999). A Runtime Monitoring Framework for the TAU Profiling System. In: Matsuoka, S., Tholburn, M. (eds) Computing in Object-Oriented Parallel Environments. ISCOPE 1999. Lecture Notes in Computer Science, vol 1732. Springer, Berlin, Heidelberg. https://doi.org/10.1007/10704054_18
Download citation
DOI: https://doi.org/10.1007/10704054_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66818-3
Online ISBN: 978-3-540-46697-0
eBook Packages: Springer Book Archive