Performance Analytics: Understanding Parallel Applications Using Cluster and Sequence Analysis

Gonzalez, Juan; Gimenez, Judit; Labarta, Jesus

doi:10.1007/978-3-319-08144-1_1

Juan Gonzalez⁵,
Judit Gimenez⁵ &
Jesus Labarta⁵

492 Accesses

Abstract

Due to the increasing complexity of High Performance Computing (HPC) systems and applications it is necessary to maximize the insight of the performance data extracted from an application execution. This is the mission of the Performance Analytics field. In this chapter, we present three different contributions to this field. First, we demonstrate how it is possible to capture the computation structure of parallel applications at fine grain by using density-based clustering algorithms. Second, we introduce the use of multiple sequence alignment algorithms to asses the quality of a computation structure provided by the cluster analysis. Third, we propose a new clustering algorithm to maximize the quality of the computation structure detected minimizing the user intervention. To demonstrate the usefulness of the different techniques, we also present three use cases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Ocelotl: Large Trace Overviews Based on Multidimensional Data Aggregation

MPI Trace Compression Using Event Flow Graphs

Performance Analysis Tool for HPC and Big Data Applications on Scientific Clusters

Notes

1.
This scenario also mimics using OpenMP to parallelize the large computation regions.

References

Ahn, D.H., Vetter, J.S.: Scalable analysis techniques for microprocessor performance counter metrics. In: ACM/IEEE Conference on Supercomputing (SC), Baltimore (2002)
Google Scholar
Carrington, L., Snavely, A., Gao, X., Wolter, N.: A performance prediction framework for scientific applications. In: 3rd International Conference on Computational Science (ICCS), Saint Petersburg/Melbourne (2003)
Google Scholar
Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: 2nd International Conference on Knowledge Discovery and Data Mining (KDD), Portland (1996)
Google Scholar
Hartigan, J., Wong, M.: Algorithm AS 136: a K-means clustering algorithm. J. R. Stat. Soc. Ser. C (Appl. Stat.) 28, 100–108 (1979)
Google Scholar
Huck, K.A., Malony, A.D.: PerfExplorer: a performance data mining framework for large-scale parallel computing. In: ACM/IEEE Conference on Supercomputing (SC), Seattle (2005)
Google Scholar
Joshi, A., Phansalkar, A., Eeckhout, L., John, L.K.: Measuring benchmark similarity using inherent program characteristics. IEEE Trans. Comput. 55(6), 769–782 (2006)
Article Google Scholar
Nickolayev, O.Y., Roth, P.C., Reed, D.A.: Real-time statistical clustering for event trace reduction. Int. J. Supercomput. Appl. High Perform. Comput. 11(2), 144–159 (1997)
Article Google Scholar
Pelleg, D., Moore, A.W.: X-means: extending K-means with efficient estimation of the number of clusters. In: 17th International Conference on Machine Learning (ICML), Stanford (2000)
Google Scholar
Sherwood, T., Perelman, E., Hamerly, G., Calder, B.: Automatically characterizing large scale program behavior. In: 10th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), San Jose (2002)
Google Scholar
Sprunt, B.: The basics of performance-monitoring hardware. IEEE Micro. 22(4), 64–71 (2002)
Article Google Scholar
Vianney, D., Mericas, A., Maron, B., Chen, T., Kunkel, S., Olszewski, B.: CPI analysis on POWER5, Part 2: introducing the CPI breakdown model. http://www-128.ibm.com/developerworks/library/pa-cpipower2

Download references

Acknowledgements

The work presented in this chapter has been partially founded by IBM, through the IBM-BSC MareIncognito collaboration agreement, the Spanish Ministry of Education under grant BES-2005-7919 and project TIN2007-60625, and the EU/Russia joint project HOPSA.

Author information

Authors and Affiliations

Barcelona Supercomputing Center/Polytechnic University of Catalonia, c/Jordi Girona, 31, 08034, Barcelona, Catalunya, Spain
Juan Gonzalez, Judit Gimenez & Jesus Labarta

Authors

Juan Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Judit Gimenez
View author publications
You can also search for this author in PubMed Google Scholar
Jesus Labarta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Juan Gonzalez .

Editor information

Editors and Affiliations

Zentrum für Informationsdienste und Hochleistungsrechnen (ZIH), Technische Universität Dresden, Dresden, Germany
Andreas Knüpfer
Höchstleistungsrechenzentrum Stuttgart (HLRS), Universität Stuttgart, Stuttgart, Germany
José Gracia
Zentrum für Informationsdienst und Hochleistungsrechnen (ZIH), Technische Universität Dresden, Dresden, Germany
Wolfgang E. Nagel
Höchstleistungsrechenzentrum Stuttgart (HLRS), Universität Stuttgart, Stuttgart, Germany
Michael M. Resch

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gonzalez, J., Gimenez, J., Labarta, J. (2014). Performance Analytics: Understanding Parallel Applications Using Cluster and Sequence Analysis. In: Knüpfer, A., Gracia, J., Nagel, W., Resch, M. (eds) Tools for High Performance Computing 2013. Springer, Cham. https://doi.org/10.1007/978-3-319-08144-1_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-08144-1_1
Published: 02 September 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08143-4
Online ISBN: 978-3-319-08144-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Performance Analytics: Understanding Parallel Applications Using Cluster and Sequence Analysis

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Ocelotl: Large Trace Overviews Based on Multidimensional Data Aggregation

MPI Trace Compression Using Event Flow Graphs

Performance Analysis Tool for HPC and Big Data Applications on Scientific Clusters

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Performance Analytics: Understanding Parallel Applications Using Cluster and Sequence Analysis

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Ocelotl: Large Trace Overviews Based on Multidimensional Data Aggregation

MPI Trace Compression Using Event Flow Graphs

Performance Analysis Tool for HPC and Big Data Applications on Scientific Clusters

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation