default search action
10th VECPAR 2012: Kobe, Japan
- Michel J. Daydé, Osni Marques, Kengo Nakajima:
High Performance Computing for Computational Science - VECPAR 2012, 10th International Conference, Kobe, Japan, July 17-20, 2012, Revised Selected Papers. Lecture Notes in Computer Science 7851, Springer 2013, ISBN 978-3-642-38717-3
Invited Presentations
- Horst D. Simon:
Barriers to Exascale Computing. 1-3 - Richard W. Vuduc, Kenneth Czechowski:
Toward a Theory of Algorithm-Architecture Co-design. 4-8 - Takashi Furumura:
Visualization of Strong Ground Motion from the 2011 Off Tohoku, Japan (Mw=9.0) Earthquake Obtained from Dense Nation-Wide Seismic Network and Large-Scale Parallel FDM Simulation. 9-16 - Ryutaro Himeno:
Grand Challenge in Life Science on K Computer. 17-22 - Kenji Ono, Tomohiro Kawanabe, Toshio Hatada:
HPC/PF - High Performance Computing Platform: An Environment That Accelerates Large-Scale Simulations. 23-27
GPU Computing
- Jakub Kurzak, Piotr Luszczek, Mathieu Faverge, Jack J. Dongarra:
Programming the LU Factorization for a Multicore System with Accelerators. 28-35 - Rohit Gupta, Martin B. van Gijzen, Cornelis Vuik:
Efficient Two-Level Preconditioned Conjugate Gradient Method on the GPU. 36-49 - Andrés Tomás, Zhaojun Bai, Vicente Hernández:
Parallelization of the QR Decomposition with Column Pivoting Using Column Cyclic Distribution on Multicore and GPU Processors. 50-58 - Toshiyuki Imamura, Susumu Yamada, Masahiko Machida:
A High Performance SYMV Kernel on a Fermi-core GPU. 59-71 - Ahmad Abdelfattah, Jack J. Dongarra, David E. Keyes, Hatem Ltaief:
Optimizing Memory-Bound SYMV Kernel on GPU Hardware Accelerators. 72-79
Applications
- Hajime Yamamoto, Shinichi Nanai, Keni Zhang, Pascal Audigane, Christophe Chiaberge, Ryusei Ogata, Noriaki Nishikawa, Yuichi Hirokawa, Satoru Shingu, Kengo Nakajima:
Numerical Simulation of Long-Term Fate of CO2 Stored in Deep Reservoir Rocks on Massively Parallel Vector Supercomputer. 80-92 - Jinfang Gao, Huilin Xing:
High Performance Simulation of Complicated Fluid Flow in 3D Fractured Porous Media with Permeable Material Matrix Using LBM. 93-104 - M. L. L. Wijerathne, Muneo Hori, Tsuyoshi Ichimura, Seizo Tanaka:
Parallel Scalability Enhancements of Seismic Response and Evacuation Simulations of Integrated Earthquake Simulator. 105-117 - Anthony Scemama, Michel Caffarel, Emmanuel Oseret, William Jalby:
QMC=Chem: A Quantum Monte Carlo Program for Large-Scale Simulations in Chemistry at the Petascale Level and beyond. 118-127
Finite Element Method from Various Viewpoints
- Niclas Jansson:
Optimizing Sparse Matrix Assembly in Finite Element Solvers with One-Sided Communication. 128-139 - Satoshi Ohshima, Masae Hayashi, Takahiro Katagiri, Kengo Nakajima:
Implementation and Evaluation of 3D Finite Element Method Application for CUDA. 140-148 - Alberto F. De Souza, Lucas de Paula Veronese, Leonardo Muniz de Lima, Claudine Badue, Lucia Catabriga:
Evaluation of Two Parallel Finite Element Implementations of the Time-Dependent Advection Diffusion Problem: GPU versus Cluster Considering Time and Energy Consumption. 149-162
Cloud and Visualization
- Germán Moltó, Amanda Calatrava, Vicente Hernández:
A Service-Oriented Architecture for Scientific Computing on Cloud Infrastructures. 163-176 - Alexandre Solon Nery, Nadia Nedjah, Felipe M. G. França, Lech Józwiak:
Interactive Volume Rendering Based on Ray-Casting for Multi-core Architectures. 177-186
Performance
- Franz Franchetti, Yevgen Voronenko, Gheorghe Almási:
Automatic Generation of the HPC Challenge's Global FFT Benchmark for BlueGene/P. 187-200 - Edgar Solomonik, James Demmel:
Matrix Multiplication on Multidimensional Torus Networks. 201-215
Methods and Tools for Advanced Scientific Computing
- Babak Hejazialhosseini, Christian Conti, Diego Rossinelli, Petros Koumoutsakos:
High Performance CPU Kernels for Multiphase Compressible Flows. 216-225 - Yasunori Futamura, Tetsuya Sakurai, Shinnosuke Furuya, Jun-ichi Iwata:
Efficient Algorithm for Linear Systems Arising in Solutions of Eigenproblems and Its Application to Electronic-Structure Calculations. 226-235 - Takahiro Katagiri, Takao Sakurai, Mitsuyoshi Igai, Satoshi Ohshima, Hisayasu Kuroda, Ken Naono, Kengo Nakajima:
Control Formats for Unsymmetric and Symmetric Sparse Matrix-Vector Multiplications on OpenMP Implementations. 236-248
Algorithms and Data Analysis
- Sandrine Mouysset, Ronan Guivarch:
Sparsification on Parallel Spectral Clustering. 249-260 - Prasanna Balaprakash, Stefan M. Wild, Paul D. Hovland:
An Experimental Study of Global and Local Search Algorithms in Empirical Performance Tuning. 261-269 - Aleksandr Drozd, Naoya Maruyama, Satoshi Matsuoka:
A Multi GPU Read Alignment Algorithm with Model-Based Performance Optimization. 270-277
Parallel Iterative Solvers on Multicore Architectures
- Masae Hayashi, Kengo Nakajima:
OpenMP/MPI Hybrid Parallel ILU(k) Preconditioner for FEM Based on Extended Hierarchical Interface Decomposition for Multi-core Clusters. 278-291 - Masatoshi Kawai, Takeshi Iwashita, Hiroshi Nakashima, Osni Marques:
Parallel Smoother Based on Block Red-Black Ordering for Multigrid Poisson Solver. 292-299 - Vincent Heuveline, Sven Janko, Wolfgang Karl, Björn Rocker, Martin Schindewolf:
Software Transactional Memory, OpenMP and Pthread Implementations of the Conjugate Gradients Method - A Preliminary Evaluation. 300-313
The Seventh International Workshop on Automatic Performance Tuning
- Takahiro Katagiri, Pierre-Yves Aquilanti, Serge G. Petiton:
A Smart Tuning Strategy for Restart Frequency of GMRES(m) with Hierarchical Cache Sizes. 314-328 - Lu Li, Usman Dastgeer, Christoph W. Kessler:
Adaptive Off-Line Tuning for Optimized Composition of Components for Heterogeneous Many-Core Systems. 329-345 - Diego Fabregat-Traver, Paolo Bientinesi:
A Domain-Specific Compiler for Linear Algebra Operations. 346-361 - Bryan Marker, Jack Poulson, Don S. Batory, Robert A. van de Geijn:
Designing Linear Algebra Algorithms by Transformation: Mechanizing the Expert Developer. 362-378 - Hiroki Toyokawa, Hiroyuki Ishigami, Kinji Kimura, Masami Takata, Yoshimasa Nakamura:
Accelerating the Reorthogonalization of Singular Vectors with a Multi-core Processor. 379-390 - Jeffrey Morlan, Shoaib Kamil, Armando Fox:
Auto-tuning the Matrix Powers Kernel with SEJITS. 391-403 - Tatsuya Abe, Mitsuhisa Sato:
Auto-tuning of Numerical Programs by Block Multi-color Ordering Code Generation and Job-Level Parallel Execution. 404-419 - Ayumu Tomiyama, Reiji Suda:
Automatic Parameter Optimization for Edit Distance Algorithm on GPU. 420-434 - Kengo Nakajima:
Automatic Tuning of Parallel Multigrid Solvers Using OpenMP/MPI Hybrid Parallel Programming Models. 435-450 - Andreas Schäfer, Dietmar Fey:
A Predictive Performance Model for Stencil Codes on Multicore CPUs. 451-466
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.