default search action
29th HiPC 2022: Bengaluru, India
- 29th IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2022, Bengaluru, India, December 18-21, 2022. IEEE 2022, ISBN 978-1-6654-9423-6
- Arjun Menon Vadakkeveedu, Debabrata Mandal, Pradeep Ramachandran, Nitin Chandrachoodan:
Split-Knit Convolution: Enabling Dense Evaluation of Transpose and Dilated Convolutions on GPUs. 1-10 - Bingyi Zhang, Hanqing Zeng, Viktor K. Prasanna:
Low-latency Mini-batch GNN Inference on CPU-FPGA Heterogeneous Platform. 11-21 - Qinghua Zhou, Quentin Anthony, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
Accelerating Broadcast Communication with GPU Compression for Deep Learning Workloads. 22-31 - Nawras Alnaasan, Arpan Jain, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
AccDP: Accelerated Data-Parallel Distributed DNN Training for Modern GPU-Based HPC Clusters. 32-41 - Manohar Lal Das, Vishwesh Jatala, Gagan Raj Gupta:
Joint Partitioning and Sampling Algorithm for Scaling Graph Neural Network. 42-47 - Zhongyi Lin, Louis Feng, Ehsan K. Ardestani, Jaewon Lee, John Lundell, Changkyu Kim, Arun Kejariwal, John D. Owens:
Building a Performance Model for Deep Learning Recommendation Model Training on GPUs. 48-58 - Kartik Lakhotia, Fabrizio Petrini, Rajgopal Kannan, Viktor K. Prasanna:
Accelerating Prefix Scan with in-network computing on Intel PIUMA. 59-68 - Ravi Shreyas Anupindi, Swaroop Kotni, Arkaprava Basu:
memwalkd : Accelerating Key-value stores using Page Table Walkers. 69-74 - Manolis Katsaragakis, Christos Baloukas, Lazaros Papadopoulos, Verena Kantere, Francky Catthoor, Dimitrios Soudris:
Energy Consumption Evaluation of Optane DC Persistent Memory for Indexing Data Structures. 75-84 - Rohit Singh, K. P. Arun, Debadatta Mishra:
LDT: Lightweight Dirty Tracking of Memory Pages for x86 Systems. 85-94 - Bharath Ramesh, Qinghua Zhou, Aamir Shafi, Mustafa Abduljabbar, Hari Subramoni, Dhabaleswar K. Panda:
Designing Efficient Pipelined Communication Schemes using Compression in MPI Libraries. 95-99 - Kaushik Kandadi Suresh, Akshay Paniraja Guptha, Benjamin Michalowicz, Bharath Ramesh, Mustafa Abduljabbar, Aamir Shafi, Hari Subramoni, Dhabaleswar K. Panda:
Efficient Personalized and Non-Personalized Alltoall Communication for Modern Multi-HCA GPU-Based Clusters. 100-104 - Zhihui Du, Joseph Patchett, Oliver Alvarado Rodriguez, Fuhuan Li, David A. Bader:
High-Performance Truss Analytics in Arkouda. 105-114 - Arindam Khanda, Sanjukta Bhowmick, Xin Liang, Sajal K. Das:
Parallel Vertex Color Update on Large Dynamic Networks. 115-124 - Reet Barik, Marco Minutoli, Mahantesh Halappanavar, Ananth Kalyanaraman:
IMpart: A Partitioning-based Parallel Approach to Accelerate Influence Maximization. 125-134 - Benoît Gallet, Michael Gowanlock:
Leveraging GPU Tensor Cores for Double Precision Euclidean Distance Calculations. 135-144 - Fazlay Rabbi, Christopher S. Daley, Ümit V. Çatalyürek, Hasan Metin Aktulga:
A Portable Sparse Solver Framework for Large Matrices on Heterogeneous Architectures. 145-155 - Nischay Ram Mamidi, Dhruv Saxena, Kumar Prasun, Anil Nemili, Bharatkumar Sharma, S. M. Deshpande:
Performance analysis of GPU accelerated meshfree q-LSKUM solvers in Fortran, C, Python, and Julia. 156-165 - Abir Mukherjee, Preeti Malakar:
A Deep Learning-Based In Situ Analysis Framework for Tropical Cyclogenesis Prediction. 166-175 - Weicong Chen, Curtis Tatsuoka, Xiaoyi Lu:
HiBGT: High-Performance Bayesian Group Testing for COVID-19. 176-185 - Chang Su, Linglin Wei, Xianzhong Xie:
Churn Prediction in Telecommunications Industry Based on Conditional Wasserstein GAN. 186-191 - Yoichi Shimomura, Akihiro Musa, Yoshihiko Sato, Atsuhiko Konja, Guoqing Cui, Rei Aoyagi, Keichi Takahashi, Hiroyuki Takizawa:
A Real-time Flood Inundation Prediction on SX-Aurora TSUBASA. 192-197 - Harshvardhan Das, Suraj Kumar, Subodh Kumar:
Precise Parallel FEM-based Interactive Cutting Simulation of Deformable Bodies. 198-203 - David Redon, Bilel Derbel, Pierre Fortin:
Scaling the SOO Global Blackbox Optimizer on a 128-core Architecture. 204-214 - Tri Nguyen, Michela Becchi:
A GPU-accelerated Data Transformation Framework Rooted in Pushdown Transducers. 215-225 - Tania Banerjee, Jong Choi, Jaemoon Lee, Qian Gong, Ruonan Wang, Scott Klasky, Anand Rangarajan, Sanjay Ranka:
An Algorithmic and Software Pipeline for Very Large Scale Scientific Data Compression with Error Guarantees. 226-235 - Ryan Kirkpatrick, Christopher Brown, Vladimir Janjic:
COMPROF and COMPLACE: Shared-Memory Communication Profiling and Automated Thread Placement via Dynamic Binary Instrumentation. 236-245 - Keith Bateman, Neeraj Rajesh, Jaime Cernuda Garcia, Luke Logan, Jie Ye, Stephen Herbein, Anthony Kougkas, Xian-He Sun:
LuxIO: Intelligent Resource Provisioning and Auto-Configuration for Storage Services. 246-255 - Narasinga Rao Miniskar, Mohammad Alaul Haque Monil, Pedro Valero-Lara, Frank Liu, Jeffrey S. Vetter:
IRIS-BLAS: Towards a Performance Portable and Heterogeneous BLAS Library. 256-261 - Avinash Maurya, Bogdan Nicolae, M. Mustafa Rafique, Amr M. Elsayed, Thierry Tonellot, Franck Cappello:
Towards Efficient Cache Allocation for High-Frequency Checkpointing. 262-271 - Conglong Li, Ammar Ahmad Awan, Hanlin Tang, Samyam Rajbhandari, Yuxiong He:
1-bit LAMB: Communication Efficient Large-Scale Large-Batch Training with LAMB's Convergence Speed. 272-281 - Jason Yik, Sanmukh R. Kuppannagari, Hanqing Zeng, Viktor K. Prasanna:
Input Feature Pruning for Accelerating GNN Inference on Heterogeneous Platforms. 282-291 - Yuta Nakamura, Tanu Malik, Iyad Kanj, Ashish Gehani:
Provenance-based Workflow Diagnostics Using Program Specification. 292-301 - Himani Sikarwar, Debasis Das:
EECAAP: Efficient Edge-Computing based Anonymous Authentication Protocol for IoV. 302-307
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.