default search action
22nd PPOPP 2017: Austin, TX, USA
- Vivek Sarkar, Lawrence Rauchwerger:
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Austin, TX, USA, February 4-8, 2017. ACM 2017, ISBN 978-1-4503-4493-7
Keynote
- Guy L. Steele Jr.:
It's Time for a New Old Language. 1
Session 1: GPU I
- Guoyang Chen, Yue Zhao, Xipeng Shen, Huiyang Zhou:
EffiSha: A Software Framework for Enabling Effficient Preemptive Scheduling of GPU. 3-16 - Nachshon Cohen, Arie Tal, Erez Petrank:
Layout Lock: A Scalable Locking Paradigm for Concurrent Data Layout Modifications. 17-29 - Xiuxia Zhang, Guangming Tan, Shuangbai Xue, Jiajia Li, Keren Zhou, Mingyu Chen:
Understanding the GPU Microarchitecture to Achieve Bare-Metal Performance Tuning. 31-43
Session 2: Concurrency
- Peizhao Ou, Brian Demsky:
Checking Concurrent Data Structures Under the C/C++11 Memory Model. 45-59 - Milind Chabbi, Abdelhalim Amer, Shasha Wen, Xu Liu:
An Efficient Abortable-locking Protocol for Multi-level NUMA Systems. 61-74 - Umut A. Acar, Naama Ben-David, Mike Rainey:
Contention in Structured Concurrency: Provably Efficient Dynamic Non-Zero Indicators for Nested Parallelism. 75-88 - Kento Sato, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz, Christopher M. Chambreau:
Noise Injection Techniques to Expose Subtle and Unintended Message Races. 89-101
Session 3: Tools
- Hao Luo, Pengcheng Li, Chen Ding:
Thread Data Sharing in Cache: Theory and Measurement. 103-115 - Bin Ren, Sriram Krishnamoorthy, Kunal Agrawal, Milind Kulkarni:
Exploiting Vector and Multicore Parallelism for Recursive, Data- and Task-Parallel Programs. 117-130 - Sergei Shudler, Alexandru Calotoiu, Torsten Hoefler, Felix Wolf:
Isoefficiency in Practice: Configuring and Understanding the Performance of Task-based Applications. 131-143 - Robert Utterback, Kunal Agrawal, I-Ting Angelina Lee, Milind Kulkarni:
Processor-Oblivious Record and Replay. 145-161
Session 4: GPU II
- Nirmal Prajapati, Waruna Ranasinghe, Sanjay V. Rajopadhye, Rumen Andonov, Hristo N. Djidjev, Tobias Grosser:
Simple, Accurate, Analytical Time Modeling and Optimal Tile Size Selection for GPGPU Stencils. 163-177 - Peng Jiang, Gagan Agrawal:
Combining SIMD and Many/Multi-core Parallelism for Finite State Machines with Enumerative Speculation. 179-191 - Ammar Ahmad Awan, Khaled Hamidouche, Jahanzeb Maqbool Hashmi, Dhabaleswar K. Panda:
S-Caffe: Co-designing MPI Runtimes and Caffe for Scalable Deep Learning on Modern GPU Clusters. 193-205 - Amit Sabne, Xiao Wang, Sherman J. Kisner, Charles A. Bouman, Anand Raghunathan, Samuel P. Midkiff:
Model-based Iterative CT Image Reconstruction on GPUs. 207-220
Session 5: Best Papers
- Tsung Tai Yeh, Amit Sabne, Putt Sakdhnagool, Rudolf Eigenmann, Timothy G. Rogers:
Pagoda: Fine-Grained GPU Resource Virtualization for Narrow Tasks. 221-234 - Tal Ben-Nun, Michael Sutton, Sreepathi Pai, Keshav Pingali:
Groute: An Asynchronous Multi-GPU Programming Model for Irregular Computations. 235-248 - Tao B. Schardl, William S. Moses, Charles E. Leiserson:
Tapir: Embedding Fork-Join Parallelism into LLVM's Intermediate Representation. 249-265 - Alexander Matveev, Yaron Meirovitch, Hayk Saribekyan, Wiktor Jakubiuk, Tim Kaler, Gergely Ódor, David M. Budden, Aleksandar Zlateski, Nir Shavit:
A Multicore Path to Connectomics-on-Demand. 267-281
Session 6: Languages & Compilers
- Michael Vollmer, Ryan G. Scott, Madanlal Musuvathi, Ryan R. Newton:
SC-Haskell: Sequential Consistency in Languages That Minimize Mutable Shared Heap. 283-298 - Martin Bättig, Thomas R. Gross:
Synchronized-by-Default Concurrency for Shared-Memory Systems. 299-312 - Rubens E. A. Moreira, Caroline Collange, Fernando Magno Quintão Pereira:
Function Call Re-Vectorization. 313-326 - Samyam Rajbhandari, Fabrice Rastello, Karol Kowalski, Sriram Krishnamoorthy, P. Sadayappan:
Optimizing the Four-Index Integral Transform Using Data Movement Lower Bounds Analysis. 327-340
Session 7: Data Analytics
- Guy L. Steele Jr., Jean-Baptiste Tristan:
Using Butterfly-Patterned Partial Sums to Draw from Discrete Distributions. 341-355 - Dmitry Basin, Edward Bortnikov, Anastasia Braginsky, Guy Golan-Gueta, Eshcar Hillel, Idit Keidar, Moshe Sulamy:
KiWi: A Key-Value Map for Scalable Real-Time Analytics. 357-369 - Lin Jiang, Zhijia Zhao:
Grammar-aware Parallelization for Scalable XPath Querying. 371-383 - Xin Wang, Weihua Zhang, Zhaoguo Wang, Ziyun Wei, Haibo Chen, Wenyun Zhao:
Eunomia: Scaling Concurrent Search Trees under Contention Using HTM. 385-399
Session 8: Fault Tolerance
- Xiongchao Tang, Jidong Zhai, Bowen Yu, Wenguang Chen, Weimin Zheng:
Self-Checkpoint: An In-Memory Checkpoint Method Using Less Space and Its Practice on Fault-Tolerant HPL. 401-413 - Panruo Wu, Nathan DeBardeleben, Qiang Guan, Sean Blanchard, Jieyang Chen, Dingwen Tao, Xin Liang, Kaiming Ouyang, Zizhong Chen:
Silent Data Corruption Resilient Two-sided Matrix Factorizations. 415-427
Session 9: Posters
- Maya Arbel-Raviv, Trevor Brown:
POSTER: Reuse, don't Recycle: Transforming Algorithms that Throw Away Descriptors. 429-430 - Vignesh Balaji, Dhruva Tirumala, Brandon Lucia:
POSTER: An Architecture and Programming Model for Accelerating Parallel Commutative Computations via Privatization. 431-432 - Arnamoy Bhattacharyya, Mike Dai Wang, Mihai Burcea, Yi Ding, Allen Deng, Sai Varikooty, Shafaaf Hossain, Cristiana Amza:
POSTER: HythTM: Extending the Applicability of Intel TSX Hardware Transactional Support. 433-434 - Rezaul Chowdhury, Pramod Ganapathi, Yuan Tang, Jesmin Jahan Tithi:
POSTER: Provably Efficient Scheduling of Cache-Oblivious Wavefront Algorithms. 435-436 - Nachshon Cohen, Maurice Herlihy, Erez Petrank, Elias Wald:
POSTER: State Teleportation via Hardware Transactional Memory. 437-438 - Dong Dai, Wei Zhang, Yong Chen:
POSTER: IOGP: An Incremental Online Graph Partitioning for Large-Scale Distributed Graph Databases. 439-440 - Jesun Sahariar Firoz, Thejaka Amila Kanewala, Marcin Zalewski, Martina Barnas, Andrew Lumsdaine:
POSTER: Distributed Control: The Benefits of Eliminating Global Synchronization via Effective Scheduling. 441-442 - Gangwon Jo, Jaehoon Jung, Jiyoung Park, Jaejin Lee:
POSTER: MAPA: An Automatic Memory Access Pattern Analyzer for GPU Applications. 443-444 - Shigang Li, Yunquan Zhang, Torsten Hoefler:
POSTER: Cache-Oblivious MPI All-to-All Communications on Many-Core Architectures. 445-446 - Harshitha Menon, Kavitha Chandrasekar, Laxmikant V. Kalé:
POSTER: Automated Load Balancer Selection Based on Application Characteristics. 447-448 - Nurit Moscovici, Nachshon Cohen, Erez Petrank:
POSTER: A GPU-Friendly Skiplist Algorithm. 449-450 - Pedro Ramalhete, Andreia Correia:
POSTER: Poor Man's URCU. 451-452 - Pedro Ramalhete, Andreia Correia:
POSTER: A Wait-Free Queue with Wait-Free Memory Reclamation. 453-454 - Yuan Tang, Ronghui You:
POSTER: STAR (Space-Time Adaptive and Reductive) Algorithms for Real-World Space-Time Optimality. 455-456 - Mingyu Wu, Haibing Guan, Binyu Zang, Haibo Chen:
POSTER: Recovering Performance for Vector-based Machine Learning on Managed Runtime. 457-458 - Minjia Zhang, Swarnendu Biswas, Michael D. Bond:
POSTER: On the Problem of Consistency Exceptions in the Context of Strong Memory Models. 459-460 - Yue Zhao, Chunhua Liao, Xipeng Shen:
POSTER: An Infrastructure for HPC Knowledge Sharing and Reuse. 461-462
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.