default search action
Xulong Tang
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c53]Yueqi Wang, Bingyao Li, Aamer Jaleel, Jun Yang, Xulong Tang:
GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement. HPCA 2024: 1080-1094 - [c52]Sheng Li, Chao Wu, Ao Li, Yanzhi Wang, Xulong Tang, Geng Yuan:
Waxing-and-Waning: a Generic Similarity-based Framework for Efficient Self-Supervised Learning. ICLR 2024 - [c51]Kaixing Yang, Xukun Zhou, Xulong Tang, Ran Diao, Hongyan Liu, Jun He, Zhaoxin Fan:
BeatDance: A Beat-Based Model-Agnostic Contrastive Learning Framework for Music-Dance Retrieval. ICMR 2024: 11-19 - [c50]Kaixing Yang, Xulong Tang, Ran Diao, Hongyan Liu, Jun He, Zhaoxin Fan:
CoDancers: Music-Driven Coherent Group Dance Generation with Choreographic Unit. ICMR 2024: 675-683 - [i16]Sheng Li, Geng Yuan, Yawen Wu, Yue Dai, Chao Wu, Alex K. Jones, Jingtong Hu, Yanzhi Wang, Xulong Tang:
EdgeOL: Efficient in-situ Online Learning on Edge Devices. CoRR abs/2401.16694 (2024) - [i15]Sheng Li, Geng Yuan, Yue Dai, Youtao Zhang, Yanzhi Wang, Xulong Tang:
SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing. CoRR abs/2401.16720 (2024) - [i14]Bingyao Li, Yueqi Wang, Tianyu Wang, Lieven Eeckhout, Jun Yang, Aamer Jaleel, Xulong Tang:
Improving Multi-Instance GPU Efficiency via Sub-Entry Sharing TLB Design. CoRR abs/2404.18361 (2024) - [i13]Tianyu Wang, Sheng Li, Bingyao Li, Yue Dai, Ao Li, Geng Yuan, Yufei Ding, Youtao Zhang, Xulong Tang:
Improving GPU Multi-Tenancy Through Dynamic Multi-Instance GPU Reconfiguration. CoRR abs/2407.13126 (2024) - 2023
- [j10]Sébastien Ollivier, Sheng Li, Yue Tang, Stephen Cahoon, Ryan Caginalp, Chayanika Chaudhuri, Peipei Zhou, Xulong Tang, Jingtong Hu, Alex K. Jones:
Sustainable AI Processing at the Edge. IEEE Micro 43(1): 19-28 (2023) - [c49]Yingheng Li, Aditya Pawar, Mohadeseh Azari, Yanan Guo, Youtao Zhang, Jun Yang, Kaushik Parasuram Seshadreesan, Xulong Tang:
Orchestrating Measurement-Based Quantum Computation over Photonic Quantum Processors. DAC 2023: 1-6 - [c48]Bingyao Li, Yueqi Wang, Xulong Tang:
Orchestrated Scheduling and Partitioning for Improved Address Translation in GPUs. DAC 2023: 1-6 - [c47]Mehrnoosh Raoufi, Jun Yang, Xulong Tang, Youtao Zhang:
EP-ORAM: Efficient NVM-Friendly Path Eviction for Ring ORAM in Hybrid Memory. DAC 2023: 1-6 - [c46]Mehrnoosh Raoufi, Jun Yang, Xulong Tang, Youtao Zhang:
AB-ORAM: Constructing Adjustable Buckets for Space Reduction in Ring ORAM. HPCA 2023: 361-373 - [c45]Bingyao Li, Jieming Yin, Anup Holey, Youtao Zhang, Jun Yang, Xulong Tang:
Trans-FW: Short Circuiting Page Table Walk in Multi-GPU Systems via Remote Forwarding. HPCA 2023: 456-470 - [c44]Yue Dai, Youtao Zhang, Xulong Tang:
CEGMA: Coordinated Elastic Graph Matching Acceleration for Graph Matching Networks. HPCA 2023: 584-597 - [c43]Yue Dai, Xulong Tang, Youtao Zhang:
FlexGM: An Adaptive Runtime System to Accelerate Graph Matching Networks on GPUs. ICCD 2023: 348-356 - [c42]Sheng Li, Geng Yuan, Yue Dai, Youtao Zhang, Yanzhi Wang, Xulong Tang:
SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing. ICLR 2023 - [c41]Zhengang Li, Geng Yuan, Tomoharu Yamauchi, Masoud Zabihi, Yanyue Xie, Peiyan Dong, Xulong Tang, Nobuyuki Yoshikawa, Devesh Tiwari, Yanzhi Wang, Olivia Chen:
SupeRBNN: Randomized Binary Neural Network Using Adiabatic Superconductor Josephson Devices. MICRO 2023: 584-598 - [c40]Bingyao Li, Yanan Guo, Yueqi Wang, Aamer Jaleel, Jun Yang, Xulong Tang:
IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations. MICRO 2023: 1163-1177 - [i12]Zhengang Li, Geng Yuan, Tomoharu Yamauchi, Masoud Zabihi, Yanyue Xie, Peiyan Dong, Xulong Tang, Nobuyuki Yoshikawa, Devesh Tiwari, Yanzhi Wang, Olivia Chen:
SupeRBNN: Randomized Binary Neural Network Using Adiabatic Superconductor Josephson Devices. CoRR abs/2309.12212 (2023) - [i11]Kaixing Yang, Xukun Zhou, Xulong Tang, Ran Diao, Hongyan Liu, Jun He, Zhaoxin Fan:
BeatDance: A Beat-Based Model-Agnostic Contrastive Learning Framework for Music-Dance Retrieval. CoRR abs/2310.10300 (2023) - [i10]Aditya Pawar, Yingheng Li, Zewei Mo, Yanan Guo, Youtao Zhang, Xulong Tang, Jun Yang:
Integrated Qubit Reuse and Circuit Cutting for Large Quantum Circuit Evaluation. CoRR abs/2312.10298 (2023) - [i9]Yingheng Li, Aditya Pawar, Zewei Mo, Youtao Zhang, Jun Yang, Xulong Tang:
Minimizing Photonic Cluster State Depth in Measurement-Based Quantum Computing. CoRR abs/2312.10865 (2023) - 2022
- [j9]Yue Dai, Xulong Tang, Youtao Zhang:
An efficient segmented quantization for graph neural networks. CCF Trans. High Perform. Comput. 4(4): 461-473 (2022) - [j8]Geng Yuan, Peiyan Dong, Mengshu Sun, Wei Niu, Zhengang Li, Yuxuan Cai, Yanyu Li, Jun Liu, Weiwen Jiang, Xue Lin, Bin Ren, Xulong Tang, Yanzhi Wang:
Mobile or FPGA? A Comprehensive Evaluation on Energy Efficiency and a Unified Optimization Framework. ACM Trans. Embed. Comput. Syst. 21(5): 65:1-65:22 (2022) - [j7]Yifan Gong, Geng Yuan, Zheng Zhan, Wei Niu, Zhengang Li, Pu Zhao, Yuxuan Cai, Sijia Liu, Bin Ren, Xue Lin, Xulong Tang, Yanzhi Wang:
Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration. ACM Trans. Design Autom. Electr. Syst. 27(5): 47:1-47:26 (2022) - [c39]Geng Yuan, Sung-En Chang, Qing Jin, Alec Lu, Yanyu Li, Yushu Wu, Zhenglun Kong, Yanyue Xie, Peiyan Dong, Minghai Qin, Xiaolong Ma, Xulong Tang, Zhenman Fang, Yanzhi Wang:
You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding. ECCV (12) 2022: 34-51 - [c38]Yilun Zhao, Yanan Guo, Yuan Yao, Amanda Dumi, Devin M. Mulvey, Shiv Upadhyay, Youtao Zhang, Kenneth D. Jordan, Jun Yang, Xulong Tang:
Q-GPU: A Recipe of Optimizations for Quantum Circuit Simulation Using GPUs. HPCA 2022: 726-740 - [c37]Mahmut T. Kandemir, Xulong Tang, Jagadish Kotra, Mustafa Karaköy:
Fine-Granular Computation and Data Layout Reorganization for Improving Locality. ICCAD 2022: 5:1-5:9 - [c36]Yajuan Du, Mingyang Liu, Yuqi Yang, Mingzhe Zhang, Xulong Tang:
Enhancing GPU Performance via Neighboring Directory Table Based Inter-TLB Sharing. ICCD 2022: 146-153 - [c35]Geng Yuan, Yanyu Li, Sheng Li, Zhenglun Kong, Sergey Tulyakov, Xulong Tang, Yanzhi Wang, Jian Ren:
Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training. NeurIPS 2022 - [c34]Bingyao Li, Qi Xue, Geng Yuan, Sheng Li, Xiaolong Ma, Yanzhi Wang, Xulong Tang:
Optimizing Data Layout for Training Deep Neural Networks. WWW (Companion Volume) 2022: 548-554 - [i8]Sébastien Ollivier, Sheng Li, Yue Tang, Chayanika Chaudhuri, Peipei Zhou, Xulong Tang, Jingtong Hu, Alex K. Jones:
Sustainable AI Processing at the Edge. CoRR abs/2207.01209 (2022) - [i7]Zhendong Wang, Xiaoming Zeng, Xulong Tang, Danfeng Zhang, Xing Hu, Yang Hu:
Demystifying Arch-hints for Model Extraction: An Attack in Unified Memory System. CoRR abs/2208.13720 (2022) - [i6]Geng Yuan, Yanyu Li, Sheng Li, Zhenglun Kong, Sergey Tulyakov, Xulong Tang, Yanzhi Wang, Jian Ren:
Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training. CoRR abs/2209.11204 (2022) - 2021
- [j6]Xulong Tang, Mahmut Taylan Kandemir, Mustafa Karaköy:
Mix and Match: Reorganizing Tasks for Enhancing Data Locality. Proc. ACM Meas. Anal. Comput. Syst. 5(2): 20:1-20:24 (2021) - [j5]Xinyi Zhang, Yawen Wu, Peipei Zhou, Xulong Tang, Jingtong Hu:
Algorithm-hardware Co-design of Attention Mechanism on FPGA Devices. ACM Trans. Embed. Comput. Syst. 20(5s): 71:1-71:24 (2021) - [c33]Yuxuan Cai, Hongjia Li, Geng Yuan, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, Yanzhi Wang:
YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design. AAAI 2021: 955-963 - [c32]Yuxuan Cai, Geng Yuan, Hongjia Li, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, Yanzhi Wang:
A Compression-Compilation Co-Design Framework Towards Real-Time Object Detection on Mobile Devices. AAAI 2021: 15997-16000 - [c31]Zhendong Wang, Rujia Wang, Zihang Jiang, Xulong Tang, Shouyi Yin, Yang Hu:
Towards a Secure Integrated Heterogeneous Platform via Cooperative CPU/GPU Encryption. ATS 2021: 115-120 - [c30]Weizheng Xu, Ashutosh Pattnaik, Geng Yuan, Yanzhi Wang, Youtao Zhang, Xulong Tang:
ScaleDNN: Data Movement Aware DNN Training on Multi-GPU. ICCAD 2021: 1-9 - [c29]Fuxun Yu, Shawn Bray, Di Wang, Longfei Shangguan, Xulong Tang, Chenchen Liu, Xiang Chen:
Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU. ICCAD 2021: 1-9 - [c28]Bingyao Li, Jieming Yin, Youtao Zhang, Xulong Tang:
Improving Address Translation in Multi-GPUs via Sharing and Spilling aware TLB Design. MICRO 2021: 1154-1168 - [c27]Shixiong Jing, Qinkun Bao, Pei Wang, Xulong Tang, Dinghao Wu:
Characterizing AI Model Inference Applications Running in the SGX Environment. NAS 2021: 1-4 - [c26]Huaipan Jiang, Haibo Zhang, Xulong Tang, Vineetha Govindaraj, Jack Sampson, Mahmut Taylan Kandemir, Danfeng Zhang:
Fluid: a framework for approximate concurrency via controlled dependency relaxation. PLDI 2021: 252-267 - [c25]Mahmut Taylan Kandemir, Xulong Tang, Hui Zhao, Jihyun Ryoo, Mustafa Karaköy:
Distance-in-time versus distance-in-space. PLDI 2021: 665-680 - [c24]Mahmut Taylan Kandemir, Jihyun Ryoo, Xulong Tang, Mustafa Karaköy:
Compiler support for near data computing. PPoPP 2021: 90-104 - [c23]Geng Yuan, Peiyan Dong, Mengshu Sun, Wei Niu, Zhengang Li, Yuxuan Cai, Jun Liu, Weiwen Jiang, Xue Lin, Bin Ren, Xulong Tang, Yanzhi Wang:
Work in Progress: Mobile or FPGA? A Comprehensive Evaluation on Energy Efficiency and a Unified Optimization Framework. RTAS 2021: 493-496 - [c22]Xulong Tang, Mahmut Taylan Kandemir, Mustafa Karaköy:
Mix and Match: Reorganizing Tasks for Enhancing Data Locality. SIGMETRICS (Abstracts) 2021: 47-48 - [c21]Weizheng Xu, Youtao Zhang, Xulong Tang:
Parallelizing DNN Training on GPUs: Challenges and Opportunities. WWW (Companion Volume) 2021: 174-178 - [i5]Yifan Gong, Geng Yuan, Zheng Zhan, Wei Niu, Zhengang Li, Pu Zhao, Yuxuan Cai, Sijia Liu, Bin Ren, Xue Lin, Xulong Tang, Yanzhi Wang:
Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration. CoRR abs/2111.11581 (2021) - [i4]Fuxun Yu, Di Wang, Longfei Shangguan, Minjia Zhang, Xulong Tang, Chenchen Liu, Xiang Chen:
A Survey of Large-Scale Deep Learning Serving System Optimization: Challenges and Opportunities. CoRR abs/2111.14247 (2021) - [i3]Fuxun Yu, Shawn Bray, Di Wang, Longfei Shangguan, Xulong Tang, Chenchen Liu, Xiang Chen:
Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU. CoRR abs/2111.14255 (2021) - 2020
- [j4]Zhendong Wang, Zihang Jiang, Zhen Wang, Xulong Tang, Cong Liu, Shouyi Yin, Yang Hu:
Enabling Latency-Aware Data Initialization for Integrated CPU/GPU Heterogeneous Platform. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 39(11): 3433-3444 (2020) - [c20]Xulong Tang, Ziyu Zhang, Weizheng Xu, Mahmut Taylan Kandemir, Rami G. Melhem, Jun Yang:
Enhancing Address Translations in Throughput Processors via Compression. PACT 2020: 191-204 - [i2]Shasha Guo, Lianhua Qu, Lei Wang, Xulong Tang, Shuo Tian, Shiming Li, Weixia Xu:
Exploration of Input Patterns for Enhancing the Performance of Liquid State Machines. CoRR abs/2004.02540 (2020) - [i1]Yuxuan Cai, Hongjia Li, Geng Yuan, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, Yanzhi Wang:
YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design. CoRR abs/2009.05697 (2020)
2010 – 2019
- 2019
- [j3]Mustafa Karaköy, Orhan Kislal, Xulong Tang, Mahmut Taylan Kandemir, Meenakshi Arunachalam:
Architecture-Aware Approximate Computing. Proc. ACM Meas. Anal. Comput. Syst. 3(2): 38:1-38:24 (2019) - [c19]Jihyun Ryoo, Mengran Fan, Xulong Tang, Huaipan Jiang, Meena Arunachalam, Sharada Naveen, Mahmut T. Kandemir:
Architecture-Centric Bottleneck Analysis for Deep Neural Network Applications. HiPC 2019: 205-214 - [c18]Ashutosh Pattnaik, Xulong Tang, Onur Kayiran, Adwait Jog, Asit K. Mishra, Mahmut T. Kandemir, Anand Sivasubramaniam, Chita R. Das:
Opportunistic computing in GPU architectures. ISCA 2019: 210-223 - [c17]Xulong Tang, Mahmut Taylan Kandemir, Mustafa Karaköy, Meenakshi Arunachalam:
Co-optimizing memory-level parallelism and cache-level parallelism. PLDI 2019: 935-949 - [c16]Mustafa Karaköy, Orhan Kislal, Xulong Tang, Mahmut Taylan Kandemir, Meenakshi Arunachalam:
Architecture-Aware Approximate Computing. SIGMETRICS (Abstracts) 2019: 23-24 - [c15]Xulong Tang, Ashutosh Pattnaik, Onur Kayiran, Adwait Jog, Mahmut Taylan Kandemir, Chita R. Das:
Quantifying Data Locality in Dynamic Parallelism in GPUs. SIGMETRICS (Abstracts) 2019: 25-26 - [c14]Xulong Tang, Mahmut Taylan Kandemir, Hui Zhao, Myoungsoo Jung, Mustafa Karaköy:
Computing with Near Data. SIGMETRICS (Abstracts) 2019: 27-28 - 2018
- [j2]Xulong Tang, Ashutosh Pattnaik, Onur Kayiran, Adwait Jog, Mahmut Taylan Kandemir, Chita R. Das:
Quantifying Data Locality in Dynamic Parallelism in GPUs. Proc. ACM Meas. Anal. Comput. Syst. 2(3): 39:1-39:24 (2018) - [j1]Xulong Tang, Mahmut Taylan Kandemir, Hui Zhao, Myoungsoo Jung, Mustafa Karaköy:
Computing with Near Data. Proc. ACM Meas. Anal. Comput. Syst. 2(3): 42:1-42:30 (2018) - [c13]Jihyun Ryoo, Orhan Kislal, Xulong Tang, Mahmut T. Kandemir:
Quantifying and Optimizing Data Access Parallelism on Manycores. MASCOTS 2018: 131-144 - [c12]Orhan Kislal, Jagadish Kotra, Xulong Tang, Mahmut Taylan Kandemir, Myoungsoo Jung:
Enhancing computation-to-core assignment with physical location information. PLDI 2018: 312-327 - [c11]Sooraj Puthoor, Xulong Tang, Joseph Gross, Bradford M. Beckmann:
Oversubscribed Command Queues in GPUs. GPGPU@PPoPP 2018: 50-60 - 2017
- [c10]Orhan Kislal, Jagadish Kotra, Xulong Tang, Mahmut Taylan Kandemir, Myoungsoo Jung:
POSTER: Location-Aware Computation Mapping for Manycore Processors. PACT 2017: 138-139 - [c9]Xulong Tang, Ashutosh Pattnaik, Huaipan Jiang, Onur Kayiran, Adwait Jog, Sreepathi Pai, Mohamed Assem Ibrahim, Mahmut T. Kandemir, Chita R. Das:
Controlled Kernel Launch for Dynamic Parallelism in GPUs. HPCA 2017: 649-660 - [c8]Akbar Sharifi, Wei Ding, Diana R. Guttman, Hui Zhao, Xulong Tang, Mahmut T. Kandemir, Chita R. Das:
DEMM: A Dynamic Energy-Saving Mechanism for Multicore Memories. MASCOTS 2017: 210-220 - [c7]Xulong Tang, Orhan Kislal, Mahmut T. Kandemir, Mustafa Karaköy:
Data movement aware computation partitioning. MICRO 2017: 730-744 - 2016
- [c6]Onur Kayiran, Adwait Jog, Ashutosh Pattnaik, Rachata Ausavarungnirun, Xulong Tang, Mahmut T. Kandemir, Gabriel H. Loh, Onur Mutlu, Chita R. Das:
μC-States: Fine-grained GPU Datapath Power Management. PACT 2016: 17-30 - [c5]Ashutosh Pattnaik, Xulong Tang, Adwait Jog, Onur Kayiran, Asit K. Mishra, Mahmut T. Kandemir, Onur Mutlu, Chita R. Das:
Scheduling Techniques for GPU Architectures with Processing-In-Memory Capabilities. PACT 2016: 31-44 - [c4]Xulong Tang, Mahmut T. Kandemir, Praveen Yedlapalli, Jagadish Kotra:
Improving bank-level parallelism for irregular applications. MICRO 2016: 57:1-57:12 - 2015
- [c3]Wei Ding, Xulong Tang, Mahmut T. Kandemir, Yuanrui Zhang, Emre Kultursay:
Optimizing off-chip accesses in multicores. PLDI 2015: 131-142 - [c2]Mahmut T. Kandemir, Hui Zhao, Xulong Tang, Mustafa Karaköy:
Memory Row Reuse Distance and its Role in Optimizing Application Performance. SIGMETRICS 2015: 137-149 - 2012
- [c1]Gu Liu, Hong An, Wenting Han, Xiaoqiang Li, Tao Sun, Wei Zhou, Xuechao Wei, Xulong Tang:
FlexBFS: a parallelism-aware implementation of breadth-first search on GPU. PPoPP 2012: 279-280
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-31 20:16 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint