default search action
Yunquan Zhang
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j48]Yunquan Zhang, Guangming Tan, Liang Yuan:
Special issue of HPCChina 2023. CCF Trans. High Perform. Comput. 6(1): 1-2 (2024) - [j47]Cunyang Wei, Haipeng Jia, Yunquan Zhang, Jianyu Yao, Chendi Li, Wenxuan Cao:
IrGEMM: An Input-Aware Tuning Framework for Irregular GEMM on ARM and X86 CPUs. IEEE Trans. Parallel Distributed Syst. 35(9): 1672-1689 (2024) - [c90]Lei Xu, Haipeng Jia, Yunquan Zhang, Luhan Wang, Xianmeng Jiang:
HAM-SpMSpV: an Optimized Parallel Algorithm for Masked Sparse Matrix-Sparse Vector Multiplications on multi-core CPUs. HPDC 2024: 160-173 - [c89]Wenxuan Zhao, Liang Yuan, Baicheng Yan, Penghao Ma, Yunquan Zhang, Long Wang, Zhe Wang:
Stencil Computation with Vector Outer Product. ICS 2024: 247-258 - [c88]Luhan Wang, Haipeng Jia, Lei Xu, Cunyang Wei, Kun Li, Xianmeng Jiang, Yunquan Zhang:
VNEC: A Vectorized Non-Empty Column Format for SpMV on CPUs. IPDPS 2024: 14-25 - [c87]Zhiqian Xu, Honghui Shang, Yi Fan, Xiongzhi Zeng, Yunquan Zhang, Chu Guo:
Scalable and Differentiable Simulator for Quantum Computational Chemistry. IPDPS 2024: 230-240 - [c86]Ruge Zhang, Haipeng Jia, Yunquan Zhang, Baicheng Yan, Penghao Ma, Long Wang, Wenxuan Zhao:
OpenFFT-SME: An Efficient Outer Product Pattern FFT Library on ARM SME CPUs. IPDPS 2024: 938-949 - [c85]Yuetao Chen, Kun Li, Yuhao Wang, Donglin Bai, Lei Wang, Lingxiao Ma, Liang Yuan, Yunquan Zhang, Ting Cao, Mao Yang:
ConvStencil: Transform Stencil Computation to Matrix Multiplication on Tensor Cores. PPoPP 2024: 333-347 - 2023
- [j46]Yan Zeng, Yong Ding, Dongyang Ou, Jilin Zhang, Yongjian Ren, Yunquan Zhang:
MP-DPS: adaptive distributed training for deep learning based on node merging and path prediction. CCF Trans. High Perform. Comput. 5(4): 429-441 (2023) - [j45]Yan Zeng, Yuankai Mu, Junfeng Yuan, Siyuan Teng, Jilin Zhang, Jian Wan, Yongjian Ren, Yunquan Zhang:
Adaptive Federated Learning With Non-IID Data. Comput. J. 66(11): 2758-2772 (2023) - [j44]Hang Cao, Liang Yuan, He Zhang, Yunquan Zhang, Baodong Wu, Kun Li, Shigang Li, Minghua Zhang, Pengqi Lu, Junmin Xiao:
AGCM-3DLF: Accelerating Atmospheric General Circulation Model via 3-D Parallelization and Leap-Format. IEEE Trans. Parallel Distributed Syst. 34(3): 766-780 (2023) - [j43]Lei Xu, Honghui Shang, Xin Chen, Yunquan Zhang, Lifang Wang, Xingyu Gao, Haifeng Song:
Redesigning OpenKMC for Multi-Component Trillion-Atom Simulations on the New Sunway Supercomputer. IEEE Trans. Parallel Distributed Syst. 34(7): 1997-2010 (2023) - [c84]Yan Zeng, Chengchuang Huang, Yijie Ni, Chunbao Zhou, Jilin Zhang, Jue Wang, Mingyao Zhou, Meiting Xue, Yunquan Zhang:
An Auto-Parallel Method for Deep Learning Models Based on Genetic Algorithm. ICPADS 2023: 230-235 - [c83]Rongyuan Guo, Haipeng Jia, Yunquan Zhang, Mingsen Deng, Cunyang Wei, Wenbin Chang, Xiang Zhao:
SA_TRSM: A Shape-Aware Auto-Tuning Framework for Small-Scale Irregular-Shaped TRSM. ICPADS 2023: 765-774 - [c82]Tun Chen, Haipeng Jia, Yunquan Zhang, Kun Li, Zhihao Li, Xiang Zhao, Jianyu Yao, Chendi Li:
OpenFFT: An Adaptive Tuning Framework for 3D FFT on ARM Multicore CPUs. ICS 2023: 398-409 - [c81]Daning Cheng, Shigang Li, Yunquan Zhang:
Asynch-SGBDT: Train Stochastic Gradient Boosting Decision Trees in an Asynchronous Parallel Manner. IPDPS 2023: 256-267 - [c80]Zhihao Li, Haipeng Jia, Yunquan Zhang, Yuyan Sun, Yiwei Zhang, Tun Chen:
Generating Fast FFT Kernels on CPUs via FFT-Specific Intrinsics. PPoPP 2023: 427-428 - [i18]Kun Li, Zhichun Li, Yuetao Chen, Zixuan Wang, Yiwei Zhang, Liang Yuan, Haipeng Jia, Yunquan Zhang, Ting Cao, Mao Yang:
Gamify Stencil Dwarf on Cloud for Democratizing Scientific Computing. CoRR abs/2303.08365 (2023) - [i17]Wenxuan Zhao, Liang Yuan, Baicheng Yan, Penghao Ma, Yunquan Zhang, Long Wang, Zhe Wang:
Stencil Computation with Vector Outer Product. CoRR abs/2310.16298 (2023) - 2022
- [j42]Yan Zeng, Jiyang Wu, Jilin Zhang, Yongjian Ren, Yunquan Zhang:
Trinity: Neural Network Adaptive Distributed Parallel Training Method Based on Reinforcement Learning. Algorithms 15(4): 108 (2022) - [j41]Yuetao Chen, Keni Qiu, Li Chen, Haipeng Jia, Yunquan Zhang, Limin Xiao, Lei Liu:
Smart scheduler: an adaptive NVM-aware thread scheduling approach on NUMA systems. CCF Trans. High Perform. Comput. 4(4): 394-406 (2022) - [j40]Yuetao Chen, Keni Qiu, Li Chen, Haipeng Jia, Yunquan Zhang, Limin Xiao, Lei Liu:
Publisher Correction: Smart scheduler: an adaptive NVM-aware thread scheduling approach on NUMA systems. CCF Trans. High Perform. Comput. 4(4): 492 (2022) - [j39]Mingchuan Wu, Yangjun Wu, Honghui Shang, Ying Liu, Huimin Cui, Fang Li, Xiaohui Duan, Yunquan Zhang, Xiaobing Feng:
Scaling Poisson Solvers on Many Cores via MMEwald. IEEE Trans. Parallel Distributed Syst. 33(8): 1888-1901 (2022) - [j38]Kun Li, Liang Yuan, Yunquan Zhang, Gongwei Chen:
An Accurate and Efficient Large-Scale Regression Method Through Best Friend Clustering. IEEE Trans. Parallel Distributed Syst. 33(11): 3129-3140 (2022) - [c79]Cunyang Wei, Haipeng Jia, Yunquan Zhang, Kun Li, Luhan Wang:
LBBGEMM: A Load-balanced Batch GEMM Framework on ARM CPU s. HPCC/DSS/SmartCity/DependSys 2022: 59-66 - [c78]Luhan Wang, Haipeng Jia, Yunquan Zhang, Kun Li, Cunyang Wei:
EgpuIP: An Embedded GPU Accelerated Library for Image Processing. HPCC/DSS/SmartCity/DependSys 2022: 914-921 - [c77]Yan Zeng, Guangzheng Yi, Yuyu Yin, Jiyang Wu, Meiting Xue, Jilin Zhang, Jian Wan, Yunquan Zhang:
Aware: Adaptive Distributed Training with Computation, Communication and Position Awareness for Deep Learning Model. HPCC/DSS/SmartCity/DependSys 2022: 1299-1306 - [c76]Yunquan Zhang, Jidong Zhai, Rajiv Ranjan:
Message from the High Performance Computing and Communications 2022 Program Chairs. HPCC/DSS/SmartCity/DependSys 2022: lv - [c75]Cunyang Wei, Haipeng Jia, Yunquan Zhang, Liusha Xu, Ji Qi:
IATF: An Input-Aware Tuning Framework for Compact BLAS Based on ARMv8 CPUs. ICPP 2022: 66:1-66:11 - [c74]Kun Li, Liang Yuan, Yunquan Zhang, Yue Yue, Hang Cao:
An Efficient Vectorization Scheme for Stencil Computation. IPDPS 2022: 650-660 - [c73]Honghui Shang, Li Shen, Yi Fan, Zhiqian Xu, Chu Guo, Jie Liu, Wenhao Zhou, Huan Ma, Rongfen Lin, Yuling Yang, Fang Li, Zhuoya Wang, Yunquan Zhang, Zhenyu Li:
Large-Scale Simulation of Quantum Computational Chemistry on a New Sunway Supercomputer. SC 2022: 14:1-14:14 - [i16]Chendi Li, Haipeng Jia, Hang Cao, Jianyu Yao, Boqian Shi, Chunyang Xiang, Jinbo Sun, Pengqi Lu, Yunquan Zhang:
AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs. CoRR abs/2208.08088 (2022) - [i15]Jianyu Yao, Boqian Shi, Chunyang Xiang, Haipeng Jia, Chendi Li, Hang Cao, Yunquan Zhang:
IAAT: A Input-Aware Adaptive Tuning framework for Small GEMM. CoRR abs/2208.09822 (2022) - 2021
- [j37]Zhixiang Ren, Yongheng Liu, Tianhui Shi, Lei Xie, Yue Zhou, Jidong Zhai, Youhui Zhang, Yunquan Zhang, Wenguang Chen:
AIPerf: Automated machine learning as an AI-HPC benchmark. Big Data Min. Anal. 4(3): 208-220 (2021) - [j36]Honghui Shang, WanZhen Liang, Yunquan Zhang, Jinlong Yang:
Efficient parallel linear scaling method to get the response density matrix in all-electron real-space density-functional perturbation theory. Comput. Phys. Commun. 258: 107613 (2021) - [j35]Honghui Shang, Xiaohui Duan, Fang Li, Libo Zhang, Zhiqian Xu, Kan Liu, Haiwen Luo, Yingrui Ji, Wenxuan Zhao, Wei Xue, Li Chen, Yunquan Zhang:
Many-core acceleration of the first-principles all-electron quantum perturbation calculations. Comput. Phys. Commun. 267: 108045 (2021) - [j34]Daning Cheng, Shigang Li, Hanping Zhang, Fen Xia, Yunquan Zhang:
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms. IEEE Trans. Parallel Distributed Syst. 32(7): 1702-1712 (2021) - [c72]Tun Chen, Haipeng Jia, Zhihao Li, Chendi Li, Yunquan Zhang:
A Transpose-free Three-dimensional FFT Algorithm on ARM CPUs. HPCC/DSS/SmartCity/DependSys 2021: 1-8 - [c71]Pengqi Lu, Yue Yue, Liang Yuan, Yunquan Zhang:
AutoFlow: Hotspot-Aware, Dynamic Load Balancing for Distributed Stream Processing. ICA3PP (3) 2021: 133-151 - [c70]Jianyu Yao, Boqian Shi, Chunyang Xiang, Haipeng Jia, Chendi Li, Hang Cao, Yunquan Zhang:
IAAT: A Input-Aware Adaptive Tuning framework for Small GEMM. ICPADS 2021: 899-906 - [c69]Chendi Li, Haipeng Jia, Hang Cao, Jianyu Yao, Boqian Shi, Chunyang Xiang, Jinbo Sun, Pengqi Lu, Yunquan Zhang:
AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs. ISPA/BDCloud/SocialCom/SustainCom 2021: 159-166 - [c68]Honghui Shang, Fang Li, Yunquan Zhang, Libo Zhang, You Fu, Yingxiang Gao, Yangjun Wu, Xiaohui Duan, Rongfen Lin, Xin Liu, Ying Liu, Dexun Chen:
Extreme-scale ab initio quantum raman spectra simulations on the leadership HPC system in China. SC 2021: 6 - [c67]Honghui Shang, Fang Li, Yunquan Zhang, Ying Liu, Libo Zhang, Mingchuan Wu, Yangjun Wu, Di Wei, Huimin Cui, Xin Liu, Fei Wang, Yuxi Ye, Yingxiang Gao, Shuang Ni, Xin Chen, Dexun Chen:
Accelerating all-electron ab initio simulation of raman spectra for biological systems. SC 2021: 41 - [c66]Honghui Shang, Xin Chen, Xingyu Gao, Rongfen Lin, Lifang Wang, Fang Li, Qian Xiao, Lei Xu, Qiang Sun, Leilei Zhu, Fei Wang, Yunquan Zhang, Haifeng Song:
TensorKMC: kinetic Monte Carlo simulation of 50 trillion atoms driven by deep learning on a new generation of Sunway supercomputer. SC 2021: 73 - [c65]Liang Yuan, Hang Cao, Yunquan Zhang, Kun Li, Pengqi Lu, Yue Yue:
Temporal vectorization for stencils. SC 2021: 82 - [c64]Kun Li, Liang Yuan, Yunquan Zhang, Yue Yue:
Reducing redundancy in data organization and arithmetic calculation for stencil computations. SC 2021: 84 - [i14]Kun Li, Liang Yuan, Yunquan Zhang, Yue Yue, Hang Cao, Pengqi Lu:
An Efficient Vectorization Scheme for Stencil Computation. CoRR abs/2103.08825 (2021) - [i13]Pengqi Lu, Liang Yuan, Yunquan Zhang, Hang Cao, Kun Li:
AutoFlow: Hotspot-Aware, Dynamic Load Balancing for Distributed Stream Processing. CoRR abs/2103.08888 (2021) - [i12]Kun Li, Liang Yuan, Yunquan Zhang, Yue Yue, Hang Cao, Pengqi Lu:
Reducing Redundancy in Data Organization and Arithmetic Calculation for Stencil Computations. CoRR abs/2103.09235 (2021) - [i11]Hang Cao, Liang Yuan, He Zhang, Yunquan Zhang:
Enhanced AGCM3D: A Highly Scalable Dynamical Core of Atmospheric General Circulation Model Based on Leap-Format. CoRR abs/2103.10114 (2021) - [i10]Kun Li, Liang Yuan, Yunquan Zhang, Gongwei Chen:
An Accurate and Efficient Large-scale Regression Method through Best Friend Clustering. CoRR abs/2104.10819 (2021) - 2020
- [j33]Honghui Shang, Lei Xu, Baodong Wu, Xinming Qin, Yunquan Zhang, Jinlong Yang:
The dynamic parallel distribution algorithm for hybrid density-functional calculations in HONPAS package. Comput. Phys. Commun. 254: 107204 (2020) - [j32]Wei Li, Jun Liang, Yunquan Zhang, Haipeng Jia, Lin Xiao, Qing Li:
Accelerated LiDAR data processing algorithm for self-driving cars on the heterogeneous computing platform. IET Comput. Digit. Tech. 14(5): 201-209 (2020) - [j31]Daobi Chen, Liang Yuan, Yunquan Zhang, Jingfu Yan, David K. Kahaner:
HPC software capability landscape in China. Int. J. High Perform. Comput. Appl. 34(1) (2020) - [j30]Xinming Qin, Honghui Shang, Lei Xu, Wei Hu, Jinlong Yang, Shigang Li, Yunquan Zhang:
The static parallel distribution algorithms for hybrid density-functional calculations in HONPAS package. Int. J. High Perform. Comput. Appl. 34(2) (2020) - [j29]Daning Cheng, Shigang Li, Yunquan Zhang:
WP-SGD: Weighted parallel SGD for distributed unbalanced-workload training system. J. Parallel Distributed Comput. 145: 202-216 (2020) - [j28]Liang Yuan, Yunquan Zhang, Xuerui Bai, Guangting Zhang:
并行程序设计语言中局部性机制的研究 (Research on Locality-aware Design Mechanism of State-of-the-art Parallel Programming Languages). 计算机科学 47(1): 7-16 (2020) - [j27]Kun Li, Shigang Li, Shan Huang, Yifeng Chen, Yunquan Zhang:
FastNBL: fast neighbor lists establishment for molecular dynamics simulation based on bitwise operations. J. Supercomput. 76(7): 5501-5520 (2020) - [j26]Zhihao Li, Haipeng Jia, Yunquan Zhang, Tun Chen, Liang Yuan, Richard W. Vuduc:
Automatic Generation of High-Performance FFT Kernels on Arm and X86 CPUs. IEEE Trans. Parallel Distributed Syst. 31(8): 1925-1941 (2020) - [c63]Ke Zhan, Zhonghua Lu, Yunquan Zhang:
Performance Optimization for Feature Extraction Section of DeepChem. ICA3PP (1) 2020: 290-304 - [c62]Hang Cao, Liang Yuan, He Zhang, Baodong Wu, Shigang Li, Pengqi Lu, Yunquan Zhang, Yongjun Xu, Minghua Zhang:
A Highly Efficient Dynamical Core of Atmospheric General Circulation Model based on Leap-Format. IPDPS 2020: 95-104 - [i9]Zhixiang Ren, Yongheng Liu, Tianhui Shi, Lei Xie, Yue Zhou, Jidong Zhai, Youhui Zhang, Yunquan Zhang, Wenguang Chen:
AIPerf: Automated machine learning as an AI-HPC benchmark. CoRR abs/2008.07141 (2020) - [i8]Liang Yuan, Hang Cao, Yunquan Zhang, Kun Li, Pengqi Lu, Yue Yue:
Temporal Vectorization for Stencils. CoRR abs/2010.04868 (2020)
2010 – 2019
- 2019
- [j25]Di Zhang, Yunquan Zhang, Qiang Niu, Xingbao Qiu:
Mining concise patterns on graph-connected itemsets. Neurocomputing 336: 27-35 (2019) - [j24]Zhihao Li, Haipeng Jia, Yunquan Zhang, Shice Liu, Shigang Li, Xiao Wang, Hao Zhang:
Efficient parallel optimizations of a high-performance SIFT on GPUs. J. Parallel Distributed Comput. 124: 78-91 (2019) - [j23]Yunquan Zhang:
2018年中国高性能计算机发展现状分析与展望 (State-of-the-art Analysis and Perspectives of 2018 China HPC Development). 计算机科学 46(1): 1-5 (2019) - [j22]Liang Yuan, Chen Ding, Wesley Smith, Peter J. Denning, Yunquan Zhang:
A Relational Theory of Locality. ACM Trans. Archit. Code Optim. 16(3): 33:1-33:26 (2019) - [j21]Kun Li, Shigang Li, Shan Huang, Yifeng Chen, Yunquan Zhang:
Correction to: FastNBL: fast neighbor lists establishment for molecular dynamics simulation based on bitwise operations. J. Supercomput. 75(12): 8339-8340 (2019) - [c61]Liang Yuan, Shan Huang, Yunquan Zhang, Hang Cao:
Tessellating Star Stencils. ICPP 2019: 43:1-43:10 - [c60]Daning Cheng, Hanping Zhang, Fen Xia, Shigang Li, Yunquan Zhang:
Using Gradient Based Multikernel Gaussian Process and Meta-Acquisition Function to Accelerate SMBO. ICTAI 2019: 440-447 - [c59]Kun Li, Shigang Li, Bei Wang, Yifeng Chen, Yunquan Zhang:
swMD: Performance Optimizations for Molecular Dynamics Simulation on Sunway Taihulight. ISPA/BDCloud/SocialCom/SustainCom 2019: 511-518 - [c58]Zhihao Li, Haipeng Jia, Yunquan Zhang, Tun Chen, Liang Yuan, Luning Cao, Xiao Wang:
AutoFFT: a template-based FFT codes auto-generation framework for ARM and X86 CPUs. SC 2019: 25:1-25:15 - [c57]Kun Li, Honghui Shang, Yunquan Zhang, Shigang Li, Baodong Wu, Dong Wang, Libo Zhang, Fang Li, Dexun Chen, Zhiqiang Wei:
OpenKMC: a KMC design for hundred-billion-atom simulation using millions of cores on Sunway Taihulight. SC 2019: 68:1-68:16 - [i7]Zihan Jiang, Wanling Gao, Lei Wang, Xingwang Xiong, Yuchen Zhang, Xu Wen, Chunjie Luo, Hainan Ye, Yunquan Zhang, Shengzhong Feng, Kenli Li, Weijia Xu, Jianfeng Zhan:
HPC AI500: A Benchmark Suite for HPC AI Systems. CoRR abs/1908.02607 (2019) - [i6]Daning Cheng, Hanping Zhang, Fen Xia, Shigang Li, Yunquan Zhang:
The Scalability for Parallel Machine Learning Training Algorithm: Dataset Matters. CoRR abs/1910.11510 (2019) - 2018
- [j20]Shigang Li, Yunquan Zhang, Torsten Hoefler:
Cache-Oblivious MPI All-to-All Communications Based on Morton Order. IEEE Trans. Parallel Distributed Syst. 29(3): 542-555 (2018) - [c56]Zihan Jiang, Wanling Gao, Lei Wang, Xingwang Xiong, Yuchen Zhang, Xu Wen, Chunjie Luo, Hainan Ye, Xiaoyi Lu, Yunquan Zhang, Shengzhong Feng, Kenli Li, Weijia Xu, Jianfeng Zhan:
HPC AI500: A Benchmark Suite for HPC AI Systems. Bench 2018: 10-22 - [c55]Xiao Wang, Haipeng Jia, Zhihao Li, Yunquan Zhang:
Implementation and Optimization of Multi-dimensional Real FFT on ARMv8 Platform. ICA3PP (2) 2018: 338-353 - [c54]Baodong Wu, Shigang Li, Hang Cao, Yunquan Zhang, He Zhang, Junmin Xiao, Minghua Zhang:
AGCM3D: A Highly Scalable Finite-Difference Dynamical Core of Atmospheric General Circulation Model Based on 3D Decomposition. ICPADS 2018: 355-364 - [c53]Junmin Xiao, Shigang Li, Baodong Wu, He Zhang, Kun Li, Erlin Yao, Yunquan Zhang, Guangming Tan:
Communication-Avoiding for Dynamical Core of Atmospheric General Circulation Model. ICPP 2018: 12:1-12:10 - [c52]Shigang Li, Baodong Wu, Yunquan Zhang, Xianmeng Wang, Jianjiang Li, Changjun Hu, Jue Wang, Yangde Feng, Ningming Nie:
Massively Scaling the Metal Microscopic Damage Simulation on Sunway TaihuLight Supercomputer. ICPP 2018: 47:1-47:11 - [c51]Liang Yuan, Wesley Smith, Sicong Fan, Zixu Chen, Chen Ding, Yunquan Zhang:
Footmark: A New Formulation for Working Set Statistics. LCPC 2018: 61-69 - [c50]Di Zhang, Yunquan Zhang, Qiang Niu, Xingbao Qiu:
Rolling Forecasting Forward by Boosting Heterogeneous Kernels. PAKDD (1) 2018: 248-260 - [e2]Zongben Xu, Xinbo Gao, Qiguang Miao, Yunquan Zhang, Jiajun Bu:
Big Data - 6th CCF Conference, Big Data 2018, Xi'an, China, October 11-13, 2018, Proceedings. Communications in Computer and Information Science 945, Springer 2018, ISBN 978-981-13-2921-0 [contents] - [i5]Liang Yuan, Chen Ding, Peter J. Denning, Yunquan Zhang:
A Measurement Theory of Locality. CoRR abs/1802.01254 (2018) - [i4]Daning Cheng, Fen Xia, Shigang Li, Yunquan Zhang:
Asynchronous Parallel Sampling Gradient Boosting Decision Tree. CoRR abs/1804.04659 (2018) - [i3]Daning Cheng, Hanping Zhang, Fen Xia, Shigang Li, Yunquan Zhang:
Using Known Information to Accelerate HyperParameters Optimization Based on SMBO. CoRR abs/1811.03322 (2018) - 2017
- [j19]Baodong Wu, Shigang Li, Yunquan Zhang, Ningming Nie:
Hybrid-optimization strategy for the communication of large-scale Kinetic Monte Carlo simulation. Comput. Phys. Commun. 211: 113-123 (2017) - [j18]Vijayalakshmi Srinivasan, Yunquan Zhang:
Special Issue on Network and Parallel Computing. Int. J. Parallel Program. 45(1): 1-3 (2017) - [c49]Zhihao Li, Haipeng Jia, Yunquan Zhang:
HartSift: A High-Accuracy and Real-Time SIFT Based on GPU. ICPADS 2017: 135-142 - [c48]Shigang Li, Yunquan Zhang, Torsten Hoefler:
POSTER: Cache-Oblivious MPI All-to-All Communications on Many-Core Architectures. PPoPP 2017: 445-446 - [c47]Liang Yuan, Yunquan Zhang, Peng Guo, Shan Huang:
Tessellating stencils. SC 2017: 49 - [i2]Daning Cheng, Shigang Li, Yunquan Zhang:
Weighted parallel SGD for distributed unbalanced-workload training system. CoRR abs/1708.04801 (2017) - [i1]Daning Cheng, Shigang Li, Yunquan Zhang:
Asynchronous COMID: the theoretic basis for transmitted data sparsification tricks on Parameter Server. CoRR abs/1709.02091 (2017) - 2016
- [j17]Yunquan Zhang, Ji-Lin Zhang:
Workshop on high performance data intensive computing. Concurr. Comput. Pract. Exp. 28(6): 1695-1696 (2016) - [j16]Renbo Pang, Yunquan Zhang, Guangming Tan, Jianliang Xu, Haipeng Jia, Qingchun Xie:
边缘海静力数值预报模式并行算法研究 (Parallelization of Hydrostatic Numerical Forecasting Model of Marginal Sea). 计算机科学 43(1): 14-17 (2016) - [j15]Tao Luo, Yin Liao, Guoliang Chen, Yunquan Zhang:
P-DOT: a model of computation for big data. Int. J. Parallel Emergent Distributed Syst. 31(3): 233-253 (2016) - [j14]Yunquan Zhang, Ting Cao, Shigang Li, Xinhui Tian, Liang Yuan, Haipeng Jia, Athanasios V. Vasilakos:
Parallel Processing Systems for Big Data: A Survey. Proc. IEEE 104(11): 2114-2136 (2016) - [j13]Yunquan Zhang, Shigang Li, Shengen Yan, Huiyang Zhou:
A Cross-Platform SpMV Framework on Many-Core Architectures. ACM Trans. Archit. Code Optim. 13(4): 33:1-33:25 (2016) - [c46]Chenxi Wang, Ting Cao, John N. Zigman, Fang Lv, Yunquan Zhang, Xiaobing Feng:
Efficient Management for Hybrid Memory in Managed Language Runtime. NPC 2016: 29-42 - 2015
- [j12]Shigang Li, Changjun Hu, Junchao Zhang, Yunquan Zhang:
Automatic tuning of sparse matrix-vector multiplication on multicore clusters. Sci. China Inf. Sci. 58(9): 1-14 (2015) - [j11]Qingkui Gong, Changyou Zhang, Xianyi Zhang, Yunquan Zhang:
基于Julia语言的并行计算方法初探 (Primary Investigation into Parallel Computing in Julia Language). 计算机科学 42(1): 44-46 (2015) - [j10]Ke Zhan, Yunquan Zhang, Ting Wang, Jingjing Zheng, Peng Zhang:
基于Pthreads的并行DSRC压缩算法设计与实现 (Design and Implementation of Parallel DSRC Compression Algorithm Based on Pthreads). 计算机科学 42(1): 90-91 (2015) - [j9]Xiaojing An, Yunquan Zhang, Haipeng Jia:
基于OpenCL的直方图生成算法优化方法研究 (Research on Histogram Generation Algorithm Optimization Based on OpenCL). 计算机科学 42(11): 32-36 (2015) - [c45]Renbo Pang, Jianliang Xu, Yunquan Zhang:
Parallel Solving Method of SOR Based on the Numerical Marine Forecasting Model. CCGRID 2015: 733-736 - [c44]Xiaomin Zhu, Junchao Zhang, Kazutomo Yoshii, Shigang Li, Yunquan Zhang, Pavan Balaji:
Analyzing MPI-3.0 Process-Level Shared Memory: A Case Study with Stencil Computations. CCGRID 2015: 1099-1106 - [c43]Shigang Li, Yunquan Zhang, Chunyang Xiang, Lei Shi:
Fast Convolution Operations on Many-Core Architectures. HPCC/CSS/ICESS 2015: 316-323 - [c42]Xiaojing An, Haipeng Jia, Yunquan Zhang:
Optimized Password Recovery for Encrypted RAR on GPUs. HPCC/CSS/ICESS 2015: 591-598 - [c41]Mengran Fan, Haipeng Jia, Yunquan Zhang, Xiaojing An, Ting Cao:
Optimizing Image Sharpening Algorithm on GPU. ICPP 2015: 230-239 - [c40]James Dinan, Wenguang Chen, Xiaosong Ma, Pavan Balaji, Satoshi Matsuoka, Jiayuan Meng, Yunquan Zhang:
AsHES Introduction and Committees. IPDPS Workshops 2015: 591-592 - [e1]Xiaohua Jia, Tharam S. Dillon, Kuan-Ching Li, Yong Zhang, Nei Kato, Kui Wu, Yunquan Zhang:
Ninth International Conference on Frontier of Computer Science and Technology, FCST 2015, Dalian, China, August 26-28, 2015. IEEE Computer Society 2015, ISBN 978-1-4673-9295-2 [contents] - 2014
- [j8]Yiqung Liu, Yan Li, Yunquan Zhang, Xianyi Zhang:
Memory Efficient Two-Pass 3D FFT Algorithm for Intel® Xeon PhiTM Coprocessor. J. Comput. Sci. Technol. 29(6): 989-1002 (2014) - [j7]Ke Zhan, Yunquan Zhang:
Function Prediction of Proteins in Yeast Networks Based on the MCL Algorithm. J. Softw. 9(5): 1157-1162 (2014) - [c39]Qingchun Xie, Yunquan Zhang, Haipeng Jia, Yongquan Lu:
Research on Mahalanobis Distance Algorithm Optimization Based on OpenCL. HPCC/CSS/ICESS 2014: 84-91 - [c38]Changmao Wu, Yunquan Zhang, Congli Yang, Yutong Lu:
Physically based parallel ray tracer for the Metropolis light transport algorithm on the Tianhe-2 supercomputer. ICPADS 2014: 444-453 - [c37]Yunquan Zhang:
AsHES Introduction and Committees. IPDPS Workshops 2014: 904-906 - [c36]Shengen Yan, Chao Li, Yunquan Zhang, Huiyang Zhou:
yaSpMV: yet another SpMV framework on GPUs. PPoPP 2014: 107-118 - 2013
- [j6]Yan Li, Yunquan Zhang, Yiqung Liu, Guoping Long, Haipeng Jia:
MPFFT: An Auto-Tuning FFT Library for OpenCL GPUs. J. Comput. Sci. Technol. 28(1): 90-105 (2013) - [c35]Tao Luo, Yin Liao, Guoliang Chen, Yunquan Zhang:
P-DOT: A model of computation for big data. IEEE BigData 2013: 31-37 - [c34]Weiyan Wang, Yunquan Zhang, Guoping Long, Shengen Yan, Haipeng Jia:
CLSIFT: An Optimization Study of the Scale Invariance Feature Transform on GPUs. HPCC/EUC 2013: 93-100 - [c33]Changmao Wu, Yunquan Zhang, Congli Yang:
Large Scale Satellite Imagery Simulations with Physically Based Ray Tracing on Tianhe-1A Supercomputer. HPCC/EUC 2013: 549-556 - [c32]Tao Luo, Guoliang Chen, Yunquan Zhang:
H-DB: Yet Another Big Data Hybrid System of Hadoop and DBMS. ICA3PP (1) 2013: 324-335 - [c31]Palden Lama, Yan Li, Ashwin M. Aji, Pavan Balaji, James Dinan, Shucai Xiao, Yunquan Zhang, Wu-chun Feng, Rajeev Thakur, Xiaobo Zhou:
pVOCL: Power-Aware Dynamic Placement and Migration in Virtualized GPU Environments. ICDCS 2013: 145-154 - [c30]Shengen Yan, Guoping Long, Yunquan Zhang:
StreamScan: fast scan algorithms for GPUs without global barrier synchronization. PPoPP 2013: 229-238 - [c29]Qian Wang, Xianyi Zhang, Yunquan Zhang, Qing Yi:
AUGEM: automatically generate high performance dense linear algebra kernels on x86 CPUs. SC 2013: 25:1-25:12 - 2012
- [c28]Liang Yuan, Yunquan Zhang:
A Locality-based Performance Model for Load-and-Compute Style Computation. CLUSTER 2012: 566-571 - [c27]Haipeng Jia, Yunquan Zhang, Guoping Long, Jianliang Xu, Shengen Yan, Yan Li:
GPURoofline: A Model for Guiding Performance Optimizations on GPUs. Euro-Par 2012: 920-932 - [c26]Haipeng Jia, Yunquan Zhang, Weiyan Wang, Jianliang Xu:
Accelerating Viola-Jones Facce Detection Algorithm on GPUs. HPCC-ICESS 2012: 396-403 - [c25]Haipeng Jia, Yunquan Zhang, Guoping Long, Shengen Yan:
An Insightful Program Performance Tuning Chain for GPU Computing. ICA3PP (1) 2012: 502-516 - [c24]Xianyi Zhang, Qian Wang, Yunquan Zhang:
Model-driven Level 3 BLAS Performance Optimization on Loongson 3A Processor. ICPADS 2012: 684-691 - [c23]Liang Yuan, Chen Ding, Daniel Stefankovic, Yunquan Zhang:
Modeling the Locality in Graph Traversals. ICPP 2012: 138-147 - [c22]Chao Li, Yunquan Zhang, Changwen Zheng, Xiaohui Hu:
Implementing High-performance Intensity Model with Blur Effect on GPUs for Large-scale Star Image Simulation. IPDPS Workshops 2012: 1879-1888 - 2011
- [c21]Xiangzheng Sun, Yunquan Zhang, Ting Wang, Guoping Long, Xianyi Zhang, Yan Li:
CRSD: Application Specific Auto-tuning of SpMV for Diagonal Sparse Matrices. Euro-Par (2) 2011: 316-327 - [c20]Yan Li, Yunquan Zhang, Haipeng Jia, Guoping Long, Ke Wang:
Automatic FFT Performance Tuning on OpenCL GPUs. ICPADS 2011: 228-235 - [c19]Xiangzheng Sun, Yunquan Zhang, Ting Wang, Xianyi Zhang, Liang Yuan, Li Rao:
Optimizing SpMV for Diagonal Sparse Matrices on GPU. ICPP 2011: 492-501 - 2010
- [j5]Yunquan Zhang, Jiachang Sun, Guoxing Yuan, Linbo Zhang:
Perspectives of China's HPC system development: a view from the 2009 China HPC TOP100 list. Frontiers Comput. Sci. China 4(4): 437-444 (2010) - [c18]Lei Wang, Yunquan Zhang, Xianyi Zhang, Fangfang Liu:
Accelerating Linpack Performance with Mixed Precision Algorithm on CPU+GPGPU Heterogeneous Cluster. CIT 2010: 1169-1174 - [c17]Jing Wang, Yunquan Zhang, Xianyi Zhang, Xiangzheng Sun, Quanhu Sheng:
QuantWiz: A scalable parallel software package for label-free protein quantification. BIC-TA 2010: 976-980 - [c16]Liang Yuan, Yunquan Zhang, Yuxin Tang, Li Rao, Xiangzheng Sun:
LogGPH: A Parallel Computational Model with Hierarchical Communication Awareness. CSE 2010: 268-274 - [c15]Chao Yang, Yunquan Zhang, Ligang Li:
Numerical Simulation of the Thermal Convection in the Earth's Outer Core. HPCC 2010: 552-555 - [c14]Liang Yuan, Yunquan Zhang, Xiangzheng Sun, Ting Wang:
Optimizing Sparse Matrix Vector Multiplication Using Diagonal Storage Matrix Format. HPCC 2010: 585-590 - [c13]Yan Li, Yunquan Zhang, Ke Wang, Wenhua Guan:
Heterogeneous Multi-core Parallel SGEMM Performance Testing and Analysis on Cell/B.E Processor. NAS 2010: 202-207
2000 – 2009
- 2009
- [j4]Yuxin Tang, Yunquan Zhang, Hu Chen:
A parallel shortest path algorithm based on graph-partitioning and iterative correcting. Comput. Syst. Sci. Eng. 24(5) (2009) - [c12]Chao Yang, Ligang Li, Yunquan Zhang:
Development of a Scalable Solver for the Earth's Core Convection. HPCA (China) 2009: 497-502 - [c11]Shengfei Liu, Yunquan Zhang, Xiangzheng Sun, RongRong Qiu:
Performance Evaluation of Multithreaded Sparse Matrix-Vector Multiplication Using OpenMP. HPCC 2009: 659-665 - [c10]Jing Wang, Yunquan Zhang, Xianyi Zhang, Xiangzheng Sun, Zelin Hu, Sujun Li, Rong Zeng:
QuantWiz: A Parallel Software Package for LC-MS-based Label-Free Protein Quantification. HPCC 2009: 683-687 - [c9]Yuan Yu, Yunquan Zhang, Ting Wang, Jiachang Sun, Xianyi Zhang, Yuxin Tang, Li Rao:
Early Performance Evaluation of Dawning 5000A and DeepComp 7000. ICPADS 2009: 578-585 - 2008
- [j3]Jian Zhang, Wenhui Zhang, Naijun Zhan, Yi-Dong Shen, Haiming Chen, Yunquan Zhang, Yongji Wang, Enhua Wu, Hongan Wang, Xueyang Zhu:
Basic research in computer science and software engineering at SKLCS. Frontiers Comput. Sci. China 2(1): 1-11 (2008) - [c8]Yuxin Tang, Yunquan Zhang, Hu Chen:
A Parallel Shortest Path Algorithm Based on Graph-Partitioning and Iterative Correcting. HPCC 2008: 155-161 - [c7]Di Zhang, Yunquan Zhang, Shengfei Liu, Xiaodi Huang:
Parallelization of FM-Index. HPCC 2008: 169-173 - [c6]Yuan Tang, Yunquan Zhang:
Utilizing the Multi-threading Techniques to Improve the Two-Level Checkpoint/Rollback System for MPI Applications. HPCC 2008: 864-869 - [c5]E. Yuan, Yunquan Zhang, Xiangzheng Sun:
Memory Access Complexity Analysis of SpMV in RAM (h) Model. HPCC 2008: 913-920 - 2007
- [j2]Yunquan Zhang, Guoliang Chen, Guangzhong Sun, Qiankun Miao:
Models of parallel computation: a survey and classification. Frontiers Comput. Sci. China 1(2): 156-165 (2007) - [c4]Di Zhang, Yunquan Zhang, Jing Chen:
Efficient Construction of FM-index Using Overlapping Block Processing for Large Scale Texts. ECIR 2007: 113-123 - [c3]Yunquan Zhang, Jiachang Sun, Guoxing Yuan, Linbo Zhang:
A brief introduction to China HPC TOP100: from 2002 to 2006. China HPC 2007: 32-36 - [c2]Yunquan Zhang, Ying Chen, Yuan Tang:
Block size selection of parallel LU and QR on PVP-based and RISC-based supercomputers. China HPC 2007: 115-125 - 2006
- [j1]Guoliang Chen, Guangzhong Sun, Yunquan Zhang, Zeyao Mo:
Study on Parallel Computing. J. Comput. Sci. Technol. 21(5): 665-673 (2006) - 2003
- [c1]Yuan Tang, Yunquan Zhang, Jiachang Sun, Yu-Cheng Li:
Hardware Impact on Communication Performance of Beowulf LINUX Cluster. Applied Informatics 2003: 495-500
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-21 20:29 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint