{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,10,22]],"date-time":"2024-10-22T22:27:25Z","timestamp":1729636045757,"version":"3.28.0"},"publisher-location":"New York, NY, USA","reference-count":131,"publisher":"ACM","funder":[{"DOI":"10.13039\/100006785","name":"Google","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100006785","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100002418","name":"Intel Corporation","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100002418","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2022,3,28]]},"DOI":"10.1145\/3492321.3519583","type":"proceedings-article","created":{"date-parts":[[2022,3,28]],"date-time":"2022-03-28T14:28:18Z","timestamp":1648477698000},"page":"747-764","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":10,"title":["APT-GET"],"prefix":"10.1145","author":[{"given":"Saba","family":"Jamilan","sequence":"first","affiliation":[{"name":"University of California"}]},{"given":"Tanvir Ahmed","family":"Khan","sequence":"additional","affiliation":[{"name":"University of Michigan"}]},{"given":"Grant","family":"Ayers","sequence":"additional","affiliation":[{"name":"Google"}]},{"given":"Baris","family":"Kasikci","sequence":"additional","affiliation":[{"name":"University of Michigan"}]},{"given":"Heiner","family":"Litz","sequence":"additional","affiliation":[{"name":"University of California"}]}],"member":"320","published-online":{"date-parts":[[2022,3,28]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"2008. scipy.signal. https:\/\/docs.scipy.org\/doc\/scipy\/reference\/signal.html 2008. scipy.signal. https:\/\/docs.scipy.org\/doc\/scipy\/reference\/signal.html"},{"key":"e_1_3_2_1_2_1","unstructured":"2008. scipy.signal.find_peaks_cwt. https:\/\/docs.scipy.org\/doc\/scipy\/reference\/generated\/scipy.signal.find_peaks_cwt.html 2008. scipy.signal.find_peaks_cwt. https:\/\/docs.scipy.org\/doc\/scipy\/reference\/generated\/scipy.signal.find_peaks_cwt.html"},{"key":"e_1_3_2_1_3_1","unstructured":"2018. Support for inserting profile-directed cache prefetches. https:\/\/reviews.llvm.org\/D54052 2018. Support for inserting profile-directed cache prefetches. https:\/\/reviews.llvm.org\/D54052"},{"volume-title":"llvm-mca - LLVM Machine Code Analyzer. https:\/\/llvm.org\/docs\/CommandGuide\/llvm-mca.html. [Online","year":"2021","key":"e_1_3_2_1_4_1","unstructured":"2021. llvm-mca - LLVM Machine Code Analyzer. https:\/\/llvm.org\/docs\/CommandGuide\/llvm-mca.html. [Online ; accessed 9- October - 2021 ]. 2021. llvm-mca - LLVM Machine Code Analyzer. https:\/\/llvm.org\/docs\/CommandGuide\/llvm-mca.html. [Online; accessed 9-October-2021]."},{"key":"e_1_3_2_1_5_1","unstructured":"2022. Profile Guided Software Prefetching. https:\/\/github.com\/SabaJamilan\/Profile-Guided-Software-Prefetching 2022. Profile Guided Software Prefetching. https:\/\/github.com\/SabaJamilan\/Profile-Guided-Software-Prefetching"},{"volume-title":"TensorFlow: A System for Large-Scale Machine Learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16)","year":"2016","author":"Abadi Mart\u00edn","key":"e_1_3_2_1_6_1","unstructured":"Mart\u00edn Abadi , Paul Barham , Jianmin Chen , Zhifeng Chen , Andy Davis , Jeffrey Dean , Matthieu Devin , Sanjay Ghemawat , Geoffrey Irving , Michael Isard , Manjunath Kudlur , Josh Levenberg , Rajat Monga , Sherry Moore , Derek G. Murray , Benoit Steiner , Paul Tucker , Vijay Vasudevan , Pete Warden , Martin Wicke , Yuan Yu , and Xiaoqiang Zheng . 2016 . TensorFlow: A System for Large-Scale Machine Learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16) . USENIX Association, Savannah, GA, 265--283. https:\/\/www.usenix.org\/conference\/osdi16\/technical-sessions\/presentation\/abadi Mart\u00edn Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, Manjunath Kudlur, Josh Levenberg, Rajat Monga, Sherry Moore, Derek G. Murray, Benoit Steiner, Paul Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. TensorFlow: A System for Large-Scale Machine Learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16). USENIX Association, Savannah, GA, 265--283. https:\/\/www.usenix.org\/conference\/osdi16\/technical-sessions\/presentation\/abadi"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC.2015.11"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2925426.2926254"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2017.7863749"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/3296957.3173189"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/PACT.2003.1238005"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/379240.379251"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2018.00061"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3373376.3378498"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3307650.3322234"},{"volume-title":"Alex Woo, and Maurice Yarrow.","year":"1995","author":"Bailey David","key":"e_1_3_2_1_16_1","unstructured":"David Bailey , Tim Harris , William Saphir , Rob Van Der Wijngaart , Alex Woo, and Maurice Yarrow. 1995 . The NAS parallel benchmarks 2.0. Technical Report. Technical Report NAS-95-020, NASA Ames Research Center . David Bailey, Tim Harris, William Saphir, Rob Van Der Wijngaart, Alex Woo, and Maurice Yarrow. 1995. The NAS parallel benchmarks 2.0. Technical Report. Technical Report NAS-95-020, NASA Ames Research Center."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2018.00021"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2019.00053"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2013.6544839"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2019.00051"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/3352460.3358325"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1145\/3302424.3303989"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/3190508.3190532"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1145\/3173162.3173177"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3302424.3303988"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/106972.106979"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/195470.195557"},{"volume-title":"Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 427--443","year":"2020","author":"Chakraborttii Chandranil","key":"e_1_3_2_1_28_1","unstructured":"Chandranil Chakraborttii and Heiner Litz . 2020 . Learning i\/o access patterns to improve prefetching in ssds . In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 427--443 . Chandranil Chakraborttii and Heiner Litz. 2020. Learning i\/o access patterns to improve prefetching in ssds. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 427--443."},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/HiPC.2014.7116904"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/2854038.2854044"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2004.1319989"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/376284.375688"},{"volume-title":"13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18)","year":"2018","author":"Chen Tianqi","key":"e_1_3_2_1_34_1","unstructured":"Tianqi Chen , Thierry Moreau , Ziheng Jiang , Lianmin Zheng , Eddie Yan , Haichen Shen , Meghan Cowan , Leyuan Wang , Yuwei Hu , Luis Ceze , 2018 . TVM: An automated end-to-end optimizing compiler for deep learning . In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18) . 578--594. Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Haichen Shen, Meghan Cowan, Leyuan Wang, Yuwei Hu, Luis Ceze, et al. 2018. TVM: An automated end-to-end optimizing compiler for deep learning. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18). 578--594."},{"key":"e_1_3_2_1_35_1","first-page":"69","article-title":"Data access microarchitectures for superscalar processors with compiler-assisted data prefetching","volume":"24","author":"Chen William Y","year":"1991","unstructured":"William Y Chen , Scott A Mahlke , Pohua P Chang , and Wen-mei W Hwu . 1991 . Data access microarchitectures for superscalar processors with compiler-assisted data prefetching . In MICRO , Vol. 24. 69 -- 73 . William Y Chen, Scott A Mahlke, Pohua P Chang, and Wen-mei W Hwu. 1991. Data access microarchitectures for superscalar processors with compiler-assisted data prefetching. In MICRO, Vol. 24. 69--73.","journal-title":"MICRO"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1109\/IISWC47752.2019.9042166"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/512529.512554"},{"key":"e_1_3_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.5555\/774861.774869"},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2001.937427"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/605432.605427"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/2815400.2815409"},{"key":"e_1_3_2_1_42_1","doi-asserted-by":"crossref","unstructured":"Fredrik Dahlgren and Per Stenstr\u00f6m. 1995. Effectiveness of Hardware-Based Stride and Sequential Prefetching in Shared-Memory Multiprocessors.. In hpca. 68--77. Fredrik Dahlgren and Per Stenstr\u00f6m. 1995. Effectiveness of Hardware-Based Stride and Sequential Prefetching in Shared-Memory Multiprocessors.. In hpca. 68--77.","DOI":"10.1109\/HPCA.1995.386554"},{"volume-title":"Slides from Linux Kongress","author":"De Melo Arnaldo Carvalho","key":"e_1_3_2_1_43_1","unstructured":"Arnaldo Carvalho De Melo . 2010. The new linux'perf'tools . In Slides from Linux Kongress , Vol. 18 . 1--42. Arnaldo Carvalho De Melo. 2010. The new linux'perf'tools. In Slides from Linux Kongress, Vol. 18. 1--42."},{"key":"e_1_3_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1145\/1250727.1250732"},{"volume-title":"Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching. bioinformatics 22, 17","year":"2006","author":"Du Pan","key":"e_1_3_2_1_45_1","unstructured":"Pan Du , Warren A Kibbe , and Simon M Lin . 2006. Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching. bioinformatics 22, 17 ( 2006 ), 2059--2065. Pan Du, Warren A Kibbe, and Simon M Lin. 2006. Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching. bioinformatics 22, 17 (2006), 2059--2065."},{"key":"e_1_3_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/263580.263597"},{"key":"e_1_3_2_1_47_1","unstructured":"Stephane Eranian. 2021. Add AMD Fam19h Branch Sampling support. https:\/\/lwn.net\/Articles\/875869\/. Stephane Eranian. 2021. Add AMD Fam19h Branch Sampling support. https:\/\/lwn.net\/Articles\/875869\/."},{"key":"e_1_3_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.2200\/S00581ED1V01Y201405CAC028"},{"key":"e_1_3_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/3302424.3303977"},{"volume-title":"Anastasia Ailamaki, and Babak Falsafi.","year":"2012","author":"Ferdman Michael","key":"e_1_3_2_1_50_1","unstructured":"Michael Ferdman , Almutaz Adileh , Onur Kocberber , Stavros Volos , Mohammad Alisafaee , Djordje Jevdjic , Cansu Kaynak , Adrian Daniel Popescu , Anastasia Ailamaki, and Babak Falsafi. 2012 . Clearing the clouds: a study of emerging scale-out workloads on modern hardware. In ACM SIGPLAN Notices, Vol. 47 . ACM , 37--48. Michael Ferdman, Almutaz Adileh, Onur Kocberber, Stavros Volos, Mohammad Alisafaee, Djordje Jevdjic, Cansu Kaynak, Adrian Daniel Popescu, Anastasia Ailamaki, and Babak Falsafi. 2012. Clearing the clouds: a study of emerging scale-out workloads on modern hardware. In ACM SIGPLAN Notices, Vol. 47. ACM, 37--48."},{"volume-title":"1st JILP Data Prefetching Championship 29","year":"2009","author":"Ferdman Michael","key":"e_1_3_2_1_51_1","unstructured":"Michael Ferdman , Stephen Somogyi , and Babak Falsafi . 2009. Spatial memory streaming with rotated patterns . 1st JILP Data Prefetching Championship 29 ( 2009 ). Michael Ferdman, Stephen Somogyi, and Babak Falsafi. 2009. Spatial memory streaming with rotated patterns. 1st JILP Data Prefetching Championship 29 (2009)."},{"volume-title":"Propeller: Profile Guided Optimizing Large Scale LLVM-based Relinker. https:\/\/github.com\/google\/llvm-propeller.","year":"2020","key":"e_1_3_2_1_52_1","unstructured":"Google. 2020 . Propeller: Profile Guided Optimizing Large Scale LLVM-based Relinker. https:\/\/github.com\/google\/llvm-propeller. Google. 2020. Propeller: Profile Guided Optimizing Large Scale LLVM-based Relinker. https:\/\/github.com\/google\/llvm-propeller."},{"key":"e_1_3_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/2591635.2667162"},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2016.7783764"},{"volume-title":"Learning memory access patterns. arXiv preprint arXiv:1803.02329","year":"2018","author":"Hashemi Milad","key":"e_1_3_2_1_55_1","unstructured":"Milad Hashemi , Kevin Swersky , Jamie A Smith , Grant Ayers , Heiner Litz , Jichuan Chang , Christos Kozyrakis , and Parthasarathy Ranganathan . 2018. Learning memory access patterns. arXiv preprint arXiv:1803.02329 ( 2018 ). Milad Hashemi, Kevin Swersky, Jamie A Smith, Grant Ayers, Heiner Litz, Jichuan Chang, Christos Kozyrakis, and Parthasarathy Ranganathan. 2018. Learning memory access patterns. arXiv preprint arXiv:1803.02329 (2018)."},{"key":"e_1_3_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2006.32"},{"key":"e_1_3_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1145\/780822.781161"},{"volume-title":"Intel Architecture Code Analyzer. https:\/\/software.intel.com\/content\/www\/us\/en\/develop\/articles\/intel-architecture-code-analyzer.html. [Online","year":"2021","author":"Intel Corporation","key":"e_1_3_2_1_58_1","unstructured":"Intel Corporation . 2019. Intel Architecture Code Analyzer. https:\/\/software.intel.com\/content\/www\/us\/en\/develop\/articles\/intel-architecture-code-analyzer.html. [Online ; accessed 9- October - 2021 ]. Intel Corporation. 2019. Intel Architecture Code Analyzer. https:\/\/software.intel.com\/content\/www\/us\/en\/develop\/articles\/intel-architecture-code-analyzer.html. [Online; accessed 9-October-2021]."},{"key":"e_1_3_2_1_59_1","first-page":"1","article-title":"Access map pattern matching for high performance data cache prefetch","volume":"13","author":"Ishii Yasuo","year":"2011","unstructured":"Yasuo Ishii , Mary Inaba , and Kei Hiraki . 2011 . Access map pattern matching for high performance data cache prefetch . Journal of Instruction-Level Parallelism 13 , 2011 (2011), 1 -- 24 . Yasuo Ishii, Mary Inaba, and Kei Hiraki. 2011. Access map pattern matching for high performance data cache prefetch. Journal of Instruction-Level Parallelism 13, 2011 (2011), 1--24.","journal-title":"Journal of Instruction-Level Parallelism"},{"key":"e_1_3_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/2540708.2540730"},{"key":"e_1_3_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1145\/325096.325162"},{"key":"e_1_3_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2750392"},{"key":"e_1_3_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPP.2014.19"},{"volume-title":"Twig: Profile-Guided BTB Prefetching for Data Center Applications. In MICRO-54: 54th Annual IEEE\/ACM International Symposium on Microarchitecture. 816--829","year":"2021","author":"Khan Tanvir Ahmed","key":"e_1_3_2_1_64_1","unstructured":"Tanvir Ahmed Khan , Nathan Brown , Akshitha Sriraman , Niranjan K Soundararajan , Rakesh Kumar , Joseph Devietti , Sreenivas Subramoney , Gilles A Pokam , Heiner Litz , and Baris Kasikci . 2021 . Twig: Profile-Guided BTB Prefetching for Data Center Applications. In MICRO-54: 54th Annual IEEE\/ACM International Symposium on Microarchitecture. 816--829 . Tanvir Ahmed Khan, Nathan Brown, Akshitha Sriraman, Niranjan K Soundararajan, Rakesh Kumar, Joseph Devietti, Sreenivas Subramoney, Gilles A Pokam, Heiner Litz, and Baris Kasikci. 2021. Twig: Profile-Guided BTB Prefetching for Data Center Applications. In MICRO-54: 54th Annual IEEE\/ACM International Symposium on Microarchitecture. 816--829."},{"volume-title":"Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI) (OSDI","year":"2021","author":"Khan Tanvir Ahmed","key":"e_1_3_2_1_65_1","unstructured":"Tanvir Ahmed Khan , Ian Neal , Gilles Pokam , Barzan Mozafari , and Baris Kasikci . 2021 . DMon: Efficient Detection and Correction of Data Locality Problems using Selective Profiling . In Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI) (OSDI 2021). USENIX Association. Tanvir Ahmed Khan, Ian Neal, Gilles Pokam, Barzan Mozafari, and Baris Kasikci. 2021. DMon: Efficient Detection and Correction of Data Locality Problems using Selective Profiling. In Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI) (OSDI 2021). USENIX Association."},{"volume-title":"I-SPY: Context-Driven Conditional Instruction Prefetching with Coalescing. In 2020 53rd Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO). IEEE, 146--159","year":"2020","author":"Khan Tanvir Ahmed","key":"e_1_3_2_1_66_1","unstructured":"Tanvir Ahmed Khan , Akshitha Sriraman , Joseph Devietti , Gilles Pokam , Heiner Litz , and Baris Kasikci . 2020 . I-SPY: Context-Driven Conditional Instruction Prefetching with Coalescing. In 2020 53rd Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO). IEEE, 146--159 . Tanvir Ahmed Khan, Akshitha Sriraman, Joseph Devietti, Gilles Pokam, Heiner Litz, and Baris Kasikci. 2020. I-SPY: Context-Driven Conditional Instruction Prefetching with Coalescing. In 2020 53rd Annual IEEE\/ACM International Symposium on Microarchitecture (MICRO). IEEE, 146--159."},{"key":"e_1_3_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA52012.2021.00063"},{"key":"e_1_3_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2016.7783763"},{"key":"e_1_3_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1145\/3093336.3037701"},{"key":"e_1_3_2_1_70_1","unstructured":"Andi Kleen. 2016. An Introduction to Last Branch Records. https:\/\/lwn.net\/Articles\/680985\/ Andi Kleen. 2016. An Introduction to Last Branch Records. https:\/\/lwn.net\/Articles\/680985\/"},{"key":"e_1_3_2_1_71_1","unstructured":"Andi Kleen. 2022. GitHub - andikleen\/pmu-tools: Intel PMU profiling tools. https:\/\/github.com\/andikleen\/pmu-tools Andi Kleen. 2022. GitHub - andikleen\/pmu-tools: Intel PMU profiling tools. https:\/\/github.com\/andikleen\/pmu-tools"},{"key":"e_1_3_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2004.1281665"},{"volume-title":"Automatic throughput and critical path analysis of x86 and ARM assembly kernels. In 2019 IEEE\/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS)","author":"Laukemann Jan","key":"e_1_3_2_1_73_1","unstructured":"Jan Laukemann , Julian Hammer , Georg Hager , and Gerhard Wellein . 2019. Automatic throughput and critical path analysis of x86 and ARM assembly kernels. In 2019 IEEE\/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS) . IEEE , 1--6. Jan Laukemann, Julian Hammer, Georg Hager, and Gerhard Wellein. 2019. Automatic throughput and critical path analysis of x86 and ARM assembly kernels. In 2019 IEEE\/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS). IEEE, 1--6."},{"volume-title":"Automated instruction stream throughput prediction for intel and amd microarchitectures. In 2018 IEEE\/ACM performance modeling, benchmarking and simulation of high performance computer systems (PMBS)","author":"Laukemann Jan","key":"e_1_3_2_1_74_1","unstructured":"Jan Laukemann , Julian Hammer , Johannes Hofmann , Georg Hager , and Gerhard Wellein . 2018. Automated instruction stream throughput prediction for intel and amd microarchitectures. In 2018 IEEE\/ACM performance modeling, benchmarking and simulation of high performance computer systems (PMBS) . IEEE , 121--131. Jan Laukemann, Julian Hammer, Johannes Hofmann, Georg Hager, and Gerhard Wellein. 2018. Automated instruction stream throughput prediction for intel and amd microarchitectures. In 2018 IEEE\/ACM performance modeling, benchmarking and simulation of high performance computer systems (PMBS). IEEE, 121--131."},{"key":"e_1_3_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2008.224"},{"key":"e_1_3_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1145\/2133382.2133384"},{"key":"e_1_3_2_1_77_1","unstructured":"Jure Leskovec and Andrej Krevl. 2014. SNAP Datasets: Stanford large network dataset collection. Jure Leskovec and Andrej Krevl. 2014. SNAP Datasets: Stanford large network dataset collection."},{"key":"e_1_3_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.1145\/1272996.1273017"},{"key":"e_1_3_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.1145\/3503222.3507745"},{"key":"e_1_3_2_1_80_1","doi-asserted-by":"publisher","DOI":"10.1109\/MDT.2005.134"},{"key":"e_1_3_2_1_81_1","doi-asserted-by":"publisher","DOI":"10.5555\/956417.956549"},{"key":"e_1_3_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO.2005.18"},{"key":"e_1_3_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1145\/237090.237190"},{"key":"e_1_3_2_1_84_1","doi-asserted-by":"publisher","DOI":"10.1145\/514191.514217"},{"key":"e_1_3_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1145\/1188455.1188677"},{"key":"e_1_3_2_1_86_1","doi-asserted-by":"publisher","DOI":"10.1145\/3461648.3463853"},{"volume-title":"International Conference on machine learning. PMLR, 4505--4515","year":"2019","author":"Mendis Charith","key":"e_1_3_2_1_87_1","unstructured":"Charith Mendis , Alex Renda , Saman Amarasinghe , and Michael Carbin . 2019 . Ithemal: Accurate, portable and fast basic block throughput estimation using deep neural networks . In International Conference on machine learning. PMLR, 4505--4515 . Charith Mendis, Alex Renda, Saman Amarasinghe, and Michael Carbin. 2019. Ithemal: Accurate, portable and fast basic block throughput estimation using deep neural networks. In International Conference on machine learning. PMLR, 4505--4515."},{"key":"e_1_3_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2016.7446087"},{"key":"e_1_3_2_1_89_1","doi-asserted-by":"publisher","DOI":"10.1016\/0743-7315(91)90014-Z"},{"key":"e_1_3_2_1_90_1","doi-asserted-by":"publisher","DOI":"10.1145\/238721.238734"},{"key":"e_1_3_2_1_91_1","doi-asserted-by":"publisher","DOI":"10.1145\/143365.143488"},{"key":"e_1_3_2_1_92_1","first-page":"45","article-title":"Introducing the graph 500","volume":"19","author":"Murphy Richard C","year":"2010","unstructured":"Richard C Murphy , Kyle B Wheeler , Brian W Barrett , and James A Ang . 2010 . Introducing the graph 500 . Cray Users Group (CUG) 19 (2010), 45 -- 74 . Richard C Murphy, Kyle B Wheeler, Brian W Barrett, and James A Ang. 2010. Introducing the graph 500. Cray Users Group (CUG) 19 (2010), 45--74.","journal-title":"Cray Users Group (CUG)"},{"key":"e_1_3_2_1_93_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2005.49"},{"key":"e_1_3_2_1_94_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2003.1183532"},{"key":"e_1_3_2_1_95_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2004.10030"},{"key":"e_1_3_2_1_96_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2019.8661201"},{"key":"e_1_3_2_1_97_1","doi-asserted-by":"publisher","DOI":"10.1145\/3446804.3446843"},{"key":"e_1_3_2_1_98_1","doi-asserted-by":"publisher","DOI":"10.1145\/1755913.1755947"},{"key":"e_1_3_2_1_99_1","unstructured":"Lucas Prates. 2020. Add support for the Branch Record Buffer extension. https:\/\/reviews.llvm.org\/D92389. Lucas Prates. 2020. Add support for the Branch Record Buffer extension. https:\/\/reviews.llvm.org\/D92389."},{"key":"e_1_3_2_1_100_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2014.6835971"},{"key":"e_1_3_2_1_101_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2008.4658635"},{"volume-title":"Google-wide profiling: A continuous profiling infrastructure for data centers","year":"2010","author":"Ren Gang","key":"e_1_3_2_1_102_1","unstructured":"Gang Ren , Eric Tune , Tipp Moseley , Yixin Shi , Silvius Rus , and Robert Hundt . 2010. Google-wide profiling: A continuous profiling infrastructure for data centers . IEEE micro 30, 4 ( 2010 ), 65--79. Gang Ren, Eric Tune, Tipp Moseley, Yixin Shi, Silvius Rus, and Robert Hundt. 2010. Google-wide profiling: A continuous profiling infrastructure for data centers. IEEE micro 30, 4 (2010), 65--79."},{"key":"e_1_3_2_1_103_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO50266.2020.00045"},{"key":"e_1_3_2_1_104_1","doi-asserted-by":"publisher","DOI":"10.1145\/384265.291034"},{"key":"e_1_3_2_1_105_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.1999.765944"},{"key":"e_1_3_2_1_106_1","doi-asserted-by":"publisher","DOI":"10.1109\/TC.2003.1183943"},{"key":"e_1_3_2_1_107_1","doi-asserted-by":"publisher","DOI":"10.1145\/2830772.2830793"},{"key":"e_1_3_2_1_108_1","doi-asserted-by":"publisher","DOI":"10.1145\/3445814.3446752"},{"key":"e_1_3_2_1_109_1","doi-asserted-by":"publisher","DOI":"10.1109\/C-M.1978.218016"},{"key":"e_1_3_2_1_110_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150019.1136508"},{"key":"e_1_3_2_1_111_1","doi-asserted-by":"publisher","DOI":"10.1145\/1594835.1504208"},{"key":"e_1_3_2_1_112_1","doi-asserted-by":"publisher","DOI":"10.1145\/3470496.3527430"},{"key":"e_1_3_2_1_113_1","doi-asserted-by":"publisher","DOI":"10.1145\/3307650.3322227"},{"key":"e_1_3_2_1_114_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA51647.2021.00061"},{"volume-title":"Disclosure of hardware prefetcher control on some Intel processors. Intel SW Developer Zone","year":"2014","author":"Viswanathan Vish","key":"e_1_3_2_1_115_1","unstructured":"Vish Viswanathan . 2014. Disclosure of hardware prefetcher control on some Intel processors. Intel SW Developer Zone ( 2014 ). Vish Viswanathan. 2014. Disclosure of hardware prefetcher control on some Intel processors. Intel SW Developer Zone (2014)."},{"key":"e_1_3_2_1_116_1","doi-asserted-by":"publisher","DOI":"10.1145\/859618.859663"},{"key":"e_1_3_2_1_117_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2009.4798239"},{"key":"e_1_3_2_1_118_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2005.50"},{"key":"e_1_3_2_1_119_1","unstructured":"Thomas Willhalm and Roman Dementiev. 2012. Intel Performance Counter Monitor - A Better Way to Measure CPU Utilization. https:\/\/software.intel.com\/en-us\/articles\/intel-performance-counter-monitor#abstracting. Thomas Willhalm and Roman Dementiev. 2012. Intel Performance Counter Monitor - A Better Way to Measure CPU Utilization. https:\/\/software.intel.com\/en-us\/articles\/intel-performance-counter-monitor#abstracting."},{"volume-title":"High performance compilers for parallel computing","author":"Wolfe Michael Joseph","key":"e_1_3_2_1_120_1","unstructured":"Michael Joseph Wolfe and Michael Wolfe . 1996. High performance compilers for parallel computing . Vol. 102 . Addison-Wesley Reading . Michael Joseph Wolfe and Michael Wolfe. 1996. High performance compilers for parallel computing. Vol. 102. Addison-Wesley Reading."},{"key":"e_1_3_2_1_121_1","doi-asserted-by":"publisher","DOI":"10.1145\/3352460.3358300"},{"key":"e_1_3_2_1_122_1","doi-asserted-by":"publisher","DOI":"10.1145\/3307650.3322225"},{"key":"e_1_3_2_1_123_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-89740-8_16"},{"key":"e_1_3_2_1_124_1","doi-asserted-by":"publisher","DOI":"10.1145\/512529.512555"},{"key":"e_1_3_2_1_125_1","doi-asserted-by":"publisher","DOI":"10.1145\/216585.216588"},{"key":"e_1_3_2_1_126_1","doi-asserted-by":"publisher","DOI":"10.1145\/2830772.2830807"},{"key":"e_1_3_2_1_127_1","doi-asserted-by":"publisher","DOI":"10.1145\/3190508.3190551"},{"volume-title":"Spark: Cluster computing with working sets. HotCloud","year":"2010","author":"Zaharia Matei","key":"e_1_3_2_1_128_1","unstructured":"Matei Zaharia , Mosharaf Chowdhury , Michael J Franklin , Scott Shenker , Ion Stoica , 2010 . Spark: Cluster computing with working sets. HotCloud (2010). Matei Zaharia, Mosharaf Chowdhury, Michael J Franklin, Scott Shenker, Ion Stoica, et al. 2010. Spark: Cluster computing with working sets. HotCloud (2010)."},{"key":"e_1_3_2_1_129_1","doi-asserted-by":"publisher","DOI":"10.1109\/MICRO50266.2020.00057"},{"key":"e_1_3_2_1_130_1","doi-asserted-by":"publisher","DOI":"10.1145\/3190508.3190534"},{"key":"e_1_3_2_1_131_1","doi-asserted-by":"publisher","DOI":"10.1109\/CGO.2006.4"},{"key":"e_1_3_2_1_132_1","doi-asserted-by":"publisher","DOI":"10.1109\/HPCA.2007.346187"}],"event":{"name":"EuroSys '22: Seventeenth European Conference on Computer Systems","sponsor":["SIGOPS ACM Special Interest Group on Operating Systems"],"location":"Rennes France","acronym":"EuroSys '22"},"container-title":["Proceedings of the Seventeenth European Conference on Computer Systems"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3492321.3519583","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,9,21]],"date-time":"2024-09-21T03:51:37Z","timestamp":1726890697000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3492321.3519583"}},"subtitle":["profile-guided