iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://api.crossref.org/works/10.1002/CPE.1337
{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,9,12]],"date-time":"2023-09-12T05:40:33Z","timestamp":1694497233319},"reference-count":10,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2008,6,6]],"date-time":"2008-06-06T00:00:00Z","timestamp":1212710400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Concurrency and Computation"],"published-print":{"date-parts":[[2009,1]]},"abstract":"Abstract<\/jats:title>An optimizing compiler cannot generate one best code pattern for all input data. There is no \u2018one optimization fits all\u2019 inputs. To attain high performance for a large range of inputs, it is therefore desirable to resort to some kind of specialization. Data specialization significantly improves the performance delivered by the compiler\u2010generated codes. Specialization is, however, limited by code expansion and introduces a time overhead for the selection of the appropriate version. We propose a new method to specialize the code at the assembly level for loop structures. Our specialization scheme focuses on different ranges of loop trip count and combines all these versions into a code that switches smoothly from one to the other while the iteration count increases. Hence, the resulting code achieves the same level of performance than each version on its specific iteration interval. We illustrate the benefit of our method on the SPEC benchmarks with detailed experimental results. Copyright \u00a9 2008 John Wiley & Sons, Ltd.<\/jats:p>","DOI":"10.1002\/cpe.1337","type":"journal-article","created":{"date-parts":[[2008,6,6]],"date-time":"2008-06-06T19:46:41Z","timestamp":1212781601000},"page":"71-84","source":"Crossref","is-referenced-by-count":1,"title":["Compositional approach applied to loop specialization"],"prefix":"10.1002","volume":"21","author":[{"given":"L.","family":"Djoudi","sequence":"first","affiliation":[]},{"given":"J.\u2010T.","family":"Acquaviva","sequence":"additional","affiliation":[]},{"given":"D.","family":"Barthou","sequence":"additional","affiliation":[]}],"member":"311","published-online":{"date-parts":[[2008,6,6]]},"reference":[{"key":"e_1_2_8_2_2","doi-asserted-by":"publisher","DOI":"10.1016\/0743-7315(91)90118-S"},{"key":"e_1_2_8_3_2","doi-asserted-by":"publisher","DOI":"10.1006\/jpdc.1995.1105"},{"key":"e_1_2_8_4_2","unstructured":"Ramakrishna RauB.Iterative modulo scheduling: An algorithm for software pipelining loops. Proceedings of the International Symposium on Microarchitecture San Jose CA 1994;63\u201374."},{"key":"e_1_2_8_5_2","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007516818651"},{"key":"e_1_2_8_6_2","unstructured":"DjoudiL BarthouD CarribaultP LemuetC AcquavivaJ\u2010T JalbyW.Exploring application performance: A new tool for a static\/dynamic approach. Los Alamos Computer Science Institute Symposium Santa Fe NM 2005."},{"key":"e_1_2_8_7_2","unstructured":"Intel Itanium2 Processor Reference Manual for Software Development and Optimization.http:\/\/download.intel.com\/design\/Itanium2\/manuals\/25111003.pdf[May2004]."},{"key":"e_1_2_8_8_2","unstructured":"SiasJW UengS\u2010Z KentGA SteinerIM NysomEM HwuW\u2010M.Field\u2010testing IMPACT EPIC research results on Itanium 2. Proceedings of the Annual Symposium on Computer Architecture ISCA Munchen Germany 2004."},{"key":"e_1_2_8_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/212094.212131"},{"key":"e_1_2_8_10_2","doi-asserted-by":"crossref","unstructured":"DinizPC RinardMC.Dynamic feedback: An effective technique for adaptive computing. Proceedings of PLDI Las Vegas Nevada 1997;71\u201384.","DOI":"10.1145\/258916.258923"},{"key":"e_1_2_8_11_2","series-title":"Lecture Notes in Computer Science","volume-title":"Languages and Compilers for Performance Computing","author":"Carribault P","year":"2004"}],"container-title":["Concurrency and Computation: Practice and Experience"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fcpe.1337","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/cpe.1337","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,12]],"date-time":"2023-09-12T00:20:47Z","timestamp":1694478047000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1002\/cpe.1337"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,6,6]]},"references-count":10,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2009,1]]}},"alternative-id":["10.1002\/cpe.1337"],"URL":"http:\/\/dx.doi.org\/10.1002\/cpe.1337","archive":["Portico"],"relation":{},"ISSN":["1532-0626","1532-0634"],"issn-type":[{"value":"1532-0626","type":"print"},{"value":"1532-0634","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,6,6]]}}}