iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://doi.org/10.1007/978-3-030-79025-7_15
Transparent Near-Memory Computing with a Reconfigurable Processor | SpringerLink
Skip to main content

Transparent Near-Memory Computing with a Reconfigurable Processor

  • Conference paper
  • First Online:
Applied Reconfigurable Computing. Architectures, Tools, and Applications (ARC 2021)

Abstract

Data intensive applications like machine learning or big data analysis have stressed the requirements on memory subsystems. They involve computational kernels whose performance is not limited by the algorithmic complexity, but by the large amount of data they need to process. To counteract the growing gap between computing power and memory bandwidth, near-memory processing techniques have been addressed to improve the performance in such applications significantly. In this paper, we leverage a general purpose processor extended with a reconfigurable framework to execute hardware-accelerated instructions. This framework features a high-bandwidth memory interface to the nearest memory controller, allowing for greatly increased bandwidth compared to the standard system bus. We introduce region-based data processing, which allows to trigger operations by merely storing data and is especially suitable for large many-core designs. We show two different approaches to trigger the architecture for near-memory operations, one using interrupts for software-assisted processing and one directly interfacing with the hardware accelerator. Our evaluations show a performance gain of 72% on SAD kernels, with memory performance improved by 48%. Benchmarking AES encryption, we can show a speed up of 70%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Corda, S., Veenboer, B., Awan, A.J., Kumar, A., Jordans, R., Corporaal, H.: Near memory acceleration on high resolution radio astronomy imaging. In: 2020 9th Mediterranean Conference on Embedded Computing (MECO), pp. 1–6 (2020)

    Google Scholar 

  2. Damschen, M., Rapp, M., Bauer, L., Henkel, J.: i-Core: a runtime-reconfigurable processor platform for cyber-physical systems (2020)

    Google Scholar 

  3. Grudnitsky, A., Bauer, L., Henkel, J.: COREFAB: concurrent reconfigurable fabric utilization in heterogeneous multi-core systems. In: 2014 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES), pp. 1–10 (2014)

    Google Scholar 

  4. Gu, P., et al.: iPIM: programmable in-memory image processing accelerator using near-bank architecture. In: 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA), pp. 804–817 (2020)

    Google Scholar 

  5. Harbaum, T., et al.: Auto-Si: an adaptive reconfigurable processor with run-time loop detection and acceleration. In: 2017 30th IEEE International System-on-Chip Conference (SOCC), pp. 153–158 (2017)

    Google Scholar 

  6. Koo, G., et al.: Summarizer: trading communication with computing near storage. In: 2017 50th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), pp. 219–231 (2017)

    Google Scholar 

  7. University of Michigan: MiBench Version 1.0 (2001). https://vhosts.eecs.umich.edu/mibench/

  8. Wong, S., Vassiliadis, S., Cotofana, S.: A sum of absolute differences implementation in FPGA hardware. In: Proceedings 28th Euromicro Conference, pp. 183–188 (2002)

    Google Scholar 

Download references

Acknowledgement

This work was supported by the German Research Foundation (DFG) as part of the Transregional Collaborative Research Center Invasive Computing [SFB/TR 89].

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fabian Lesniak .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Lesniak, F., Kreß, F., Becker, J. (2021). Transparent Near-Memory Computing with a Reconfigurable Processor. In: Derrien, S., Hannig, F., Diniz, P.C., Chillet, D. (eds) Applied Reconfigurable Computing. Architectures, Tools, and Applications. ARC 2021. Lecture Notes in Computer Science(), vol 12700. Springer, Cham. https://doi.org/10.1007/978-3-030-79025-7_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-79025-7_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-79024-0

  • Online ISBN: 978-3-030-79025-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics