Abstract
The increasing importance of depending on Convolutional Neural Networks (CNN) in many real-time applications especially for image classifications and Humanoid Robots leads to the search for an optimum solution to accelerate the computational process capabilities for the hardware-based systems. Multiply-Accumulate (MAC) is the most computational demanding unit in any CNN architectures. In this paper, three optimized 2D MAC hardware-based architecture units have been designed using VHDL and synthesized for the operation on the FPGA platform due to its parallelism-architecture support feature. The logic utilization, power dissipation, and timing analyze of the three proposed 2D MAC have been made using Quartus ii tools and showed that the 3rd MAC design can achieve a 18.34 Giga Operation per Second (GOPS) while keeping the core dynamic thermal power dissipation level at 303.67 mW.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Véstias, M., Neto, H.: Trends of CPU, GPU and FPGA for high-performance computing. In: 2014 24th International Conference on Field Programmable Logic and Applications (FPL), Munich, Germany. IEEE (2014)
Moini, S., Alizadeh, B., Emad, M., Ebrahimpour, R.: A resource-limited hardware accelerator for convolutional neural networks in embedded vision applications. IEEE Trans. Circuits Syst. II Express. Briefs 64(10), 1217–1221 (2017)
Ahmed, H.O., Ghoneima, M., Dessouky, M.: Concurrent MAC unit design using VHDL for deep learning networks on FPGA. Presented at the IEEE Symposium on Computer Applications & Industrial Electronics (ISCAIE 2018), Penang Island, Malaysia (2018, in Press)
Wang, H., Shao, M., Liu, Y., Zhao, W.: Enhanced efficiency 3D convolution based on optimal FPGA accelerator. IEEE Access 5, 6909–6916 (2017)
Saravanan, R., Balaji, P., Prabu, R.: Design of 16-bit floating point multiply and accumulate unit. IJMTES Int. J. Mod. Trends Eng. Sci. 03(01) (2015)
Shaikh, T., Beleri, M.: FPGA implementation of multiply accumulate (MAC) unit based on block enable technique. Int. J. Innov. Res. Comput. Commun. Eng. 3(4) (2015)
Ashwini, N., Rao, T.K., Rao, D.S.: Low power multiply accumulate unit (MAC) for DSP applications. Int. J. Res. Stud. Sci. Eng. Technol. IJRSSET 2(8), 49–54 (2015)
Nain, P., Virdi, G.S.: Multiplier-accumulator (MAC) unit. Int. J. Digit. Appl. Contemp. Res. 5(3) (2016)
SaiKumar, M., Kumar, D.A., Samundiswary, P.: Design and performance analysis of multiply-accumulate (MAC) unit. Presented at the International Conference on Circuit, Power and Computing Technologies, ICCPCT, Nagercoil, India (2014)
Duarte, R.P., Véstias, M., de Sousa, J.T., Neto, H.: Parallel dot-products for deep learning on FPGA. Presented at the 2017 27th International Conference on Field Programmable Logic and Applications (FPL), Ghent, Belgium, September 2017
Taylor, G., Lacey, G., Areibi, S.: Deep learning on FPGAs: past, present, and future, 13 February 2016
Dettmers, T.: 8-bit approximations for parallelism in deep learning. Presented at the ICLR 2016, San Juan, Puerto Rico, 2–4 May 2016 (2016)
Wu, E., Fu, Y., Sirasao, A., Attia, S., Khan, K., Wittig, R.: Deep learning with INT8 optimization on Xilinx devices. In: UltraScale and UltraScale+FPGAs, vol. v1.0.1, no. WP486, 24 April 2017
Gysel, P., Motamedi, M., Ghiasi, S.: Hardware-oriented approximation of convolutional neural networks. Presented at the ICLR, San Juan, Puerto Rico (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Ahmed, H.O., Ghoneima, M., Dessouky, M. (2019). High-Speed 2D Parallel MAC Unit Hardware Accelerator for Convolutional Neural Network. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Intelligent Systems and Applications. IntelliSys 2018. Advances in Intelligent Systems and Computing, vol 868. Springer, Cham. https://doi.org/10.1007/978-3-030-01054-6_47
Download citation
DOI: https://doi.org/10.1007/978-3-030-01054-6_47
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01053-9
Online ISBN: 978-3-030-01054-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)