Treating Artificial Neural Net Training as a Nonsmooth Global Optimization Problem

Griewank, Andreas; Rojas, Ángel

doi:10.1007/978-3-030-37599-7_64

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11943))

Included in the following conference series:

International Conference on Machine Learning, Optimization, and Data Science

1917 Accesses
2 Citations

Abstract

We attack the classical neural network training problem by successive piecewise linearization, applying three different methods for the global optimization of the local piecewise linear models. The methods are compared to each other and steepest descent as well as stochastic gradient on the regression problem for the Griewank function.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Towards More Biologically Plausible Error-Driven Learning for Artificial Neural Networks

Deep Learning Optimization

On the Omnipresence of Spurious Local Minima in Certain Neural Network Training Problems

Article Open access 14 June 2023

References

Arora, S., Cohen, N., Golowich, N., Hu, W.: A convergence analysis of gradient descent for deep linear neural networks. CoRR, abs/1810.02281 (2018)
Google Scholar
Bagirov, A., Karmitsa, N., Mäkelä, M.: Introduction to Nonsmooth Optimization: Theory, Practice and Software. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-08114-4
Book MATH Google Scholar
Bölcskei, H., Grohs, P., Kutyniok, G., Petersen, P.: Optimal approximation with sparsely connected deep neural networks. ArXiv:abs/1705.01714 (2019)
Bottou, L., Curtis, F.E., Nocedal, J.: Optimization methods for large-scale machine learning. SIAM Rev. 60, 223–311 (2018)
Article MathSciNet Google Scholar
Fourer, R., Kernighan, B.W.: AMPL: a modeling language for mathematical programming (2003)
Google Scholar
Glorot, X., Bordes, A., Bengio, Y.: Deep sparse rectifier neural networks. J. Mach. Learn. Res. 15, 315–323 (2011)
Google Scholar
Griewank, A.: On stable piecewise linearization and generalized algorithmic differentiation. Optim. Methods Softw. 28(6), 1139–1178 (2013)
Article MathSciNet Google Scholar
Griewank, A., Walther, A.: First and second order optimality conditions for piecewise smooth objective functions. Optim. Methods Softw. 31(5), 904–930 (2016)
Article MathSciNet Google Scholar
Griewank, A.: Generalized descent of global optimization. J. Optim. Theory Appl. 34, 11–39 (1981)
Article MathSciNet Google Scholar
Griewank, A., Walther, A.: Finite convergence of an active signature method to local minima of piecewise linear functions. Optim. Methods Softw. 34, 1035–1055 (2019)
Article MathSciNet Google Scholar
Gupte, A., Ahmed, S., Cheon, M., Dey, S.: Solving mixed integer bilinear problems using MILP formulations. SIAM J. Optim. 23(2), 721–744 (2013)
Article MathSciNet Google Scholar
Wright, S.J.: Coordinate descent algorithms. Math. Program. 151, 3–34 (2015)
Article MathSciNet Google Scholar
Kakade, S.M., Lee, J.D.: Provably correct automatic sub-differentiation for qualified programs. ArXiv:abs/1809.08530 (2018)
Kärkkäinen, T., Heikkola, E.: Robust formulations for training multilayer perceptrons. Neural Comput. 16, 837–862 (2004)
Article Google Scholar
Scholtes, S.: Introduction to Piecewise Differentiable Equations. Springer, New York (2012). https://doi.org/10.1007/978-1-4614-4340-7
Book MATH Google Scholar
Yarotsky, D.: Error bounds for approximations with deep ReLU networks. Neural Netw. Off. J. Int. Neural Netw. Soc. 94, 103–114 (2017)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Research Center on Mathematical Modelling (MODEMAT), Escuela Politécnica Nacional, Quito, Ecuador
Andreas Griewank
School of Mathematical and Computational Sciences, Yachay Tech, Urcuquí, Ecuador
Ángel Rojas

Authors

Andreas Griewank
View author publications
You can also search for this author in PubMed Google Scholar
Ángel Rojas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andreas Griewank .

Editor information

Editors and Affiliations

University of Cambridge, Cambridge, UK
Giuseppe Nicosia
University of Florida, Gainesville, FL, USA
Panos Pardalos
Harvard University, Cambridge, MA, USA
Renato Umeton
Università di Catania, Catania, Catania, Italy
Giovanni Giuffrida
Almawave, Rome, Roma, Italy
Vincenzo Sciacca

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Griewank, A., Rojas, Á. (2019). Treating Artificial Neural Net Training as a Nonsmooth Global Optimization Problem. In: Nicosia, G., Pardalos, P., Umeton, R., Giuffrida, G., Sciacca, V. (eds) Machine Learning, Optimization, and Data Science. LOD 2019. Lecture Notes in Computer Science(), vol 11943. Springer, Cham. https://doi.org/10.1007/978-3-030-37599-7_64

Download citation

DOI: https://doi.org/10.1007/978-3-030-37599-7_64
Published: 03 January 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37598-0
Online ISBN: 978-3-030-37599-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Treating Artificial Neural Net Training as a Nonsmooth Global Optimization Problem

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Towards More Biologically Plausible Error-Driven Learning for Artificial Neural Networks

Deep Learning Optimization

On the Omnipresence of Spurious Local Minima in Certain Neural Network Training Problems

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Treating Artificial Neural Net Training as a Nonsmooth Global Optimization Problem

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Towards More Biologically Plausible Error-Driven Learning for Artificial Neural Networks

Deep Learning Optimization

On the Omnipresence of Spurious Local Minima in Certain Neural Network Training Problems

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation