Dissertation/ Thesis

Optimal Learning Rates for Neural Networks

التفاصيل البيبلوغرافية
العنوان: Optimal Learning Rates for Neural Networks
المؤلفون: Moncur, Tyler
المصدر: Theses and Dissertations.
بيانات النشر: BYU ScholarsArchive.
المجموعة: Brigham Young University
مصطلحات موضوعية: neural network, learning rate, crelu, Physical Sciences and Mathematics
الوصف: Neural networks have long been known as universal function approximators and have more recently been shown to be powerful and versatile in practice. But it can be extremely challenging to find the right set of parameters and hyperparameters. Model training is both expensive and difficult due to the large number of parameters and sensitivity to hyperparameters such as learning rate and architecture. Hyperparameter searches are notorious for requiring tremendous amounts of processing power and human resources. This thesis provides an analytic approach to estimating the optimal value of one of the key hyperparameters in neural networks, the learning rate. Where possible, the analysis is computed exactly, and where necessary, approximations and assumptions are used and justified. The result is a method that estimates the optimal learning rate for a certain type of network, a fully connected CReLU network.
Original Identifier: oai:scholarsarchive.byu.edu:etd-9662
نوع الوثيقة: Text
وصف الملف: application/pdf
الاتاحة: https://scholarsarchive.byu.edu/etd/8662
https://scholarsarchive.byu.edu/cgi/viewcontent.cgi?article=9662&context=etd
Rights: URL: https://lib.byu.edu/about/copyright/
رقم الانضمام: edsndl.BGMYU2.oai.scholarsarchive.byu.edu.etd.9662
قاعدة البيانات: Networked Digital Library of Theses & Dissertations