Skip to content

Mohammad8921/UniformStabilityOfAdam

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Uniform Stability of Adam Algorithm and its Effect on Generalization Error of Deep Neural Networks

Info

This repo contains all of the codes of my theoretical thesis in MSc supervised by Dr. Amin Gheibi, conducted at Amirkabir University of Technology (Tehran Polytechnic).

Abstract

  • We theoretically connected the Lipschitz constant and maximum value of a loss function to the generalization error of deep learning models trained by the Adam and AdamW optimizers under the uniform stability theory.
  • Using the theoretical results, we proposed a novel loss function for training deep classification models to improve the generalization performance and overcome the over-fitting issue.
  • We assessed our theorems in human age estimation based on face images.
  • We trained deep neural networks using our new loss function in the image and node classification problems in order to stabilize the output models and increase their accuracy.
  • Requirements:
    • Python 3.7.8
    • SciPy 1.7.3
    • OpenCV 4.5.5
    • PyTorch 1.11
    • CUDA 11.3
    • Torchvision 0.12
    • PyTorch Geometric 2.1

Paper