A variable metric proximal stochastic gradient method: an application to classification problems

Abstract

Due to the continued success of machine learning and deep learning in particular, supervised classification problems are ubiquitous in numerous scientific fields. Training these models typically involves the minimization of the empirical risk over large data sets along with a possibly non-differentiable regularization. In this paper, we introduce a stochastic gradient method for the considered classification problem. To control the variance of the objective's gradients, we use an automatic sample size selection along with a variable metric to precondition the stochastic gradient directions. Further, we utilize a non-monotone line search to automatize step size selection. Convergence results are provided for both convex and non-convex objective functions. Extensive numerical experiments verify that the suggested approach performs on par with state-of-the-art methods for training both statistical models for binary classification and artificial neural networks for multi-class image classification.

Non-convex multi-class classification

Credits

dataset_TinyImageNet.py contains the auxiliary functions to load the dataset. It's a modified version of this loader.
Adabelief.py contains the implementation of the AdaBelief optimizer available here.
adam.py contains an implementation of the Adam optimizer in PyTorch .

How to download Tiny ImageNet

Dataset on Paperswithcode

mkdir data && cd data
wget http://cs231n.stanford.edu/tiny-imagenet-200.zip
unzip -q tiny-imagenet-200.zip

Citing

Please consider to cite this work if you find it helpful.

@article{cascarano2024variable,
  title={A variable metric proximal stochastic gradient method: an application to classification problems},
  author={Cascarano, Pasquale and Franchini, Giorgia and Kobler, Erich and Porta, Federica and Sebastiani, Andrea},
  journal={EURO Journal on Computational Optimization},
  volume={},
  pages={},
  year={2024},
  publisher={Elsevier}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
images		images
lisa		lisa
models_architectures		models_architectures
.gitignore		.gitignore
AdaBelief.py		AdaBelief.py
LICENSE		LICENSE
README.md		README.md
adam.py		adam.py
dataset_CIFAR10.py		dataset_CIFAR10.py
dataset_MNIST.py		dataset_MNIST.py
dataset_TinyImageNet.py		dataset_TinyImageNet.py
main.py		main.py
models.py		models.py
requirements.yml		requirements.yml
test_autograd_hacks.py		test_autograd_hacks.py
train_experiment.sh		train_experiment.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A variable metric proximal stochastic gradient method: an application to classification problems

Abstract

Non-convex multi-class classification

Credits

How to download Tiny ImageNet

Citing

About

Releases

Packages

Contributors 2

Languages

License

koblererich/lisavm

Folders and files

Latest commit

History

Repository files navigation

A variable metric proximal stochastic gradient method: an application to classification problems

Abstract

Non-convex multi-class classification

Credits

How to download Tiny ImageNet

Citing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages