Optimization algorithms

A description of algorithms is available here.

This repository contains my naive implementations in python in order to understand the different methods.

Stochastic Gradient Descent

SGD is like GD but with a "partial" gradient computation. This "partial" part can be done by randomness + bagging and this allows parallelization for GD.

SGD is more sensitive to the step-size than GD.

SGD convergence is sensitive to the step-size. It is stable at start and the closer it goes to the solution the more fluctuating it is. It is good in ML to go to a solution quickly.

SGD is unbiased with respect to the randomness used.

The speed of the SGD convergence depends of the level of noise it has => the variance of the SGD.

Back-propagation in neural networks IS SGD.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
DP_fibonacci.py		DP_fibonacci.py
GD_steepest_animation.py		GD_steepest_animation.py
README.md		README.md
differentiate.py		differentiate.py
gradient.py		gradient.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optimization algorithms

Stochastic Gradient Descent

About

Releases

Packages

Languages

jnkien/optimization

Folders and files

Latest commit

History

Repository files navigation

Optimization algorithms

Stochastic Gradient Descent

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages