code

1.SGDR This is a Pytorch implementation of training a model (Resnet-50) using a differential learning rate. The optimizer used is Stochastic Gradient descent with RESTARTS ( SGDR) that uses Cosine Annealing which decreases the learning rate in the form of half a cosine curve. Cycling the learning rate allows the network to get out of spiky minima and enter a more robust one.

This code shows how to train a Resnet-50 model using Pytorch and a differential learning rate. The optimizer is Stochastic Gradient Descent with RESTARTS (SGDR), which applies Cosine Annealing to reduce the learning rate following a half-cosine curve. This technique helps the network escape from sharp minima and converge to a smoother one.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
SGDR		SGDR
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

code

About

Releases

Packages

Languages

ShakirKhurshid/Pytorch-SGDR-Resnet50

Folders and files

Latest commit

History

Repository files navigation

code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages