2015 · ICML 2015

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Ioffe, Szegedy

2015 ICML 2015

TL;DR

Normalize each layer's activations over the mini-batch, then rescale with learned parameters. Dramatically speeds up training and reduces sensitivity to initialization.

Read paper

BACKLOG · WORK IN PROGRESS

This paper is being written.

The metadata and shape of this page are stable, but the body content isn't ready yet. We'll publish it once it meets the bar of teaching something new with worked examples and real tools.

Back to papers Track progress on GitHub