In the first session we will look more closely into common techniques of widely used NN architectures like batch normalisation, dropout and stochastic optimisers. We shall also touch upon regularisation ideas and various activation functions.
It will be roughly based upon the following papers:
1. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift pdf
2. Dropout: A Simple Way to Prevent Neural Networks from Overfitting pdf
3. Adam: A Method for Stochastic Optimization pdf
The first session will be given by the organisers but participants are expected to be familiar with the papers. More information about the reading group can be found at mathsml.com