What is the total number of parameters, including bias, in a single 1x1 convolution filter when the input image is of size 64x64x16?
Your model is underperforming during training. The last hidden layer outputs very very small values.What do you do?
Which of the following techniques can be used to address the issue of exploding gradients in deep learning models?
Feature maps at the beginning of a CNN, as opposed to those towards the end of the network:
Training phase shows no improvement in the model after:

