You can't train fully connected deep models with backprop, or at least not easil...

mark_l_watson · on Nov 24, 2012

That is correct. The problem is that the gradients get smaller and smaller as you back propagate back towards the input layer. So learning on the front part of the net is slow. Hinton has a lot of good material about htis in his Coursera lectures.

wookietrader · on Nov 24, 2012

Yes you can.

Check out the publications by Ciresan on MNIST, have a look at Hinton's dropout paper or at the Kaggle competition that used deep nets. Or try it yourself and spend a descent amount of time on hyper parameter tuning. :)

iskander · on Nov 25, 2012

Which of Ciresan's projects are you referring to? Everything I've seen by him uses convolutional layers of some sort.