Reading Ian Goodfellow's new deep learning book and can't figure out how to derive a conditional probability. Can someone help? • /r/MachineLearning
Its a constant that you use to normalize, right? And what comes after the normalizing constant in the equation is a vector, right? The authors are using Z' so that you know that the vector always gets normalized, you don't just calculate a constant at the start of training and reuse the same constant each time you calculate as the vector moves off normal.
Apr-9-2016, 23:35:28 GMT