[D] Max-over-time pooling vs no max-pooling for text classification? • r/MachineLearning
Kim 2014 and Collobert 2011 argue that max-over-time pooling helps getting the words from a sentence that are most important to the semantics. Then I read a blog post from the Googler Lakshmanan V on text classification. The author argues that spatial invariance isn't wanted because it's important where words are placed in a sentence. Thus he doesn't recommend maxpool. Are there empirical studies that compares the two approaches?
Dec-26-2017, 12:50:16 GMT
- Technology: