'Why do we use MaxPooling 2x2? Can we use any other size like 3x3 or 5x5? And how to select which pooling to choose in what scenrio?

Greating,

I've searched it everywhere on YouTube, Google and also read some articles and research papers but can't seem to find the exact answer to my questions

I've few questions regarding CONVOLUTIONAL NEURAL NETWORK, I'm confused with this question: why do we use MaxPooling size 2x2 why don't we use any other size like 3x3, 4x4 ... nxn(of course less than the size of input) and can we even use any other than 2x2? And my other question is that: why do we always use MaxPooling most of the times? Does it depend on the images? For example if we have some noisy images then would it be suitable to use MaxPooling or should we use any other type of pooling?

Thank you!



Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution Source