Revisiting the Information Capacity of Neural Network Watermarks: Upper Bound Estimation and Beyond