Learnings from Scaling Visual Tokenizers for Reconstruction and Generation