Is the Number of Trainable Parameters All That Actually Matters?