Entropy and type-token ratio in gigaword corpora