PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications