Is BERT Always the Better Cheaper Faster Answer in NLP? Apparently Not.