Semantic similarity estimation for domain specific data using BERT and other techniques