Natural language processing for clusterization of genes according to their functions