Text Classification using Watson NLP
You can downsample the dataset in the data processing step to reduce the model training time. Some of the product categories have fewer instances compared to others. So, you can drop those categories before training the model. Finally, you can carry out the train-test split using the sampling method on the Pandas dataframe. One crucial step required here is to convert the dataframe into the JSON or CSV format as required by the Watson NLP classification algorithm.
Nov-21-2022, 22:20:05 GMT