r/MachineLearning - [D] Text classification on a small dataset

May-14-2018, 19:38:30 GMT–@machinelearnbot

I am trying to perform multiclass text classification (for 24 classes) on a set documents, but I have a very small dataset currently (1200 total examples). The data collection process is a bit tedious in my case, hence the small dataset size. The best result I have achieved till now is 58% accuracy with an SVM model and a single layer CNN model. Is there any other approach I can try other than collecting more data? I have tried oversampling the training set, but it didn't seem to improve the performance.

machine learning, natural language, text classification, (4 more...)

@machinelearnbot

May-14-2018, 19:38:30 GMT

News Web Page

Add feedback

Industry:
- Media > News (0.40)

Technology:
- Information Technology
  - Communications > Social Media (0.76)
  - Artificial Intelligence
    - Machine Learning (0.75)
    - Natural Language > Text Classification (0.71)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found