Addestramento con Dataset Sbilanciati
–arXiv.org Artificial Intelligence
The following document pursues the objective of comparing some useful methods to balance a dataset and obtain a trained model. The dataset used for training is made up of short and medium length sentences, such as simple phrases or extracts from conversations that took place on web channels. The training of the models will take place with the help of the structures made available by the Apache Spark framework, the models may subsequently be useful for a possible implementation of a solution capable of classifying sentences using the distributed environment, as described in "New frontier of textual classification: Big data and distributed calculation" by Massimiliano Morrelli et al.
arXiv.org Artificial Intelligence
Aug-18-2020
- Country:
- Asia > Middle East
- Saudi Arabia > Ḥaʼil Province > Ha'il (0.04)
- Europe
- Italy > Basilicata
- Potenza Province > Potenza (0.04)
- Latvia > Riga Municipality
- Riga (0.05)
- Italy > Basilicata
- Asia > Middle East
- Genre:
- Research Report (0.41)
- Technology: