Addestramento con Dataset Sbilanciati

Aug-18-2020–arXiv.org Artificial Intelligence

The following document pursues the objective of comparing some useful methods to balance a dataset and obtain a trained model. The dataset used for training is made up of short and medium length sentences, such as simple phrases or extracts from conversations that took place on web channels. The training of the models will take place with the help of the structures made available by the Apache Spark framework, the models may subsequently be useful for a possible implementation of a solution capable of classifying sentences using the distributed environment, as described in "New frontier of textual classification: Big data and distributed calculation" by Massimiliano Morrelli et al.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

Aug-18-2020

arXiv.org PDF

Add feedback

Country:
- Europe
  - Latvia > Riga Municipality
    - Riga (0.05)
  - Italy > Basilicata
    - Potenza Province > Potenza (0.04)
- Asia > Middle East
  - Saudi Arabia > Ḥaʼil Province > Ha'il (0.04)

Genre:
- Research Report (0.41)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.35)
  - Artificial Intelligence > Machine Learning
    - Performance Analysis > Accuracy (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found