When it comes to AI, can we ditch the datasets?

Mar-15-2022, 21:38:19 GMT–#artificialintelligence

Huge amounts of data are needed to train machine-learning models to perform image classification tasks, such as identifying damage in satellite photos following a natural disaster. However, these data are not always easy to come by. Datasets may cost millions of dollars to generate, if usable data exist in the first place, and even the best datasets often contain biases that negatively impact a model's performance. To circumvent some of the problems presented by datasets, MIT researchers developed a method for training a machine learning model that, rather than using a dataset, uses a special type of machine-learning model to generate extremely realistic synthetic data that can train another model for downstream vision tasks. Their results show that a contrastive representation learning model trained using only these synthetic data is able to learn visual representations that rival or even outperform those learned from real data.

dataset, generative model, machine-learning model, (11 more...)

#artificialintelligence

Mar-15-2022, 21:38:19 GMT

News Web Page

Add feedback

Country:
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.40)

Genre:
- Research Report > New Finding (0.36)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Vision > Image Understanding (0.62)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found