ODIN: On-demand Data Formulation to Mitigate Dataset Lock-in
Choi, SP, Lee, Jihun, Ahn, Hyeongseok, Jung, Sanghee, Kang, Bumsoo
–arXiv.org Artificial Intelligence
ODIN is an innovative approach that addresses the problem of dataset constraints by integrating generative AI models. Traditional zero-shot learning methods are constrained by the training dataset. To fundamentally overcome this limitation, ODIN attempts to mitigate the dataset constraints by generating on-demand datasets based on user requirements. ODIN consists of three main modules: a prompt generator, a text-to-image generator, and an image post-processor. To generate high-quality prompts and images, we adopted a large language model (e.g., ChatGPT), and a text-to-image diffusion model (e.g., Stable Diffusion), respectively. We evaluated ODIN on various datasets in terms of model accuracy and data diversity to demonstrate its potential, and conducted post-experiments for further investigation. Overall, ODIN is a feasible approach that enables Al to learn unseen knowledge beyond the training dataset.
arXiv.org Artificial Intelligence
Mar-16-2023
- Country:
- North America
- United States > New York
- New York County > New York City (0.04)
- Canada > Ontario
- Toronto (0.04)
- United States > New York
- Europe > Netherlands
- North Holland > Amsterdam (0.04)
- North America
- Genre:
- Research Report
- New Finding (0.68)
- Promising Solution (0.48)
- Research Report
- Technology: