Appendix (LAION-5B: An open large-scale dataset for training next generation image-text models) A Datasheet for LAION-5B dataset A.1 Motivation Q1
–Neural Information Processing Systems
For what purpose was the dataset created? Was there a specific task in mind? YFCC with 100 million image/videos and associated metadata. Who created the dataset (e.g., which team, research group) and on behalf of which Who funded the creation of the dataset? This work was sponsored by Hugging Face and Stability AI. What do the instances that comprise the dataset represent (e.g., documents, photos, Are there multiple types of instances (e.g., movies, users, and ratings; We provide 5.8 billion image-text pairs.
Neural Information Processing Systems
Feb-11-2026, 02:26:53 GMT
- Genre:
- Research Report (0.46)
- Industry:
- Information Technology > Security & Privacy (0.93)
- Law (1.00)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (0.48)
- Natural Language (1.00)
- Vision (1.00)
- Machine Learning > Neural Networks
- Communications (1.00)
- Security & Privacy (0.93)
- Sensing and Signal Processing > Image Processing (0.93)
- Artificial Intelligence
- Information Technology