MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

Oct-10-2025, 00:27:22 GMT–Neural Information Processing Systems

Multimodal interleaved datasets featuring free-form interleaved sequences of images and text are crucial for training frontier large multimodal models (LMMs).

corpusid, dataset, semanticscholar, (17 more...)

Neural Information Processing Systems

Oct-10-2025, 00:27:22 GMT

Conferences PDF

Country:
- Europe > Monaco (0.04)
- North America > United States
  - Texas > Travis County
    - Austin (0.04)
  - California > Alameda County
    - Berkeley (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - Japan > Honshū
    - Chūbu > Toyama Prefecture > Toyama (0.04)

Industry:
- Law (1.00)
- Information Technology (1.00)
- Government (0.67)

Technology:
- Information Technology
  - Data Science (1.00)
  - Communications (1.00)
  - Artificial Intelligence
    - Natural Language
      - Large Language Model (1.00)
      - Chatbot (0.93)
    - Machine Learning > Neural Networks
      - Deep Learning (0.68)

Duplicate Docs Excel Report

Title
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

Similar Docs Excel Report more

Title	Similarity	Source
None found