MERLOT: MultimodalNeuralScriptKnowledgeModels

Feb-11-2026, 02:47:50 GMT–Neural Information Processing Systems

By pretraining with a mix of both framelevel (spatial) and video-level (temporal) objectives, our model not only learns to match images to temporally corresponding words, but also to contextualize what is happening globally over time.

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Feb-11-2026, 02:47:50 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Hawaii > Honolulu County
    - Honolulu (0.04)
  - California > San Francisco County
    - San Francisco (0.14)

Industry:
- Education (0.47)
- Consumer Products & Services > Food, Beverage, Tobacco & Cannabis
  - Beverages (0.43)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
c6d4eb15f1e84a36eff58eca3627c82e-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found