AITopics | pre-training method

Pre-training has achieved remarkable success when transferred to downstream tasks. In machine learning, we care about not only the good performance of a model but also its behavior under reasonable shifts of condition.

artificial intelligence, machine learning, optimization problem, (15 more...)

Neural Information Processing Systems

Country:

Asia > China (0.04)
North America > United States > Washington > King County > Seattle (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning

Neural Information Processing SystemsDec-26-2025, 05:26:16 GMT

Unsupervised pre-training methods utilizing large and diverse datasets have achieved tremendous success across a range of domains. Recent work has investigated such unsupervised pre-training methods for model-based reinforcement learning (MBRL) but is limited to domain-specific or simulated data. In this paper, we study the problem of pre-training world models with abundant in-the-wild videos for efficient learning of downstream visual control tasks. However, in-the-wild videos are complicated with various contextual factors, such as intricate backgrounds and textured appearance, which precludes a world model from extracting shared world knowledge to generalize better. To tackle this issue, we introduce Contextualized World Models (ContextWM) that explicitly separate context and dynamics modeling to overcome the complexity and diversity of in-the-wild videos and facilitate knowledge transfer between distinct scenes. Specifically, a contextualized extension of the latent dynamics model is elaborately realized by incorporating a context encoder to retain contextual information and empower the image decoder, which encourages the latent dynamics model to concentrate on essential temporal variations. Our experiments show that in-the-wild video pre-training equipped with ContextWM can significantly improve the sample efficiency of MBRL in various domains, including robotic manipulation, locomotion, and autonomous driving.

in-the-wild video, name change, pre-training contextualized world model, (6 more...)

Neural Information Processing Systems

Industry: Education > Educational Technology > Educational Software > Computer Based Training (0.43)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning

Neural Information Processing SystemsDec-24-2025, 14:52:02 GMT

Recently, vision model pre-training has evolved from relying on manually annotated datasets to leveraging large-scale, web-crawled image-text data. Despite these advances, there is no pre-training method that effectively exploits the interleaved image-text data, which is very prevalent on the Internet. Inspired by the recent success of compression learning in natural language processing, we propose a novel vision model pre-training method called Latent Compression Learning (LCL) for interleaved image-text data. This method performs latent compression learning by maximizing the mutual information between the inputs and outputs of a causal attention model. The training objective can be decomposed into two basic tasks: 1) contrastive learning between visual representation and preceding context, and 2) generating subsequent text based on visual representation. Our experiments demonstrate that our method not only matches the performance of CLIP on paired pre-training datasets (e.g., LAION), but can also leverage interleaved pre-training data (e.g., MMC4) to learn robust visual representations from scratch, showcasing the potential of vision model pre-training with interleaved image-text data.

artificial intelligence, interleaved image-text data, machine learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Graph Laplacian Eigenvector-based Pre-training Method for Graph Neural Networks

Dai, Howard, Njenga, Nyambura, Madhu, Hiren, Viswanath, Siddharth, Pellico, Ryan, Adelstein, Ian, Krishnaswamy, Smita

arXiv.org Artificial IntelligenceNov-11-2025

The development of self-supervised graph pre-training methods is a crucial ingredient in recent efforts to design robust graph foundation models (GFMs). Structure-based pre-training methods are under-explored yet crucial for downstream applications which rely on underlying graph structure. In addition, pre-training traditional message passing GNNs to capture global and regional structure is often challenging due to the risk of oversmoothing as network depth increases. We address these gaps by proposing the Laplacian Eigenvector Learning Module (LELM), a novel pre-training module for graph neural networks (GNNs) based on predicting the low-frequency eigenvectors of the graph Laplacian. Moreover, LELM introduces a novel architecture that overcomes oversmoothing, allowing the GNN model to learn long-range interdependencies. Empirically, we show that models pre-trained via our framework outperform baseline models on downstream molecular property prediction tasks.

artificial intelligence, eigenvector, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2509.02803

Country: Asia (0.28)

Genre:

Research Report > Experimental Study (1.00)
Instructional Material (0.88)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Filters

Collaborating Authors

pre-training method

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

1e4322fddd833f83c855660ac65e428d-Paper-Conference.pdf

873c86d9a979ab80d8e2919510d4446b-Supplemental-Conference.pdf

Pre-Training Protein Encoder via Siamese Sequence-Structure Diffusion Trajectory Prediction

SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling

5e0da5da69b71349ae0bd7ad716e4bc9-Supplemental-Conference.pdf

5e0da5da69b71349ae0bd7ad716e4bc9-Paper-Conference.pdf

Task-Robust Pre-Training for Worst-Case Downstream Adaptation Jianghui Wang, Y ang Chen

Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning

Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning

A Graph Laplacian Eigenvector-based Pre-training Method for Graph Neural Networks