AITopics | fine-tuning step

RoMa: ARobust Model Watermarking Scheme for Protecting IP in Diffusion Models

Neural Information Processing SystemsJun-16-2026, 00:47:07 GMT

In this regard, model watermarking is a common practice for IP protection that embeds traceable information within models and allows for further verification. Nevertheless, existing watermarking schemes often face challenges due to their vulnerability to fine-tuning, limiting their practical application in general pretraining and fine-tuning paradigms. Inspired by using mode connectivity to analyze model performance between a pair of connected models, we investigate watermark vulnerability by leveraging Linear Mode Connectivity (LMC) as a proxy to analyze the fine-tuning dynamics of watermark performance. Our results show that existing watermarked models tend to converge to sharp minima in the loss landscape, thus making them vulnerable to fine-tuning. To tackle this challenge, we propose RoMa, a Robust Model watermarking scheme that improves the robustness of watermarks against fine-tuning. Specifically, RoMa decomposes watermarking into two components, including Embedding Functionality, which preserves reliable watermark detection capability, and Path-specific Smoothness, which enhances the smoothness along the watermark-connected path to improve robustness. Extensive experiments on benchmark datasets MS-COCO-2017 and CUB-200-2011 demonstrate that RoMa significantly improves watermark robustness against fine-tuning while maintaining generation quality, outperforming baselines. The code is available at https://github.com/xiekks/RoMa.

machine learning, natural language, roma, (19 more...)

Neural Information Processing Systems

Country:

Europe (0.67)
Asia > China (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
(4 more...)

Add feedback

4b5721f7fcc1672930d860e0dfcfee84-Paper-Conference.pdf

Neural Information Processing SystemsApr-27-2026, 03:02:49 GMT

artificial intelligence, machine learning, molecule, (19 more...)

Neural Information Processing Systems

Genre: Research Report (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Efficient Meta Neural Heuristic for Multi-Objective Combinatorial Optimization (Appendix) A Model architecture The architecture of the base model in meta-learning is the same as POMO [ 26

Neural Information Processing SystemsFeb-16-2026, 15:04:45 GMT

Each sublayer adds a skip-connection (ADD) and batch normalization (BN). The decoder sequentially chooses a node according to a probability distribution produced by the node embeddings to construct a solution. The scaled symmetric sampling method is shown in Algorithm 2. The scaled factor The uniform division of the weight space is illustrated as follows. Thus, its approximate Pareto optimal solutions are commonly pursued. V ehicles must serve all the customers and finally return to the depot.

Add feedback

Efficient Meta Neural Heuristic for Multi-Objective Combinatorial Optimization Jinbiao Chen

Neural Information Processing SystemsFeb-16-2026, 15:04:41 GMT

Recently, neural heuristics based on deep reinforcement learning have exhibited promise in solving multi-objective combinatorial optimization problems (MO-COPs).

artificial intelligence, machine learning, optimization problem, (16 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)

Genre: Research Report (0.68)

Industry: Transportation (0.96)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.86)

Add feedback

4b5721f7fcc1672930d860e0dfcfee84-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 11:36:40 GMT

geometry, molecule, wavefunction, (16 more...)

Neural Information Processing Systems

Country: Europe > Austria > Vienna (0.04)

Genre: Research Report (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Transformer-Gather, Fuzzy-Reconsider: A Scalable Hybrid Framework for Entity Resolution

Sharifi, Mohammadreza, Ahmadzadeh, Danial

arXiv.org Artificial IntelligenceOct-27-2025

Entity resolution plays a significant role in enterprise systems where data integrity must be rigorously maintained. Traditional methods often struggle with handling noisy data or semantic understanding, while modern methods suffer from computational costs or the excessive need for parallel computation. In this study, we introduce a scalable hybrid framework, which is designed to address several important problems, including scalability, noise robustness, and reliable results. We utilized a pre-trained language model to encode each structured data into corresponding semantic embedding vectors. Subsequently, after retrieving a semantically relevant subset of candidates, we apply a syntactic verification stage using fuzzy string matching techniques to refine classification on the unlabeled data. This approach was applied to a real-world entity resolution task, which exposed a linkage between a central user management database and numerous shared hosting server records. Compared to other methods, this approach exhibits an outstanding performance in terms of both processing time and robustness, making it a reliable solution for a server-side product. Crucially, this efficiency does not compromise results, as the system maintains a high retrieval recall of approximately 0.97. The scalability of the framework makes it deployable on standard CPU-based infrastructure, offering a practical and effective solution for enterprise-level data integrity auditing.

information retrieval, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2509.1747

Country: Asia (0.14)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

b1efde53be364a73914f58805a001731-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 05:07:08 GMT

artificial intelligence, machine learning, weight vector, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

b1efde53be364a73914f58805a001731-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 05:07:05 GMT

artificial intelligence, machine learning, optimization problem, (16 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)

Genre: Research Report (0.68)

Industry: Transportation (0.96)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Model-agnostic Meta-learning for Adaptive Gait Phase and Terrain Geometry Estimation with Wearable Soft Sensors

Zhu, Zenan, Chen, Wenxi, Kao, Pei-Chun, Clark, Janelle, Behnke, Lily, Kramer-Bottiglio, Rebecca, Yanco, Holly, Gu, Yan

arXiv.org Artificial IntelligenceAug-6-2025

This letter presents a model-agnostic meta-learning (MAML) based framework for simultaneous and accurate estimation of human gait phase and terrain geometry using a small set of fabric-based wearable soft sensors, with efficient adaptation to unseen subjects and strong generalization across different subjects and terrains. Compared to rigid alternatives such as inertial measurement units, fabric-based soft sensors improve comfort but introduce nonlinearities due to hysteresis, placement error, and fabric deformation. Moreover, inter-subject and inter-terrain variability, coupled with limited calibration data in real-world deployments, further complicate accurate estimation. To address these challenges, the proposed framework integrates MAML into a deep learning architecture to learn a generalizable model initialization that captures subject- and terrain-invariant structure. This initialization enables efficient adaptation (i.e., adaptation with only a small amount of calibration data and a few fine-tuning steps) to new users, while maintaining strong generalization (i.e., high estimation accuracy across subjects and terrains). Experiments on nine participants walking at various speeds over five terrain conditions demonstrate that the proposed framework outperforms baseline approaches in estimating gait phase, locomotion mode, and incline angle, with superior accuracy, adaptation efficiency, and generalization.

artificial intelligence, estimation, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2508.0293

Country: North America > United States > Maryland (0.68)

Genre:

Research Report > New Finding (0.68)
Research Report > Experimental Study (0.68)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

A Generalized Meta Federated Learning Framework with Theoretical Convergence Guarantees

Jamali, Mohammad Vahid, Saber, Hamid, Bae, Jung Hyun

arXiv.org Artificial IntelligenceMay-14-2025

Meta federated learning (FL) is a personalized variant of FL, where multiple agents collaborate on training an initial shared model without exchanging raw dat a samples. The initial model should be trained in a way that current or new agents can easily adapt it to their local datasets after one or a few fine-tuning steps, thus improving the model personaliza tion. Conventional meta FL approaches minimize the average loss of agents on the local models obtai ned after one step of fine-tuning. In practice, agents may need to apply several fine-tuning steps to adapt the global model to their local data, especially under highly heterogeneous data dis tributions across agents. To this end, we present a generalized framework for the meta FL by minimizin g the average loss of agents on their local model after any arbitrary number ν of fine-tuning steps. For this generalized framework, we present a variant of the well-known federated averaging ( FedAvg) algorithm and conduct a comprehensive theoretical convergence analysis to charac terize the convergence speed as well as behavior of the meta loss functions in both the exact and appr oximated cases. Our experiments on real-world datasets demonstrate superior accuracy and fas ter convergence for the proposed scheme compared to conventional approaches.

artificial intelligence, fine-tuning step, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2504.21327

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.87)

Add feedback

Filters

Collaborating Authors

fine-tuning step

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

RoMa: ARobust Model Watermarking Scheme for Protecting IP in Diffusion Models

4b5721f7fcc1672930d860e0dfcfee84-Paper-Conference.pdf

Efficient Meta Neural Heuristic for Multi-Objective Combinatorial Optimization (Appendix) A Model architecture The architecture of the base model in meta-learning is the same as POMO [ 26

Efficient Meta Neural Heuristic for Multi-Objective Combinatorial Optimization Jinbiao Chen

4b5721f7fcc1672930d860e0dfcfee84-Paper-Conference.pdf

Transformer-Gather, Fuzzy-Reconsider: A Scalable Hybrid Framework for Entity Resolution

b1efde53be364a73914f58805a001731-Supplemental-Conference.pdf

b1efde53be364a73914f58805a001731-Paper-Conference.pdf

Model-agnostic Meta-learning for Adaptive Gait Phase and Terrain Geometry Estimation with Wearable Soft Sensors

A Generalized Meta Federated Learning Framework with Theoretical Convergence Guarantees