AITopics | representation extractor

Collaborating Authors

representation extractor

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Enhancing Adversarial Contrastive Learning via Adversarial Invariant Regularization

Neural Information Processing SystemsOct-8-2025, 10:46:49 GMT

However, it is unclear how the style-independence property benefits ACL-learned robust representations.

representation, robustness transferability, transferability, (14 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

DP$^2$-FedSAM: Enhancing Differentially Private Federated Learning Through Personalized Sharpness-Aware Minimization

Zhang, Zhenxiao, Guo, Yuanxiong, Gong, Yanmin

arXiv.org Artificial IntelligenceSep-20-2024

Federated learning (FL) is a distributed machine learning approach that allows multiple clients to collaboratively train a model without sharing their raw data. To prevent sensitive information from being inferred through the model updates shared in FL, differentially private federated learning (DPFL) has been proposed. DPFL ensures formal and rigorous privacy protection in FL by clipping and adding random noise to the shared model updates. However, the existing DPFL methods often result in severe model utility degradation, especially in settings with data heterogeneity. To enhance model utility, we propose a novel DPFL method named DP$^2$-FedSAM: Differentially Private and Personalized Federated Learning with Sharpness-Aware Minimization. DP$^2$-FedSAM leverages personalized partial model-sharing and sharpness-aware minimization optimizer to mitigate the adverse impact of noise addition and clipping, thereby significantly improving model utility without sacrificing privacy. From a theoretical perspective, we provide a rigorous theoretical analysis of the privacy and convergence guarantees of our proposed method. To evaluate the effectiveness of DP$^2$-FedSAM, we conduct extensive evaluations based on common benchmark datasets. Our results verify that our method improves the privacy-utility trade-off compared to the existing DPFL methods, particularly in heterogeneous data settings.

dp 2, model update, representation extractor, (13 more...)

arXiv.org Artificial Intelligence

2409.13645

Country: North America > United States > Texas (0.04)

Genre: Research Report > New Finding (0.66)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Improved Generalization Bounds for Communication Efficient Federated Learning

Gholami, Peyman, Seferoglu, Hulya

arXiv.org Artificial IntelligenceMay-27-2024

This paper focuses on reducing the communication cost of federated learning by exploring generalization bounds and representation learning. We first characterize a tighter generalization bound for one-round federated learning based on local clients' generalizations and heterogeneity of data distribution (non-iid scenario). We also characterize a generalization bound in R-round federated learning and its relation to the number of local updates (local stochastic gradient descents (SGDs)). Then, based on our generalization bound analysis and our representation learning interpretation of this analysis, we show for the first time that less frequent aggregations, hence more local updates, for the representation extractor (usually corresponds to initial layers) leads to the creation of more generalizable models, particularly for non-iid scenarios. We design a novel Federated Learning with Adaptive Local Steps (FedALS) algorithm based on our generalization bound and representation learning analysis. FedALS employs varying aggregation frequencies for different parts of the model, so reduces the communication cost. The paper is followed with experimental results showing the effectiveness of FedALS.

generalization, generalization error, improved generalization bound, (12 more...)

arXiv.org Artificial Intelligence

2404.11754

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Label-Agnostic Forgetting: A Supervision-Free Unlearning in Deep Models

Shen, Shaofei, Zhang, Chenhao, Zhao, Yawen, Bialkowski, Alina, Chen, Weitong Tony, Xu, Miao

arXiv.org Artificial IntelligenceMay-7-2024

Machine unlearning aims to remove information derived from forgotten data while preserving that of the remaining dataset in a well-trained model. With the increasing emphasis on data privacy, several approaches to machine unlearning have emerged. However, these methods typically rely on complete supervision throughout the unlearning process. Unfortunately, obtaining such supervision, whether for the forgetting or remaining data, can be impractical due to the substantial cost associated with annotating real-world datasets. This challenge prompts us to propose a supervision-free unlearning approach that operates without the need for labels during the unlearning process. Specifically, we introduce a variational approach to approximate the distribution of representations for the remaining data. Leveraging this approximation, we adapt the original model to eliminate information from the forgotten data at the representation level. To further address the issue of lacking supervision information, which hinders alignment with ground truth, we introduce a contrastive loss to facilitate the matching of representations between the remaining data and those of the original model, thus preserving predictive performance. Experimental results across various unlearning tasks demonstrate the effectiveness of our proposed method, Label-Agnostic Forgetting (LAF) without using any labels, which achieves comparable performance to state-of-the-art methods that rely on full supervision information. Furthermore, our approach excels in semi-supervised scenarios, leveraging limited supervision information to outperform fully supervised baselines. This work not only showcases the viability of supervision-free unlearning in deep models but also opens up a new possibility for future research in unlearning at the representation level.

dataset, information, representation, (17 more...)

arXiv.org Artificial Intelligence

2404.00506

Country:

Oceania > Australia > Queensland (0.04)
North America > United States > California (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre:

Research Report > Promising Solution (0.75)
Research Report > New Finding (0.46)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Are Synthetic Time-series Data Really not as Good as Real Data?

Fu, Fanzhe, Chen, Junru, Zhang, Jing, Yang, Carl, Ma, Lvbin, Yang, Yang

arXiv.org Artificial IntelligenceFeb-1-2024

Integrating universal Issues: The fine-tuning process for temporal data needs data synthesis methods holds promise in improving to be handled carefully as it may contain adversarial or noisy generalization. However, current methods cannot examples, which could impact the model's robustness; (2) guarantee that the generator's output covers Bias and Vulnerabilities: The use of temporal data may all unseen real data. In this paper, we introduce cause the model to inherit biases or vulnerabilities from the InfoBoost-a highly versatile cross-domain data data, thereby reducing its robustness in real-world applications; synthesizing framework with time series representation (3) Generalization Problems: Despite being trained learning capability. We have developed on vast datasets, time-series models may not generalize a method based on synthetic data that enables well to unseen or out-of-distribution data. Time-series and model training without the need for real data, surpassing spatio-temporal data may exhibit sudden shifts or trends, the performance of models trained with potentially leading to unreliable outputs, highlighting the real data. Additionally, we have trained a universal need for robust generalization (Jin et al., 2023).

information, real data, synthetic data, (15 more...)

arXiv.org Artificial Intelligence

2402.00607

Country:

Asia > China > Zhejiang Province > Hangzhou (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Data Science > Data Quality (0.68)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

Enhancing Adversarial Contrastive Learning via Adversarial Invariant Regularization

Xu, Xilie, Zhang, Jingfeng, Liu, Feng, Sugiyama, Masashi, Kankanhalli, Mohan

arXiv.org Artificial IntelligenceOct-23-2023

Adversarial contrastive learning (ACL) is a technique that enhances standard contrastive learning (SCL) by incorporating adversarial data to learn a robust representation that can withstand adversarial attacks and common corruptions without requiring costly annotations. To improve transferability, the existing work introduced the standard invariant regularization (SIR) to impose style-independence property to SCL, which can exempt the impact of nuisance style factors in the standard representation. However, it is unclear how the style-independence property benefits ACL-learned robust representations. In this paper, we leverage the technique of causal reasoning to interpret the ACL and propose adversarial invariant regularization (AIR) to enforce independence from style factors. We regulate the ACL using both SIR and AIR to output the robust representation. Theoretically, we show that AIR implicitly encourages the representational distance between different views of natural data and their adversarial variants to be independent of style factors. Empirically, our experimental results show that invariant regularization significantly improves the performance of state-of-the-art ACL methods in terms of both standard generalization and robustness on downstream tasks. To the best of our knowledge, we are the first to apply causal reasoning to interpret ACL and develop AIR for enhancing ACL-learned robust representations.

representation, robustness transferability, transferability, (14 more...)

arXiv.org Artificial Intelligence

2305.00374

Country:

Asia > Singapore (0.04)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
(2 more...)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Enriching Disentanglement: Definitions to Metrics

Zhang, Yivan, Sugiyama, Masashi

arXiv.org Artificial IntelligenceMay-19-2023

Disentangled representation learning is a challenging task that involves separating multiple factors of variation in complex data. Although various metrics for learning and evaluating disentangled representations have been proposed, it remains unclear what these metrics truly quantify and how to compare them. In this work, we study the definitions of disentanglement given by first-order equational predicates and introduce a systematic approach for transforming an equational definition into a compatible quantitative metric based on enriched category theory. Specifically, we show how to replace (i) equality with metric or divergence, (ii) logical connectives with order operations, (iii) universal quantifier with aggregation, and (iv) existential quantifier with the best approximation. Using this approach, we derive metrics for measuring the desired properties of a disentangled representation extractor and demonstrate their effectiveness on synthetic data. Our proposed approach provides practical guidance for researchers in selecting appropriate evaluation metrics and designing effective learning algorithms for disentangled representation learning.

artificial intelligence, machine learning, representation, (12 more...)

arXiv.org Artificial Intelligence

2305.11512

Country: