AITopics | Wen, Zhenyu

Collaborating Authors

Wen, Zhenyu

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

FLMarket: Enabling Privacy-preserved Pre-training Data Pricing for Federated Learning

Wen, Zhenyu, Feng, Wanglei, Wu, Di, Hu, Haozhen, Xu, Chang, Qian, Bin, Hong, Zhen, Wang, Cong, Ji, Shouling

arXiv.org Artificial IntelligenceNov-18-2024

Federated Learning (FL), as a mainstream privacy-preserving machine learning paradigm, offers promising solutions for privacy-critical domains such as healthcare and finance. Although extensive efforts have been dedicated from both academia and industry to improve the vanilla FL, little work focuses on the data pricing mechanism. In contrast to the straightforward in/post-training pricing techniques, we study a more difficult problem of pre-training pricing without direct information from the learning process. We propose FLMarket that integrates a two-stage, auction-based pricing mechanism with a security protocol to address the utility-privacy conflict. Through comprehensive experiments, we show that the client selection according to FLMarket can achieve more than 10% higher accuracy in subsequent FL training compared to state-of-the-art methods. In addition, it outperforms the in-training baseline with more than 2% accuracy increase and 3x run-time speedup.

artificial intelligence, enabling privacy-preserved pre-training data pricing, machine learning, (2 more...)

arXiv.org Artificial Intelligence

2411.11713

Genre: Research Report > Promising Solution (0.53)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Dataset Distillation-based Hybrid Federated Learning on Non-IID Data

Shi, Xiufang, Zhang, Wei, Wu, Mincheng, Liu, Guangyi, Wen, Zhenyu, He, Shibo, Shah, Tejal, Ranjan, Rajiv

arXiv.org Artificial IntelligenceSep-25-2024

In federated learning, the heterogeneity of client data has a great impact on the performance of model training. Many heterogeneity issues in this process are raised by non-independently and identically distributed (Non-IID) data. This study focuses on the issue of label distribution skew. To address it, we propose a hybrid federated learning framework called HFLDD, which integrates dataset distillation to generate approximately independent and equally distributed (IID) data, thereby improving the performance of model training. Particularly, we partition the clients into heterogeneous clusters, where the data labels among different clients within a cluster are unbalanced while the data labels among different clusters are balanced. The cluster headers collect distilled data from the corresponding cluster members, and conduct model training in collaboration with the server. This training process is like traditional federated learning on IID data, and hence effectively alleviates the impact of Non-IID data on model training. Furthermore, we compare our proposed method with typical baseline methods on public datasets. Experimental results demonstrate that when the data labels are severely imbalanced, the proposed HFLDD outperforms the baseline methods in terms of both test accuracy and communication cost.

artificial intelligence, dataset, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2409.17517

Country:

Asia > China (0.14)
Europe > United Kingdom (0.14)

Genre: Research Report > New Finding (0.48)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.68)

Add feedback

Data Augmentation on Graphs: A Technical Survey

Zhou, Jiajun, Xie, Chenxuan, Wen, Zhenyu, Zhao, Xiangyu, Xuan, Qi

arXiv.org Artificial IntelligenceFeb-18-2023

In recent years, graph representation learning has achieved remarkable success while suffering from low-quality data problems. As a mature technology to improve data quality in computer vision, data augmentation has also attracted increasing attention in graph domain. For promoting the development of this emerging research direction, in this survey, we comprehensively review and summarize the existing graph data augmentation (GDAug) techniques. Specifically, we first summarize a variety of feasible taxonomies, and then classify existing GDAug studies based on fine-grained graph elements. Furthermore, for each type of GDAug technique, we formalize the general definition, discuss the technical details, and give schematic illustration. In addition, we also summarize common performance metrics and specific design metrics for constructing a GDAug evaluation system. Finally, we summarize the applications of GDAug from both data and model levels, as well as future directions. Latest advances in GDAug are summarized in a GitHub repository: https://github.com/jjzhou012/GDAug-Survey.

data mining, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2212.0997

Country:

North America > United States (0.46)
Europe (0.28)

Genre: Overview (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback