Goto

Collaborating Authors

 specific data


Exploring Incremental Unlearning: Techniques, Challenges, and Future Directions

arXiv.org Artificial Intelligence

The growing demand for data privacy in Machine Learning (ML) applications has seen Machine Unlearning (MU) emerge as a critical area of research. As the `right to be forgotten' becomes regulated globally, it is increasingly important to develop mechanisms that delete user data from AI systems while maintaining performance and scalability of these systems. Incremental Unlearning (IU) is a promising MU solution to address the challenges of efficiently removing specific data from ML models without the need for expensive and time-consuming full retraining. This paper presents the various techniques and approaches to IU. It explores the challenges faced in designing and implementing IU mechanisms. Datasets and metrics for evaluating the performance of unlearning techniques are discussed as well. Finally, potential solutions to the IU challenges alongside future research directions are offered. This survey provides valuable insights for researchers and practitioners seeking to understand the current landscape of IU and its potential for enhancing privacy-preserving intelligent systems.


Survey of Security and Data Attacks on Machine Unlearning In Financial and E-Commerce

arXiv.org Artificial Intelligence

Machine learning in financial and e-commerce sector employs vast amounts of data are used to predict trends, detect fraud, and optimize decision-making processes. However, as these models become more widespread, concerns over security and privacy have also increased. In response to such challenges, machine unlearning has been introduced as a solution to enable models to forget specific data points when necessary, particularly for compliance with data regulations like the General Data Protection Regulation (GDPR). While machine unlearning provides an avenue for users to request the deletion of data from ML models, it also introduces new vulnerabilities to both privacy and security. Privacy and security attacks on machine unlearning are growing areas of concern, especially in sensitive financial applications where personal data is paramount. Two main categories of attacks can exploit this process: privacy attacks and security attacks. Privacy attacks target the confidentiality of data by attempting to reveal sensitive information, whereas security attacks aim to compromise the integrity and functionality of the machine unlearning process. In this paper, we aim to survey the types of privacy and security data attacks specific to machine unlearning in financial applications.


A More Practical Approach to Machine Unlearning

arXiv.org Artificial Intelligence

Machine learning models often incorporate vast amounts of data, raising significant privacy concerns. Machine unlearning, the ability to remove the influence of specific data points from a trained model, addresses these concerns. This paper explores practical methods for implementing machine unlearning, focusing on a first-epoch gradient-ascent approach. Key findings include: 1. Single vs. Multi-Epoch Unlearning: First-epoch gradient unlearning is more effective than multi-epoch gradients. 2. Layer-Based Unlearning: The embedding layer in GPT-2 is crucial for effective unlearning. Gradients from the output layers (11 and 12) have no impact. Efficient unlearning can be achieved using only the embedding layer, halving space complexity. 3. Influence Functions & Scoring: Techniques like Hessian Vector Product and the dot product of activations and tensors are used for quantifying unlearning. 4. Gradient Ascent Considerations: Calibration is necessary to avoid overexposing the model to specific data points during unlearning, which could prematurely terminate the process. 5. Fuzzy Matching vs. Iterative Unlearning: Fuzzy matching techniques shift the model to a new optimum, while iterative unlearning provides a more complete modality. Our empirical evaluation confirms that first-epoch gradient ascent for machine unlearning is more effective than whole-model gradient ascent. These results highlight the potential of machine unlearning for enhancing data privacy and compliance with regulations such as GDPR and CCPA. The study underscores the importance of formal methods to comprehensively evaluate the unlearning process.


Confusion Matrix without Confused

#artificialintelligence

As we know, the output for classification problem is consists from two target variables, either 0 or 1; Yes or No; Positive or Negative; etc. and our model is trying to classify whether a specific data is 0 or 1; Yes or No; etc. The columns are representing the True Class, which means true or real label for the specific data. The rows are representing the Predicted Class, which means the prediction results derived from our model for the specific use case. True Positive (TP) TP is simply the count of data where the Predicted value is Positive and True value is Positive too. True Negative (TN) TN is simply the count of data where the Predicted value is Negative and True value is Negative too.


Machine learning in knee arthroplasty: specific data are key--a systematic review - Knee Surgery, Sports Traumatology, Arthroscopy

#artificialintelligence

Artificial intelligence (AI) in healthcare is rapidly growing and offers novel options of data analysis. Machine learning (ML) represents a distinct application of AI, which is capable of generating predictions and has already been tested in different medical specialties with various approaches such as diagnostic applications, cost predictions or identification of risk factors. In orthopaedics, this technology has only recently been introduced and the literature on ML in knee arthroplasty is scarce. In this review, we aim to investigate which predictions are already feasible using ML models in knee arthroplasty to identify prerequisites for the effective use of this novel approach. For this reason, we conducted a systematic review of ML algorithms for outcome prediction in knee arthroplasty.


Data science: the key to growing your business in Africa

#artificialintelligence

Do you want to understand why your competitors are winning the business of potential clients? How do you predict future trends within your marketplace? Are you looking to predict government policy decisions that will affect you and your business? These are the questions that can be answered by a team of data scientists that will improve your business and your understanding of your clients. Let's take the first question: "Why are your competitors winning the business of potential clients that you may be missing out on."


How To Verify The Memory Loss Of A Machine Learning Model

#artificialintelligence

It is a known fact that deep learning models get better with diversity in the data they are fed with. For instance, data in a use case related to healthcare data will be taken from several providers such as patient data, history, workflows of professionals, insurance providers, etc. to ensure such data diversity. These data points that are collected through various interactions of people are fed into a machine learning model, which sits remotely in a data haven spewing predictions without exhausting. However, consider a scenario where one of the providers ceases to offer data to the healthcare project and later requests to delete the provided information. In such a case, does the model remember or forget its learnings from this data?


GDPR panic may spur data and AI innovation

#artificialintelligence

If AI innovation runs on data, the new European Union's General Data Protection Regulations (GDPR) seem poised to freeze AI advancement. The regulations prescribe a utopian data future where consumers can refuse companies access to their personally identifiable information (PII). Although the enforcement deadline has passed, the technical infrastructure and manpower needed to meet these requirements still do not exist in most companies today. Coincidentally, the barriers to GDPR compliance are also bottlenecks of widespread AI adoption. Despite the hype, enterprise AI is still nascent: Companies may own petabytes of data that can be used for AI, but fully digitizing that data, knowing what the data tables actually contain and understanding who, where and how to access that data remains a herculean coordination effort for even the most empowered internal champion.