AITopics | Mekala, Anmol

Plotting

Mekala, Anmol

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Alternate Preference Optimization for Unlearning Factual Knowledge in Large Language Models

Mekala, Anmol, Dorna, Vineeth, Dubey, Shreya, Lalwani, Abhishek, Koleczek, David, Rungta, Mukund, Hasan, Sadid, Lobo, Elita

arXiv.org Artificial IntelligenceDec-17-2024

Machine unlearning aims to efficiently eliminate the influence of specific training data, known as the forget set, from the model. However, existing unlearning methods for Large Language Models (LLMs) face a critical challenge: they rely solely on negative feedback to suppress responses related to the forget set, which often results in nonsensical or inconsistent outputs, diminishing model utility and posing potential privacy risks. To address this limitation, we propose a novel approach called Alternate Preference Optimization (AltPO), which combines negative feedback with in-domain positive feedback on the forget set. Additionally, we introduce new evaluation metrics to assess the quality of responses related to the forget set. Extensive experiments show that our approach not only enables effective unlearning but also avoids undesirable model behaviors while maintaining overall model performance. Our implementation can be found at https://github.com/molereddy/Alternate-Preference-Optimization.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2409.13474

Country:

North America > United States > Massachusetts (0.14)
Asia > Middle East > Iraq (0.14)
Asia > Middle East > Iran (0.14)

Genre:

Research Report > Promising Solution (0.48)
Research Report > New Finding (0.46)

Industry:

Health & Medicine (0.46)
Media (0.46)
Information Technology > Security & Privacy (0.46)
Government (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Automated Model Selection for Tabular Data

Amballa, Avinash, Mekala, Anmol, Akkinapalli, Gayathri, Madine, Manas, Yarrabolu, Naga Pavana Priya, Grabowicz, Przemyslaw A.

arXiv.org Artificial IntelligenceJan-1-2024

Structured data in the form of tabular datasets contain features that are distinct and discrete, with varying individual and relative importances to the target. Combinations of one or more features may be more predictive and meaningful than simple individual feature contributions. R's mixed effect linear models library allows users to provide such interactive feature combinations in the model design. However, given many features and possible interactions to select from, model selection becomes an exponentially difficult task. We aim to automate the model selection process for predictions on tabular datasets incorporating feature interactions while keeping computational costs small. The framework includes two distinct approaches for feature selection: a Priority-based Random Grid Search and a Greedy Search method. The Priority-based approach efficiently explores feature combinations using prior probabilities to guide the search. The Greedy method builds the solution iteratively by adding or removing features based on their impact. Experiments on synthetic demonstrate the ability to effectively capture predictive feature combinations.

artificial intelligence, interaction, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2401.00961

Country: North America > United States (0.14)

Genre: Research Report > Experimental Study (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Add feedback

Using Early Readouts to Mediate Featural Bias in Distillation

Tiwari, Rishabh, Sivasubramanian, Durga, Mekala, Anmol, Ramakrishnan, Ganesh, Shenoy, Pradeep

arXiv.org Artificial IntelligenceNov-8-2023

Deep networks tend to learn spurious feature-label correlations in real-world supervised learning tasks. This vulnerability is aggravated in distillation, where a student model may have lesser representational capacity than the corresponding teacher model. Often, knowledge of specific spurious correlations is used to reweight instances & rebalance the learning process. We propose a novel early readout mechanism whereby we attempt to predict the label using representations from earlier network layers. We show that these early readouts automatically identify problem instances or groups in the form of confident, incorrect predictions. Leveraging these signals to modulate the distillation loss on an instance level allows us to substantially improve not only group fairness measures across benchmark datasets, but also overall accuracy of the student model. We also provide secondary analyses that bring insight into the role of feature learning in supervision and distillation.

artificial intelligence, dataset, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2310.1859

Country: North America > United States > Massachusetts (0.14)

Genre: Research Report (0.82)

Industry: Education (0.55)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback