AITopics | Baku

Machine unlearning aims to remove specific information, e.g. sensitive or undesirable content, from large language models (LLMs) while preserving overall performance. We propose an inference-time unlearning algorithm that uses contrastive decoding, leveraging two auxiliary smaller models, one trained without the forget set and one trained with it, to guide the outputs of the original model using their difference during inference. Our strategy substantially improves the tradeoff between unlearning effectiveness and model utility. We evaluate our approach on two unlearning benchmarks, TOFU and MUSE. Results show notable gains in both forget quality and retained performance in comparison to prior approaches, suggesting that incorporating contrastive decoding can offer an efficient, practical avenue for unlearning concepts in large-scale models.

arxiv preprint arxiv, large language model, machine learning, (17 more...)

arXiv.org Machine Learning

2506.12097

Country:

North America > United States > California (0.14)
Europe (0.14)
Oceania > Australia (0.04)
(3 more...)

Genre: Research Report > New Finding (0.66)

Industry:

Information Technology > Security & Privacy (1.00)
Government (1.00)
Law > Statutes (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

Olympus: A Jumping Quadruped for Planetary Exploration Utilizing Reinforcement Learning for In-Flight Attitude Control

Olsen, Jørgen Anker, Malczyk, Grzegorz, Alexis, Kostas

arXiv.org Artificial IntelligenceMar-5-2025

Exploring planetary bodies with lower gravity, such as the moon and Mars, allows legged robots to utilize jumping as an efficient form of locomotion thus giving them a valuable advantage over traditional rovers for exploration. Motivated by this fact, this paper presents the design, simulation, and learning-based "in-flight" attitude control of Olympus, a jumping legged robot tailored to the gravity of Mars. First, the design requirements are outlined followed by detailing how simulation enabled optimizing the robot's design - from its legs to the overall configuration - towards high vertical jumping, forward jumping distance, and in-flight attitude reorientation. Subsequently, the reinforcement learning policy used to track desired in-flight attitude maneuvers is presented. Successfully crossing the sim2real gap, extensive experimental studies of attitude reorientation tests are demonstrated.

configuration, olympus, robot, (12 more...)

arXiv.org Artificial Intelligence

2503.03574

Country:

North America > United States (0.14)
North America > Canada > Quebec > Montreal (0.04)
Europe > Norway > Central Norway > Trøndelag > Trondheim (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry: Transportation > Air (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.72)
Information Technology > Artificial Intelligence > Robots > Locomotion (0.70)

Add feedback

Agricultural Field Boundary Detection through Integration of "Simple Non-Iterative Clustering (SNIC) Super Pixels" and "Canny Edge Detection Method"

Gayibov, Artughrul

arXiv.org Artificial IntelligenceFeb-6-2025

Efficient use of cultivated areas is a necessary factor for sustainable development of agriculture and ensuring food security. Along with the rapid development of satellite technologies in developed countries, new methods are being searched for accurate and operational identification of cultivated areas. In this context, identification of cropland boundaries based on spectral analysis of data obtained from satellite images is considered one of the most optimal and accurate methods in modern agriculture. This article proposes a new approach to determine the suitability and green index of cultivated areas using satellite data obtained through the "Google Earth Engine" (GEE) platform. In this approach, two powerful algorithms, "SNIC (Simple Non-Iterative Clustering) Super Pixels" and "Canny Edge Detection Method", are combined. The SNIC algorithm combines pixels in a satellite image into larger regions (super pixels) with similar characteristics, thereby providing better image analysis. The Canny Edge Detection Method detects sharp changes (edges) in the image to determine the precise boundaries of agricultural fields. This study, carried out using high-resolution multispectral data from the Sentinel-2 satellite and the Google Earth Engine JavaScript API, has shown that the proposed method is effective in accurately and reliably classifying randomly selected agricultural fields. The combined use of these two tools allows for more accurate determination of the boundaries of agricultural fields by minimizing the effects of outliers in satellite images. As a result, more accurate and reliable maps can be created for agricultural monitoring and resource management over large areas based on the obtained data. By expanding the application capabilities of cloud-based platforms and artificial intelligence methods in the agricultural field.

artificial intelligence, boundary, machine learning, (11 more...)

arXiv.org Artificial Intelligence

2502.04529

Country:

Asia > Azerbaijan > Ganja-Dashkasan Economic Region > Ganja (0.05)
Asia > Azerbaijan > Central Aran Economic Region > Imishli District > Imishli (0.05)
North America > United States (0.04)
Asia > Azerbaijan > Baku Economic Region > Baku (0.04)

Genre: Research Report (0.65)

Industry: Food & Agriculture > Agriculture (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (0.81)

Add feedback

Uncertainty Quantification for Transformer Models for Dark-Pattern Detection

Muñoz, Javier, Huertas-García, Álvaro, Martí-González, Carlos, Ambite, Enrique De Miguel

arXiv.org Artificial IntelligenceDec-6-2024

The opaque nature of transformer-based models, particularly in applications susceptible to unethical practices such as dark-patterns in user interfaces, requires models that integrate uncertainty quantification to enhance trust in predictions. This study focuses on dark-pattern detection, deceptive design choices that manipulate user decisions, undermining autonomy and consent. We propose a differential fine-tuning approach implemented at the final classification head via uncertainty quantification with transformer-based pre-trained models. Employing a dense neural network (DNN) head architecture as a baseline, we examine two methods capable of quantifying uncertainty: Spectral-normalized Neural Gaussian Processes (SNGPs) and Bayesian Neural Networks (BNNs). These methods are evaluated on a set of open-source foundational models across multiple dimensions: model performance, variance in certainty of predictions and environmental impact during training and inference phases. Results demonstrate that integrating uncertainty quantification maintains performance while providing insights into challenging instances within the models. Moreover, the study reveals that the environmental impact does not uniformly increase with the incorporation of uncertainty quantification techniques. The study's findings demonstrate that uncertainty quantification enhances transparency and provides measurable confidence in predictions, improving the explainability and clarity of black-box models. This facilitates informed decision-making and mitigates the influence of dark-patterns on user interfaces. These results highlight the importance of incorporating uncertainty quantification techniques in developing machine learning models, particularly in domains where interpretability and trustworthiness are critical.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2412.05251

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > Switzerland (0.04)
(5 more...)

Genre: Research Report > New Finding (0.86)

Industry:

Health & Medicine (1.00)
Energy (0.94)
Transportation (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Breadboarding the European Moon Rover System: discussion and results of the analogue field test campaign

Luna, Cristina, Eguíluz, Augusto Gómez, Barrientos-Díez, Jorge, Moreno, Almudena, Guerra, Alba, Esquer, Manuel, Seoane, Marina L., Kay, Steven, Cameron, Angus, Camañes, Carmen, Haas, Philipp, Papantoniou, Vassilios, Wedler, Armin, Rebele, Bernhard, Reynolds, Jennifer, Landgraf, Markus

arXiv.org Artificial IntelligenceNov-21-2024

Abstract-- This document compiles results obtained from the test campaign of the European Moon Rover System (EMRS) project. The test campaign, conducted at the Planetary Exploration Lab of DLR in Wessling, aimed to understand the scope of the EMRS breadboard design, its strengths, and the benefits of the modular design. The discussion of test results is based on rover traversal analyses, robustness assessments, wheel deflection analyses, and the overall transportation cost of the rover. This not only enables the comparison of locomotion modes on lunar regolith but also facilitates critical decisionmaking in the design of future lunar missions. I. INTRODUCTION Humanity has had its gaze set on the stars since an early age.

deflection, rover, test campaign, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/iSpaRo60631.2024.10687700

2411.13978

Country:

Europe > Germany (0.04)
Asia > Japan (0.04)
Europe > United Kingdom > England > Oxfordshire (0.04)
(8 more...)

Genre: Research Report (0.64)

Industry:

Energy (0.47)
Government > Space Agency (0.35)

Technology: Information Technology > Artificial Intelligence (0.49)

Add feedback

The Download: understanding AI, and what to expect from the UN's climate conference

MIT Technology ReviewNov-14-2024, 13:10:00 GMT

We don't know exactly how AI works, or why it works so well. That's a problem: It could lead us to deploy an AI system in a highly sensitive field like medicine without understanding that it could have critical flaws embedded in its workings. A team at Google DeepMind that studies something called mechanistic interpretability has been working on new ways to let us peer under the hood. It recently released a tool to help researchers understand what is happening when AI is generating an output. It's all part of a push to get a better understanding of exactly what is happening inside an AI model.

climate change, climate conference, download, (1 more...)

MIT Technology Review

Country: Asia > Azerbaijan > Baku Economic Region > Baku (0.08)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.62)

Add feedback

Universal approximation theorem for neural networks with inputs from a topological vector space

Ismailov, Vugar

arXiv.org Artificial IntelligenceSep-19-2024

We study feedforward neural networks with inputs from a topological vector space (TVS-FNNs). Unlike traditional feedforward neural networks, TVS-FNNs can process a broader range of inputs, including sequences, matrices, functions and more. We prove a universal approximation theorem for TVS-FNNs, which demonstrates their capacity to approximate any continuous function defined on this expanded input space.

activation function, approximation theorem, neural network, (12 more...)

arXiv.org Artificial Intelligence

2409.12913

Country:

North America > United States > New York (0.04)
Asia > Azerbaijan > Baku Economic Region > Baku (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
(2 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.62)

Add feedback