Goto

Collaborating Authors

 Accuracy


Algebraic and machine learning approach to hierarchical triple-star stability

arXiv.org Artificial Intelligence

We present two approaches to determine the dynamical stability of a hierarchical triple-star system. The first is an improvement on the Mardling-Aarseth stability formula from 2001, where we introduce a dependence on inner orbital eccentricity and improve the dependence on mutual orbital inclination. The second involves a machine learning approach, where we use a multilayer perceptron (MLP) to classify triple-star systems as `stable' and `unstable'. To achieve this, we generate a large training data set of 10^6 hierarchical triples using the N-body code MSTAR. Both our approaches perform better than previous stability criteria, with the MLP model performing the best. The improved stability formula and the machine learning model have overall classification accuracies of 93 % and 95 % respectively. Our MLP model, which accurately predicts the stability of any hierarchical triple-star system within the parameter ranges studied with almost no computation required, is publicly available on Github in the form of an easy-to-use Python script.


Avast-CTU Public CAPE Dataset

arXiv.org Artificial Intelligence

There is a plethora of methods for detecting malicious samples (e.g., see surveys [19, 13]). Broadly speaking, we can distinguish two main categories: (1) detecting the samples based on their static features and (2) detecting the samples based on a behavioral analysis. The static features typically consist of considering the whole sample (e.g., as an image [9]) and/or properties of its most important parts (e.g., by examining in details header of a Windows portable executable (PE) file) [18]. The behavioral analysis consists of executing (or simulating the execution) of the sample and logging performed actions in order to determine whether these actions have characteristics of malicious behavior [13]. The main advantage of the first approach is the computational efficiency since extracting static features from the file itself can be much faster compared to the (simulated) execution. On the other hand, the main disadvantage of the static approach is the inability to reliably distinguish malicious samples from benign samples in case the sample is encrypted and/or the clean file is altered in a minor way to exhibit malicious behavior. The methods relying on behavioral analysis can discover malicious behavior even in encrypted samples, however, they require significantly more resources to run or simulate the instructions of the analyzed sample. In either case, the growing number of new, previously unseen samples makes the usage of automated decision/classification methods of artificial intelligence (AI) and machine learning (ML) inevitable in the malware-detection domain. Framing the problem of malware detection as an AI/ML problem reveals interesting and unique properties of the domain that are less prevalent in other domains.


Combining Sequential and Aggregated Data for Churn Prediction in Casual Freemium Games

arXiv.org Artificial Intelligence

In freemium games, the revenue from a player comes from the in-app purchases made and the advertisement to which that player is exposed. The longer a player is playing the game, the higher will be the chances that he or she will generate a revenue within the game. Within this scenario, it is extremely important to be able to detect promptly when a player is about to quit playing (churn) in order to react and attempt to retain the player within the game, thus prolonging his or her game lifetime. In this article we investigate how to improve the current state-of-the-art in churn prediction by combining sequential and aggregate data using different neural network architectures. The results of the comparative analysis show that the combination of the two data types grants an improvement in the prediction accuracy over predictors based on either purely sequential or purely aggregated data.


Change Detection for Local Explainability in Evolving Data Streams

arXiv.org Artificial Intelligence

As complex machine learning models are increasingly used in sensitive applications like banking, trading or credit scoring, there is a growing demand for reliable explanation mechanisms. Local feature attribution methods have become a popular technique for post-hoc and model-agnostic explanations. However, attribution methods typically assume a stationary environment in which the predictive model has been trained and remains stable. As a result, it is often unclear how local attributions behave in realistic, constantly evolving settings such as streaming and online applications. In this paper, we discuss the impact of temporal change on local feature attributions. In particular, we show that local attributions can become obsolete each time the predictive model is updated or concept drift alters the data generating distribution. Consequently, local feature attributions in data streams provide high explanatory power only when combined with a mechanism that allows us to detect and respond to local changes over time. To this end, we present CDLEEDS, a flexible and model-agnostic framework for detecting local change and concept drift. CDLEEDS serves as an intuitive extension of attribution-based explanation techniques to identify outdated local attributions and enable more targeted recalculations. In experiments, we also show that the proposed framework can reliably detect both local and global concept drift. Accordingly, our work contributes to a more meaningful and robust explainability in online machine learning.


Multi-Modal Hypergraph Diffusion Network with Dual Prior for Alzheimer Classification

arXiv.org Artificial Intelligence

The automatic early diagnosis of prodromal stages of Alzheimer's disease is of great relevance for patient treatment to improve quality of life. We address this problem as a multi-modal classification task. Multi-modal data provides richer and complementary information. However, existing techniques only consider either lower order relations between the data and single/multi-modal imaging data. In this work, we introduce a novel semi-supervised hypergraph learning framework for Alzheimer's disease diagnosis. Our framework allows for higher-order relations among multi-modal imaging and non-imaging data whilst requiring a tiny labelled set. Firstly, we introduce a dual embedding strategy for constructing a robust hypergraph that preserves the data semantics. We achieve this by enforcing perturbation invariance at the image and graph levels using a contrastive based mechanism. Secondly, we present a dynamically adjusted hypergraph diffusion model, via a semi-explicit flow, to improve the predictive uncertainty. We demonstrate, through our experiments, that our framework is able to outperform current techniques for Alzheimer's disease diagnosis.


Robust and Efficient Imbalanced Positive-Unlabeled Learning with Self-supervision

arXiv.org Artificial Intelligence

Learning from positive and unlabeled (PU) data is a setting where the learner only has access to positive and unlabeled samples while having no information on negative examples. Such PU setting is of great importance in various tasks such as medical diagnosis, social network analysis, financial markets analysis, and knowledge base completion, which also tend to be intrinsically imbalanced, i.e., where most examples are actually negatives. Most existing approaches for PU learning, however, only consider artificially balanced datasets and it is unclear how well they perform in the realistic scenario of imbalanced and long-tail data distribution. This paper proposes to tackle this challenge via robust and efficient self-supervised pretraining. However, training conventional self-supervised learning methods when applied with highly imbalanced PU distribution needs better reformulation. In this paper, we present \textit{ImPULSeS}, a unified representation learning framework for \underline{Im}balanced \underline{P}ositive \underline{U}nlabeled \underline{L}earning leveraging \underline{Se}lf-\underline{S}upervised debiase pre-training. ImPULSeS uses a generic combination of large-scale unsupervised learning with debiased contrastive loss and additional reweighted PU loss. We performed different experiments across multiple datasets to show that ImPULSeS is able to halve the error rate of the previous state-of-the-art, even compared with previous methods that are given the true prior. Moreover, our method showed increased robustness to prior misspecification and superior performance even when pretraining was performed on an unrelated dataset. We anticipate such robustness and efficiency will make it much easier for practitioners to obtain excellent results on other PU datasets of interest. The source code is available at \url{https://github.com/JSchweisthal/ImPULSeS}


Achieving Model Fairness in Vertical Federated Learning

arXiv.org Artificial Intelligence

Vertical federated learning (VFL) has attracted greater and greater interest since it enables multiple parties possessing non-overlapping features to strengthen their machine learning models without disclosing their private data and model parameters. Similar to other machine learning algorithms, VFL faces demands and challenges of fairness, i.e., the learned model may be unfairly discriminatory over some groups with sensitive attributes. To tackle this problem, we propose a fair VFL framework in this work. First, we systematically formulate the problem of training fair models in VFL, where the learning task is modelled as a constrained optimization problem. To solve it in a federated and privacy-preserving manner, we consider the equivalent dual form of the problem and develop an asynchronous gradient coordinate-descent ascent algorithm, where some active data parties perform multiple parallelized local updates per communication round to effectively reduce the number of communication rounds. The messages that the server sends to passive parties are deliberately designed such that the information necessary for local updates is released without intruding on the privacy of data and sensitive attributes. We rigorously study the convergence of the algorithm when applied to general nonconvex-concave min-max problems. We prove that the algorithm finds a $\delta$-stationary point of the dual objective in $\mathcal{O}(\delta^{-4})$ communication rounds under mild conditions. Finally, the extensive experiments on three benchmark datasets demonstrate the superior performance of our method in training fair models.


Group-$k$ Consistent Measurement Set Maximization for Robust Outlier Detection

arXiv.org Artificial Intelligence

This paper presents a method for the robust selection of measurements in a simultaneous localization and mapping (SLAM) framework. Existing methods check consistency or compatibility on a pairwise basis, however many measurement types are not sufficiently constrained in a pairwise scenario to determine if either measurement is inconsistent with the other. This paper presents group-$k$ consistency maximization (G$k$CM) that estimates the largest set of measurements that is internally group-$k$ consistent. Solving for the largest set of group-$k$ consistent measurements can be formulated as an instance of the maximum clique problem on generalized graphs and can be solved by adapting current methods. This paper evaluates the performance of G$k$CM using simulated data and compares it to pairwise consistency maximization (PCM) presented in previous work.


Ethical AI, Monetizing False Negatives and Growing Total Addressable Market - DataScienceCentral.com

#artificialintelligence

What if I told you that companies that don't embrace Ethical AI are leaving significant amounts of "Money on the Table"; that they are not only missing out on potentially profitable customers, but that over time they are eroding their Total Addressable Market (TAM)? Do I have your attention now? After I published the blog "The Ethical AI Application Pyramid", a question from Karrie Sullivan coupled with a mentoring session with the startup unfog.ai "If your AI model doesn't take into consideration the ultimate outcomes of the AI model's False Negatives, then confirmation bias in the AI model could set in and eventually the company's Total Addressable Market (TAM) could shrink to a point where the business might no longer be viable." Yea, not only is Ethical AI the right thing to do from a cultural and society perspective, but there are direct bottom-line financial ramifications if your AI models are not learning and adapting from the AI model's False Negatives.


Authentication of Underwater Assets

arXiv.org Artificial Intelligence

Secure digital wireless communication underwater has become a key issue as maritime operations shift towards employing a heterogeneous mix of robotic assets and as the security of digital systems becomes challenged across all domains. At the same time, a proliferation of underwater signal coding and physical layer options are delivering greater bandwidth and flexibility, but mostly without the standards necessary for interoperability. We address here an essential requirement for security, namely a confirmation of asset identities also known as authentication. We propose, implement, verify and validate an authentication protocol based on the first digital underwater communications standard. Our scheme is applicable primarily to AUVs operating around offshore oil and gas facilities, but also to other underwater devices that may in the future have acoustic modems. It makes communication including command and control significantly more secure and provides a foundation for the development of more sophisticated security mechanisms.