AITopics | Bontempi, Gianluca

Collaborating Authors

Bontempi, Gianluca

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Streaming Active Learning Strategies for Real-Life Credit Card Fraud Detection: Assessment and Visualization

Carcillo, Fabirzio, Borgne, Yann-Aël Le, Caelen, Olivier, Bontempi, Gianluca

arXiv.org Machine LearningApr-20-2018

Credit card fraud detection is a very challenging problem because of the specific nature of transaction data and the labeling process. The transaction data is peculiar because they are obtained in a streaming fashion, they are strongly imbalanced and prone to non-stationarity. The labeling is the outcome of an active learning process, as every day human investigators contact only a small number of cardholders (associated to the riskiest transactions) and obtain the class (fraud or genuine) of the related transactions. An adequate selection of the set of cardholders is therefore crucial for an efficient fraud detection process. In this paper, we present a number of active learning strategies and we investigate their fraud detection accuracies. We compare different criteria (supervised, semi-supervised and unsupervised) to query unlabeled transactions. Finally, we highlight the existence of an exploitation/exploration trade-off for active learning in the context of fraud detection, which has so far been overlooked in the literature.

law enforcement, public safety, transaction, (22 more...)

arXiv.org Machine Learning

doi: 10.1007/s41060-018-0116-z

1804.07481

Country:

North America > United States > Wisconsin (0.14)
North America > United States > New York (0.14)
North America > United States > California (0.14)

Genre: Research Report > New Finding (0.68)

Industry: Law Enforcement & Public Safety > Fraud (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science > Data Mining > Anomaly Detection (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)

Add feedback

Feature selection in high-dimensional dataset using MapReduce

Reggiani, Claudio, Borgne, Yann-Aël Le, Bontempi, Gianluca

arXiv.org Machine LearningSep-7-2017

This paper describes a distributed MapReduce implementation of the minimum Redundancy Maximum Relevance algorithm, a popular feature selection method in bioinformatics and network inference problems. The proposed approach handles both tall/narrow and wide/short datasets. We further provide an open source implementation based on Hadoop/Spark, and illustrate its scalability on datasets involving millions of observations or features.

big data, dataset, survey article, (20 more...)

arXiv.org Machine Learning

1709.02327

Country:

Europe > Belgium (0.15)
North America > United States (0.14)
Europe > Germany (0.14)

Genre: Research Report (0.83)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.30)

Add feedback

From dependency to causality: a machine learning approach

Bontempi, Gianluca, Flauder, Maxime

arXiv.org Machine LearningDec-19-2014

The relationship between statistical dependency and causality lies at the heart of all statistical approaches to causal inference. Recent results in the ChaLearn cause-effect pair challenge have shown that causal directionality can be inferred with good accuracy also in Markov indistinguishable configurations thanks to data driven approaches. This paper proposes a supervised machine learning approach to infer the existence of a directed causal link between two variables in multivariate settings with $n>2$ variables. The approach relies on the asymmetry of some conditional (in)dependence relations between the members of the Markov blankets of two variables causally connected. Our results show that supervised learning methods may be successfully used to extract causal information on the basis of asymmetric statistical descriptors also for $n>2$ variate distributions.

algorithm, artificial intelligence, inductive learning, (19 more...)

arXiv.org Machine Learning

1412.6285

Country:

North America > United States (0.14)
Europe > Belgium (0.14)

Genre:

Research Report > New Finding (0.54)
Research Report > Promising Solution (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

A review and comparison of strategies for multi-step ahead time series forecasting based on the NN5 forecasting competition

Taieb, Souhaib Ben, Bontempi, Gianluca, Atiya, Amir, Sorjamaa, Antti

arXiv.org Machine LearningAug-16-2011

Multi-step ahead forecasting is still an open challenge in time series forecasting. Several approaches that deal with this complex problem have been proposed in the literature but an extensive comparison on a large number of tasks is still missing. This paper aims to fill this gap by reviewing existing strategies for multi-step ahead forecasting and comparing them in theoretical and practical terms. To attain such an objective, we performed a large scale comparison of these different strategies using a large experimental benchmark (namely the 111 series from the NN5 forecasting competition). In addition, we considered the effects of deseasonalization, input variable selection, and forecast combination on these strategies and on multi-step ahead forecasting at large. The following three findings appear to be consistently supported by the experimental results: Multiple-Output strategies are the best performing approaches, deseasonalization leads to uniformly improved forecast accuracy, and input selection is more effective when performed in conjunction with deseasonalization.

forecasting strategy, fuzzy logic, survey article, (20 more...)

arXiv.org Machine Learning

1108.3259

Country:

Europe (1.00)
Africa > Middle East > Egypt (0.28)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre:

Workflow (0.93)
Research Report > New Finding (0.92)

Industry: Government (0.46)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.45)

Add feedback

Lazy Learning Meets the Recursive Least Squares Algorithm

Birattari, Mauro, Bontempi, Gianluca, Bersini, Hugues

Neural Information Processing SystemsDec-31-1999

Lazy learning is a memory-based technique that, once a query is received, extracts a prediction interpolating locally the neighboring examples of the query which are considered relevant according to a distance measure. In this paper we propose a data-driven method to select on a query-by-query basis the optimal number of neighbors to be considered for each prediction. As an efficient way to identify and validate local models, the recursive least squares algorithm is introduced in the context of local approximation and lazy learning. Furthermore, beside the winner-takes-all strategy for model selection, a local combination of the most promising models is explored. The method proposed is tested on six different datasets and compared with a state-of-the-art approach.

artificial intelligence, machine learning, prediction, (13 more...)

Neural Information Processing Systems

Country:

Europe (0.14)
North America > United States (0.14)

Genre: Research Report > Promising Solution (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Lazy Learning Meets the Recursive Least Squares Algorithm

Birattari, Mauro, Bontempi, Gianluca, Bersini, Hugues

Neural Information Processing SystemsDec-31-1999

Lazy learning is a memory-based technique that, once a query is received, extractsa prediction interpolating locally the neighboring examples of the query which are considered relevant according to a distance measure. In this paper we propose a data-driven method to select on a query-by-query basis the optimal number of neighbors to be considered for each prediction. As an efficient way to identify and validate local models, the recursive least squares algorithm is introduced in the context oflocal approximation and lazy learning. Furthermore, beside the winner-takes-all strategy for model selection, a local combination of the most promising models is explored. The method proposed is tested on six different datasets and compared with a state-of-the-art approach.

artificial intelligence, machine learning, prediction, (13 more...)

Neural Information Processing Systems

Country:

Europe (0.14)
North America > United States (0.14)

Genre: Research Report > Promising Solution (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.50)

Add feedback