AITopics | Hernández-Orallo, José

Collaborating Authors

Hernández-Orallo, José

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Your Prompt is My Command: On Assessing the Human-Centred Generality of Multimodal Models

Schellaert, Wout, Martínez-Plumed, Fernando, Vold, Karina, Burden, John, A. M. Casares, Pablo, Sheng Loe, Bao, Reichart, Roi, Ó hÉigeartaigh, Sean, Korhonen, Anna, Hernández-Orallo, José

Journal of Artificial Intelligence ResearchJun-12-2023

Even with obvious deficiencies, large prompt-commanded multimodal models are proving to be flexible cognitive tools representing an unprecedented generality. But the directness, diversity, and degree of user interaction create a distinctive “human-centred generality” (HCG), rather than a fully autonomous one. HCG implies that —for a specific user— a system is only as general as it is effective for the user’s relevant tasks and their prevalent ways of prompting. A human-centred evaluation of general-purpose AI systems therefore needs to reflect the personal nature of interaction, tasks and cognition. We argue that the best way to understand these systems is as highly-coupled cognitive extenders, and to analyse the bidirectional cognitive adaptations between them and humans. In this paper, we give a formulation of HCG, as well as a high-level overview of the elements and trade-offs involved in the prompting process. We end the paper by outlining some essential research questions and suggestions for improving evaluation practices, which we envision as characteristic for the evaluation of general artificial intelligence in the future. This paper appears in the AI & Society track.

arxiv preprint arxiv, machine learning, natural language, (13 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.14157

AI Access Foundation

14157

Journal of Artificial Intelligence Research

Country:

Europe (1.00)
North America > United States (0.46)
North America > Canada (0.28)
Asia > Middle East (0.28)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
(2 more...)

Add feedback

Compute and Energy Consumption Trends in Deep Learning Inference

Desislavov, Radosvet, Martínez-Plumed, Fernando, Hernández-Orallo, José

arXiv.org Artificial IntelligenceSep-12-2021

The progress of some AI paradigms such as deep learning is said to be linked to an exponential growth in the number of parameters. There are many studies corroborating these trends, but does this translate into an exponential increase in energy consumption? In order to answer this question we focus on inference costs rather than training costs, as the former account for most of the computing effort, solely because of the multiplicative factors. Also, apart from algorithmic innovations, we account for more specific and powerful hardware (leading to higher FLOPS) that is usually accompanied with important energy efficiency optimisations. We also move the focus from the first implementation of a breakthrough paper towards the consolidated version of the techniques one or two year later. Under this distinctive and comprehensive perspective, we study relevant models in the areas of computer vision and natural language processing: for a sustained increase in performance we see a much softer growth in energy consumption than previously anticipated. The only caveat is, yet again, the multiplicative factor, as future AI increases penetration and becomes more pervasive.

deep learning, flop, neural network, (21 more...)

arXiv.org Artificial Intelligence

2109.05472

Country: Europe > Spain (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Energy (1.00)
Information Technology > Services (0.46)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Conditional Teaching Size

Garcia-Piqueras, Manuel, Hernández-Orallo, José

arXiv.org Artificial IntelligenceJun-29-2021

Recent research in machine teaching has explored the instruction of any concept expressed in a universal language. In this compositional context, new experimental results have shown that there exist data teaching sets surprisingly shorter than the concept description itself. However, there exists a bound for those remarkable experimental findings through teaching size and concept complexity that we further explore here. As concepts are rarely taught in isolation we investigate the best configuration of concepts to teach a given set of concepts, where those that have been acquired first can be reused for the description of new ones. This new notion of conditional teaching size uncovers new insights, such as the interposition phenomenon: certain prior knowledge generates simpler compatible concepts that increase the teaching size of the concept that we want to teach. This does not happen for conditional Kolmogorov complexity. Furthermore, we provide an algorithm that constructs optimal curricula based on interposition avoidance. This paper presents a series of theoretical results, including their proofs, and some directions for future work. New research possibilities in curriculum teaching in compositional scenarios are now wide open to exploration.

artificial intelligence, curriculum, natural language, (19 more...)

arXiv.org Artificial Intelligence

2107.07038

Country: Europe > Spain > Castilla-La Mancha (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.35)

Add feedback

The Animal-AI Environment: Training and Testing Animal-Like Artificial Cognition

Beyret, Benjamin, Hernández-Orallo, José, Cheke, Lucy, Halina, Marta, Shanahan, Murray, Crosby, Matthew

arXiv.org Artificial IntelligenceSep-18-2019

Recent advances in artificial intelligence have been strongly driven by the use of game environments for training and evaluating agents. Games are often accessible and versatile, with well-defined state-transitions and goals allowing for intensive training and experimentation. However, agents trained in a particular environment are usually tested on the same or slightly varied distributions, and solutions do not necessarily imply any understanding. If we want AI systems that can model and understand their environment, we need environments that explicitly test for this. Inspired by the extensive literature on animal cognition, we present an environment that keeps all the positive elements of standard gaming environments, but is explicitly designed for the testing of animal-like artificial cognition. All source-code is publicly available (see appendix).

agent, computer game, neural network, (21 more...)

arXiv.org Artificial Intelligence

1909.07483

Country: Europe > United Kingdom > England (0.14)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Fairness and Missing Values

Martínez-Plumed, Fernando, Ferri, Cèsar, Nieves, David, Hernández-Orallo, José

arXiv.org Artificial IntelligenceMay-29-2019

The causes underlying unfair decision making are complex, being internalised in different ways by decision makers, other actors dealing with data and models, and ultimately by the individuals being affected by these decisions. One frequent manifestation of all these latent causes arises in the form of missing values: protected groups are more reluctant to give information that could be used against them, delicate information for some groups can be erased by human operators, or data acquisition may simply be less complete and systematic for minority groups. As a result, missing values and bias in data are two phenomena that are tightly coupled. However, most recent techniques, libraries and experimental results dealing with fairness in machine learning have simply ignored missing data. In this paper, we claim that fairness research should not miss the opportunity to deal properly with missing data. To support this claim, (1) we analyse the sources of missing data and bias, and we map the common causes, (2) we find that rows containing missing values are usually fairer than the rest, which should not be treated as the uncomfortable ugly data that different techniques and libraries get rid of at the first occasion, and (3) we study the trade-off between performance and fairness when the rows with missing values are used (either because the technique deals with them directly or by imputation methods). We end the paper with a series of recommended procedures about what to do with missing data when aiming for fair decision making.

dataset, decision tree learning, survey article, (20 more...)

arXiv.org Artificial Intelligence

1905.12728

Country:

North America > United States (0.14)
Europe > Spain (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry:

Government (1.00)
Law (0.92)

Technology:

Information Technology > Data Science > Data Quality (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

Analysing Results from AI Benchmarks: Key Indicators and How to Obtain Them

Martínez-Plumed, Fernando, Hernández-Orallo, José

arXiv.org Artificial IntelligenceNov-20-2018

Item response theory (IRT) can be applied to the analysis of the evaluation of results from AI benchmarks. The two-parameter IRT model provides two indicators (difficulty and discrimination) on the side of the item (or AI problem) while only one indicator (ability) on the side of the respondent (or AI agent). In this paper we analyse how to make this set of indicators dual, by adding a fourth indicator, generality, on the side of the respondent. Generality is meant to be dual to discrimination, and it is based on difficulty. Namely, generality is defined as a new metric that evaluates whether an agent is consistently good at easy problems and bad at difficult ones. With the addition of generality, we see that this set of four key indicators can give us more insight on the results of AI benchmarks. In particular, we explore two popular benchmarks in AI, the Arcade Learning Environment (Atari 2600 games) and the General Video Game AI competition. We provide some guidelines to estimate and interpret these indicators for other AI benchmarks and competitions. I. INTRODUCTION The evaluation of AI systems has traditionally been done with one system evaluated on one single problem.

computer game, generality, planning & scheduling, (22 more...)

arXiv.org Artificial Intelligence

1811.08186

Country:

Europe > Portugal (0.14)
Oceania > Australia (0.14)
North America > United States (0.14)
(3 more...)

Genre: Research Report (0.81)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

General-purpose Declarative Inductive Programming with Domain-Specific Background Knowledge for Data Wrangling Automation

Contreras-Ochando, Lidia, Ferri, César, Hernández-Orallo, José, Martínez-Plumed, Fernando, Ramírez-Quintana, María José, Katayama, Susumu

arXiv.org Artificial IntelligenceSep-26-2018

Given one or two examples, humans are good at understanding how to solve a problem independently of its domain, because they are able to detect what the problem is and to choose the appropriate background knowledge according to the context. For instance, presented with the string "8/17/2017" to be transformed to "17th of August of 2017", humans will process this in two steps: (1) they recognise that it is a date and (2) they map the date to the 17th of August of 2017. Inductive Programming (IP) aims at learning declarative (functional or logic) programs from examples. Two key advantages of IP are the use of background knowledge and the ability to synthesise programs from a few input/output examples (as humans do). In this paper we propose to use IP as a means for automating repetitive data manipulation tasks, frequently presented during the process of {\em data wrangling} in many data manipulation problems. Here we show that with the use of general-purpose declarative (programming) languages jointly with generic IP systems and the definition of domain-specific knowledge, many specific data wrangling problems from different application domains can be automatically solved from very few examples. We also propose an integrated benchmark for data wrangling, which we share publicly for the community.

artificial intelligence, logic programming, magichaskeller, (19 more...)

arXiv.org Artificial Intelligence

1809.10054

Country:

North America > United States > New York (0.14)
North America > United States > New Jersey (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Data Science > Data Quality > Data Cleaning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.89)

Add feedback

A multidisciplinary task-based perspective for evaluating the impact of AI autonomy and generality on the future of work

Fernández-Macías, Enrique, Gómez, Emilia, Hernández-Orallo, José, Loe, Bao Sheng, Martens, Bertin, Martínez-Plumed, Fernando, Tolan, Songül

arXiv.org Artificial IntelligenceJul-6-2018

This paper presents a multidisciplinary task approach for assessing the impact of artificial intelligence on the future of work. We provide definitions of a task from two main perspectives: socio-economic and computational. We propose to explore ways in which we can integrate or map these perspectives, and link them with the skills or capabilities required by them, for humans and AI systems. Finally, we argue that in order to understand the dynamics of tasks, we have to explore the relevance of autonomy and generality of AI systems for the automation or alteration of the workplace.

artificial intelligence, educational setting, survey article, (17 more...)

arXiv.org Artificial Intelligence

1807.02416

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)

Genre: Research Report (1.00)

Industry:

Education > Educational Setting (0.46)
Banking & Finance > Economy (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

Assessing the impact of machine intelligence on human behaviour: an interdisciplinary endeavour

Gómez, Emilia, Castillo, Carlos, Charisi, Vicky, Dahl, Verónica, Deco, Gustavo, Delipetrev, Blagoj, Dewandre, Nicole, González-Ballester, Miguel Ángel, Gouyon, Fabien, Hernández-Orallo, José, Herrera, Perfecto, Jonsson, Anders, Koene, Ansgar, Larson, Martha, de Mántaras, Ramón López, Martens, Bertin, Miron, Marius, Moreno-Bote, Rubén, Oliver, Nuria, Gallardo, Antonio Puertas, Schweitzer, Heike, Sebastian, Nuria, Serra, Xavier, Serrà, Joan, Tolan, Songül, Vold, Karina

arXiv.org Artificial IntelligenceJun-7-2018

This document contains the outcome of the first Human behaviour and machine intelligence (HUMAINT) workshop that took place 5-6 March 2018 in Barcelona, Spain. The workshop was organized in the context of a new research programme at the Centre for Advanced Studies, Joint Research Centre of the European Commission, which focuses on studying the potential impact of artificial intelligence on human behaviour. The workshop gathered an interdisciplinary group of experts to establish the state of the art research in the field and a list of future research challenges to be addressed on the topic of human and machine intelligence, algorithm's potential impact on human cognitive capabilities and decision making, and evaluation and regulation needs. The document is made of short position statements and identification of challenges provided by each expert, and incorporates the result of the discussions carried out during the workshop. In the conclusion section, we provide a list of emerging research topics and strategies to be addressed in the near future.

algorithm, deep learning, neural network, (22 more...)

arXiv.org Artificial Intelligence

1806.03192

Country:

North America > United States (1.00)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.24)
Europe > United Kingdom > England > Cambridgeshire (0.14)

Genre:

Instructional Material > Course Syllabus & Notes (0.67)
Research Report > Experimental Study (0.46)
Research Report > New Finding (0.46)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)
Information Technology > Security & Privacy (1.00)
(8 more...)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.45)

Add feedback

Accounting for the Neglected Dimensions of AI Progress

Martínez-Plumed, Fernando, Avin, Shahar, Brundage, Miles, Dafoe, Allan, hÉigeartaigh, Sean Ó, Hernández-Orallo, José

arXiv.org Artificial IntelligenceJun-2-2018

We analyze and reframe AI progress. In addition to the prevailing metrics of performance, we highlight the usually neglected costs paid in the development and deployment of a system, including: data, expert knowledge, human oversight, software resources, computing cycles, hardware and network facilities, development time, etc. These costs are paid throughout the life cycle of an AI system, fall differentially on different individuals, and vary in magnitude depending on the replicability and generality of the AI solution. The multidimensional performance and cost space can be collapsed to a single utility metric for a user with transitive and complete preferences. Even absent a single utility function, AI advances can be generically assessed by whether they expand the Pareto (optimal) surface. We explore a subset of these neglected dimensions using the two case studies of Alpha* and ALE. This broadened conception of progress in AI should lead to novel ways of measuring success in AI, and can help set milestones for future progress.

artificial intelligence, dimension, machine learning, (18 more...)

arXiv.org Artificial Intelligence

1806.0061

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)
Information Technology > Artificial Intelligence > Robots (0.68)

Add feedback