AITopics | Serrà, Joan

Collaborating Authors

Serrà, Joan

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

Pascual, Santiago, Ravanelli, Mirco, Serrà, Joan, Bonafonte, Antonio, Bengio, Yoshua

arXiv.org Machine LearningApr-6-2019

Learning good representations without supervision is still an open issue in machine learning, and is particularly challenging for speech signals, which are often characterized by long sequences with a complex hierarchical structure. Some recent works, however, have shown that it is possible to derive useful speech representations by employing a self-supervised encoder-discriminator approach. This paper proposes an improved self-supervised method, where a single neural encoder is followed by multiple workers that jointly solve different self-supervised tasks. The needed consensus across different tasks naturally imposes meaningful constraints to the encoder, contributing to discover general representations and to minimize the risk of learning superficial ones. Experiments show that the proposed approach can learn transferable, robust, and problem-agnostic features that carry on relevant information from the speech signal, such as speaker identity, phonemes, and even higher-level features such as emotional cues. In addition, a number of design choices make the encoder easily exportable, facilitating its direct usage or adaptation to different problems.

deep learning, neural network, representation, (21 more...)

arXiv.org Machine Learning

1904.03416

Country: North America > Canada > Quebec (0.14)

Genre: Research Report (0.82)

Industry: Education > Focused Education > Special Education (0.40)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Training neural audio classifiers with few data

Pons, Jordi, Serrà, Joan, Serra, Xavier

arXiv.org Artificial IntelligenceNov-3-2018

These studies are mostly based on publiclyavailable datasets, where each class typically contains more than 100 audio examples [5, 6, 7, 8, 9]. Contrastingly, only few works study the problem of training neural audio classifiers with few audio examples (for instance, less than 10 per class) [10, 11, 12, 13]. In this work, we study how a number of neural network architectures perform in such situation. Two primary reasons motivate our work: (i) given that humans are able to learn novel concepts from few examples, we aim to quantify up to what extent such behavior is possible in current neural machine listening systems; and (ii) provided that data curation processes are tedious and expensive, it is unreasonable to assume that sizable amounts of annotated audio are always available for training neural network classifiers. The challenge of training neural networks with few audio data has been previously addressed.

deep learning, neural network, prototypical network, (18 more...)

arXiv.org Artificial Intelligence

1810.10274

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Add feedback

Assessing the impact of machine intelligence on human behaviour: an interdisciplinary endeavour

Gómez, Emilia, Castillo, Carlos, Charisi, Vicky, Dahl, Verónica, Deco, Gustavo, Delipetrev, Blagoj, Dewandre, Nicole, González-Ballester, Miguel Ángel, Gouyon, Fabien, Hernández-Orallo, José, Herrera, Perfecto, Jonsson, Anders, Koene, Ansgar, Larson, Martha, de Mántaras, Ramón López, Martens, Bertin, Miron, Marius, Moreno-Bote, Rubén, Oliver, Nuria, Gallardo, Antonio Puertas, Schweitzer, Heike, Sebastian, Nuria, Serra, Xavier, Serrà, Joan, Tolan, Songül, Vold, Karina

arXiv.org Artificial IntelligenceJun-7-2018

This document contains the outcome of the first Human behaviour and machine intelligence (HUMAINT) workshop that took place 5-6 March 2018 in Barcelona, Spain. The workshop was organized in the context of a new research programme at the Centre for Advanced Studies, Joint Research Centre of the European Commission, which focuses on studying the potential impact of artificial intelligence on human behaviour. The workshop gathered an interdisciplinary group of experts to establish the state of the art research in the field and a list of future research challenges to be addressed on the topic of human and machine intelligence, algorithm's potential impact on human cognitive capabilities and decision making, and evaluation and regulation needs. The document is made of short position statements and identification of challenges provided by each expert, and incorporates the result of the discussions carried out during the workshop. In the conclusion section, we provide a list of emerging research topics and strategies to be addressed in the near future.

algorithm, deep learning, neural network, (22 more...)

arXiv.org Artificial Intelligence

1806.03192

Country:

North America > United States (1.00)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.24)
Europe > United Kingdom > England > Cambridgeshire (0.14)

Genre:

Instructional Material > Course Syllabus & Notes (0.67)
Research Report > Experimental Study (0.46)
Research Report > New Finding (0.46)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)
Information Technology > Security & Privacy (1.00)
(8 more...)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.45)

Add feedback

Towards a universal neural network encoder for time series

Serrà, Joan, Pascual, Santiago, Karatzoglou, Alexandros

arXiv.org Machine LearningMay-10-2018

We study the use of a time series encoder to learn representations that are useful on data set types with which it has not been trained on. The encoder is formed of a convolutional neural network whose temporal output is summarized by a convolutional attention mechanism. This way, we obtain a compact, fixed-length representation from longer, variable-length time series. We evaluate the performance of the proposed approach on a well-known time series classification benchmark, considering full adaptation, partial adaptation, and no adaptation of the encoder to the new data type. Results show that such strategies are competitive with the state-of-the-art, often outperforming conceptually-matching approaches. Besides accuracy scores, the facility of adaptation and the efficiency of pre-trained encoders make them an appealing option for the processing of scarcely- or non-labeled time series.

deep learning, neural network, representation, (19 more...)

arXiv.org Machine Learning

1805.03908

Country: Oceania > Australia (0.14)

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Overcoming catastrophic forgetting with hard attention to the task

Serrà, Joan, Surís, Dídac, Miron, Marius, Karatzoglou, Alexandros

arXiv.org Artificial IntelligenceFeb-14-2018

Catastrophic forgetting occurs when a neural network loses the information learned in a previous task after training on subsequent tasks. This problem remains a hurdle for artificial intelligence systems with sequential learning capabilities. In this paper, we propose a task-based hard attention mechanism that preserves previous tasks' information without affecting the current task's learning. A hard attention mask is learned concurrently to every task, through stochastic gradient descent, and previous masks are exploited to condition such learning. We show that the proposed mechanism is effective for reducing catastrophic forgetting, cutting current rates by 45 to 80%. We also show that it is robust to different hyperparameter choices, and that it offers a number of monitoring capabilities. The approach features the possibility to control both the stability and compactness of the learned knowledge, which we believe makes it also attractive for online learning or network compression applications.

computer based training, deep learning, overcoming catastrophic forgetting, (20 more...)

arXiv.org Artificial Intelligence

1801.01423

Country: North America > Canada > Ontario > Toronto (0.14)

Industry: Education > Educational Setting > Online (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

An Empirical Evaluation of Similarity Measures for Time Series Classification

Serrà, Joan, Arcos, Josep Lluis

arXiv.org Machine LearningJan-16-2014

Time series are ubiquitous, and a measure to assess their similarity is a core part of many computational systems. In particular, the similarity measure is the most essential ingredient of time series clustering and classification systems. Because of this importance, countless approaches to estimate time series similarity have been proposed. However, there is a lack of comparative studies using empirical, rigorous, quantitative, and large-scale assessment strategies. In this article, we provide an extensive evaluation of similarity measures for time series classification following the aforementioned principles. We consider 7 different measures coming from alternative measure `families', and 45 publicly-available time series data sets coming from a wide variety of scientific domains. We focus on out-of-sample classification accuracy, but in-sample accuracies and parameter choices are also discussed. Our work is based on rigorous evaluation methodologies and includes the use of powerful statistical significance tests to derive meaningful conclusions. The obtained results show the equivalence, in terms of accuracy, of a number of measures, but with one single candidate outperforming the rest. Such findings, together with the followed methodology, invite researchers on the field to adopt a more consistent evaluation criteria and a more informed decision regarding the baseline measures to which new developments should be compared.

artificial intelligence, health & medicine, time series, (17 more...)

arXiv.org Machine Learning

doi: 10.1016/j.knosys.2014.04.035

1401.3973

Country:

North America > United States (0.28)
Europe > Spain (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.48)

Add feedback

Unsupervised Detection of Music Boundaries by Time Series Structure Features

Serrà, Joan (Artificial Intelligence Research Institute, Spanish National Research Council (IIIA-CSIC)) | Müller, Meinard (Max Planck Institute for Computer Science and Saarland University) | Grosche, Peter (Max Planck Institute for Computer Science and Saarland University) | Arcos, Josep Lluis (Artificial Intelligence Research Institute, Spanish National Research Council (IIIA-CSIC))

AAAI ConferencesJul-21-2012

Locating boundaries between coherent and/or repetitive segments of a time series is a challenging problem pervading many scientific domains. In this paper we propose an unsupervised method for boundary detection, combining three basic principles: novelty, homogeneity, and repetition. In particular, the method uses what we call structure features, a representation encapsulating both local and global properties of a time series. We demonstrate the usefulness of our approach in detecting music structure boundaries, a task that has received much attention in recent years and for which exist several benchmark datasets and publicly available annotations. We find our method to significantly outperform the best accuracies published so far. Importantly, our boundary approach is generic, thus being applicable to a wide range of time series beyond the music and audio domains.

artificial intelligence, boundary, machine learning, (16 more...)

AAAI Conferences

Twenty-Sixth AAAI Conference on Artificial Intelligence

Country:

Europe > Austria > Vienna (0.14)
North America > Canada > Quebec > Montreal (0.14)
Europe > Germany > Saarland (0.14)

Genre: Research Report (0.46)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science (0.68)

Add feedback

Characterization and exploitation of community structure in cover song networks

Serrà, Joan, Zanin, Massimiliano, Herrera, Perfecto, Serra, Xavier

arXiv.org Machine LearningSep-12-2011

The use of community detection algorithms is explored within the framework of cover song identification, i.e. the automatic detection of different audio renditions of the same underlying musical piece. Until now, this task has been posed as a typical query-by-example task, where one submits a query song and the system retrieves a list of possible matches ranked by their similarity to the query. In this work, we propose a new approach which uses song communities to provide more relevant answers to a given query. Starting from the output of a state-of-the-art system, songs are embedded in a complex weighted network whose links represent similarity (related musical content). Communities inside the network are then recognized as groups of covers and this information is used to enhance the results of the system. In particular, we show that this approach increases both the coherence and the accuracy of the system. Furthermore, we provide insight into the internal organization of individual cover song communities, showing that there is a tendency for the original song to be central within the community. We postulate that the methods and results presented here could be relevant to other query-by-example tasks.

algorithm, artificial intelligence, survey article, (19 more...)

arXiv.org Machine Learning

doi: 10.1016/j.patrec.2012.02.013

1108.6003

Country:

North America > United States (0.46)
Europe > Spain (0.28)
Europe > United Kingdom > England (0.14)

Genre: Research Report > New Finding (0.68)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback