Goto

Collaborating Authors

 South America


Learning Behaviors in Agents Systems with Interactive Dynamic Influence Diagrams

AAAI Conferences

Interactive dynamic influence diagrams(I-DIDs) are a well recognized decision model that explicitly considers how multiagent interaction affects individual decision making. To predict behavior of other agents, I-DIDs require models of the other agents to be known ahead of time and manually encoded. This becomes a barrier to I-DID applications in a human-agent interaction setting, such as development of intelligent non-player characters(NPCs) in real-time strategy(RTS) games, where models of other agents or human players are often inaccessible to domain experts. In this paper, we use automatic techniques for learning behavior of other agents from replay data in RTS games. We propose a learning algorithm with improvement over existing work by building a full profile of agent behavior. This is the first time that data-driven learning techniques are embedded into the I-DID decision making framework. We evaluate the performance of our approach on two test cases.



Stochastic Density Ratio Estimation and Its Application to Feature Selection

AAAI Conferences

In this work, we deal with a relatively new statistical tool in machine learning: the estimation of the ratio of two probability densities, or density ratio estimation for short. As a side piece of research that gained its own traction, we also tackle the task of parameter selection in learning algorithms based on kernel methods.


Combining Existential Rules with the Power of CP-Theories

AAAI Conferences

The tastes of a user can be represented in a natural way by using qualitative preferences. In this paper, we explore how ontological knowledge expressed via existential rules can be combined with CP-theories to (i) represent qualitative preferences along with domain knowledge, and (ii) perform preference-based answering of conjunctive queries (CQs). We call these combinations ontological CP-theories (OCP-theories). We define skyline and k-rank answers to CQs based on the user’s preferences encoded in an OCP-theory, and provide an algorithm for computing them. We also provide precise complexity (including data tractability) results for deciding consistency, dominance, and CQ skyline membership for OCP-theories.


Using A* for Inference in Probabilistic Classifier Chains

AAAI Conferences

Probabilistic Classifiers Chains (PCC) offers interesting properties to solve multi-label classification tasks due to its ability to estimate the joint probability of the labels. However, PCC presents the major drawback of having a high computational cost in the inference process required to predict new samples. Lately, several approaches have been proposed to overcome this issue, including beam search and an epsilon-Approximate algorithm based on uniform-cost search. Surprisingly, the obvious possibility of using heuristic search has not been considered yet. This paper studies this alternative and proposes an admisible heuristic that, applied in combination with A* algorithm, guarantees, not only optimal predictions in terms of subset 0/1 loss, but also that it always explores less nodes than epsilon-Approximate algorithm. In the experiments reported, the number of nodes explored by our method is less than two times the number of labels for all datasets analyzed. But, the difference in explored nodes must be large enough to compensate the overhead of the heuristic in order to improve prediction time. Thus, our proposal may be a good choice for complex multi-label problems.


Feature Selection for Multi-Label Learning

AAAI Conferences

Feature Selection plays an important role in machine learning and data mining, and it is often applied as a data pre-processing step. This task can speed up learning algorithms and sometimes improve their performance. In multi-label learning, label dependence is considered another aspect that can contribute to improve learning performance. A replicable and wide systematic review performed by us corroborates this idea. Based on this information, it is believed that considering label dependence during feature selection can lead to better learning performance. The hypothesis of this work is that multi-label feature selection algorithms that consider label dependence will perform better than the ones that disregard it. To this end, we propose multi-label feature selection algorithms that take into account label relations. These algorithms were experimentally compared to the standard approach for feature selection, showing good performance in terms of feature reduction and predictability of the classifiers built using the selected features.


RoTuEl: A Semi-Automated Method for Labeling Political Tweets

AAAI Conferences

The latest research on prediction of the outcome of elections using Twitter data, the election tweets labeling area has hardly been explored. Therefore, the authors of this paper propose to develop a semi-automated model for labeling political tweets. The expected result of this study is to contribute to enhance the quality of the choice of messages used in the labeling process by reducing the time selection of messages and the efficiency of classifying the messages and, thus, to increase the accuracy of the models using this approach. The proposed method could label 2200 messages from the analysis of only 60 messages by 20 users. The first results obtained by the method were higher than the process carried out manually by humans.


Bipartite Graph for Topic Extraction

AAAI Conferences

This article presents a bipartite graph propagation method to be applied to different tasks in the machine learning unsupervised domain, such as topic extraction and clustering. We introduce the objectives and hypothesis that motivate the use of graph based method, and we give the intuition of the proposed Bipartite Graph Propagation Algorithm. The contribution of this study is the development of new method that allows the use of heuristic knowledge to discover topics in textual data easier than it is possible in the traditional mathematical formalism based on Latent Dirichlet Allocation (LDA). Initial experiments demonstrate that our Bipartite Graph Propagation algorithm return good results in a static context (offline algorithm). Now, our research is focusing on big amount of data and dynamic context (online algorithm).


Information Extraction of Texts in the Biomedical Domain

AAAI Conferences

Automatic detection of relevant terms in medical reports is useful for educational purposes and for clinical research. Natural language processing techniques can be applied in order to identify them. The main goal of this research is to develop a method to identify whether medical reports of imaging studies (usually called radiology reports) written in Spanish are important (in the sense that they have non-negated pathological findings) or not. We also try to identify which finding is present and if possible its relationship with anatomical entities.


A Distributed Platform to Ease the Development of Recommendation Algorithms on Large-Scale Graphs

AAAI Conferences

The creation of novel recommendation algorithms for social networks is currently struggling with the volume of available data originating in such environments. Given that social networks can be modeled as graphs, a distributed graph-oriented support to exploit the computing capabilities of clusters arises as a necessity. In this thesis, a platform for graph storage and processing named Graphly is proposed along with GraphRec, an API for easy specification of recommendation algorithms. Graphly and GraphRec hide distributed programming concerns from the user while still allowing fine-tuning of the remote execution. For example, users may customize an algorithm execution using job distribution strategies, without modifying the original code. GraphRec also simplifies the design of graph-based recommender systems by implementing well-known algorithms as “primitives” that can be reused.