Case-Based Reasoning
Exploiting Markov Random Fields to Enhance Retrieval in Case-Based Reasoning
Portinale, Luigi (Universitá del Piemonte Orientale)
The similarity assumption in Case-Based Reasoning (similar problems have similar solutions) has been questioned by several researchers. If knowledge about the adaptability of solutions is available, it can be exploited in order to guide retrieval. Several approaches have been proposed in this context, often assuming a similarity or cost measure defined over the solution space. In this paper, we propose a novel approach where the adaptability of the solutions is captured inside a metric Markov Random Field (MRF). Each case is represented as a node in the MRF, and edges connect cases whose solutions are close in the solution space. States of the nodes represent the adaptability effort with respect to the query. Potentals are defined to enforce connected nodes to share the same state; this models the fact that cases having similar solutions should have the same adaptability effort with respect to the query. The main goal is to enlarge the set of potentially adaptable cases that are retrieved (the recall) without significantly sacrificing the precision of retrieval. We will report on some experiments concerning a retrieval architecture where a simple kNN retrieval is followed by a further retrieval step based on MRF inference.
What Is the Next Step? Supporting Architectural Room Configuration Process with Case-Based Reasoning and Recurrent Neural Networks
Eisenstadt, Viktor (University of Hildesheim) | Althoff, Klaus-Dieter (University of Hildesheim)
This paper presents the first results of the research into AI-based support of the room configuration process during the early design phases in architecture. Room configuration (also: room layout or space layout) is an essential stage of the initial design phase: its results are crucial for user-friendliness and success of the planned utilization of the architectural object. Our approach takes into account different possible actions of the configuration process, such as adding, removing, or (re)assigning of the room type. Its mode of operation is based on specific process chain clusters, where each cluster represents a contextual subset of previous configuration steps and provides a recurrent neural network trained on this cluster data only to suggest the next step, and a case base that is used to determine if the current process chain belongs to this cluster. The most similar cluster then tries to suggest the next step of the process. The approach is implemented in a distributed CBR framework for support of early conceptual design in architecture and was evaluated with a high number of process chain queries to prove its general suitability.
C2C Trace Retrieval: Fast Classification Using Class-to-Class Weighting
Ye, Xiaomeng (Indiana University Bloomington)
Traditional case-based classification methods are based on feature similarity. In contrast, class-to-class (C2C) weighting also considers whether the difference between two cases has been seen before. Combined with instance-specific weighting, C2C weighting learns the local patterns of both similarities and differences (shortened as patterns). Once C2C weightings has learned the pattern between case A of class C_1 and some set of cases R of class C_2, given a query Q whose difference from A matches the pattern between A and R, then we can skip cases around A and continue the search for near neighbors around R. Based on this, we developed an algorithm, C2C trace retrieval, which quickly traverses promising cases, retrieves relevant cases from different classes, and provides an informed hypothesis of the query's class. C2C trace retrieval achieves great efficiency at a reasonable cost of accuracy. Therefore, C2C trace retrieval can be used as a fast classification method or as the first pass for a more sophisticated method.
A Reachability-Based Complexity Measure for Case-Based Reasoners
Ganesan, Devi (Indian Institute of Technology Madras) | Chakraborti, Sutanu (Indian Institute of Technology Madras)
Case-Based Reasoning relies on the underlying hypothesis that similar problems have similar solutions. The extent to which this hypothesis holds good in the case base has been used by CBR designers as a measure of case base complexity, which in turn gives insights on the generalization ability of the reasoner. Several local and global complexity measures have been proposed in the literature. However, the existing measures rely only on the similarity knowledge to compute complexity. We propose a new complexity measure called Reachability-Based Complexity Measure (RBCM) that goes beyond the similarity knowledge to include the effects of all knowledge containers in the reasoner. The proposed measure is evaluated on several real-world datasets and results suggest that RBCM corroborates well with the generalization accuracy of the reasoner.
Similarity Measures for Case-Based Retrieval of Natural Language Argument Graphs in Argumentation Machines
Bergmann, Ralph (University of Trier) | Lenz, Mirko (University of Trier) | Ollinger, Stefan (University of Trier) | Pfister, Maximilian (University of Trier)
In the field of argumentation, the vision of robust argumentation machines is investigated. They explore natural language arguments from available information sources on the web and reason with them on the knowledge level to actively support the deliberation and synthesis of arguments for a particular query of a user. We aim at combining methods from case-based reasoning (CBR), information retrieval, and computational argumentation to contribute to the foundations of such argumentation machines. In this paper, we focus on the retrieval phase of a CBR approach for an argumentation machine and propose similarity measures for arguments represented as argument graphs. We evaluate the similarity measures on a corpus of annotated micro texts containing different topics and demonstrate the benefit of semantic similarity measures as well as the relevance of structural aspects.
Probabilistic Energy Forecasting using Quantile Regressions based on a new Nearest Neighbors Quantile Filter
Ordiano, Jorge Ángel González, Gröll, Lutz, Mikut, Ralf, Hagenmeyer, Veit
Parametric quantile regressions are a useful tool for creating probabilistic energy forecasts. Nonetheless, since classical quantile regressions are trained using a non-differentiable cost function, their creation using complex data mining techniques (e.g., artificial neural networks) may be complicated. This article presents a method that uses a new nearest neighbors quantile filter to obtain quantile regressions independently of the utilized data mining technique and without the non-differentiable cost function. Thereafter, a validation of the presented method using the dataset of the Global Energy Forecasting Competition of 2014 is undertaken. The results show that the presented method is able to solve the competition's task with a similar accuracy and in a similar time as the competition's winner, but requiring a much less powerful computer. This property may be relevant in an online forecasting service for which the fast computation of probabilistic forecasts using not so powerful machines is required.
Readings in Medical Artificial Intelligence: The First Decade
A survey of early work exploring how AI can be used in medicine, with somewhat more technical expositions than in the complementary volume Artificial Intelligence in Medicine."Each chapter is preceded by a brief introduction that outlines our view of its contribution to the field, the reason it was selected for inclusion in this volume, an overview of its content, and a discussion of how the work evolved after the article appeared and how it relates to other chapters in the book.
Weak consistency of the 1-nearest neighbor measure with applications to missing data and covariate shift
When data is partially missing at random, imputation and importance weighting are often used to estimate moments of the unobserved population. In this paper, we study 1-nearest neighbor (1NN) imputation, which replaces missing data with the complete data that is the nearest neighbor in the non-missing covariate space. We define an empirical measure, the 1NN measure, and show that it is weakly consistent for the measure of the missing data. The main idea behind this result is that 1NN imputation is performing inverse probability weighting in the limit. We study applications to missing data and assessing the impact of covariate shift in prediction tasks. We conclude with a discussion of using 1NN imputation for domain adaptation in order to alleviate the impact of covariate shift.
A Theory-Based Evaluation of Nearest Neighbor Models Put Into Practice
Fichtenberger, Hendrik, Rohde, Dennis
In the $k$-nearest neighborhood model ($k$-NN), we are given a set of points $P$, and we shall answer queries $q$ by returning the $k$ nearest neighbors of $q$ in $P$ according to some metric. This concept is crucial in many areas of data analysis and data processing, e.g., computer vision, document retrieval and machine learning. Many $k$-NN algorithms have been published and implemented, but often the relation between parameters and accuracy of the computed $k$-NN is not explicit. We study property testing of $k$-NN graphs in theory and evaluate it empirically: given a point set $P \subset \mathbb{R}^\delta$ and a directed graph $G=(P,E)$, is $G$ a $k$-NN graph, i.e., every point $p \in P$ has outgoing edges to its $k$ nearest neighbors, or is it $\epsilon$-far from being a $k$-NN graph? Here, $\epsilon$-far means that one has to change more than an $\epsilon$-fraction of the edges in order to make $G$ a $k$-NN graph. We develop a randomized algorithm with one-sided error that decides this question, i.e., a property tester for the $k$-NN property, with complexity $O(\sqrt{n} k^2 / \epsilon^2)$ measured in terms of the number of vertices and edges it inspects, and we prove a lower bound of $\Omega(\sqrt{n / \epsilon k})$. We evaluate our tester empirically on the $k$-NN models computed by various algorithms and show that it can be used to detect $k$-NN models with bad accuracy in significantly less time than the building time of the $k$-NN model.
The Nearest Neighbor Information Estimator is Adaptively Near Minimax Rate-Optimal
Jiao, Jiantao, Gao, Weihao, Han, Yanjun
We analyze the Kozachenko–Leonenko (KL) fixed k-nearest neighbor estimator for the differential entropy. We obtain the first uniform upper bound on its performance for any fixed k over H\"{o}lder balls on a torus without assuming any conditions on how close the density could be from zero. Accompanying a recent minimax lower bound over the H\"{o}lder ball, we show that the KL estimator for any fixed k is achieving the minimax rates up to logarithmic factors without cognizance of the smoothness parameter s of the H\"{o}lder ball for $s \in (0,2]$ and arbitrary dimension d, rendering it the first estimator that provably satisfies this property.