Search
Colonial Beach Teen Tops in State With Rubik's Cube
The son of Paul Christie and Sonya Stagnoli, Ben and his sister Bella are home-schooled students who also take college courses. He'll graduate with an associate's degree from Germanna Community College next spring, at about the same time that he receives his high school diploma. She takes classes at Rappahannock Community College.
Distributional Equivalence and Structure Learning for Bow-free Acyclic Path Diagrams
Nowzohour, Christopher, Maathuis, Marloes H., Evans, Robin J., Bühlmann, Peter
We consider the problem of structure learning for bow-free acyclic path diagrams (BAPs). BAPs can be viewed as a generalization of linear Gaussian DAG models that allow for certain hidden variables. We present a first method for this problem using a greedy score-based search algorithm. We also prove some necessary and some sufficient conditions for distributional equivalence of BAPs which are used in an algorithmic ap- proach to compute (nearly) equivalent model structures. This allows us to infer lower bounds of causal effects. We also present applications to real and simulated datasets using our publicly available R-package.
Chamberlin--Courant Rule with Approval Ballots: Approximating the MaxCover Problem with Bounded Frequencies in FPT Time
Skowron, Piotr, Faliszewski, Piotr
We consider the problem of winner determination under Chamberlin--Courant's multiwinner voting rule with approval utilities. This problem is equivalent to the well-known NP-complete MaxCover problem and, so, the best polynomial-time approximation algorithm for it has approximation ratio 1 - 1/e. We show exponential-time/FPT approximation algorithms that, on one hand, achieve arbitrarily good approximation ratios and, on the other hand, have running times much better than known exact algorithms. We focus on the cases where the voters have to approve of at most/at least a given number of candidates.
Hierarchical Policy Search via Return-Weighted Density Estimation
Osa, Takayuki, Sugiyama, Masashi
Learning an optimal policy from a multi-modal reward function is a challenging problem in reinforcement learning (RL). Hierarchical RL (HRL) tackles this problem by learning a hierarchical policy, where multiple option policies are in charge of different strategies corresponding to modes of a reward function and a gating policy selects the best option for a given context. Although HRL has been demonstrated to be promising, current state-of-the-art methods cannot still perform well in complex real-world problems due to the difficulty of identifying modes of the reward function. In this paper, we propose a novel method called hierarchical policy search via return-weighted density estimation (HPSDE), which can efficiently identify the modes through density estimation with return-weighted importance sampling. Our proposed method finds option policies corresponding to the modes of the return function and automatically determines the number and the location of option policies, which significantly reduces the burden of hyper-parameters tuning. Through experiments, we demonstrate that the proposed HPSDE successfully learns option policies corresponding to modes of the return function and that it can be successfully applied to a challenging motion planning problem of a redundant robotic manipulator.
Learning Certifiably Optimal Rule Lists for Categorical Data
Angelino, Elaine, Larus-Stone, Nicholas, Alabi, Daniel, Seltzer, Margo, Rudin, Cynthia
We present the design and implementation of a custom discrete optimization technique for building rule lists over a categorical feature space. Our algorithm produces rule lists with optimal training performance, according to the regularized empirical risk, with a certificate of optimality. By leveraging algorithmic bounds, efficient data structures, and computational reuse, we achieve several orders of magnitude speedup in time and a massive reduction of memory consumption. We demonstrate that our approach produces optimal rule lists on practical problems in seconds. Our results indicate that it is possible to construct optimal sparse rule lists that are approximately as accurate as the COMPAS proprietary risk prediction tool on data from Broward County, Florida, but that are completely interpretable. This framework is a novel alternative to CART and other decision tree methods for interpretable modeling.
Google says it is struggling to 'understand truth' because people are confusing its search algorithm
Google has admitted it is having trouble working out what's true and what's false. People are managing to confuse the company's search algorithm, says Eric Schmidt, the executive chairman of Alphabet. As a result, it's struggling to rank search results correctly, in order of accuracy. "Let's say that this group believes Fact A and this group believes Fact B, and you passionately disagree with each other and you are all publishing and writing about it and so forth and so on," Mr Schmidt said at the Halifax International Security Forum last weekend, reports CNBC. "It is very difficult for us to understand truth.
Google's search algorithm struggles to rank information
With millions of views published online every day, it can be difficult for Google to rank information correctly within its search engine. Speaking this week, Eric Schmidt, Chairman of Google's parent firm, Alphabet, explained that it is'very difficult' for the search algorithm to weed out the truth in a sea of opposing articles. Thankfully, Schmidt believes the problem should be easy to address by tweaking the algorithm. Speaking this week, Eric Schmidt, Chairman of Google's parent firm, Alphabet, explained that it is'very difficult' for the search algorithm to weed out the truth in a sea of opposing articles Inaccurate results are often down to "Google bombing" used by groups to be ranked highly. These include linking to a fake news site from several other sources and hiding text on a page that is invisible to humans but visible to the search engine's algorithms.
Relief-Based Feature Selection: Introduction and Review
Urbanowicz, Ryan J., Meeker, Melissa, LaCava, William, Olson, Randal S., Moore, Jason H.
Feature selection plays a critical role in data mining, driven by increasing feature dimensionality in target problems and growing interest in advanced but computationally expensive methodologies able to model complex associations. Specifically, there is a need for feature selection methods that are computationally efficient, yet sensitive to complex patterns of association, e.g. interactions, so that informative features are not mistakenly eliminated prior to downstream modeling. This paper focuses on Relief-based algorithms (RBAs), a unique family of filter-style feature selection algorithms that strike an effective balance between these objectives while flexibly adapting to various data characteristics, e.g. classification vs. regression. First, this work broadly examines types of feature selection and defines RBAs within that context. Next, we introduce the original Relief algorithm and associated concepts, emphasizing the intuition behind how it works, how feature weights generated by the algorithm can be interpreted, and why it is sensitive to feature interactions without evaluating combinations of features. Lastly, we include an expansive review of RBA methodological research beyond Relief and its popular descendant, ReliefF. In particular, we characterize branches of RBA research, and provide comparative summaries of RBA algorithms including contributions, strategies, functionality, time complexity, adaptation to key data characteristics, and software availability.
Time and Space Bounds for Planning
Bäckström, Christer, Jonsson, Peter
There is an extensive literature on the complexity of planning, but explicit bounds on time and space complexity are very rare. On the other hand, problems like the constraint satisfaction problem (CSP) have been thoroughly analysed in this respect. We provide a number of upper- and lower-bound results (the latter based on various complexity-theoretic assumptions such as the Exponential Time Hypothesis) for both satisficing and optimal planning. We show that many classes of planning instances exhibit a dichotomy: either they can be solved in polynomial time or they cannot be solved in subexponential time. In many cases, we can even prove closely matching upper and lower bounds. Our results also indicate, analogously to CSPs, the existence of sharp phase transitions. We finally study and discuss the trade-off between time and space. In particular, we show that depth-first search may sometimes be a viable option for planning under severe space constraints.
Sparse High-Dimensional Linear Regression. Algorithmic Barriers and a Local Search Algorithm
We consider a sparse high dimensional regression model where the goal is to recover a k-sparse unknown vector \beta^* from n noisy linear observations of the form Y=X\beta^*+W \in R^n where X \in R^{n \times p} has iid N(0,1) entries and W \in R^n has iid N(0,\sigma^2) entries. Under certain assumptions on the parameters, an intriguing assymptotic gap appears between the minimum value of n, call it n^*, for which the recovery is information theoretically possible, and the minimum value of n, call it n_{alg}, for which an efficient algorithm is known to provably recover \beta^*. In a recent paper it was conjectured that the gap is not artificial, in the sense that for sample sizes n \in [n^*,n_{alg}] the problem is algorithmically hard. We support this conjecture in two ways. Firstly, we show that a well known recovery mechanism called Basis Pursuit Denoising Scheme provably fails to \ell_2-stably recover the vector when n \in [n^*,c n_{alg}], for some sufficiently small constant c>0. Secondly, we establish that n_{alg}, up to a multiplicative constant factor, is a phase transition point for the appearance of a certain Overlap Gap Property (OGP) over the space of k-sparse vectors. The presence of such an Overlap Gap Property phase transition, which originates in statistical physics, is known to provide evidence of an algorithmic hardness. Finally we show that if n>C n_{alg} for some large enough constant C>0, a very simple algorithm based on a local search improvement is able to infer correctly the support of the unknown vector \beta^*, adding it to the list of provably successful algorithms for the high dimensional linear regression problem.