Goto

Collaborating Authors

 Search


r/artificial - Is it 100% naive to enter AI without doing any formal training?

#artificialintelligence

TLDR: Depends how hard your problem is. In the strictest sense, you don't necessarily need formal training. I'm sure a smart enough person like Stephen Hawking (RIP) could figure pretty much anything out with enough years of diligent study with various online resources. However, for us mere mortals (no pun intended), it's hairier. AI is very vast, and depending on your case, you could need a lot of formal training or none at all.


Compiler-Level Matrix Multiplication Optimization for Deep Learning

arXiv.org Machine Learning

An important linear algebra routine, GEneral Matrix Multiplication (GEMM), is a fundamental operator in deep learning. Compilers need to translate these routines into low-level code optimized for specific hardware. Compiler-level optimization of GEMM has significant performance impact on training and executing deep learning models. However, most deep learning frameworks rely on hardware-specific operator libraries in which GEMM optimization has been mostly achieved by manual tuning, which restricts the performance on different target hardware. In this paper, we propose two novel algorithms for GEMM optimization based on the TVM framework, a lightweight Greedy Best First Search (G-BFS) method based on heuristic search, and a Neighborhood Actor Advantage Critic (N-A2C) method based on reinforcement learning. Experimental results show significant performance improvement of the proposed methods, in both the optimality of the solution and the cost of search in terms of time and fraction of the search space explored. Specifically, the proposed methods achieve 24% and 40% savings in GEMM computation time over state-of-the-art XGBoost and RNN methods, respectively, while exploring only 0.1% of the search space. The proposed approaches have potential to be applied to other operator-level optimizations.


Compiling Stochastic Constraint Programs to And-Or Decision Diagrams

arXiv.org Artificial Intelligence

Factored stochastic constraint programming (FSCP) is a formalism to represent multi-stage decision making problems under uncertainty. FSCP models support factorized probabilistic models and involve constraints over decision and random variables. These models have many applications in real-world problems. However, solving these problems requires evaluating the best course of action for each possible outcome of the random variables and hence is computationally challenging. FSCP problems often involve repeated subproblems which ideally should be solved once. In this paper we show how identifying and exploiting these identical subproblems can simplify solving them and leads to a compact representation of the solution. We compile an And-Or search tree to a compact decision diagram. Preliminary experiments show that our proposed method significantly improves the search efficiency by reducing the size of the problem and outperforms the existing methods.


A Time-Dependent TSP Formulation for the Design of an Active Debris Removal Mission using Simulated Annealing

arXiv.org Artificial Intelligence

This paper proposes a formulation of the Active Debris Removal (ADR) Mission Design problem as a modified Time-Dependent Traveling Salesman Problem (TDTSP). The TDTSP is a well-known combinatorial optimization problem, whose solution is the cheapest mono-cyclic tour connecting a number of non-stationary cities in a map. The problem is tackled with an optimization procedure based on Simulated Annealing, that efficiently exploits a natural encoding and a careful choice of mutation operators. The developed algorithm is used to simultaneously optimize the targets sequence and the rendezvous epochs of an impulsive ADR mission. Numerical results are presented for sets comprising up to 20 targets. INTRODUCTION The Traveling Salesman Problem (TSP) is a well-known combinatorial optimization problem, whose solution is the cheapest tour which allows a salesman to visit, only once, a number of cities in a map; the cost of each city-to-city transfer is, typically, the traveled distance or the fuel consumption. Active Debris Removal (ADR) missions can be seen as peculiar instances of the TDTSP, where an active (chaser) spacecraft is asked to visit, that is, to perform a rendezvous, with a certain number of targets (space debris), making the best use of the on-board propellant. Such kind of missions are increasing in popularity among space agencies all over the world, as the sustainability of the extra-atmospheric environment is becoming compromised by the huge amount of "space garbage" now orbiting Earth. A cost-competitive space program would involve the removal of several dozens of small debris with each single mission; such a complex scenario could became feasible only with the best possible use of the propellant on-board of the chaser spacecraft. As a consequence, a well-designed ADR mission would require the optimization of a multi-target rendezvous trajectory. A number of authors dealt with long term or time-free ADR missions aimed at removing a small number of debris from Sun synchronous orbits (at a rate of three to ten per year). These missions heavily rely on J 2 orbital perturbation for the alignment of the orbital planes of consecutive targets before starting the rendezvous maneuver, in order to reduce the mission cost.


Fermat's Library Some Studies In Machine Learning Using the Game of Checkers annotated/explained version.

#artificialintelligence

This is his seminal paper originally published in 1959 where Samuel sets out to build a program that can learn to play the game of checkers. Checkers is an extremely complex game - as a matter of fact the game has roughly 500 billion billion possible positions - that using a brute force only approach to solve it is not satisfactory. Samuel's program was based on Claude Shannon's minimax strategy to find the best move from a given current position. In this paper he describes how a machine could look ahead "by evaluating the resulting board positions much as a human player might do".


8 Timeless Job Search Strategies to Beat AI

#artificialintelligence

Despite all the talk about how A.I. is taking over industries, pushing people out of jobs, and reshaping the hiring process, I'm here to tell you that as long as "human" remains a central element of "human resources," you can rely on a few surefire job search tactics that reach people. Many career analysts and insiders claim the resume is dead, but you have permission to ignore them at least for one more year. Many companies still use screening technologies that hinge on parsing resumes, so a well-written, keyword-rich document is crucial to your career. You should also read their excellent articles. Read Jon Shields article where he offers 56 resume tips to guide you through the process.


Understanding and Robustifying Differentiable Architecture Search

arXiv.org Artificial Intelligence

Differentiable Architecture Search (DARTS) has attracted a lot of attention due to its simplicity and small search costs achieved by a continuous relaxation and an approximation of the resulting bi-level optimization problem. However, DARTS does not work robustly for new problems: we identify a wide range of search spaces for which DARTS yields degenerate architectures with very poor test performance. We study this failure mode and show that, while DARTS successfully minimizes validation loss, the found solutions generalize poorly when they coincide with high validation loss curvature in the space of architectures. We show that by adding one of various types of regularization we can robustify DARTS to find solutions with smaller Hessian spectrum and with better generalization properties. Based on these observations we propose several simple variations of DARTS that perform substantially more robustly in practice. Our observations are robust across five search spaces on three image classification tasks and also hold for the very different domains of disparity estimation (a dense regression task) and language modelling. We provide our implementation and scripts to facilitate reproducibility.


Deep Learning Assisted Heuristic Tree Search for the Container Pre-marshalling Problem

arXiv.org Artificial Intelligence

The container pre-marshalling problem (CPMP) is concerned with the re-ordering of containers in container terminals during off-peak times so that containers can be quickly retrieved when the port is busy. The problem has received significant attention in the literature and is addressed by a large number of exact and heuristic methods. Existing methods for the CPMP heavily rely on problem-specific components (e.g., proven lower bounds) that need to be developed by domain experts with knowledge of optimization techniques and a deep understanding of the problem at hand. With the goal to automate the costly and time-intensive design of heuristics for the CPMP, we propose a new method called Deep Learning Heuristic Tree Search (DLTS). It uses deep neural networks to learn solution strategies and lower bounds customized to the CPMP solely through analyzing existing (near-) optimal solutions to CPMP instances. The networks are then integrated into a tree search procedure to decide which branch to choose next and to prune the search tree. DLTS produces the highest quality heuristic solutions to the CPMP to date with gaps to optimality below 2% on real-world sized instances.


WSJ News Exclusive Amazon Changed Search Algorithm in Ways That Boost Its Own Products

#artificialintelligence

Amazon.com Inc. has adjusted its product-search system to more prominently feature listings that are more profitable for the company, said people who worked on the project--a move, contested internally, that could favor Amazon's own brands. Late last year, these people said, Amazon optimized the secret algorithm that ranks listings so that instead of showing customers mainly the most-relevant and best-selling listings when they search--as it had for more than a decade--the site also gives a boost to items that are more profitable...


Minimax Confidence Intervals for the Sliced Wasserstein Distance

arXiv.org Machine Learning

September 18, 2019 Abstract The Wasserstein distance has risen in popularity in the statistics and machine learning communities as a useful metric for comparing probability distributions. We study the problem of uncertainty quantification for the Sliced Wasserstein distance--an easily computable approximation of the Wasserstein distance. Specifically, we construct confidence intervals for the Sliced Wasserstein distance which have finite-sample validity under no assumptions or mild moment assumptions, and are adaptive in length to the smoothness of the underlying distributions. We also bound the minimax risk of estimating the Sliced Wasserstein distance, and show that the length of our proposed confidence intervals is minimax optimal over appropriate distribution classes. To motivate the choice of these classes, we also study minimax rates of estimating a distribution under the Sliced Wasserstein distance. These theoretical findings are complemented with a simulation study. 1 Introduction The Wasserstein distance is a metric between probability distributions which has received a surge of interest in statistics and machine learning (Panaretos and Zemel, 2018; Kolouri et al., 2017). This distance is a special case of the optimal transport problem (Villani, 2003), and measures the work required to couple one distribution with another. Specifically, let P ( X) denote the set of Borel probability measures supported on a set X R d, for some integer d 1, and let P r(X) denote the set of probability measures with finite r -th moment, for some r 1.