SPE
Story Generation and Aviation Incident Representation
This working note discusses the topic of story generation, with a view to identifying the knowledge required to understand aviation incident narratives (which have structural similarities to stories), following the premise that to understand aviation incidents, one should at least be able to generate examples of them. We give a brief overview of aviation incidents and their relation to stories, and then describe two of our earlier attempts (using `scripts' and `story grammars') at incident generation which did not evolve promisingly. Following this, we describe a simple incident generator which did work (at a `toy' level), using a `world simulation' approach. This generator is based on Meehan's TALE-SPIN story generator (1977). We conclude with a critique of the approach.
Distinguishing Question Subjectivity from Difficulty for Improved Crowdsourcing
Jin, Yuan, Carman, Mark, Zhu, Ye, Buntine, Wray
Their joint effects give rise to the variation in responses to the same question by different crowdworkers. This variation is low when the question is easy to answer and objective, and high when it is difficult and subjective. Unfortunately, current quality control methods for crowdsourcing consider only the question difficulty to account for the variation. As a result, these methods cannot distinguish workers' personal preferences for different correct answers of a partially subjective question from their ability/expertise to avoid objectively wrong answers for that question. To address this issue, we present a probabilistic model which (i) explicitly encodes question difficulty as a model parameter and (ii) implicitly encodes question subjectivity via latent preference factors for crowd-workers. We show that question subjectivity induces grouping of crowd-workers, revealed through clustering of their latent preferences. Moreover, we develop a quantitative measure of the subjectivity of a question. Experiments show that our model (1) improves the performance of both quality control for crowdsourced answers and next answer prediction for crowd-workers, and (2) can potentially provide coherent rankings of questions in terms of their difficulty and subjectivity, so that task providers can refine their designs of the crowdsourcing tasks, e.g. by removing highly subjective questions or inappropriately difficult questions.
Learning a SAT Solver from Single-Bit Supervision
Selsam, Daniel, Lamm, Matthew, Bรผnz, Benedikt, Liang, Percy, de Moura, Leonardo, Dill, David L.
We present NeuroSAT, a message passing neural network that learns to solve SAT problems after only being trained as a classifier to predict satisfiability. Although it is not competitive with state-of-the-art SAT solvers, NeuroSAT can solve problems that are substantially larger and more difficult than it ever saw during training by simply running for more iterations. Moreover, NeuroSAT generalizes to novel distributions; after training only on random SAT problems, at test time it can solve SAT problems encoding graph coloring, clique detection, dominating set, and vertex cover problems, all on a range of distributions over small random graphs.
An Improved Tabu Search Heuristic for Static Dial-A-Ride Problem
Ho, Songguang, Nagavarapu, Sarat Chandra, Pandi, Ramesh Ramasamy, Dauwels, Justin
Dial-A-Ride Problem (DARP) addresses the issue of doorto-door transportation service for the customers with high customer satisfaction. Now-a-days, transportation services have increasing need in our daily life, and it started to directly impact our environment as well as quality of living. According to a study conducted by University of British Columbia, the road pricing or pay-per-use is the most effective way to reduce emissions and traffic [1]. DARP has many applications ranging from taxi services to autonomous cargo and ground operations at the airports. DARP is an extension of pickup and delivery problem under the class of vehicle routing problem (VRP) [2]. It is a combinatorial optimization problem with an objective function to minimise the overall cost while satisfying a specific set of constraints such as time-window, maximum waiting time and maximum ride time to ensure high-quality customer service. In this problem, a set of customers makes a request for pickup and drop-off at certain locations within a predefined time-window. An approach to solve DARP based on dynamic programming has been proposed in [3], in which divide and conquer method is used to solve the problem.
Why Didn't Chatbots Live Up To Their Hype?
Why didn't chatbots become as prolific as they were predicted to be in 2017? originally appeared on Quora: the place to gain and share knowledge, empowering people to learn from others and better understand the world. Chatbots didn't live up to the hype for a couple of reasons. It would have been ...
TVM: End-to-End Optimization Stack for Deep Learning
Chen, Tianqi, Moreau, Thierry, Jiang, Ziheng, Shen, Haichen, Yan, Eddie, Wang, Leyuan, Hu, Yuwei, Ceze, Luis, Guestrin, Carlos, Krishnamurthy, Arvind
Scalable frameworks, such as TensorFlow, MXNet, Caffe, and PyTorch drive the current popularity and utility of deep learning. However, these frameworks are optimized for a narrow range of server-class GPUs and deploying workloads to other platforms such as mobile phones, embedded devices, and specialized accelerators (e.g., FPGAs, ASICs) requires laborious manual effort. We propose TVM, an end-to-end optimization stack that exposes graph-level and operator-level optimizations to provide performance portability to deep learning workloads across diverse hardware back-ends. We discuss the optimization challenges specific to deep learning that TVM solves: high-level operator fusion, low-level memory reuse across threads, mapping to arbitrary hardware primitives, and memory latency hiding. Experimental results demonstrate that TVM delivers performance across hardware back-ends that are competitive with state-of-the-art libraries for low-power CPU and server-class GPUs. We also demonstrate TVM's ability to target new hardware accelerator back-ends by targeting an FPGA-based generic deep learning accelerator. The compiler infrastructure is open sourced.
7 Types of Regression Techniques you should know
Linear and Logistic regressions are usually the first algorithms people learn in predictive modeling. Due to their popularity, a lot of analysts even end up thinking that they are the only form of regressions. The ones who are slightly more involved think that they are the most important amongst all forms of regression analysis. The truth is that there are innumerable forms of regressions, which can be performed. Each form has its own importance and a specific condition where they are best suited to apply.
HR Technology for 2018: More Intelligent than Ever
Almost every HR vendor I talk with claims to have artificial intelligence (AI)-based solutions, predictive analytics, chatbots or some other form of algorithmic solution to make HR better. As I've learned about all these products and started to see them in action, let me give you tips on what to look for. In the recruitment market, data is really driving our future. Thanks to the ubiquitous nature of social networks and dozens of intelligent sourcing and assessment tools, our research shows, AI is creating significant value. As you search for new recruiting tools (sourcing, candidate assessment, intelligent chatbots and mobile recruiting platforms), ask the vendor to show you how its AI works.
End-to-End Task-Completion Neural Dialogue Systems
Li, Xiujun, Chen, Yun-Nung, Li, Lihong, Gao, Jianfeng, Celikyilmaz, Asli
One of the major drawbacks of modularized task-completion dialogue systems is that each module is trained individually, which presents several challenges. For example, downstream modules are affected by earlier modules, and the performance of the entire system is not robust to the accumulated errors. This paper presents a novel end-to-end learning framework for task-completion dialogue systems to tackle such issues. Our neural dialogue system can directly interact with a structured database to assist users in accessing information and accomplishing certain tasks. The reinforcement learning based dialogue manager offers robust capabilities to handle noises caused by other components of the dialogue system. Our experiments in a movie-ticket booking domain show that our end-to-end system not only outperforms modularized dialogue system baselines for both objective and subjective evaluation, but also is robust to noises as demonstrated by several systematic experiments with different error granularity and rates specific to the language understanding module.
An Information-Theoretic Optimality Principle for Deep Reinforcement Learning
Leibfried, Felix, Grau-Moya, Jordi, Bou-Ammar, Haitham
We methodologically address the problem of Q-value overestimation in deep reinforcement learning to handle high-dimensional state spaces efficiently. By adapting concepts from information theory, we introduce an intrinsic penalty signal encouraging reduced Q-value estimates. The resultant algorithm encompasses a wide range of learning outcomes containing deep Q-networks as a special case. Different learning outcomes can be demonstrated by tuning a Lagrange multiplier accordingly. We furthermore propose a novel scheduling scheme for this Lagrange multiplier to ensure efficient and robust learning. In experiments on Atari games, our algorithm outperforms other algorithms (e.g.