Oceania
The real-time reactive surgical case sequencing problem
In this paper, the multiple operating room (OR) surgical case sequencing problem (SCSP) is addressed. The objective is to maximise total OR utilisation during standard opening hours. The work here is based on a case study of a large Australian public hospital with long surgical waiting lists and high levels of non-elective demand. Due to the complexity of the SCSP and the size of the instances considered herein, heuristic techniques are required to solve the problem. Constructive heuristics are presented based on both a modified block scheduling policy and an open scheduling policy. A number of real-time reactive strategies are presented that can be used to maintain schedule feasibility in the case of disruptions. Results of computational experiments show that the approach presented in this paper can be used to maintain schedule feasibility in real-time, whilst increasing OT utilisation and throughput, and reducing the waiting time of non-elective patients. The framework presented here is applicable to the real-life scheduling of OT departments, and recommendations have been provided regarding implementation of the approach.
The reactive multiple operating room surgical case sequencing problem
In this paper we consider the surgical case sequencing problem (SCSP) under stochastic conditions. In addition to implementing a robust surgical schedule, we investigate the use of a number of reactive strategies that can be used to maintain schedule feasibility. We present a mixed integer nonlinear programming (MINLP) model for the reactive multiple operating room (OR) SCSP that may be suitable for direct implementation on small problem instances. A machine scheduling perspective is considered and the model is equivalent to a resource-constrained parallel-machine scheduling problem with identical machines, machine eligibility restrictions, and machine and job release dates. The explicit objective of the model is to reduce OR idle time, although other common objectives (including time to surgery and overtime) are discussed. The work here is based on a case study of a large Australian public hospital with long surgical waiting lists and high non-elective demand. Results of computational experiments show that the reactive strategies presented in this paper can be used to reduce idle time without putting excessive pressure on surgeons.
ExpIt-OOS: Towards Learning from Planning in Imperfect Information Games
Kitchen, Andy, Benedetti, Michela
The current state of the art in playing many important perfect information games, including Chess and Go, combines planning and deep reinforcement learning with self-play. We extend this approach to imperfect information games and present ExIt-OOS, a novel approach to playing imperfect information games within the Expert Iteration framework and inspired by AlphaZero. We use Online Outcome Sampling, an online search algorithm for imperfect information games in place of MCTS. While training online, our neural strategy is used to improve the accuracy of playouts in OOS, allowing a learning and planning feedback loop for imperfect information games.
Learning a Policy for Opportunistic Active Learning
Padmakumar, Aishwarya, Stone, Peter, Mooney, Raymond J.
Active learning identifies data points to label that are expected to be the most useful in improving a supervised model. Opportunistic active learning incorporates active learning into interactive tasks that constrain possible queries during interactions. Prior work has shown that opportunistic active learning can be used to improve grounding of natural language descriptions in an interactive object retrieval task. In this work, we use reinforcement learning for such an object retrieval task, to learn a policy that effectively trades off task completion with model improvement that would benefit future tasks.
Wasserstein is all you need
Singh, Sidak Pal, Hug, Andreas, Dieuleveut, Aymeric, Jaggi, Martin
We propose a unified framework for building unsupervised representations of individual objects or entities (and their compositions), by associating with each object both a distributional as well as a point estimate (vector embedding). This is made possible by the use of optimal transport, which allows us to build these associated estimates while harnessing the underlying geometry of the ground space. Our method gives a novel perspective for building rich and powerful feature representations that simultaneously capture uncertainty (via a distributional estimate) and interpretability (with the optimal transport map). As a guiding example, we formulate unsupervised representations for text, in particular for sentence representation and entailment detection. Empirical results show strong advantages gained through the proposed framework. This approach can be used for any unsupervised or supervised problem (on text or other modalities) with a co-occurrence structure, such as any sequence data. The key tools underlying the framework are Wasserstein distances and Wasserstein barycenters (and, hence the title!).
Dropout with Tabu Strategy for Regularizing Deep Neural Networks
Ma, Zongjie, Sattar, Abdul, Zhou, Jun, Chen, Qingliang, Su, Kaile
Dropout has proven to be an effective technique for regularization and preventing the co-adaptation of neurons in deep neural networks (DNN). It randomly drops units with a probability $p$ during the training stage of DNN. Dropout also provides a way of approximately combining exponentially many different neural network architectures efficiently. In this work, we add a diversification strategy into dropout, which aims at generating more different neural network architectures in a proper times of iterations. The dropped units in last forward propagation will be marked. Then the selected units for dropping in the current FP will be kept if they have been marked in the last forward propagation. We only mark the units from the last forward propagation. We call this new technique Tabu Dropout. Tabu Dropout has no extra parameters compared with the standard Dropout and also it is computationally cheap. The experiments conducted on MNIST, Fashion-MNIST datasets show that Tabu Dropout improves the performance of the standard dropout.
Building a Robust Text Classifier on a Test-Time Budget
Parvez, Md Rizwan, Bolukbasi, Tolga, Chang, kai-Wei, Saligrama, Venkatesh
In this paper, we study a generic learning framework for building robust text classification model that achieves accuracy comparable to standard full models under test-time budget constraints. Our approach learns a selector to identify words that are relevant to the prediction tasks and only passes these words to the classifier for processing. The selector is trained jointly with the classifier and directly learns to incorporate with the classifier. We further propose a data aggregation scheme to improve the robustness of the classifier. Our learning framework is general and can be incorporated with any type of text classification model. On real-world data, we show that the proposed approach improves the performance of a given classifier and speeds up the model with a mere loss in accuracy performance.
Rule induction for global explanation of trained models
Sushil, Madhumita, Šuster, Simon, Daelemans, Walter
Understanding the behavior of a trained network and finding explanations for its outputs is important for improving the network's performance and generalization ability, and for ensuring trust in automated systems. Several approaches have previously been proposed to identify and visualize the most important features by analyzing a trained network. However, the relations between different features and classes are lost in most cases. We propose a technique to induce sets of if-then-else rules that capture these relations to globally explain the predictions of a network. We first calculate the importance of the features in the trained network. We then weigh the original inputs with these feature importance scores, simplify the transformed input space, and finally fit a rule induction model to explain the model predictions. We find that the output rule-sets can explain the predictions of a neural network trained for 4-class text classification from the 20 newsgroups dataset to a macro-averaged F-score of 0.80. We make the code available at https://github.com/
APRIL: Interactively Learning to Summarise by Combining Active Preference Learning and Reinforcement Learning
Gao, Yang, Meyer, Christian M., Gurevych, Iryna
We propose a method to perform automatic document summarisation without using reference summaries. Instead, our method interactively learns from users' preferences. The merit of preference-based interactive summarisation is that preferences are easier for users to provide than reference summaries. Existing preference-based interactive learning methods suffer from high sample complexity, i.e. they need to interact with the oracle for many rounds in order to converge. In this work, we propose a new objective function, which enables us to leverage active learning, preference learning and reinforcement learning techniques in order to reduce the sample complexity. Both simulation and real-user experiments suggest that our method significantly advances the state of the art. Our source code is freely available at https://github.com/UKPLab/
NZ Analytics Forum Auckland 11 Sep 2018 NEW FRONTIERS IN ANALYTICS – New Zealand Analytics Forum
The NZ Analytics Forum invites you to our event on New Frontiers in Analytics. The event is free of charge and is kindly sponsored by Westpac New Zealand. Registration: The event is free of charge, with capacity of 230 people. Caleb has worked in the Data Science team at Westpac New Zealand for 18 months. Prior to this he was a Data Journalist at NZ Herald.