Not enough data to create a plot.
Try a different view from the menu above.
Gomes, Carla
Scalable Relaxations of Sparse Packing Constraints: Optimal Biocontrol in Predator-Prey Networks
Bjorck, Johan (Cornell University) | Bai, Yiwei (Shanghai Jiao Tong University) | Wu, Xiaojian (Cornell University) | Xue, Yexiang (Cornell University) | Whitmore, Mark (Cornell University) | Gomes, Carla (Cornell University)
Cascades represent rapid changes in networks. A cascading phenomenon of ecological and economic impact is the spread of invasive species in geographic landscapes. The most promising management strategy is often biocontrol, which entails introducing a natural predator able to control the invading population, a setting that can be treated as two interacting cascades of predator and prey populations. We formulate and study a nonlinear problem of optimal biocontrol: optimally seeding the predator cascade over time to minimize the harmful prey population. Recurring budgets, which typically face conservation organizations, naturally leads to sparse constraints which make the problem amenable to approximation algorithms. Available methods based on continuous relaxations scale poorly, to remedy this we develop a novel and scalable randomized algorithm based on a width relaxation, applicable to a broad class of combinatorial optimization problems. We evaluate our contributions in the context of biocontrol for the insect pest Hemlock Wolly Adelgid (HWA) in eastern North America. Our algorithm outperforms competing methods in terms of scalability and solution quality and finds near-optimal strategies for the control of the HWA for fine-grained networks -- an important problem in computational sustainability.
Deep Multi-Species Embedding
Chen, Di, Xue, Yexiang, Chen, Shuo, Fink, Daniel, Gomes, Carla
Understanding how species are distributed across landscapes over time is a fundamental question in biodiversity research. Unfortunately, most species distribution models only target a single species at a time, despite strong ecological evidence that species are not independently distributed. We propose Deep Multi-Species Embedding (DMSE), which jointly embeds vectors corresponding to multiple species as well as vectors representing environmental covariates into a common high-dimensional feature space via a deep neural network. Applied to bird observational data from the citizen science project \textit{eBird}, we demonstrate how the DMSE model discovers inter-species relationships to outperform single-species distribution models (random forests and SVMs) as well as competing multi-label models. Additionally, we demonstrate the benefit of using a deep neural network to extract features within the embedding and show how they improve the predictive performance of species distribution modelling. An important domain contribution of the DMSE model is the ability to discover and describe species interactions while simultaneously learning the shared habitat preferences among species. As an additional contribution, we provide a graphical embedding of hundreds of bird species in the Northeast US.
Pattern Decomposition with Complex Combinatorial Constraints: Application to Materials Discovery
Ermon, Stefano, Bras, Ronan Le, Suram, Santosh K., Gregoire, John M., Gomes, Carla, Selman, Bart, van Dover, Robert B.
Identifying important components or factors in large amounts of noisy data is a key problem in machine learning and data mining. Motivated by a pattern decomposition problem in materials discovery, aimed at discovering new materials for renewable energy, e.g. for fuel and solar cells, we introduce CombiFD, a framework for factor based pattern decomposition that allows the incorporation of a-priori knowledge as constraints, including complex combinatorial constraints. In addition, we propose a new pattern decomposition algorithm, called AMIQO, based on solving a sequence of (mixed-integer) quadratic programs. Our approach considerably outperforms the state of the art on the materials discovery problem, scaling to larger datasets and recovering more precise and physically meaningful decompositions. We also show the effectiveness of our approach for enforcing background knowledge on other application domains.
Computational Sustainability: Editorial Introduction to the Summer and Fall Issues
Eaton, Eric (University of Pennsylvania) | Gomes, Carla (Cornell University) | Williams, Brian C. (Massachusetts Institute of Technology)
Computational sustainability problems, which exist in dynamic environments with high amounts of uncertainty, provide a variety of unique challenges to artificial intelligence research and the opportunity for significant impact upon our collective future. This editorial introduction provides an overview of artificial intelligence for computational sustainability, and introduces the special issue articles that appear in this issue and the previous issue of AI Magazine.
Computational Sustainability: Editorial Introduction to the Summer and Fall Issues
Eaton, Eric (University of Pennsylvania) | Gomes, Carla (Cornell University) | Williams, Brian C. (Massachusetts Institute of Technology)
Computational sustainability problems, which exist in dynamic environments with high amounts of uncertainty, provide a variety of unique challenges to artificial intelligence research and the opportunity for significant impact upon our collective future. This editorial introduction provides an overview of artificial intelligence for computational sustainability, and introduces the special issue articles that appear in this issue and the previous issue of AI Magazine.
Uncovering Hidden Structure through Parallel Problem Decomposition
Xue, Yexiang (Cornell University) | Ermon, Stefano (Cornell University) | Gomes, Carla (Cornell University) | Selman, Bart (Cornell University)
A key strategy for speeding up computation is to run in parallel on multiple cores. However, on hard combinatorial problems, exploiting parallelism has been surprisingly challenging. It appears that traditional divide-and-conquer strategies do not work well, due to the intricate non-local nature of the interactions between the problem variables. In this paper, we introduce a novel way in which parallelism can be used to exploit hidden structure of hard combinatorial problems. We demonstrate the success of this approach on minimal set basis problem, which has a wide range of applications in machine learning and system security, etc. We also show the effectiveness on a related application problem from materials discovery. In our approach, a large number of smaller sub-problems are identified and solved concurrently. We then aggregate the information from those solutions, and use this to initialize the search of a global, complete solver. We show that this strategy leads to a significant speed-up over a sequential approach. The strategy also greatly outperforms state-of-the-art incomplete solvers in terms of solution quality. Our work opens up a novel angle for using parallelism to solve hard combinatorial problems.
Designing Fast Absorbing Markov Chains
Ermon, Stefano (Cornell University) | Gomes, Carla (Cornell University) | Sabharwal, Ashish (IBM Watson Research Center) | Selman, Bart (Cornell University)
Markov Chains are a fundamental tool for the analysis of real world phenomena and randomized algorithms. Given a graph with some specified sink nodes and an initial probability distribution,we consider the problem of designing an absorbing Markov Chain that minimizes the time required to reach a sink node, by selecting transition probabilities subject to some natural regularity constraints. By exploiting the Markovian structure, we obtain closed form expressions for the objective function as well as its gradient, which can be thus evaluated efficiently without any simulation of the underlying process and fed to a gradient-based optimization package. For the special case of designing reversible Markov Chains, we show that global optimum can be efficiently computed by exploiting convexity. We demonstrate how our method can be used for the evaluation and design of local search methods tailored for certain domains.
Robust Network Design For Multispecies Conservation
Bras, Ronan Le (Cornell University) | Dilkina, Bistra (Cornell University) | Xue, Yexiang (Cornell University) | Gomes, Carla (Cornell University) | McKelvey, Kevin (US Forest Service) | Schwartz, Michael (US Forest Service) | Montgomery, Claire (Oregon State University)
Our work is motivated by an important network design application in computational sustainability concerning wildlife conservation. In the face of human development and climate change, it is important that conservation plans for protecting landscape connectivity exhibit certain level of robustness. While previous work has focused on conservation strategies that result in a connected network of habitat reserves, the robustness of the proposed solutions has not been taken into account. In order to address this important aspect, we formalize the problem as a node-weighted bi-criteria network design problem with connectivity requirements on the number of disjoint paths between pairs of nodes. While in most previous work on survivable network design the objective is to minimize the cost of the selected network, our goal is to optimize the quality of the selected paths within a specified budget, while meeting the connectivity requirements. We characterize the complexity of the problem under different restrictions. We provide a mixed-integer programming encoding that allows for finding solutions with optimality guarantees, as well as a hybrid local search method with better scaling behavior but no guarantees. We evaluate the typical-case performance of our approaches using a synthetic benchmark, and apply them to a large-scale real-world network design problem concerning the conservation of wolverine and lynx populations in the U.S. Rocky Mountains (Montana).
A Human/Computer Learning Network to Improve Biodiversity Conservation and Research
Kelling, Steve (Cornell University) | Gerbracht, Jeff (Cornell University) | Fink, Daniel (Cornell University) | Lagoze, Carl (Cornell University) | Wong, Weng-Keen (Oregon State University) | Yu, Jun (Oregon State University) | Damoulas, Theodoros (Cornell University) | Gomes, Carla (Cornell University)
In this paper we describe eBird, a citizen-science project that takes advantage of the human observational capacity to identify birds to species, which is then used to accurately represent patterns of bird occurrences across broad spatial and temporal extents. We call this a Human-Computer Learning Network, whose core is an active learning feedback loop between humans and machines that dramatically improves the quality of both, and thereby continually improves the effectiveness of the network as a whole. In this paper we explore how Human-Computer Learning Networks can leverage the contributions of a broad recruitment of human observers and processes their contributed data with Artificial Intelligence algorithms leading to a computational power that far exceeds the sum of the individual parts.
A Human/Computer Learning Network to Improve Biodiversity Conservation and Research
Kelling, Steve (Cornell University) | Gerbracht, Jeff (Cornell University) | Fink, Daniel (Cornell University) | Lagoze, Carl (Cornell University) | Wong, Weng-Keen (Oregon State University) | Yu, Jun (Oregon State University) | Damoulas, Theodoros (Cornell University) | Gomes, Carla (Cornell University)
In this paper we describe eBird, a citizen-science project that takes advantage of the human observational capacity to identify birds to species, which is then used to accurately represent patterns of bird occurrences across broad spatial and temporal extents. eBird employs artificial intelligence techniques such as machine learning to improve data quality by taking advantage of the synergies between human computation and mechanical computation. We call this a Human-Computer Learning Network, whose core is an active learning feedback loop between humans and machines that dramatically improves the quality of both, and thereby continually improves the effectiveness of the network as a whole. In this paper we explore how Human-Computer Learning Networks can leverage the contributions of a broad recruitment of human observers and processes their contributed data with Artificial Intelligence algorithms leading to a computational power that far exceeds the sum of the individual parts.