Country
Alleviating Label Switching with Optimal Transport
Monteiller, Pierre, Claici, Sebastian, Chien, Edward, Mirzazadeh, Farzaneh, Solomon, Justin, Yurochkin, Mikhail
Label switching is a phenomenon arising in mixture model posterior inference that prevents one from meaningfully assessing posterior statistics using standard Monte Carlo procedures. This issue arises due to invariance of the posterior under actions of a group; for example, permuting the ordering of mixture components has no effect on the likelihood. We propose a resolution to label switching that leverages machinery from optimal transport. Our algorithm efficiently computes posterior statistics in the quotient space of the symmetry group. We give conditions under which there is a meaningful solution to label switching and demonstrate advantages over alternative approaches on simulated and real data.
Graph Representation Learning via Multi-task Knowledge Distillation
Machine learning on graph structured data has attracted much research interest due to its ubiquity in real world data. However, how to efficiently represent graph data in a general way is still an open problem. Traditional methods use handcraft graph features in a tabular form but suffer from the defects of domain expertise requirement and information loss. Graph representation learning overcomes these defects by automatically learning the continuous representations from graph structures, but they require abundant training labels, which are often hard to fulfill for graph-level prediction problems. In this work, we demonstrate that, if available, the domain expertise used for designing handcraft graph features can improve the graph-level representation learning when training labels are scarce. Specifically, we proposed a multi-task knowledge distillation method. By incorporating network-theory-based graph metrics as auxiliary tasks, we show on both synthetic and real datasets that the proposed multi-task learning method can improve the prediction performance of the original learning task, especially when the training data size is small.
RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers
Wang, Bailin, Shin, Richard, Liu, Xiaodong, Polozov, Oleksandr, Richardson, Matthew
When translating natural language questions into SQL queries to answer questions from a database, contemporary semantic parsing models struggle to generalize to unseen database schemas. The generalization challenge lies in (a) encoding the database relations in an accessible way for the semantic parser, and (b) modeling alignment between database columns and their mentions in a given query. We present a unified framework, based on the relation-aware self-attention mechanism, to address schema encoding, schema linking, and feature representation within a text-to-SQL encoder. On the challenging Spider dataset this framework boosts the exact match accuracy to 53.7%, compared to 47.4% for the state-of-the-art model unaugmented with BERT em-beddings. In addition, we observe qualitative improvements in the model's understanding of schema linking and alignment.
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Zhang, Chao, Yang, Zichao, He, Xiaodong, Deng, Li
Deep learning has revolutionized speech recognition, image recognition, and natural language processing since 2010, each involving a single modality in the input signal. However, many applications in artificial intelligence involve more than one modality. It is therefore of broad interest to study the more difficult and complex problem of modeling and learning across multiple modalities. In this paper, a technical review of the models and learning methods for multimodal intelligence is provided. The main focus is the combination of vision and natural language, which has become an important area in both computer vision and natural language processing research communities. This review provides a comprehensive analysis of recent work on multimodal deep learning from three new angles - learning multimodal representations, the fusion of multimodal signals at various levels, and multimodal applications. On multimodal representation learning, we review the key concept of embedding, which unifies the multimodal signals into the same vector space and thus enables cross-modality signal processing. We also review the properties of the many types of embedding constructed and learned for general downstream tasks. On multimodal fusion, this review focuses on special architectures for the integration of the representation of unimodal signals for a particular task. On applications, selected areas of a broad interest in current literature are covered, including caption generation, text-to-image generation, and visual question answering. We believe this review can facilitate future studies in the emerging field of multimodal intelligence for the community.
Knowledge Guided Text Retrieval and Reading for Open Domain Question Answering
Min, Sewon, Chen, Danqi, Zettlemoyer, Luke, Hajishirzi, Hannaneh
This paper presents a general approach for open-domain question answering (QA) that models interactions between paragraphs using structural information from a knowledge base. We first describe how to construct a graph of passages from a large corpus, where the relations are either from the knowledge base or the internal structure of Wikipedia. We then introduce a reading comprehension model which takes this graph as an input, to better model relationships across pairs of paragraphs. This approach consistently outperforms competitive baselines in three open-domain QA datasets, WebQuestions, Natural Questions and TriviaQA, improving the pipeline-based state-of-the-art by 3--13%.
Liability Design for Autonomous Vehicles and Human-Driven Vehicles: A Hierarchical Game-Theoretic Approach
Di, Xuan, Chen, Xu, Talley, Eric
Autonomous vehicles (AVs) are inevitably entering our lives with potential benefits for improved traffic safety, mobility, and accessibility. However, AVs' benefits also introduce a serious potential challenge, in the form of complex interactions with human-driven vehicles (HVs). The emergence of AVs introduces uncertainty in the behavior of human actors and in the impact of the AV manufacturer on autonomous driving design. This paper thus aims to investigate how AVs affect road safety and to design socially optimal liability rules for AVs and human drivers. A unified game is developed, including a Nash game between human drivers, a Stackelberg game between the AV manufacturer and HVs, and a Stackelberg game between the law maker and other users. We also establish the existence and uniqueness of the equilibrium of the game. The game is then simulated with numerical examples to investigate the emergence of human drivers' moral hazard, the AV manufacturer's role in traffic safety, and the law maker's role in liability design. Our findings demonstrate that human drivers could develop moral hazard if they perceive their road environment has become safer and an optimal liability rule design is crucial to improve social welfare with advanced transportation technologies. More generally, the game-theoretic model developed in this paper provides an analytical tool to assist policy-makers in AV policymaking and hopefully mitigate uncertainty in the existing regulation landscape about AV technologies.
Could A Machine Learning ETF Be The Next Step For Forward Thinking Investors ETF Trends
The debate between active and passive portfolio management has been going on for years. Is it more profitable to try and select stocks yourself, constantly seeking to maximize edge by buying and selling based on market conditions, or is it more lucrative to have longer holding periods and less turnover, regardless of market fluctuations? Artificial Intelligence ETFs could be the answer. "Active management has been out there for a long time, and under performing. They haven't found a solution yet. And I think the technology that I run into is going to help the marketplace to do that," said Procure Holdings President Robert "Bob" Tull on CNBC.
Machine Learning Prediction of Mortality and Hospitalization in Heart Failure with Preserved Ejection Fraction
Objectives This study sought to develop models for predicting mortality and heart failure (HF) hospitalization for outpatients with HF with preserved ejection fraction (HFpEF) in the TOPCAT (Treatment of Preserved Cardiac Function Heart Failure with an Aldosterone Antagonist) trial. Background Although risk assessment models are available for patients with HF with reduced ejection fraction, few have assessed the risks of death and hospitalization in patients with HFpEF. Methods The following 5 methods: logistic regression with a forward selection of variables; logistic regression with a lasso regularization for variable selection; random forest (RF); gradient descent boosting; and support vector machine, were used to train models for assessing risks of mortality and HF hospitalization through 3 years of follow-up and were validated using 5-fold cross-validation. Model discrimination and calibration were estimated using receiver-operating characteristic curves and Brier scores, respectively. The top prediction variables were assessed by using the best performing models, using the incremental improvement of each variable in 5-fold cross-validation. Results The RF was the best performing model with a mean C-statistic of 0.72 (95% confidence interval [CI]: 0.69 to 0.75) for predicting mortality (Brier score: 0.17), and 0.76 (95% CI: 0.71 to 0.81) for HF hospitalization (Brier score: 0.19). Blood urea nitrogen levels, body mass index, and Kansas City Cardiomyopathy Questionnaire (KCCQ) subscale scores were strongly associated with mortality, whereas hemoglobin level, blood urea nitrogen, time since previous HF hospitalization, and KCCQ scores were the most significant predictors of HF hospitalization. Conclusions These models predict the risks of mortality and HF hospitalization in patients with HFpEF and emphasize the importance of health status data in determining prognosis.
Putin suggests drafting moral standards for human interaction with artificial intelligence
Moral standards of human interaction with artificial intelligence should be drawn up, Russian President Vladimir Putin said at the AI Journey conference in Moscow on Saturday. "Discussion is currently underway on social aspects and implications of the use of artificial intelligence. It is a very important issue," the Russian president said. "I suggest that the professional community and companies should contemplate drawing up a set of moral rules for interaction between humans and artificial intelligence," he said recalling that "human beings are the highest value." "Technology must not be invented for the sake of technology," he stressed. "Our main goal is sustainable and harmonious development, a higher life quality and new opportunities for citizens."
Lyft Opens Testing Facility for Self-Driving Cars, Adds Chrysler Minivans Digital Trends
Lyft is planning a significant expansion of its autonomous car testing program. The company is opening a new testing facility, adding vehicles to its fleet, and racking up more test miles. Like rival Uber, Lyft believes self-driving cars are the future of ridesharing. Lyft's self-driving cars are now driving four times as many miles per quarter in autonomous mode as they were six months ago, Luc Vincent, Lyft's executive vice president of autonomous driving, wrote in a blog post. The company currently gives rides in test vehicles to employees, and the number of routes where these rides are available has tripled in the past year, Vincent wrote.