Overview
Low-resource Learning with Knowledge Graphs: A Comprehensive Survey
Chen, Jiaoyan, Geng, Yuxia, Chen, Zhuo, Pan, Jeff Z., He, Yuan, Zhang, Wen, Horrocks, Ian, Chen, Huajun
Machine learning methods especially deep neural networks have achieved great success but many of them often rely on a number of labeled samples for training. In real-world applications, we often need to address sample shortage due to e.g., dynamic contexts with emerging prediction targets and costly sample annotation. Therefore, low-resource learning, which aims to learn robust prediction models with no enough resources (especially training samples), is now being widely investigated. Among all the low-resource learning studies, many prefer to utilize some auxiliary information in the form of Knowledge Graph (KG), which is becoming more and more popular for knowledge representation, to reduce the reliance on labeled samples. In this survey, we very comprehensively reviewed over $90$ papers about KG-aware research for two major low-resource learning settings -- zero-shot learning (ZSL) where new classes for prediction have never appeared in training, and few-shot learning (FSL) where new classes for prediction have only a small number of labeled samples that are available. We first introduced the KGs used in ZSL and FSL studies as well as the existing and potential KG construction solutions, and then systematically categorized and summarized KG-aware ZSL and FSL methods, dividing them into different paradigms such as the mapping-based, the data augmentation, the propagation-based and the optimization-based. We next presented different applications, including not only KG augmented tasks in Computer Vision and Natural Language Processing (e.g., image classification, text classification and knowledge extraction), but also tasks for KG curation (e.g., inductive KG completion), and some typical evaluation resources for each task. We eventually discussed some challenges and future directions on aspects such as new learning and reasoning paradigms, and the construction of high quality KGs.
Hot papers on arXiv from 2021
Reproduced under a CC BY 4.0 license. We've collated the most tweeted papers for each month that were uploaded onto arXiv during 2021. Results are powered by Arxiv Sanity Preserver. Abstract: Large-scale model training has been a playing ground for a limited few requiring complex model refactoring and access to prohibitively expensive GPU clusters. ZeRO-Offload changes the large model training landscape by making large model training accessible to nearly everyone.
Learning from Disagreement: A Survey
Uma, Alexandra N., Fornaciari, Tommaso, Hovy, Dirk, Paun, Silviu, Plank, Barbara, Poesio, Massimo
Many tasks in Natural Language Processing (NLP) and Computer Vision (CV) offer evidence that humans disagree, from objective tasks such as part-of-speech tagging to more subjective tasks such as classifying an image or deciding whether a proposition follows from certain premises. While most learning in artificial intelligence (AI) still relies on the assumption that a single (gold) interpretation exists for each item, a growing body of research aims to develop learning methods that do not rely on this assumption. In this survey, we review the evidence for disagreements on NLP and CV tasks, focusing on tasks for which substantial datasets containing this information have been created. We discuss the most popular approaches to training models from datasets containing multiple judgments potentially in disagreement. We systematically compare these different approaches by training them with each of the available datasets, considering several ways to evaluate the resulting models. Finally, we discuss the results in depth, focusing on four key research questions, and assess how the type of evaluation and the characteristics of a dataset determine the answers to these questions. Our results suggest, first of all, that even if we abandon the assumption of a gold standard, it is still essential to reach a consensus on how to evaluate models. This is because the relative performance of the various training methods is critically affected by the chosen form of evaluation. Secondly, we observed a strong dataset effect. With substantial datasets, providing many judgments by high-quality coders for each item, training directly with soft labels achieved better results than training from aggregated or even gold labels. This result holds for both hard and soft evaluation. But when the above conditions do not hold, leveraging both gold and soft labels generally achieved the best results in the hard evaluation. All datasets and models employed in this paper are freely available as supplementary materials.
A Brief History of Updates of Answer-Set Programs
Over the last couple of decades, there has been a considerable effort devoted to the problem of updating logic programs under the stable model semantics (a.k.a. answer-set programs) or, in other words, the problem of characterising the result of bringing up-to-date a logic program when the world it describes changes. Whereas the state-of-the-art approaches are guided by the same basic intuitions and aspirations as belief updates in the context of classical logic, they build upon fundamentally different principles and methods, which have prevented a unifying framework that could embrace both belief and rule updates. In this paper, we will overview some of the main approaches and results related to answer-set programming updates, while pointing out some of the main challenges that research in this topic has faced.
Explainable Artificial Intelligence Methods in Combating Pandemics: A Systematic Review
Giuste, Felipe, Shi, Wenqi, Zhu, Yuanda, Naren, Tarun, Isgut, Monica, Sha, Ying, Tong, Li, Gupte, Mitali, Wang, May D.
Despite the myriad peer-reviewed papers demonstrating novel Artificial Intelligence (AI)-based solutions to COVID-19 challenges during the pandemic, few have made significant clinical impact. The impact of artificial intelligence during the COVID-19 pandemic was greatly limited by lack of model transparency. This systematic review examines the use of Explainable Artificial Intelligence (XAI) during the pandemic and how its use could overcome barriers to real-world success. We find that successful use of XAI can improve model performance, instill trust in the end-user, and provide the value needed to affect user decision-making. We introduce the reader to common XAI techniques, their utility, and specific examples of their application. Evaluation of XAI results is also discussed as an important step to maximize the value of AI-based clinical decision support systems. We illustrate the classical, modern, and potential future trends of XAI to elucidate the evolution of novel XAI techniques. Finally, we provide a checklist of suggestions during the experimental design process supported by recent publications. Common challenges during the implementation of AI solutions are also addressed with specific examples of potential solutions. We hope this review may serve as a guide to improve the clinical impact of future AI-based solutions.
A Survey on Interpretable Reinforcement Learning
Glanois, Claire, Weng, Paul, Zimmer, Matthieu, Li, Dong, Yang, Tianpei, Hao, Jianye, Liu, Wulong
Although deep reinforcement learning has become a promising machine learning approach for sequential decision-making problems, it is still not mature enough for high-stake domains such as autonomous driving or medical applications. In such contexts, a learned policy needs for instance to be interpretable, so that it can be inspected before any deployment (e.g., for safety and verifiability reasons). This survey provides an overview of various approaches to achieve higher interpretability in reinforcement learning (RL). To that aim, we distinguish interpretability (as a property of a model) and explainability (as a post-hoc operation, with the intervention of a proxy) and discuss them in the context of RL with an emphasis on the former notion. In particular, we argue that interpretable RL may embrace different facets: interpretable inputs, interpretable (transition/reward) models, and interpretable decision-making. Based on this scheme, we summarize and analyze recent work related to interpretable RL with an emphasis on papers published in the past 10 years. We also discuss briefly some related research areas and point to some potential promising research directions.
A 2021 NLP Retrospective
Every company that derives value from language stands to benefit from NLP, the branch of machine learning that has the most transformative potential. Language is the lowest common denominator in almost all of our interactions, and the ways in which we can capture value from language has changed dramatically over the last three years. Recent advancements in NLP have outsized potential to accelerate business performance. It even has the promise of bringing trust and integrity back to our online interactions. Large incumbents have been the first to jump onboard, but the real promise lies in the next wave of NLP applications and tools that will translate the hype around artificial intelligence from ideology into reality. So, there you have it, these are my personal highlights of 2021 in NLP. I hope you enjoyed this summary and it'd be great to hear about your personal highlights from the past 12 months in NLP. Please comment on this blog post or reach out directly.
Global Big Data Conference
In the spirit of the last couple of years, we review developments in what we have identified as the key technology drivers for the 2020s in the world of databases, data management and AI. We are looking back at 2021, trying to identify patterns that will shape 2022. Today we pick up from where we started with part one of our review, to cover AI and knowledge graphs. In principle, we try to approach AI holistically. To take into account positives and negatives, from the shiny to the mundane, and from hardware to software.
A Survey on Text Classification: From Shallow to Deep Learning
Text classification is the most fundamental and essential task in natural language processing. The last decade has seen a surge of research in this area due to the unprecedented success of deep learning. Numerous methods, datasets, and evaluation metrics have been proposed in the literature, raising the need for a comprehensive and updated survey. This paper fills the gap by reviewing the state-of-the-art approaches from 1961 to 2021, focusing on models from traditional models to deep learning. We create a taxonomy for text classification according to the text involved and the models used for feature extraction and classification. We then discuss each of these categories in detail, dealing with both the technical developments and benchmark datasets that support tests of predictions. A comprehensive comparison between different techniques, as well as identifying the pros and cons of various evaluation metrics are also provided in this survey. Finally, we conclude by summarizing key implications, future research directions, and the challenges facing the research area.
The 100 Most Disruptive Companies to Watch In 2021
Disruptive technology is the technology that affects the normal operation of a market or an industry. Digital disruption entails established companies and start-ups alike enlisting new technologies in the fight to dislodge incumbents, protect entrenched positions, or to re-invent entire industries and business activities. And to remain disruptive in the market, it is really important to keep innovating. This is crucial because, innovations occur now and then in every industry, however, to be truly disruptive, and innovation must entirely transform a product or solution that historically was so complicated only a few could access it. On a minimum level, digital transformation enables an organization to address the needs of its customers more simply and directly. But through disruptive innovation, companies can offer a far better way to users of doing things that current incumbents simply cannot compete with. Artificial intelligence (AI), E-Commerce, cloud, social networking, Internet of Things, 5G, blockchain and other emerging technologies are being leveraged to blur the lines between industries, creating new business models and converging sectors. A company that disrupts its market is in a great position to take advantage of new opportunities. Sometimes offering something different can change the whole market for the better. Most of the top disruptive companies get this label by offering highly innovative products and services and here are 100 such top disruptive companies listed below. The company provides innovative, managed cloud services to help its customers succeed. With best-in-class service and technology, 403Tech protects companies against cybercrimes while enabling greater efficiency and productivity. Some of its popular services include desktop support, server support, wired and wireless networking, virus removal, data recovery, and backup and hosted cloud services. Aegeus Technologies aims to design and develop robotic technologies and solutions.