Expert Systems
Effective FAQ Retrieval and Question Matching With Unsupervised Knowledge Injection
Tseng, Wen-Ting, Lo, Tien-Hong, Hsu, Yung-Chang, Chen, Berlin
Frequently asked question (FAQ) retrieval, with the purpose of providing information on frequent questions or concerns, has far-reaching applications in many areas, where a collection of question-answer (Q-A) pairs compiled a priori can be employed to retrieve an appropriate answer in response to a user\u2019s query that is likely to reoccur frequently. To this end, predominant approaches to FAQ retrieval typically rank question-answer pairs by considering either the similarity between the query and a question (q-Q), the relevance between the query and the associated answer of a question (q-A), or combining the clues gathered from the q-Q similarity measure and the q-A relevance measure. In this paper, we extend this line of research by combining the clues gathered from the q-Q similarity measure and the q-A relevance measure and meanwhile injecting extra word interaction information, distilled from a generic (open domain) knowledge base, into a contextual language model for inferring the q-A relevance. Furthermore, we also explore to capitalize on domain-specific topically-relevant relations between words in an unsupervised manner, acting as a surrogate to the supervised domain-specific knowledge base information. As such, it enables the model to equip sentence representations with the knowledge about domain-specific and topically-relevant relations among words, thereby providing a better q-A relevance measure. We evaluate variants of our approach on a publicly-available Chinese FAQ dataset, and further apply and contextualize it to a large-scale question-matching task, which aims to search questions from a QA dataset that have a similar intent as an input query. Extensive experimental results on these two datasets confirm the promising performance of the proposed approach in relation to some state-of-the-art ones.
Learning to Deceive Knowledge Graph Augmented Models via Targeted Perturbation
Raman, Mrigank, Agarwal, Siddhant, Wang, Peifeng, Chan, Aaron, Wang, Hansen, Kim, Sungchul, Rossi, Ryan, Zhao, Handong, Lipka, Nedim, Ren, Xiang
Symbolic knowledge (e.g., entities, relations, and facts in a knowledge graph) has become an increasingly popular component of neural-symbolic models applied to machine learning tasks, such as question answering and recommender systems. Besides improving downstream performance, these symbolic structures (and their associated attention weights) are often used to help explain the model's predictions and provide "insights" to practitioners. In this paper, we question the faithfulness of such symbolic explanations. We demonstrate that, through a learned strategy (or even simple heuristics), one can produce deceptively perturbed symbolic structures which maintain the downstream performance of the original structure while significantly deviating from the original semantics. In particular, we train a reinforcement learning policy to manipulate relation types or edge connections in a knowledge graph, such that the resulting downstream performance is maximally preserved. Across multiple models and tasks, our approach drastically alters knowledge graphs with little to no drop in performance. These results raise doubts about the faithfulness of explanations provided by learned symbolic structures and the reliability of current neural-symbolic models in leveraging symbolic knowledge.
Expert Systems in Artificial Intelligence (AI) : Types, Uses and Advantages
Expert systems in Artificial Intelligence are a prominent domain for research in AI. It was initially introduced by researchers at the Stanford University, and were developed to solve complex problems in a particular domain. The following topics will be covered through this blog on Expert Systems in Artificial Intelligence. An Expert system is a domain in which Artificial Intelligence stimulates the behavior and judgement of a human or an organisation containing experts. It acquires relevant knowledge from its knowledge base, and interprets it as per the user's problem. The data in the knowledge base is essentially added by humans who are experts in a particular domain.
Chaining Techniques in Artificial Intelligence - Great Learning
We have created Artificial Intelligence as a way to amplify human intelligence and promote growth like never before. AI can help us solve numerous problems of varying complexities. One such type of problem is the case where one has to predict outcomes using the given pool of knowledge. Here, the knowledge base is given and using logical rules and reasoning, one has to predict the outcome. These problems are usually solved using Inference Engines, which utilize their two special modes: Backward Chaining and Forward Chaining.
Applications of AI in CAD Technology
A new feature to be found in modern CAD software releases is KBE (Knowledge Based Engineering) to support diagnosis, selection, and monitoring of tasks. KBE relies on capturing and storing experiential knowledge which includes proprietary design and manufacturing practices exercised during a product development cycle. KBE helps engineering companies to retain and preserve in-house knowledge and intellectual information. A related technology which could significantly augment problem solving capabilities in CAD software is AI (Artificial Intelligence), which was introduced in the mid-1980s. The purpose of AI is to learn and replicate human problem solving capabilities.
Axiom Learning and Belief Tracing for Transparent Decision Making in Robotics
A robot's ability to provide descriptions of its decisions and beliefs promotes effective collaboration with humans. Providing such transparency is particularly challenging in integrated robot systems that include knowledge-based reasoning methods and data-driven learning algorithms. Towards addressing this challenge, our architecture couples the complementary strengths of non-monotonic logical reasoning, deep learning, and decision-tree induction. During reasoning and learning, the architecture enables a robot to provide on-demand relational descriptions of its decisions, beliefs, and the outcomes of hypothetical actions. These capabilities are grounded and evaluated in the context of scene understanding tasks and planning tasks performed using simulated images and images from a physical robot manipulating tabletop objects.
Measuring Systematic Generalization in Neural Proof Generation with Transformers
Gontier, Nicolas, Sinha, Koustuv, Reddy, Siva, Pal, Christopher
We are interested in understanding how well Transformer language models (TLMs) can perform reasoning tasks when trained on knowledge encoded in the form of natural language. We investigate their systematic generalization abilities on a logical reasoning task in natural language, which involves reasoning over relationships between entities grounded in first-order logical proofs. Specifically, we perform soft theorem-proving by leveraging TLMs to generate natural language proofs. We test the generated proofs for logical consistency, along with the accuracy of the final inference. We observe length-generalization issues when evaluated on longer-than-trained sequences. However, we observe TLMs improve their generalization performance after being exposed to longer, exhaustive proofs. In addition, we discover that TLMs are able to generalize better using backward-chaining proofs compared to their forward-chaining counterparts, while they find it easier to generate forward chaining proofs. We observe that models that are not trained to generate proofs are better at generalizing to problems based on longer proofs. This suggests that Transformers have efficient internal reasoning strategies that are harder to interpret. These results highlight the systematic generalization behavior of TLMs in the context of logical reasoning, and we believe this work motivates deeper inspection of their underlying reasoning strategies.
Explainable Automated Fact-Checking for Public Health Claims
Kotonya, Neema, Toni, Francesca
Fact-checking is the task of verifying the veracity of claims by assessing their assertions against credible evidence. The vast majority of fact-checking studies focus exclusively on political claims. Very little research explores fact-checking for other topics, specifically subject matters for which expertise is required. We present the first study of explainable fact-checking for claims which require specific expertise. For our case study we choose the setting of public health. To support this case study we construct a new dataset PUBHEALTH of 11.8K claims accompanied by journalist crafted, gold standard explanations (i.e., judgments) to support the fact-check labels for claims. We explore two tasks: veracity prediction and explanation generation. We also define and evaluate, with humans and computationally, three coherence properties of explanation quality. Our results indicate that, by training on in-domain data, gains can be made in explainable, automated fact-checking for claims which require specific expertise.
Effective Distributed Representations for Academic Expert Search
Berger, Mark, Zavrel, Jakub, Groth, Paul
Expert search aims to find and rank experts based on a user's query. In academia, retrieving experts is an efficient way to navigate through a large amount of academic knowledge. Here, we study how different distributed representations of academic papers (i.e. embeddings) impact academic expert retrieval. We use the Microsoft Academic Graph dataset and experiment with different configurations of a document-centric voting model for retrieval. In particular, we explore the impact of the use of contextualized embeddings on search performance. We also present results for paper embeddings that incorporate citation information through retrofitting. Additionally, experiments are conducted using different techniques for assigning author weights based on author order. We observe that using contextual embeddings produced by a transformer model trained for sentence similarity tasks produces the most effective paper representations for document-centric expert retrieval. However, retrofitting the paper embeddings and using elaborate author contribution weighting strategies did not improve retrieval performance.
The Knowledge Graph for Macroeconomic Analysis with Alternative Big Data
Yang, Yucheng, Pang, Yue, Huang, Guanhua, E, Weinan
The current knowledge system of macroeconomics is built on interactions among a small number of variables, since traditional macroeconomic models can mostly handle a handful of inputs. Recent work using big data suggests that a much larger number of variables are active in driving the dynamics of the aggregate economy. In this paper, we introduce a knowledge graph (KG) that consists of not only linkages between traditional economic variables but also new alternative big data variables. We extract these new variables and the linkages by applying advanced natural language processing (NLP) tools on the massive textual data of academic literature and research reports. As one example of the potential applications, we use it as the prior knowledge to select variables for economic forecasting models in macroeconomics. Compared to statistical variable selection methods, KG-based methods achieve significantly higher forecasting accuracy, especially for long run forecasts.