Expert Systems
Tab2Know: Building a Knowledge Base from Tables in Scientific Papers
Kruit, Benno, He, Hongyu, Urbani, Jacopo
Tables in scientific papers contain a wealth of valuable knowledge for the scientific enterprise. To help the many of us who frequently consult this type of knowledge, we present Tab2Know, a new end-to-end system to build a Knowledge Base (KB) from tables in scientific papers. Tab2Know addresses the challenge of automatically interpreting the tables in papers and of disambiguating the entities that they contain. To solve these problems, we propose a pipeline that employs both statistical-based classifiers and logic-based reasoning. First, our pipeline applies weakly supervised classifiers to recognize the type of tables and columns, with the help of a data labeling system and an ontology specifically designed for our purpose. Then, logic-based reasoning is used to link equivalent entities (via sameAs links) in different tables. An empirical evaluation of our approach using a corpus of papers in the Computer Science domain has returned satisfactory performance. This suggests that ours is a promising step to create a large-scale KB of scientific knowledge.
Semantic Reasoning with Differentiable Graph Transformations
This paper introduces a differentiable semantic reasoner, where rules are presented as a relevant set of graph transformations. These rules can be written manually or inferred by a set of facts and goals presented as a training set. While the internal representation uses embeddings in a latent space, each rule can be expressed as a set of predicates conforming to a subset of Description Logic.
Why mechanical engineers should learn A.I.
There are some mechanical engineering fields in which AI is about to give a paradigm shift. AI used in Computer-Aided Design (CAD) generally works on knowledge-based systems. Design artefacts, rules, and problems in CAD are stored which later assist CAD designers. Merging of AI and CAD is done through Model-Based Reasoning (MBR). Many new releases of software packages are using knowledge-based systems.
L.A. County sees another sharp rise in coronavirus cases as mask rules set to take effect
Los Angeles County recorded more than 1,900 new coronavirus cases Friday, another major jump, as a mandatory mask restriction for inside public places takes effect Saturday night. Over the last week, L.A. County has reported an average of more than 1,000 new coronavirus cases a day -- a tally that, though merely a fraction of the sky-high counts seen during previous surges, is still six times as high as what the county was seeing in mid-June. Daily case numbers have jumped: 1,537 new cases were reported Thursday, and 1,902 more were added Friday. COVID-19 hospitalizations also doubled over that same time period, from 223 on June 15 to 462 on Thursday. More than 8,000 coronavirus-positive patients were hospitalized countywide during the darkest days of the winter wave.
Hyperparameter Optimization: Foundations, Algorithms, Best Practices and Open Challenges
Bischl, Bernd, Binder, Martin, Lang, Michel, Pielok, Tobias, Richter, Jakob, Coors, Stefan, Thomas, Janek, Ullmann, Theresa, Becker, Marc, Boulesteix, Anne-Laure, Deng, Difan, Lindauer, Marius
Most machine learning algorithms are configured by one or several hyperparameters that must be carefully chosen and often considerably impact performance. To avoid a time consuming and unreproducible manual trial-and-error process to find well-performing hyperparameter configurations, various automatic hyperparameter optimization (HPO) methods, e.g., based on resampling error estimation for supervised machine learning, can be employed. After introducing HPO from a general perspective, this paper reviews important HPO methods such as grid or random search, evolutionary algorithms, Bayesian optimization, Hyperband and racing. It gives practical recommendations regarding important choices to be made when conducting HPO, including the HPO algorithms themselves, performance evaluation, how to combine HPO with ML pipelines, runtime improvements, and parallelization.
Deep Metric Learning Model for Imbalanced Fault Diagnosis
Intelligent diagnosis method based on data-driven and deep learning is an attractive and meaningful field in recent years. However, in practical application scenarios, the imbalance of time-series fault is an urgent problem to be solved. This paper proposes a novel deep metric learning model, where imbalanced fault data and a quadruplet data pair design manner are considered. Based on such data pair, a quadruplet loss function which takes into account the inter-class distance and the intra-class data distribution are proposed. This quadruplet loss pays special attention to imbalanced sample pair. The reasonable combination of quadruplet loss and softmax loss function can reduce the impact of imbalance. Experiment results on two open-source datasets show that the proposed method can effectively and robustly improve the performance of imbalanced fault diagnosis.
Facebook Groups can now have dedicated topic 'experts'
Facebook is working on a new way to highlight authoritative information within Groups. The platform is starting to roll out a new "expert" label for group members who have expertise in an area related to the group's interests. With the change, which Facebook says is available to "select" Groups, an admin can invite a group member to be a group "expert." If the person accepts, then they'll get a badge next to their name similar to the way group moderators and admins are identified. Notably, being a group "expert" doesn't grant you extra control of group features, or higher visibility within a group.
A Classification of Artificial Intelligence Systems for Mathematics Education
Van Vaerenbergh, Steven, Pérez-Suay, Adrián
This chapter provides an overview of the different Artificial Intelligence (AI) systems that are being used in contemporary digital tools for Mathematics Education (ME). It is aimed at researchers in AI and Machine Learning (ML), for whom we shed some light on the specific technologies that are being used in educational applications; and at researchers in ME, for whom we clarify: i) what the possibilities of the current AI technologies are, ii) what is still out of reach and iii) what is to be expected in the near future. We start our analysis by establishing a high-level taxonomy of AI tools that are found as components in digital ME applications. Then, we describe in detail how these AI tools, and in particular ML, are being used in two key applications, specifically AI-based calculators and intelligent tutoring systems. We finish the chapter with a discussion about student modeling systems and their relationship to artificial general intelligence.
A Framework and Benchmarking Study for Counterfactual Generating Methods on Tabular Data
Mazzine, Raphael, Martens, David
Counterfactual explanations are viewed as an effective way to explain machine learning predictions. This interest is reflected by a relatively young literature with already dozens of algorithms aiming to generate such explanations. These algorithms are focused on finding how features can be modified to change the output classification. However, this rather general objective can be achieved in different ways, which brings about the need for a methodology to test and benchmark these algorithms. The contributions of this work are manifold: First, a large benchmarking study of 10 algorithmic approaches on 22 tabular datasets is performed, using 9 relevant evaluation metrics. Second, the introduction of a novel, first of its kind, framework to test counterfactual generation algorithms. Third, a set of objective metrics to evaluate and compare counterfactual results. And finally, insight from the benchmarking results that indicate which approaches obtain the best performance on what type of dataset. This benchmarking study and framework can help practitioners in determining which technique and building blocks most suit their context, and can help researchers in the design and evaluation of current and future counterfactual generation algorithms. Our findings show that, overall, there's no single best algorithm to generate counterfactual explanations as the performance highly depends on properties related to the dataset, model, score and factual point specificities.