AITopics | Overview

Collaborating Authors

Overview

Software Engineering Practices for Machine Learning

arXiv.org Artificial IntelligenceSep-3-2022

In the last couple of years we have witnessed an enormous increase of machine learning (ML) applications. More and more program functions are no longer written in code, but learnt from a huge amount of data samples using an ML algorithm. However, what is often overlooked is the complexity of managing the resulting ML models as well as bringing these into a real production system. In software engineering, we have spent decades on developing tools and methodologies to create, manage and assemble complex software modules. We present an overview of current techniques to manage complex software, and how this applies to ML models.

ml model, module, requirement, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/MC.2022.3160276

1906.10366

Country:

North America > United States (0.04)
Europe > France (0.04)
Europe > Belgium > Flanders > Flemish Brabant > Leuven (0.04)
Europe > Belgium > Flanders > East Flanders > Ghent (0.04)

Genre: Overview (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Classifying with Uncertain Data Envelopment Analysis

Garner, Casey, Holder, Allen

arXiv.org Artificial IntelligenceSep-2-2022

Classifications organize entities into categories that identify similarities within a category and discern dissimilarities among categories, and they powerfully classify information in support of analysis. We propose a new classification scheme premised on the reality of imperfect data. Our computational model uses uncertain data envelopment analysis to define a classification's proximity to equitable efficiency, which is an aggregate measure of intra-similarity within a classification's categories. Our classification process has two overriding computational challenges, those being a loss of convexity and a combinatorially explosive search space. We overcome the first by establishing lower and upper bounds on the proximity value, and then by searching this range with a first-order algorithm. We overcome the second by adapting the p-median problem to initiate our exploration, and by then employing an iterative neighborhood search to finalize a classification. We conclude by classifying the thirty stocks in the Dow Jones Industrial average into performant tiers and by classifying prostate treatments into clinically effectual categories.

category, classification, efficiency, (14 more...)

arXiv.org Artificial Intelligence

2209.01052

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)
North America > United States > New York (0.04)
Oceania > New Zealand (0.04)
(2 more...)

Genre:

Research Report (0.64)
Overview (0.46)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.93)

Add feedback

INTERACTION: A Generative XAI Framework for Natural Language Inference Explanations

Yu, Jialin, Cristea, Alexandra I., Harit, Anoushka, Sun, Zhongtian, Aduragba, Olanrewaju Tahir, Shi, Lei, Moubayed, Noura Al

arXiv.org Artificial IntelligenceSep-2-2022

XAI with natural language processing aims to produce human-readable explanations as evidence for AI decision-making, which addresses explainability and transparency. However, from an HCI perspective, the current approaches only focus on delivering a single explanation, which fails to account for the diversity of human thoughts and experiences in language. This paper thus addresses this gap, by proposing a generative XAI framework, INTERACTION (explaIn aNd predicT thEn queRy with contextuAl CondiTional varIational autO-eNcoder). Our novel framework presents explanation in two steps: (step one) Explanation and Label Prediction; and (step two) Diverse Evidence Generation. We conduct intensive experiments with the Transformer architecture on a benchmark dataset, e-SNLI. Our method achieves competitive or better performance against state-of-the-art baseline models on explanation generation (up to 4.7% gain in BLEU) and prediction (up to 4.4% gain in accuracy) in step one; it can also generate multiple diverse explanations in step two.

architecture, arxiv preprint arxiv, explanation, (14 more...)

arXiv.org Artificial Intelligence

2209.01061

Country:

Europe > United Kingdom > England > Durham > Durham (0.04)
Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (1.00)

Add feedback

Unsupervised EHR-based Phenotyping via Matrix and Tensor Decompositions

Becker, Florian, Smilde, Age K., Acar, Evrim

arXiv.org Machine LearningSep-1-2022

Computational phenotyping allows for unsupervised discovery of subgroups of patients as well as corresponding co-occurring medical conditions from electronic health records (EHR). Typically, EHR data contains demographic information, diagnoses and laboratory results. Discovering (novel) phenotypes has the potential to be of prognostic and therapeutic value. Providing medical practitioners with transparent and interpretable results is an important requirement and an essential part for advancing precision medicine. Low-rank data approximation methods such as matrix (e.g., non-negative matrix factorization) and tensor decompositions (e.g., CANDECOMP/PARAFAC) have demonstrated that they can provide such transparent and interpretable insights. Recent developments have adapted low-rank data approximation methods by incorporating different constraints and regularizations that facilitate interpretability further. In addition, they offer solutions for common challenges within EHR data such as high dimensionality, data sparsity and incompleteness. Especially extracting temporal phenotypes from longitudinal EHR has received much attention in recent years. In this paper, we provide a comprehensive review of low-rank approximation-based approaches for computational phenotyping. The existing literature is categorized into temporal vs. static phenotyping approaches based on matrix vs. tensor decompositions. Furthermore, we outline different approaches for the validation of phenotypes, i.e., the assessment of clinical significance.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Machine Learning

doi: 10.1002/widm.1494

2209.00322

Country:

Africa > Senegal > Kolda Region > Kolda (0.05)
Asia > Middle East > Republic of Türkiye > Bingoel Province > Bingol (0.04)
Europe > Norway > Eastern Norway > Oslo (0.04)
(3 more...)

Genre:

Research Report > Experimental Study (1.00)
Overview (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

DLCSS: Dynamic Longest Common Subsequences

Bogdoll, Daniel, Rauch, Jonas, Zöllner, J. Marius

arXiv.org Artificial IntelligenceSep-1-2022

Autonomous driving is a key technology towards a brighter, more sustainable future. To enable such a future, it is necessary to utilize autonomous vehicles in shared mobility models. However, to evaluate, whether two or more route requests have the potential for a shared ride, is a compute-intensive task, if done by rerouting. In this work, we propose the Dynamic Longest Common Subsequences algorithm for fast and cost-efficient comparison of two routes for their compatibility, dynamically only incorporating parts of the routes which are suited for a shared trip. Based on this, one can also estimate, how many autonomous vehicles might be necessary to fulfill the local mobility demands. This can help providers to estimate the necessary fleet sizes, policymakers to better understand mobility patterns and cities to scale necessary infrastructure.

algorithm, artificial intelligence, meeting point, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICECCME55909.2022.9987849

2207.06061

Country:

Europe > Germany > North Rhine-Westphalia (0.04)
Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.04)
Asia > Maldives (0.04)

Genre:

Overview (0.47)
Research Report (0.40)

Industry:

Transportation > Ground > Road (1.00)
Government (0.89)
Transportation > Passenger (0.72)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.78)

Add feedback

Learning with Differentiable Algorithms

Petersen, Felix

arXiv.org Artificial IntelligenceSep-1-2022

Classic algorithms and machine learning systems like neural networks are both abundant in everyday life. While classic computer science algorithms are suitable for precise execution of exactly defined tasks such as finding the shortest path in a large graph, neural networks allow learning from data to predict the most likely answer in more complex tasks such as image classification, which cannot be reduced to an exact algorithm. To get the best of both worlds, this thesis explores combining both concepts leading to more robust, better performing, more interpretable, more computationally efficient, and more data efficient architectures. The thesis formalizes the idea of algorithmic supervision, which allows a neural network to learn from or in conjunction with an algorithm. When integrating an algorithm into a neural architecture, it is important that the algorithm is differentiable such that the architecture can be trained end-to-end and gradients can be propagated back through the algorithm in a meaningful way. To make algorithms differentiable, this thesis proposes a general method for continuously relaxing algorithms by perturbing variables and approximating the expectation value in closed form, i.e., without sampling. In addition, this thesis proposes differentiable algorithms, such as differentiable sorting networks, differentiable renderers, and differentiable logic gate networks. Finally, this thesis presents alternative training strategies for learning with algorithms.

alternative optimization method, categorical probability distribution, conditional swap operation, (16 more...)

arXiv.org Artificial Intelligence

2209.00616

Country:

North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > New York (0.04)
North America > United States > New Jersey > Atlantic County > Atlantic City (0.04)
(6 more...)

Genre:

Research Report > Experimental Study (1.00)
Overview (1.00)
Research Report > Promising Solution (0.92)
Research Report > New Finding (0.67)

Industry:

Health & Medicine (1.00)
Education > Educational Setting (0.45)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

An Empirical Study and Analysis of Learning Generalizable Manipulation Skill in the SAPIEN Simulator

Liu, Kun, Fu, Huiyuan, Zhang, Zheng, Yin, Huanpu

arXiv.org Artificial IntelligenceAug-31-2022

This paper provides a brief overview of our submission to the no interaction track of SAPIEN ManiSkill Challenge 2021. Our approach follows an end-to-end pipeline which mainly consists of two steps: we first extract the point cloud features of multiple objects; then we adopt these features to predict the action score of the robot simulators through a deep and wide transformer-based network. More specially, %to give guidance for future work, to open up avenues for exploitation of learning manipulation skill, we present an empirical study that includes a bag of tricks and abortive attempts. Finally, our method achieves a promising ranking on the leaderboard. All code of our solution is available at https://github.com/liu666666/bigfish\_codes.

generalizable policy learning, iteration, simulator, (10 more...)

arXiv.org Artificial Intelligence

2208.14646

Country:

Asia > China > Beijing > Beijing (0.05)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre: Overview (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Probabilistic Deduction: an Approach to Probabilistic Structured Argumentation

Fan, Xiuyi

arXiv.org Artificial IntelligenceAug-31-2022

This paper introduces Probabilistic Deduction (PD) as an approach to probabilistic structured argumentation. A PD framework is composed of probabilistic rules (p-rules). As rules in classical structured argumentation frameworks, p-rules form deduction systems. In addition, p-rules also represent conditional probabilities that define joint probability distributions. With PD frameworks, one performs probabilistic reasoning by solving Rule-Probabilistic Satisfiability. At the same time, one can obtain an argumentative reading to the probabilistic reasoning with arguments and attacks. In this work, we introduce a probabilistic version of the Closed-World Assumption (P-CWA) and prove that our probabilistic approach coincides with the complete extension in classical argumentation under P-CWA and with maximum entropy reasoning. We present several approaches to compute the joint probability distribution from p-rules for achieving a practical proof theory for PD. PD provides a framework to unify probabilistic reasoning with argumentative reasoning. This is the first work in probabilistic structured argumentation where the joint distribution is not assumed form external sources.

argument, pd framework, probability, (15 more...)

arXiv.org Artificial Intelligence

2209.0021

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > Netherlands (0.04)
(2 more...)

Genre:

Research Report (0.81)
Overview (0.67)

Industry:

Law (0.46)
Education (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Add feedback

Council Post: A Primer To Robotic Process Automation

#artificialintelligenceAug-30-2022, 09:20:53 GMT

It's important to acknowledge that RPA is not new. It is an extension of the record-and-playback approach used in test automation. But where RPA differs from test automation is that while test automation is meant to check for any break in application functionality by automating regression tests, RPA is used in automating business processes. While people commonly say that test automation involves code and RPA doesn't, there are scenarios where RPA also involves some coding. Now, test automation has evolved over time, and there is a clear approach on how to get started with test automation and use it to the best extent possible to achieve the results that an organization wants from it.

automation, business process, test automation, (9 more...)

#artificialintelligence

Genre: Overview (0.40)

Technology: Information Technology > Artificial Intelligence > Robots (0.91)

Add feedback

Fault Detection for Non-Condensing Boilers using Simulated Building Automation System Sensor Data

Shohet, Rony, Kandil, Mohamed, Wang, Y., McArthur, J. J.

arXiv.org Artificial IntelligenceAug-30-2022

Building performance has been shown to degrade significantly after commissioning, resulting in increased energy consumption and associated greenhouse gas emissions. Continuous Commissioning using existing sensor networks and IoT devices has the potential to minimize this waste by continually identifying system degradation and re-tuning control strategies to adapt to real building performance. Due to its significant contribution to greenhouse gas emissions, the performance of gas boiler systems for building heating is critical. A review of boiler performance studies has been used to develop a set of common faults and degraded performance conditions, which have been integrated into a MATLAB/Simulink emulator. This resulted in a labeled dataset with approximately 10,000 simulations of steady-state performance for each of 14 non-condensing boilers. The collected data is used for training and testing fault classification using K-nearest neighbour, Decision tree, Random Forest, and Support Vector Machines. The results show that the Support Vector Machines method gave the best prediction accuracy, consistently exceeding 90%, and generalization across multiple boilers is not possible due to low classification accuracy.

artificial intelligence, boiler, machine learning, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.aei.2020.101176

2205.08418

Country: North America > United States (0.45)

Genre:

Overview (1.00)
Research Report > New Finding (0.34)

Industry:

Energy > Oil & Gas > Upstream (1.00)
Energy > Renewable (0.95)
Construction & Engineering > HVAC (0.72)
Energy > Power Industry (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.66)

Add feedback