AITopics

2303.10462

Country:

North America > United States (1.00)
Asia (1.00)
Europe > France (0.93)
Europe > United Kingdom > England (0.67)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Health & Medicine (1.00)
Energy > Oil & Gas > Upstream (1.00)
Education > Educational Setting (0.93)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Kolahdouz-Rahimi, Shekoufeh, Lano, Kevin, Lin, Chenghua

Requirement Formalisation using Natural Language Processing and Machine Learning: A Systematic Review

arXiv.org Artificial IntelligenceMar-18-2023

Improvement of software development methodologies attracts developers to automatic Requirement Formalisation (RF) in the Requirement Engineering (RE) field. The potential advantages by applying Natural Language Processing (NLP) and Machine Learning (ML) in reducing the ambiguity and incompleteness of requirement written in natural languages is reported in different studies. The goal of this paper is to survey and classify existing work on NLP and ML for RF, identifying challenges in this domain and providing promising future research directions. To achieve this, we conducted a systematic literature review to outline the current state-of-the-art of NLP and ML techniques in RF by selecting 257 papers from common used libraries. The search result is filtered by defining inclusion and exclusion criteria and 47 relevant studies between 2012 and 2022 are selected. We found that heuristic NLP approaches are the most common NLP techniques used for automatic RF, primary operating on structured and semi-structured data. This study also revealed that Deep Learning (DL) technique are not widely used, instead classical ML techniques are predominant in the surveyed studies. More importantly, we identified the difficulty of comparing the performance of different approaches due to the lack of standard benchmark cases for RF.

artificial intelligence, machine learning, natural language, (16 more...)

2303.13365

Country:

Europe > United Kingdom > England > Greater London > London (0.04)
Europe > United Kingdom > England > South Yorkshire > Sheffield (0.04)
Asia > Middle East > Iran > Tehran Province > Tehran (0.04)

Genre:

Research Report (1.00)
Overview (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

arXiv.org Artificial IntelligenceMar-18-2023

Representation Bias in Data: A Survey on Identification and Resolution Techniques

Shahbazi, Nima, Lin, Yin, Asudeh, Abolfazl, Jagadish, H. V.

Data-driven algorithms are only as good as the data they work with, while data sets, especially social data, often fail to represent minorities adequately. Representation Bias in data can happen due to various reasons ranging from historical discrimination to selection and sampling biases in the data acquisition and preparation methods. Given that "bias in, bias out", one cannot expect AI-based solutions to have equitable outcomes for societal applications, without addressing issues such as representation bias. While there has been extensive study of fairness in machine learning models, including several review papers, bias in the data has been less studied. This paper reviews the literature on identifying and resolving representation bias as a feature of a data set, independent of how consumed later. The scope of this survey is bounded to structured (tabular) and unstructured (e.g., image, text, graph) data. It presents taxonomies to categorize the studied techniques based on multiple design dimensions and provides a side-by-side comparison of their properties. There is still a long way to fully address representation bias issues in data. The authors hope that this survey motivates researchers to approach these challenges in the future by observing existing work within their respective domains.

data mining, machine learning, pattern recognition, (18 more...)

doi: 10.1145/3588433

2203.11852

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
Africa > Eswatini > Manzini > Manzini (0.04)
North America > United States > Michigan (0.04)
(3 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.67)
Research Report > Experimental Study (0.45)

Industry:

Law (1.00)
Health & Medicine > Therapeutic Area (1.00)
Education > Educational Setting (1.00)
(2 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(8 more...)

#artificialintelligenceMar-17-2023, 11:18:09 GMT

Artificial Intelligence Moving Towards Brighter Future in Healthcare Industry

Digital health is one of the emerging areas in the today's technologically advanced world. Penetration of artificial intelligence (AI) in the healthcare sector has many benefits including the discovery of diseases and development of new products. The data handling and analyzing the capacity of AI is at par as compared to humans. So, the effective utilization of AI would help in saving time and lives of patients. Moreover, it has benefits for the firms manufacturing new products and drugs in the healthcare domain.

artificial intelligence, brighter future, healthcare industry, (1 more...)

#artificialintelligence

Genre: Overview (0.49)

Industry: Health & Medicine > Health Care Providers & Services (0.60)

Technology: Information Technology > Artificial Intelligence (1.00)

Hu, Ruimeng, Laurière, Mathieu

Recent Developments in Machine Learning Methods for Stochastic Control and Games

Stochastic optimal control and games have found a wide range of applications, from finance and economics to social sciences, robotics and energy management. Many real-world applications involve complex models which have driven the development of sophisticated numerical methods. Recently, computational methods based on machine learning have been developed for stochastic control problems and games. We review such methods, with a focus on deep learning algorithms that have unlocked the possibility to solve such problems even when the dimension is high or when the structure is very complex, beyond what is feasible with traditional numerical methods. Here, we consider mostly the continuous time and continuous space setting. Many of the new approaches build on recent neural-network based methods for high-dimensional partial differential equations or backward stochastic differential equations, or on model-free reinforcement learning for Markov decision processes that have led to breakthrough results. In this paper we provide an introduction to these methods and summarize state-of-the-art works on machine learning for stochastic control and games.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

2303.10257

Country:

Europe (0.92)
North America > United States > California (0.45)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Information Technology (1.00)
Banking & Finance > Trading (1.00)
Energy > Oil & Gas (0.92)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Landgraf, Steven, Wursthorn, Kira, Hillemann, Markus, Ulrich, Markus

DUDES: Deep Uncertainty Distillation using Ensembles for Semantic Segmentation

Deep neural networks lack interpretability and tend to be overconfident, which poses a serious problem in safety-critical applications like autonomous driving, medical imaging, or machine vision tasks with high demands on reliability. Quantifying the predictive uncertainty is a promising endeavour to open up the use of deep neural networks for such applications. Unfortunately, current available methods are computationally expensive. In this work, we present a novel approach for efficient and reliable uncertainty estimation which we call Deep Uncertainty Distillation using Ensembles for Segmentation (DUDES). DUDES applies student-teacher distillation with a Deep Ensemble to accurately approximate predictive uncertainties with a single forward pass while maintaining simplicity and adaptability. Experimentally, DUDES accurately captures predictive uncertainties without sacrificing performance on the segmentation task and indicates impressive capabilities of identifying wrongly classified pixels and out-of-domain samples on the Cityscapes dataset. With DUDES, we manage to simultaneously simplify and outperform previous work on Deep Ensemble-based Uncertainty Distillation.

artificial intelligence, machine learning, student, (18 more...)

2303.09843

Country:

North America > United States > New York > New York County > New York City (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
(3 more...)

Genre:

Overview (0.66)
Research Report > Promising Solution (0.34)

Industry:

Information Technology (0.34)
Transportation > Ground > Road (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Provably Convergent Subgraph-wise Sampling for Fast GNN Training

Wang, Jie, Shi, Zhihao, Liang, Xize, Ji, Shuiwang, Li, Bin, Wu, Feng

Subgraph-wise sampling -- a promising class of mini-batch training techniques for graph neural networks (GNNs -- is critical for real-world applications. During the message passing (MP) in GNNs, subgraph-wise sampling methods discard messages outside the mini-batches in backward passes to avoid the well-known neighbor explosion problem, i.e., the exponentially increasing dependencies of nodes with the number of MP iterations. However, discarding messages may sacrifice the gradient estimation accuracy, posing significant challenges to their convergence analysis and convergence speeds. To address this challenge, we propose a novel subgraph-wise sampling method with a convergence guarantee, namely Local Message Compensation (LMC). To the best of our knowledge, LMC is the first subgraph-wise sampling method with provable convergence. The key idea is to retrieve the discarded messages in backward passes based on a message passing formulation of backward passes. By efficient and effective compensations for the discarded messages in both forward and backward passes, LMC computes accurate mini-batch gradients and thus accelerates convergence. Moreover, LMC is applicable to various MP-based GNN architectures, including convolutional GNNs (finite message passing iterations with different layers) and recurrent GNNs (infinite message passing iterations with a shared layer). Experiments on large-scale benchmarks demonstrate that LMC is significantly faster than state-of-the-art subgraph-wise sampling methods.

lmc4conv, machine learning, natural language, (21 more...)

2303.11081

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > Texas > Brazos County > College Station (0.14)
Asia > China > Anhui Province > Hefei (0.04)
(5 more...)

Genre:

Research Report (0.63)
Personal (0.45)
Overview (0.45)

Industry:

Health & Medicine (0.45)
Education (0.45)
Government (0.34)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications (0.94)
(4 more...)

Boukhers, Zeyd, Asundi, Nagaraj Bahubali

Deep Author Name Disambiguation using DBLP Data

In the academic world, the number of scientists grows every year and so does the number of authors sharing the same names. Consequently, it challenging to assign newly published papers to their respective authors. Therefore, Author Name Ambiguity (ANA) is considered a critical open problem in digital libraries. This paper proposes an Author Name Disambiguation (AND) approach that links author names to their real-world entities by leveraging their co-authors and domain of research. To this end, we use data collected from the DBLP repository that contains more than 5 million bibliographic records authored by around 2.6 million co-authors. Our approach first groups authors who share the same last names and same first name initials. The author within each group is identified by capturing the relation with his/her co-authors and area of research, represented by the titles of the validated publications of the corresponding author. To this end, we train a neural network model that learns from the representations of the co-authors and titles. We validated the effectiveness of our approach by conducting extensive experiments on a large dataset.

data mining, machine learning, natural language, (19 more...)

2303.10067

Country:

Europe > Germany (0.04)
Asia > Japan > Honshū > Tōhoku > Miyagi Prefecture > Sendai (0.04)
Europe > Italy (0.04)

Genre:

Overview (0.93)
Research Report > New Finding (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Litrico, Mattia, Del Bue, Alessio, Morerio, Pietro

Guiding Pseudo-labels with Uncertainty Estimation for Source-free Unsupervised Domain Adaptation

Standard Unsupervised Domain Adaptation (UDA) methods assume the availability of both source and target data during the adaptation. In this work, we investigate Source-free Unsupervised Domain Adaptation (SF-UDA), a specific case of UDA where a model is adapted to a target domain without access to source data. We propose a novel approach for the SF-UDA setting based on a loss reweighting strategy that brings robustness against the noise that inevitably affects the pseudo-labels. The classification loss is reweighted based on the reliability of the pseudo-labels that is measured by estimating their uncertainty. Guided by such reweighting strategy, the pseudo-labels are progressively refined by aggregating knowledge from neighbouring samples. Furthermore, a self-supervised contrastive framework is leveraged as a target space regulariser to enhance such knowledge aggregation. A novel negative pairs exclusion strategy is proposed to identify and exclude negative pairs made of samples sharing the same class, even in presence of some noise in the pseudo-labels. Our method outperforms previous methods on three major benchmarks by a large margin. We set the new SF-UDA state-of-the-art on VisDA-C and DomainNet with a performance gain of +1.8% on both benchmarks and on PACS with +12.3% in the single-source setting and +6.6% in multi-target adaptation. Additional analyses demonstrate that the proposed approach is robust to the noise, which results in significantly more accurate pseudo-labels compared to state-of-the-art approaches.

artificial intelligence, deep learning, machine learning, (15 more...)

2303.0377

Country:

North America > United States (0.05)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > Promising Solution (0.54)
Overview > Innovation (0.54)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Amadini, Roberto, Gabbrielli, Maurizio, Liu, Tong, Mauro, Jacopo

On the Evaluation of (Meta-)solver Approaches

Journal of Artificial Intelligence ResearchMar-17-2023

Meta-solver approaches exploit many individual solvers to potentially build a better solver. To assess the performance of meta-solvers, one can adopt the metrics typically used for individual solvers (e.g., runtime or solution quality) or employ more specific evaluation metrics (e.g., by measuring how close the meta-solver gets to its virtual best performance). In this paper, based on some recently published works, we provide an overview of different performance metrics for evaluating (meta-)solvers by exposing their strengths and weaknesses.

individual solver, scenario, solver, (15 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.14102

AI Access Foundation

14102

Journal of Artificial Intelligence Research

Country:

North America > United States > District of Columbia > Washington (0.04)
North America > United States > Arizona > Maricopa County > Phoenix (0.04)
Europe > Middle East > Cyprus > Nicosia > Nicosia (0.04)
(5 more...)

Genre: Overview (0.54)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)