Goto

Collaborating Authors

 South America


iiot machinelearning_2021-10-29_03-56-37.xlsx

#artificialintelligence

The graph represents a network of 3,441 Twitter users whose tweets in the requested range contained "iiot machinelearning", or who were replied to or mentioned in those tweets. The network was obtained from the NodeXL Graph Server on Friday, 29 October 2021 at 11:01 UTC. The requested start date was Friday, 29 October 2021 at 00:01 UTC and the maximum number of tweets (going backward in time) was 7,500. The tweets in the network were tweeted over the 2-day, 15-hour, 36-minute period from Tuesday, 26 October 2021 at 08:24 UTC to Friday, 29 October 2021 at 00:00 UTC. Additional tweets that were mentioned in this data set were also collected from prior time periods.


Transparency of Deep Neural Networks for Medical Image Analysis: A Review of Interpretability Methods

arXiv.org Artificial Intelligence

Artificial Intelligence has emerged as a useful aid in numerous clinical applications for diagnosis and treatment decisions. Deep neural networks have shown same or better performance than clinicians in many tasks owing to the rapid increase in the available data and computational power. In order to conform to the principles of trustworthy AI, it is essential that the AI system be transparent, robust, fair and ensure accountability. Current deep neural solutions are referred to as black-boxes due to a lack of understanding of the specifics concerning the decision making process. Therefore, there is a need to ensure interpretability of deep neural networks before they can be incorporated in the routine clinical workflow. In this narrative review, we utilized systematic keyword searches and domain expertise to identify nine different types of interpretability methods that have been used for understanding deep learning models for medical image analysis applications based on the type of generated explanations and technical similarities. Furthermore, we report the progress made towards evaluating the explanations produced by various interpretability methods. Finally we discuss limitations, provide guidelines for using interpretability methods and future directions concerning the interpretability of deep neural networks for medical imaging analysis.


JEDAI Explains Decision-Making AI

arXiv.org Artificial Intelligence

This paper presents JEDAI, an AI system designed for outreach and educational efforts aimed at non-AI experts. JEDAI features a novel synthesis of research ideas from integrated task and motion planning and explainable AI. JEDAI helps users create high-level, intuitive plans while ensuring that they will be executable by the robot. It also provides users customized explanations about errors and helps improve their understanding of AI planning as well as the limits and capabilities of the underlying robot system.


Oh, This Game Set in Latin America Has a Coup? How Original

WIRED

For quite some time, I've felt a deep unease playing shooting games set in the modern world. While I'm always delighted to have 11-year-olds pulverize me in Fortnite, or to drop into a zombie-infested city for make-believe fun, when it comes to more realistic shooters I get hung up on the details. For games in the Call of Duty or Tom Clancy franchises, these details usually entail an express ride through a soul-crushing wheel of stereotypes and a kaleidoscope of ahistorical musings extracted from a fictional mashup of the Cold War and the war on drugs. Likewise, as a historian of Latin America and someone who grew up in a Mexican-American community on the US–Mexico border, the genre's ongoing obsession with depicting everything south of my hometown as simultaneously exotic, corrupt, and tyrannical is tedious at best and enraging at worst. So when the reviews for Far Cry 6 started trickling into cyberspace, I wasn't surprised to read that the it rehashed all of the worst stereotypes we've come to expect from video games set in Latin America.


Adversarial Robustness with Semi-Infinite Constrained Learning

arXiv.org Machine Learning

Despite strong performance in numerous applications, the fragility of deep learning to input perturbations has raised serious questions about its use in safety-critical domains. While adversarial training can mitigate this issue in practice, state-of-the-art methods are increasingly application-dependent, heuristic in nature, and suffer from fundamental trade-offs between nominal performance and robustness. Moreover, the problem of finding worst-case perturbations is non-convex and underparameterized, both of which engender a non-favorable optimization landscape. Thus, there is a gap between the theory and practice of adversarial training, particularly with respect to when and why adversarial training works. In this paper, we take a constrained learning approach to address these questions and to provide a theoretical foundation for robust learning. In particular, we leverage semi-infinite optimization and non-convex duality theory to show that adversarial training is equivalent to a statistical problem over perturbation distributions, which we characterize completely. Notably, we show that a myriad of previous robust training techniques can be recovered for particular, sub-optimal choices of these distributions. Using these insights, we then propose a hybrid Langevin Monte Carlo approach of which several common algorithms (e.g., PGD) are special cases. Finally, we show that our approach can mitigate the trade-off between nominal and robust performance, yielding state-of-the-art results on MNIST and CIFAR-10. Our code is available at: https://github.com/arobey1/advbench.


Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method

arXiv.org Artificial Intelligence

Transformers are expensive to train due to the quadratic time and space complexity in the self-attention mechanism. On the other hand, although kernel machines suffer from the same computation bottleneck in pairwise dot products, several approximation schemes have been successfully incorporated to considerably reduce their computational cost without sacrificing too much accuracy. In this work, we leverage the computation methods for kernel machines to alleviate the high computational cost and introduce Skyformer, which replaces the softmax structure with a Gaussian kernel to stabilize the model training and adapts the Nystr\"om method to a non-positive semidefinite matrix to accelerate the computation. We further conduct theoretical analysis by showing that the matrix approximation error of our proposed method is small in the spectral norm. Experiments on Long Range Arena benchmark show that the proposed method is sufficient in getting comparable or even better performance than the full self-attention while requiring fewer computation resources.


Australian fintech Completes Major US Acquisition

#artificialintelligence

Remitter, a fintech platform that improves bill payment rates by communicating to consumers digitally, has just finalised its acquisition of US-based Mercantile Adjustment Bureau, following a successful pre-IPO USD $12m cap raise led by Canaccord Genuity. The raise was cornerstoned by Allium Capital and Casey Capital. Remitter is a white-label communications platform, founded in Australia, which uses AI to optimise customer engagement and enhance the recovery of accounts receivables. Currently many organisations face challenges in collecting bill payments on time, with 46% of customers paying late according to Aite Group research. Remitter entered the US market in 2020, following two years of development including compliance across states and territories and collaboration with clients to ensure an optimal feature set. "Entering the US market involved ensuring the Remitter platform was compliant in all 52 states, each with its own different laws and regulations around payments.


Two-sided fairness in rankings via Lorenz dominance

arXiv.org Artificial Intelligence

We consider the problem of generating rankings that are fair towards both users and item producers in recommender systems. We address both usual recommendation (e.g., of music or movies) and reciprocal recommendation (e.g., dating). Following concepts of distributive justice in welfare economics, our notion of fairness aims at increasing the utility of the worse-off individuals, which we formalize using the criterion of Lorenz efficiency. It guarantees that rankings are Pareto efficient, and that they maximally redistribute utility from better-off to worse-off, at a given level of overall utility. We propose to generate rankings by maximizing concave welfare functions, and develop an efficient inference procedure based on the Frank-Wolfe algorithm. We prove that unlike existing approaches based on fairness constraints, our approach always produces fair rankings. Our experiments also show that it increases the utility of the worse-off at lower costs in terms of overall utility.


Human Activity Recognition using Attribute-Based Neural Networks and Context Information

arXiv.org Artificial Intelligence

We consider human activity recognition (HAR) from wearable sensor data in manual-work processes, like warehouse order-picking. Such structured domains can often be partitioned into distinct process steps, e.g., packaging or transporting. Each process step can have a different prior distribution over activity classes, e.g., standing or walking, and different system dynamics. Here, we show how such context information can be integrated systematically into a deep neural network-based HAR system. Specifically, we propose a hybrid architecture that combines a deep neural network-that estimates high-level movement descriptors, attributes, from the raw-sensor data-and a shallow classifier, which predicts activity classes from the estimated attributes and (optional) context information, like the currently executed process step. We empirically show that our proposed architecture increases HAR performance, compared to state-of-the-art methods. Additionally, we show that HAR performance can be further increased when information about process steps is incorporated, even when that information is only partially correct.


Concept and Attribute Reduction Based on Rectangle Theory of Formal Concept

arXiv.org Artificial Intelligence

Based on rectangle theory of formal concept and set covering theory, the concept reduction preserving binary relations is investigated in this paper. It is known that there are three types of formal concepts: core concepts, relative necessary concepts and unnecessary concepts. First, we present the new judgment results for relative necessary concepts and unnecessary concepts. Second, we derive the bounds for both the maximum number of relative necessary concepts and the maximum number of unnecessary concepts and it is a difficult problem as either in concept reduction preserving binary relations or attribute reduction of decision formal contexts, the computation of formal contexts from formal concepts is a challenging problem. Third, based on rectangle theory of formal concept, a fast algorithm for reducing attributes while preserving the extensions for a set of formal concepts is proposed using the extension bit-array technique, which allows multiple context cells to be processed by a single 32-bit or 64-bit operator. Technically, the new algorithm could store both formal context and extent of a concept as bit-arrays, and we can use bit-operations to process set operations "or" as well as "and". One more merit is that the new algorithm does not need to consider other concepts in the concept lattice, thus the algorithm is explicit to understand and fast. Experiments demonstrate that the new algorithm is effective in the computation of attribute reductions.