AITopics | Oceania

Collaborating Authors

Oceania

Towards Fair Deep Clustering With Multi-State Protected Variables

arXiv.org Machine LearningJan-28-2019

Fair clustering under the disparate impact doctrine requires that population of each protected group should be approximately equal in every cluster. Previous work investigated a difficult-to-scale pre-processing step for $k$-center and $k$-median style algorithms for the special case of this problem when the number of protected groups is two. In this work, we consider a more general and practical setting where there can be many protected groups. To this end, we propose Deep Fair Clustering, which learns a discriminative but fair cluster assignment function. The experimental results on three public datasets with different types of protected attribute show that our approach can steadily improve the degree of fairness while only having minor loss in terms of clustering quality.

dataset, fair deep clustering, fairness, (13 more...)

arXiv.org Machine Learning

1901.10053

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > United States > New York > Richmond County > New York City (0.04)
(12 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.69)

Add feedback

Will big data make everyone healthier?

#artificialintelligenceJan-27-2019, 06:39:23 GMT

New Zealand's first ever health conference looking at artificial intelligence was organised to give the health system "a shake-up". Dr Marise Stuart, an organiser of the Hack Aotearoa conference, said New Zealand's heath system needed to "pick up its game" in the use of artificial intelligence (AI). Last week's conference explored how predictive data, smart technologies and robotics could improve the health of New Zealanders. "There's a junior doctors' strike going on now and a lot of that could be prevented by having good IT because junior doctors spend so much of their time typing. That's just one small snag in the health system that AI could hugely have an effect on."

artificial intelligence, data mining, machine learning, (17 more...)

#artificialintelligence

Country: Oceania > New Zealand (0.47)

Industry:

Health & Medicine > Health Care Providers & Services (1.00)
Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.53)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.42)
Information Technology > Artificial Intelligence > Machine Learning (0.32)

Add feedback

Example and Feature importance-based Explanations for Black-box Machine Learning Models

Adhikari, Ajaya, Tax, D. M. J, Satta, Riccardo, Fath, Matthias

arXiv.org Artificial IntelligenceJan-27-2019

As machine learning models become more accurate, they typically become more complex and uninterpretable by humans. The black-box character of these models holds back its acceptance in practice, especially in high-risk domains where the consequences of failure could be catastrophic such as health-care or defense. Providing understandable and useful explanations behind ML models or predictions can increase the trust of the user. Example-based reasoning, which entails leveraging previous experience with analogous tasks to make a decision, is a well known strategy for problem solving and justification. This work presents a new explanation extraction method called LEAFAGE, for a prediction made by any black-box ML model. The explanation consists of the visualization of similar examples from the training set and the importance of each feature. Moreover, these explanations are contrastive which aims to take the expectations of the user into account. LEAFAGE is evaluated in terms of fidelity to the underlying black-box model and usefulness to the user. The results showed that LEAFAGE performs overall better than the current state-of-the-art method LIME in terms of fidelity, on ML models with non-linear decision boundary. A user-study was conducted which focused on revealing the differences between example-based and feature importance-based explanations. It showed that example-based explanations performed significantly better than feature importance-based explanation, in terms of perceived transparency, information sufficiency, competence and confidence. Counter-intuitively, when the gained knowledge of the participants was tested, it showed that they learned less about the black-box model after seeing a feature importance-based explanation than seeing no explanation at all. The participants found feature importance-based explanation vague and hard to generalize it to other instances.

explanation, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

1812.09044

Country:

North America > United States > Iowa > Story County > Ames (0.04)
Europe > Netherlands > South Holland > Delft (0.04)
Oceania (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Transportation > Air (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.46)

Add feedback

Automatic end-to-end De-identification: Is high accuracy the only metric?

Yogarajan, Vithya, Pfahringer, Bernhard, Mayo, Michael

arXiv.org Machine LearningJan-27-2019

De-identification of electronic health records (EHR) is a vital step towards advancing health informatics research and maximising the use of available data. It is a two-step process where step one is the identification of protected health information (PHI), and step two is replacing such PHI with surrogates. Despite the recent advances in automatic de-identification of EHR, significant obstacles remain if the abundant health data available are to be used to the full potential. Accuracy in de-identification could be considered a necessary, but not sufficient condition for the use of EHR without individual patient consent. We present here a comprehensive review of the progress to date, both the impressive successes in achieving high accuracy and the significant risks and challenges that remain. To best of our knowledge, this is the first paper to present a complete picture of end-to-end automatic de-identification. We review 18 recently published automatic de-identification systems -designed to de-identify EHR in the form of free text- to show the advancements made in improving the overall accuracy of the system, and in identifying individual PHI. We argue that despite the improvements in accuracy there remain challenges in surrogate generation and replacements of identified PHIs, and the risks posed to patient protection and privacy.

de-identification system, f-measure, uzuner, (14 more...)

arXiv.org Machine Learning

1901.10583

Country:

North America > United States (0.68)
Oceania > New Zealand > North Island > Waikato > Hamilton (0.04)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Health Care Technology > Medical Record (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Stiffness: A New Perspective on Generalization in Neural Networks

Fort, Stanislav, Nowak, Paweł Krzysztof, Narayanan, Srini

arXiv.org Machine LearningJan-27-2019

We investigate neural network training and generalization using the concept of stiffness. We measure how stiff a network is by looking at how a small gradient step on one example affects the loss on another example. In particular, we study how stiffness varies with 1) class membership, 2) distance between data points (in the input space as well as in latent spaces), 3) training iteration, and 4) learning rate. We empirically study the evolution of stiffness on MNIST, FASHION MNIST, CIFAR-10 and CIFAR-100 using fully-connected and convolutional neural networks. Our results demonstrate that stiffness is a useful concept for diagnosing and characterizing generalization. We observe that small learning rates lead to initial learning of more specific features that do not translate well to improvements on inputs from all classes, whereas high learning rates initially benefit all classes at once. We measure stiffness as a function of distance between data points and observe that higher learning rates induce positive correlation between changes in loss further apart, pointing towards a regularization effect of learning rate. When training on CIFAR-100, the stiffness matrix exhibits a coarse-grained behavior suggestive of the model's awareness of super-class membership.

gradient update, neural network, stiffness, (14 more...)

arXiv.org Machine Learning

1901.09491

Country:

Oceania > Australia > New South Wales > Sydney (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre: Research Report > New Finding (0.87)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Disentangling in Variational Autoencoders with Natural Clustering

Antoran, Javier, Miguel, Antonio

arXiv.org Machine LearningJan-27-2019

Learning representations that disentangle the underlying factors of variability in data is an intuitive precursor to AI with human-like reasoning. Consequently, it has been the object of many efforts of the machine learning community. This work takes a step further in this direction by addressing the scenario where generative factors present a multimodal distribution due to the existence of class distinction in the data. We formulate a lower bound on the joint distribution of inputs and class labels and present N-VAE, a model which is capable of separating factors of variation which are exclusive to certain classes from factors that are shared among classes. This model implements the natural clustering prior through the use of a class-conditioned latent space and a shared latent space. We show its usefulness for detecting and disentangling class-dependent generative factors as well as for generating rich artificial samples.

latent variable, n-vae, representation, (16 more...)

arXiv.org Machine Learning

1901.09415

Country:

Europe > Spain > Aragón > Zaragoza Province > Zaragoza (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Google Testing Out Coffee Delivery By Drones In Australia

#artificialintelligenceJan-26-2019, 06:56:38 GMT

Is drone delivery the future of deliveries? We know that some companies are already working on such technology, and a recent video published by the Wall Street Journal has revealed that over in the state of Canberra, Australia, Google is already testing out and making deliveries by autonomous drones. Through an app that users can download, they will be able to select the various food items that they want, and the order will be sent to the restaurant who will then prepare the food, package it, and attach it to the drone who will then fly to the customer. From what we can tell, customers will be able to choose their delivery location, meaning that say you're in the park, you can just drop a pin on your location and it'll be able to deliver to you. This is versus more conventional delivery services which usually require you to have a fixed address like a home or an office.

australia, coffee delivery, delivery, (4 more...)

#artificialintelligence

Country: Oceania > Australia > Australian Capital Territory > Canberra (0.28)

Industry: Consumer Products & Services (0.61)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)

Add feedback

Efficient Toxicity Prediction via Simple Features Using Shallow Neural Networks and Decision Trees

Karim, Abdul, Mishra, Avinash, Newton, M A Hakim, Sattar, Abdul

arXiv.org Machine LearningJan-26-2019

Toxicity prediction of chemical compounds is a grand challenge. Lately, it achieved significant progress in accuracy but using a huge set of features, implementing a complex blackbox technique such as a deep neural network, and exploiting enormous computational resources. In this paper, we strongly argue for the models and methods that are simple in machine learning characteristics, efficient in computing resource usage, and powerful to achieve very high accuracy levels. To demonstrate this, we develop a single task-based chemical toxicity prediction framework using only 2D features that are less compute intensive. We effectively use a decision tree to obtain an optimum number of features from a collection of thousands of them. We use a shallow neural network and jointly optimize it with decision tree taking both network parameters and input features into account. Our model needs only a minute on a single CPU for its training while existing methods using deep neural networks need about 10 min on NVidia Tesla K40 GPU. However, we obtain similar or better performance on several toxicity benchmark tasks. We also develop a cumulative feature ranking method which enables us to identify features that can help chemists perform prescreening of toxic compounds effectively.

auc-roc, neural network, optimization, (15 more...)

arXiv.org Machine Learning

1901.0924

Country:

Oceania > Australia (0.04)
North America > United States > Michigan (0.04)
Europe > Hungary (0.04)
Asia > India > NCT > New Delhi (0.04)

Genre:

Research Report > New Finding (0.68)
Research Report > Experimental Study (0.67)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Distributed Convolutional Dictionary Learning (DiCoDiLe): Pattern Discovery in Large Images and Signals

Moreau, Thomas, Gramfort, Alexandre

arXiv.org Machine LearningJan-26-2019

Convolutional dictionary learning (CDL) estimates shift invariant basis adapted to multidimensional data. CDL has proven useful for image denoising or inpainting, as well as for pattern discovery on multivariate signals. As estimated patterns can be positioned anywhere in signals or images, optimization techniques face the difficulty of working in extremely high dimensions with millions of pixels or time samples, contrarily to standard patch-based dictionary learning. To address this optimization problem, this work proposes a distributed and asynchronous algorithm, employing locally greedy coordinate descent and an asynchronous locking mechanism that does not require a central server. This algorithm can be used to distribute the computation on a number of workers which scales linearly with the encoded signal's size. Experiments confirm the scaling properties which allows us to learn patterns on large scales images from the Hubble Space Telescope.

algorithm, complexity, dicodile-z, (11 more...)

arXiv.org Machine Learning

1901.09235

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Texas > Dallas County > Dallas (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
(6 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

How does Disagreement Help Generalization against Label Corruption?

Yu, Xingrui, Han, Bo, Yao, Jiangchao, Niu, Gang, Tsang, Ivor W., Sugiyama, Masashi

arXiv.org Machine LearningJan-26-2019

Learning with noisy labels is one of the hottest problems in weakly-supervised learning. Based on memorization effects of deep neural networks, training on small-loss instances becomes very promising for handling noisy labels. This fosters the state-of-the-art approach "Co-teaching" that cross-trains two deep neural networks using the small-loss trick. However, with the increase of epochs, two networks converge to a consensus and Co-teaching reduces to the self-training MentorNet. To tackle this issue, we propose a robust learning paradigm called Co-teaching+, which bridges the "Update by Disagreement" strategy with the original Co-teaching. First, two networks feed forward and predict all data, but keep prediction disagreement data only. Then, among such disagreement data, each network selects its small-loss data, but back propagates the small-loss data from its peer network and updates its own parameters. Empirical results on benchmark datasets demonstrate that Co-teaching+ is much superior to many state-of-the-art methods in the robustness of trained models.

co-teaching, epoch, noisy label, (13 more...)

arXiv.org Machine Learning

1901.04215

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)

Genre: Research Report > Promising Solution (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback