AITopics | Oceania

Collaborating Authors

Oceania

Multi-Fusion Chinese WordNet (MCW) : Compound of Machine Learning and Manual Correction

arXiv.org Artificial IntelligenceFeb-5-2020

Princeton WordNet (PWN) is a lexicon-semantic network based on cognitive linguistics, which promotes the development of natural language processing. Based on PWN, five Chinese wordnets have been developed to solve the problems of syntax and semantics. They include: Northeastern University Chinese WordNet (NEW), Sinica Bilingual Ontological WordNet (BOW), Southeast University Chinese WordNet (SEW), Taiwan University Chinese WordNet (CWN), Chinese Open WordNet (COW). By using them, we found that these word networks have low accuracy and coverage, and cannot completely portray the semantic network of PWN. So we decided to make a new Chinese wordnet called Multi-Fusion Chinese Wordnet (MCW) to make up those shortcomings. The key idea is to extend the SEW with the help of Oxford bilingual dictionary and Xinhua bilingual dictionary, and then correct it. More specifically, we used machine learning and manual adjustment in our corrections. Two standards were formulated to help our work. We conducted experiments on three tasks including relatedness calculation, word similarity and word sense disambiguation for the comparison of lemma's accuracy, at the same time, coverage also was compared. The results indicate that MCW can benefit from coverage and accuracy via our method. However, it still has room for improvement, especially with lemmas. In the future, we will continue to enhance the accuracy of MCW and expand the concepts in it.

artificial intelligence, natural language, text processing, (17 more...)

arXiv.org Artificial Intelligence

2002.01761

Country:

Asia > Taiwan (0.24)
Asia > China > Shanghai > Shanghai (0.05)
Oceania > Tonga (0.04)
(6 more...)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback

Robust Boosting for Regression Problems

Ju, Xiaomeng, Salibián-Barrera, Matías

arXiv.org Machine LearningFeb-5-2020

The gradient boosting algorithm constructs a regression estimator using a linear combination of simple "base learners". In order to obtain a robust non-parametric regression estimator that is scalable to high dimensional problems we propose a robust boosting algorithm based on a two-stage approach, similar to what is done for robust linear regression: we first minimize a robust residual scale estimator, and then improve its efficiency by optimizing a bounded loss function. Unlike previous proposals, our algorithm does not need to compute an ad-hoc residual scale estimator in each step. Since our loss functions are typically non-convex, we propose initializing our algorithm with an $L_1$ regression tree, which is fast to compute. We also introduce a robust variable importance metric for variable selection that is calculated via a permutation procedure. Through simulated and real data experiments, we compare our method against gradient boosting with squared loss and other robust boosting methods in the literature. With clean data, our method works equally well as gradient boosting with the squared loss. With symmetric and asymmetrically contaminated data, we show that our proposed method outperforms in terms of prediction error and variable selection accuracy.

estimator, gradient, loss function, (17 more...)

arXiv.org Machine Learning

2002.02054

Country:

Europe > Austria > Vienna (0.14)
Oceania > Australia > Tasmania (0.04)
North America > Canada > British Columbia (0.04)
(2 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.67)

Add feedback

Semiparametric Bayesian Forecasting of Spatial Earthquake Occurrences

Kolev, Aleksandar A., Ross, Gordon J.

arXiv.org Machine LearningFeb-5-2020

Self-exciting Hawkes processes are used to model events which cluster in time and space, and have been widely studied in seismology under the name of the Epidemic Type Aftershock Sequence (ETAS) model. In the ETAS framework, the occurrence of the mainshock earthquakes in a geographical region is assumed to follow an inhomogeneous spatial point process, and aftershock events are then modelled via a separate triggering kernel. Most previous studies of the ETAS model have relied on point estimates of the model parameters due to the complexity of the likelihood function, and the difficulty in estimating an appropriate mainshock distribution. In order to take estimation uncertainty into account, we instead propose a fully Bayesian formulation of the ETAS model which uses a nonparametric Dirichlet process mixture prior to capture the spatial mainshock process. Direct inference for the resulting model is problematic due to the strong correlation of the parameters for the mainshock and triggering processes, so we instead use an auxiliary latent variable routine to perform efficient inference.

catalog, earthquake, eta model, (17 more...)

arXiv.org Machine Learning

2002.01706

Country:

North America > United States > California (0.14)
Europe > Italy (0.05)
Europe > Romania (0.04)
(9 more...)

Genre: Research Report (0.50)

Industry:

Law Enforcement & Public Safety (1.00)
Government > Regional Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Temporal-adaptive Hierarchical Reinforcement Learning

Zhou, Wen-Ji, Yu, Yang

arXiv.org Artificial IntelligenceFeb-5-2020

Hierarchical reinforcement learning (HRL) helps address large-scale and sparse reward issues in reinforcement learning. In HRL, the policy model has an inner representation structured in levels. With this structure, the reinforcement learning task is expected to be decomposed into corresponding levels with sub-tasks, and thus the learning can be more efficient. In HRL, although it is intuitive that a high-level policy only needs to make macro decisions in a low frequency, the exact frequency is hard to be simply determined. Previous HRL approaches often employed a fixed-time skip strategy or learn a terminal condition without taking account of the context, which, however, not only requires manual adjustments but also sacrifices some decision granularity. In this paper, we propose the \emph{temporal-adaptive hierarchical policy learning} (TEMPLE) structure, which uses a temporal gate to adaptively control the high-level policy decision frequency. We train the TEMPLE structure with PPO and test its performance in a range of environments including 2-D rooms, Mujoco tasks, and Atari games. The results show that the TEMPLE structure can lead to improved performance in these environments with a sequential adaptive high-level control.

internal action, reinforcement, temple, (15 more...)

arXiv.org Artificial Intelligence

2002.0208

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Oceania > Australia (0.04)
North America > United States > New York (0.04)
(8 more...)

Genre: Research Report > New Finding (0.34)

Industry: Leisure & Entertainment > Games > Computer Games (0.55)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.33)

Add feedback

How to ensure artificial intelligence benefits society: A conversation with Stuart Russell and James Manyika

#artificialintelligenceFeb-4-2020, 17:33:25 GMT

Stuart Russell, a leading artificial-intelligence (AI) researcher at the University of California, Berkeley, and author of the book Human Compatible (Penguin Random House, October 2019), sits down with McKinsey Global Institute chairman James Manyika to discuss our future as AI transforms our world. In this broad conversation, they explore the immense benefits ahead and what our role will be as AI becomes more pervasive. They also delve into potential challenges we may face with our current approach to AI, and how we can redefine AI to ensure it helps humanity achieve its full potential. James Manyika: When you look at the AI field today and you see all these announcements and breakthroughs, what excites you the most? Stuart Russell: With today's technology, delivering high-quality education to everybody on Earth is just the beginning. Even fairly simple AI tutoring tools have been shown to be very effective. So that can only get better if we figure out how to roll it out to the people who really need it.

james manyika, objective, stuart russell, (11 more...)

#artificialintelligence

Country:

North America > United States > California > Alameda County > Berkeley (0.25)
Oceania > Australia (0.05)
North America > United States > California > San Francisco County > San Francisco (0.04)

Genre: Personal > Interview (0.40)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

New Zealand to use AI to determine deforestation levels « Carbon Pulse

#artificialintelligenceFeb-4-2020, 12:19:26 GMT

New Zealand will be among the first in the world to use an artificial intelligence programme to calculate deforestation levels, which will be used when reporting emissions data to the UNFCCC as well as under its domestic emissions trading scheme. You must be logged in to post a comment.

carbon pulse, new zealand, use ai

#artificialintelligence

Country:

Oceania > New Zealand (0.88)
Asia (0.18)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Linearly Constrained Neural Networks

Hendriks, Johannes, Jidling, Carl, Wills, Adrian, Schön, Thomas

arXiv.org Machine LearningFeb-4-2020

We present an approach to designing neural network based models that will explicitly satisfy known linear constraints. To achieve this, the target function is modelled as a linear transformation of an underlying function. This transformation is chosen such that any prediction of the target function is guaranteed to satisfy the constraints and can be determined from known physics or, more generally, by following a constructive procedure that was previously presented for Gaussian processes. The approach is demonstrated on simulated and real-data examples.

constraint, neural network, strain field, (16 more...)

arXiv.org Machine Learning

2002.016

Country:

Europe > Sweden > Uppsala County > Uppsala (0.04)
Oceania > Australia > New South Wales > Callaghan (0.04)
North America > United States > New Jersey (0.04)
(2 more...)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Bayesian Networks in Healthcare: Distribution by Medical Condition

McLachlan, Scott, Dube, Kudakwashe, Hitman, Graham A, Fenton, Norman E, Kyrimi, Evangelia

arXiv.org Artificial IntelligenceFeb-4-2020

Bayesian networks (BNs) have received increasing research attention that is not matched by adoption in practice and yet have potential to significantly benefit healthcare. Hitherto, research works have not investigated the types of medical conditions being modelled with BNs, nor whether any differences exist in how and why they are applied to different conditions. This research seeks to identify and quantify the range of medical conditions for which healthcare-related BN models have been proposed, and the differences in approach between the most common medical conditions to which they have been applied. We found that almost two-thirds of all healthcare BNs are focused on four conditions: cardiac, cancer, psychological and lung disorders. We believe that a lack of understanding regarding how BNs work and what they are capable of exists, and that it is only with greater understanding and promotion that we may ever realise the full potential of BNs to effect positive change in daily healthcare practice.

bayesian network, international conference, medical condition, (14 more...)

arXiv.org Artificial Intelligence

2002.00224

Country:

Europe > United Kingdom > England > Greater London > London (0.04)
Europe > Latvia > Riga Municipality > Riga (0.04)
Oceania > New Zealand (0.04)
Asia > India (0.04)

Genre: Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

On Positive-Unlabeled Classification in GAN

Guo, Tianyu, Xu, Chang, Huang, Jiajun, Wang, Yunhe, Shi, Boxin, Xu, Chao, Tao, Dacheng

arXiv.org Machine LearningFeb-4-2020

This paper defines a positive and unlabeled classification problem for standard GANs, which then leads to a novel technique to stabilize the training of the discriminator in GANs. Traditionally, real data are taken as positive while generated data are negative. This positive-negative classification criterion was kept fixed all through the learning process of the discriminator without considering the gradually improved quality of generated data, even if they could be more realistic than real data at times. In contrast, it is more reasonable to treat the generated data as unlabeled, which could be positive or negative according to their quality. The discriminator is thus a classifier for this positive and unlabeled classification problem, and we derive a new Positive-Unlabeled GAN (PUGAN). We theoretically discuss the global optimality the proposed model will achieve and the equivalent optimization goal. Empirically, we find that PUGAN can achieve comparable or even better performance than those sophisticated discriminator stabilization methods.

algorithm, dataset, gan, (15 more...)

arXiv.org Machine Learning

2002.01136

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
Asia > China (0.04)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Newton-ADMM: A Distributed GPU-Accelerated Optimizer for Multiclass Classification Problems

Fang, Chih-Hao, Kylasa, Sudhir B, Roosta, Fred, Mahoney, Michael W., Grama, Ananth

arXiv.org Machine LearningFeb-4-2020

First-order optimization methods, such as stochastic gradient descent (SGD) and its variants, are widely used in machine learning applications due to their simplicity and low per-iteration costs. However, they often require larger numbers of iterations, with associated communication costs in distributed environments. In contrast, Newton-type methods, while having higher per-iteration costs, typically require a significantly smaller number of iterations, which directly translates to reduced communication costs. In this paper, we present a novel distributed optimizer for classification problems, which integrates a GPU-accelerated Newton-type solver with the global consensus formulation of Alternating Direction of Method Multipliers (ADMM). By leveraging the communication efficiency of ADMM, GPU-accelerated inexact-Newton solver, and an effective spectral penalty parameter selection strategy, we show that our proposed method (i) yields better generalization performance on several classification problems; (ii) significantly outperforms state-of-the-art methods in distributed time to solution; and (iii) offers better scaling on large distributed platforms.

dataset, newton-admm, solver, (15 more...)

arXiv.org Machine Learning

1807.07132

Country:

North America > United States > Indiana > Tippecanoe County > Lafayette (0.04)
Oceania > Australia > Queensland (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.55)

Add feedback