AITopics | Asia

Collaborating Authors

Asia

Optimizing Hybrid Spreading in Metapopulations

Zhang, Changwang, Zhou, Shi, Miller, Joel C., Cox, Ingemar J., Chain, Benjamin M.

arXiv.org Artificial IntelligenceMar-31-2015

Epidemic spreading phenomena are ubiquitous in nature and society. Examples include the spreading of diseases, information, and computer viruses. Epidemics can spread by local spreading, where infected nodes can only infect a limited set of direct target nodes and global spreading, where an infected node can infect every other node. In reality, many epidemics spread using a hybrid mixture of both types of spreading. In this study we develop a theoretical framework for studying hybrid epidemics, and examine the optimum balance between spreading mechanisms in terms of achieving the maximum outbreak size. We show the existence of critically hybrid epidemics where neither spreading mechanism alone can cause a noticeable spread but a combination of the two spreading mechanisms would produce an enormous outbreak. Our results provide new strategies for maximising beneficial epidemics and estimating the worst outcome of damaging hybrid epidemics.

epidemic, node, subpopulation, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1038/srep09924

1409.7291

Country:

South America > Brazil (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > New York (0.04)
(4 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.94)
Health & Medicine > Epidemiology (0.68)
Information Technology > Security & Privacy (0.67)

Technology:

Information Technology > Artificial Intelligence (0.68)
Information Technology > Security & Privacy (0.49)
Information Technology > Communications > Networks (0.47)

Add feedback

Thompson Sampling for Learning Parameterized Markov Decision Processes

Gopalan, Aditya, Mannor, Shie

arXiv.org Machine LearningMar-31-2015

We consider reinforcement learning in parameterized Markov Decision Processes (MDPs), where the parameterization may induce correlation across transition probabilities or rewards. Consequently, observing a particular state transition might yield useful information about other, unobserved, parts of the MDP. We present a version of Thompson sampling for parameterized reinforcement learning problems, and derive a frequentist regret bound for priors over general parameter spaces. The result shows that the number of instants where suboptimal actions are chosen scales logarithmically with time, with high probability. It holds for prior distributions that put significant probability near the true model, without any additional, specific closed-form structure such as conjugate or product-form priors. The constant factor in the logarithmic scaling encodes the information complexity of learning the MDP in terms of the Kullback-Leibler geometry of the parameter space.

artificial intelligence, bayesian inference, machine learning, (18 more...)

arXiv.org Machine Learning

1406.7498

Country: Asia (0.46)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Nonparametric Relational Topic Models through Dependent Gamma Processes

Xuan, Junyu, Lu, Jie, Zhang, Guangquan, Da Xu, Richard Yi, Luo, Xiangfeng

arXiv.org Machine LearningMar-30-2015

Traditional Relational Topic Models provide a way to discover the hidden topics from a document network. Many theoretical and practical tasks, such as dimensional reduction, document clustering, link prediction, benefit from this revealed knowledge. However, existing relational topic models are based on an assumption that the number of hidden topics is known in advance, and this is impractical in many real-world applications. Therefore, in order to relax this assumption, we propose a nonparametric relational topic model in this paper. Instead of using fixed-dimensional probability distributions in its generative model, we use stochastic processes. Specifically, a gamma process is assigned to each document, which represents the topic interest of this document. Although this method provides an elegant solution, it brings additional challenges when mathematically modeling the inherent network structure of typical document network, i.e., two spatially closer documents tend to have more similar topics. Furthermore, we require that the topics are shared by all the documents. In order to resolve these challenges, we use a subsampling strategy to assign each document a different gamma process from the global gamma process, and the subsampling probabilities of documents are assigned with a Markov Random Field constraint that inherits the document network structure. Through the designed posterior inference algorithm, we can discover the hidden topics and its number simultaneously. Experimental results on both synthetic and real-world network datasets demonstrate the capabilities of learning the hidden topics and, more importantly, the number of topics.

data mining, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

1503.08542

Country:

Asia > China (0.68)
North America > United States > New York (0.14)

Genre: Research Report (0.82)

Industry: Education (0.93)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
(2 more...)

Add feedback

A simple coding for cross-domain matching with dimension reduction via spectral graph embedding

Shimodaira, Hidetoshi

arXiv.org Machine LearningMar-29-2015

Data vectors are obtained from multiple domains. They are feature vectors of images or vector representations of words. Domains may have different numbers of data vectors with different dimensions. These data vectors from multiple domains are projected to a common space by linear transformations in order to search closely related vectors across domains. We would like to find projection matrices to minimize distances between closely related data vectors. This formulation of cross-domain matching is regarded as an extension of the spectral graph embedding to multi-domain setting, and it includes several multivariate analysis methods of statistics such as multiset canonical correlation analysis, correspondence analysis, and principal component analysis. Similar approaches are very popular recently in pattern recognition and vision. In this paper, instead of proposing a novel method, we will introduce an embarrassingly simple idea of coding the data vectors for explaining all the above mentioned approaches. A data vector is concatenated with zero vectors from all other domains to make an augmented vector. The cross-domain matching is solved by applying the single-domain version of spectral graph embedding to these augmented vectors of all the domains. An interesting connection to the classical associative memory model of neural networks is also discussed by noticing a coding for association. A cross-validation method for choosing the dimension of the common space and a regularization parameter will be discussed in an illustrative numerical example.

artificial intelligence, machine learning, vector, (15 more...)

arXiv.org Machine Learning

1412.838

Country: Asia > Japan (0.14)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Inferring Team Task Plans from Human Meetings: A Generative Modeling Approach with Logic-Based Prior

Kim, Been, Chacha, Caleb M., Shah, Julie A.

Journal of Artificial Intelligence ResearchMar-27-2015

We aim to reduce the burden of programming and deploying autonomous systems to work in concert with people in time-critical domains such as military field operations and disaster response. Deployment plans for these operations are frequently negotiated on-the-fly by teams of human planners. A human operator then translates the agreed-upon plan into machine instructions for the robots. We present an algorithm that reduces this translation burden by inferring the final plan from a processed form of the human team's planning conversation. Our hybrid approach combines probabilistic generative modeling with logical plan validation used to compute a highly structured prior over possible plans, enabling us to overcome the challenge of performing inference over a large solution space with only a small amount of noisy data from the team planning session. We validate the algorithm through human subject experimentations and show that it is able to infer a human team's final plan with 86% accuracy on average. We also describe a robot demonstration in which two people plan and execute a first-response collaborative task with a PR2 robot. To the best of our knowledge, this is the first work to integrate a logical planning technique within a generative model to perform plan inference.

final plan, predicate, utterance, (16 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.4496

AI Access Foundation

10932

Journal of Artificial Intelligence Research

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
Asia > Middle East > Jordan (0.04)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.04)
North America > United States > Florida (0.04)

Genre: Research Report (0.93)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.68)
Government > Military (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling > Plan Recognition (1.00)
(3 more...)

Add feedback

Bayesian Cross Validation and WAIC for Predictive Prior Design in Regular Asymptotic Theory

Watanabe, Sumio

arXiv.org Machine LearningMar-27-2015

Prior design is one of the most important problems in both statistics and machine learning. The cross validation (CV) and the widely applicable information criterion (WAIC) are predictive measures of the Bayesian estimation, however, it has been difficult to apply them to find the optimal prior because their mathematical properties in prior evaluation have been unknown and the region of the hyperparameters is too wide to be examined. In this paper, we derive a new formula by which the theoretical relation among CV, WAIC, and the generalization loss is clarified and the optimal hyperparameter can be directly found. By the formula, three facts are clarified about predictive prior design. Firstly, CV and WAIC have the same second order asymptotic expansion, hence they are asymptotically equivalent to each other as the optimizer of the hyperparameter. Secondly, the hyperparameter which minimizes CV or WAIC makes the average generalization loss to be minimized asymptotically but does not the random generalization loss. And lastly, by using the mathematical relation between priors, the variances of the optimized hyperparameters by CV and WAIC are made smaller with small computational costs. Also we show that the optimized hyperparameter by DIC or the marginal likelihood does not minimize the average or random generalization loss in general.

artificial intelligence, machine learning, waic, (19 more...)

arXiv.org Machine Learning

1503.0797

Country: Asia > Japan (0.28)

Genre: Research Report (0.50)

Add feedback

Variational Optimization of Annealing Schedules

Kiwaki, Taichi

arXiv.org Machine LearningMar-27-2015

Annealed importance sampling (AIS) is a common algorithm to estimate partition functions of useful stochastic models. One important problem for obtaining accurate AIS estimates is the selection of an annealing schedule. Conventionally, an annealing schedule is often determined heuristically or is simply set as a linearly increasing sequence. In this paper, we propose an algorithm for the optimal schedule by deriving a functional that dominates the AIS estimation error and by numerically minimizing this functional. We experimentally demonstrate that the proposed algorithm mostly outperforms conventional scheduling schemes with large quantization numbers.

artificial intelligence, machine learning, varopt0, (17 more...)

arXiv.org Machine Learning

1502.05313

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.47)

Add feedback

Bayesian Reconstruction of Missing Observations

Kataoka, Shun, Yasuda, Muneki, Tanaka, Kazuyuki

arXiv.org Machine LearningMar-26-2015

We focus on an interpolation method referred to Bayesian reconstruction in this paper. Whereas in standard interpolation methods missing data are interpolated deterministically, in Bayesian reconstruction, missing data are interpolated probabilistically using a Bayesian treatment. In this paper, we address the framework of Bayesian reconstruction and its application to the traffic data reconstruction problem in the field of traffic engineering. In the latter part of this paper, we describe the evaluation of the statistical performance of our Bayesian traffic reconstruction model using a statistical mechanical approach and clarify its statistical behavior.

artificial intelligence, bayesian inference, machine learning, (17 more...)

arXiv.org Machine Learning

1404.5793

Country: Asia > Japan > Honshū > Tōhoku (0.15)

Genre: Research Report (0.64)

Industry: Transportation > Ground > Road (0.36)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.97)

Add feedback

On Gridless Sparse Methods for Line Spectral Estimation From Complete and Incomplete Data

Yang, Zai, Xie, Lihua

arXiv.org Machine LearningMar-26-2015

Abstract--This paper is concerned about sparse, continuous frequency estimation in line spectral estimation, and focused on developing gridless sparse methods which overcome grid mismatches and correspond to limiting scenarios of existing grid-based approaches, e.g., We generalize AST (atomic-norm soft thresholding) to the case of nonconsecutively sampled data (incomplete data) inspired by recent atomic norm based techniques. We present a gridless version of SPICE (gridless SPICE, or GLS), which is applicable to both complete and incomplete data without the knowledge of noise level. We further prove the equivalence between GLS and atomic norm-based techniques under different assumptions of noise. Moreover, we extend GLS to a systematic framework consisting of model order selection and robust frequency estimation, and present feasible algorithms for AST and GLS. Numerical simulations are provided to validate our theoretical analysis and demonstrate performance of our methods compared to existing ones. Spectral analysis of signals [1] is a major problem in statistical signal processing. In this paper we are concerned about the line spectral estimation problem which has wide applications in communications, radar, sonar, seismology, astronomy and so on. C is the measurement noise. The sinusoid numberK M, usually referred to as the model order, is typically unknown in practice. Following from [2], the case when the signal is observed on [M ] is referred to as the complete data case while the other case when only samples on Ω [M ] are available is called the incomplete data case (or missing data case), in which the samples on the complementary set of Ω, Ω, [M ]\ Ω, are called missing data. Manuscript November 2013; accepted by IEEE Transactions on Signal Processing March 2015. The authors are with the School of Electrical and Electronic Engineering, Nanyang Technological University, 639798, Singapore (email: { yangzai, elhxie } @ntu.edu.sg). Frequency estimation and model order selection are two important topics in line spectral estimation. 's can be obtained by a simple least-squares method according to (1). This paper is mainly focused on frequency estimation but we also incorporate existing model order selection tools in our methods. Many methods have been proposed for frequency estimation. Common classical methods include periodogram (or beamforming), nonlinear least squares (NLS) and MUSIC but often have limitations (see the review in [1]). For example, the periodogram suffers from leakage problems and have difficulties in resolving closely separated frequencies [1]. It is worth noting that the recent iterative adaptive approach (IAA) [4], [5] reduces the leakage of periodogram.

artificial intelligence, bayesian inference, machine learning, (15 more...)

arXiv.org Machine Learning

1407.249

Country: Asia > Singapore (0.24)

Genre: Research Report (0.40)

Industry: Energy (0.34)

Technology:

Information Technology > Data Science (0.87)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)
(2 more...)

Add feedback

Stable Feature Selection from Brain sMRI

Xin, Bo, Hu, Lingjing, Wang, Yizhou, Gao, Wen

arXiv.org Machine LearningMar-25-2015

Neuroimage analysis usually involves learning thousands or even millions of variables using only a limited number of samples. In this regard, sparse models, e.g. the lasso, are applied to select the optimal features and achieve high diagnosis accuracy. The lasso, however, usually results in independent unstable features. Stability, a manifest of reproducibility of statistical results subject to reasonable perturbations to data and the model (Yu 2013), is an important focus in statistics, especially in the analysis of high dimensional data. In this paper, we explore a nonnegative generalized fused lasso model for stable feature selection in the diagnosis of Alzheimer's disease. In addition to sparsity, our model incorporates two important pathological priors: the spatial cohesion of lesion voxels and the positive correlation between the features and the disease labels. To optimize the model, we propose an efficient algorithm by proving a novel link between total variation and fast network flow algorithms via conic duality. Experiments show that the proposed nonnegative model performs much better in exploring the intrinsic structure of data via selecting stable features compared with other state-of-the-arts.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Machine Learning

1503.07508

Country: Asia > China (0.14)

Genre: Research Report (0.65)

Industry: Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback