AITopics

1106.36

Country:

Europe > United Kingdom > England (0.28)
North America > United States > Massachusetts > Middlesex County (0.14)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
Information Technology > Artificial Intelligence > Natural Language (0.87)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.68)

Rostamizadeh, Afshin, Agarwal, Alekh, Bartlett, Peter

Online and Batch Learning Algorithms for Data with Missing Features

arXiv.org Machine LearningJun-16-2011

We introduce new online and batch algorithms that are robust to data with missing features, a situation that arises in many practical applications. In the online setup, we allow for the comparison hypothesis to change as a function of the subset of features that is observed on any given round, extending the standard setting where the comparison hypothesis is fixed throughout. In the batch setup, we present a convex relation of a non-convex problem to jointly estimate an imputation function, used to fill in the values of missing features, along with the classification hypothesis. We prove regret bounds in the online setting and Rademacher complexity bounds for the batch i.i.d. setting. The algorithms are tested on several UCI datasets, showing superior performance over baselines.

algorithm, artificial intelligence, machine learning, (14 more...)

arXiv.org Machine Learning

1104.0729

Country: North America (0.28)

Genre: Research Report (0.64)

Industry: Education (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Kursa, Miron B., Komsta, Łukasz, Rudnicki, Witold R.

Random forest models of the retention constants in the thin layer chromatography

arXiv.org Artificial IntelligenceJun-16-2011

In the current study we examine an application of the machine learning methods to model the retention constants in the thin layer chromatography (TLC). This problem can be described with hundreds or even thousands of descriptors relevant to various molecular properties, most of them redundant and not relevant for the retention constant prediction. Hence we employed feature selection to significantly reduce the number of attributes. Additionally we have tested application of the bagging procedure to the feature selection. The random forest regression models were built using selected variables. The resulting models have better correlation with the experimental data than the reference models obtained with linear regression. The cross-validation confirms robustness of the models.

algorithm, artificial intelligence, machine learning, (14 more...)

1106.3361

Country: Europe > Poland (0.29)

Genre: Research Report > New Finding (0.48)

Industry: Materials > Chemicals > Commodity Chemicals (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.54)

Rampone, Salvatore, Russo, Ciro

A fuzzified BRAIN algorithm for learning DNF from incomplete data

arXiv.org Artificial IntelligenceJun-16-2011

Aim of this paper is to address the problem of learning Boolean functions from training data with missing values. We present an extension of the BRAIN algorithm, called U-BRAIN (Uncertainty-managing Batch Relevance-based Artificial INtelligence), conceived for learning DNF Boolean formulas from partial truth tables, possibly with uncertain values or missing bits. Such an algorithm is obtained from BRAIN by introducing fuzzy sets in order to manage uncertainty. In the case where no missing bits are present, the algorithm reduces to the original BRAIN.

artificial intelligence, inductive learning, machine learning, (18 more...)

doi: 10.1285/i20705948v5n2p256

1002.4014

Country:

Europe (0.28)
North America > United States (0.28)

Genre: Research Report (0.64)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Cho, Yoon-Sik, Steeg, Greg Ver, Galstyan, Aram

Co-evolution of Selection and Influence in Social Networks

arXiv.org Machine LearningJun-14-2011

Many networks are complex dynamical systems, where both attributes of nodes and topology of the network (link structure) can change with time. We propose a model of co-evolving networks where both node at- tributes and network structure evolve under mutual influence. Specifically, we consider a mixed membership stochastic blockmodel, where the probability of observing a link between two nodes depends on their current membership vectors, while those membership vectors themselves evolve in the presence of a link between the nodes. Thus, the network is shaped by the interaction of stochastic processes describing the nodes, while the processes themselves are influenced by the changing network structure. We derive an efficient variational inference procedure for our model, and validate the model on both synthetic and real-world data.

artificial intelligence, machine learning, senator, (17 more...)

arXiv.org Machine Learning

1106.2788

Country: North America > United States (1.00)

Genre: Research Report (0.64)

Industry: Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.68)

Dong, Xiaowen, Frossard, Pascal, Vandergheynst, Pierre, Nefedov, Nikolai

Clustering with Multi-Layer Graphs: A Spectral Perspective

arXiv.org Machine LearningJun-11-2011

Observational data usually comes with a multimodal nature, which means that it can be naturally represented by a multi-layer graph whose layers share the same set of vertices (users) with different edges (pairwise relationships). In this paper, we address the problem of combining different layers of the multi-layer graph for improved clustering of the vertices compared to using layers independently. We propose two novel methods, which are based on joint matrix factorization and graph regularization framework respectively, to efficiently combine the spectrum of the multiple graph layers, namely the eigenvectors of the graph Laplacian matrices. In each case, the resulting combination, which we call a "joint spectrum" of multiple graphs, is used for clustering the vertices. We evaluate our approaches by simulations with several real world social network datasets. Results demonstrate the superior or competitive performance of the proposed methods over state-of-the-art technique and common baseline methods, such as co-regularization and summation of information from individual graphs.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

doi: 10.1109/TSP.2012.2212886

1106.2233

Country: Europe > Switzerland (0.15)

Genre:

Research Report > Promising Solution (0.54)
Research Report > New Finding (0.48)

Industry: Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.95)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Communications (0.90)

Efficient Solution Algorithms for Factored MDPs

Guestrin, C., Koller, D., Parr, R., Venkataraman, S.

This paper addresses the problem of planning under uncertainty in large Markov Decision Processes (MDPs). Factored MDPs represent a complex state space using state variables and the transition model using a dynamic Bayesian network. This representation often allows an exponential reduction in the representation size of structured MDPs, but the complexity of exact solution algorithms for such MDPs can grow exponentially in the representation size. In this paper, we present two approximate solution algorithms that exploit structure in factored MDPs. Both use an approximate value function represented as a linear combination of basis functions, where each basis function involves only a small subset of the domain variables. A key contribution of this paper is that it shows how the basic operations of both algorithms can be performed efficiently in closed form, by exploiting both additive and context-specific structure in a factored MDP. A central element of our algorithms is a novel linear program decomposition technique, analogous to variable elimination in Bayesian networks, which reduces an exponentially large LP to a provably equivalent, polynomial-sized one. One algorithm uses approximate linear programming, and the second approximate dynamic programming. Our dynamic programming algorithm is novel in that it uses an approximation based on max-norm, a technique that more directly minimizes the terms that appear in error bounds for approximate MDP algorithms. We provide experimental results on problems with over 10^40 states, demonstrating a promising indication of the scalability of our approach, and compare our algorithm to an existing state-of-the-art approach, showing, in some problems, exponential gains in computation time.

algorithm, artificial intelligence, machine learning, (15 more...)

doi: 10.1613/jair.1000

1106.1822

Country:

Europe (1.00)
North America > United States > California (0.67)
North America > United States > Massachusetts (0.45)

Genre: Research Report > New Finding (0.45)

Industry: Government > Military (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Barzilay, R., Elhadad, N.

Inferring Strategies for Sentence Ordering in Multidocument News Summarization

The problem of organizing information for multidocument summarization so that the generated summary is coherent has received relatively little attention. While sentence ordering for single document summarization can be determined from the ordering of sentences in the input article, this is not the case for multidocument summarization where summary sentences may be drawn from different input articles. In this paper, we propose a methodology for studying the properties of ordering information in the news genre and describe experiments done on a corpus of multiple acceptable orderings we developed for the task. Based on these experiments, we implemented a strategy for ordering information that combines constraints from chronological order of events and topical relatedness. Evaluation of our augmented algorithm shows a significant improvement of the ordering over two baseline strategies.

artificial intelligence, information, natural language, (19 more...)

doi: 10.1613/jair.991

1106.182

Country:

North America > United States (1.00)
Europe (0.93)
Asia (0.67)

Genre:

Research Report > Experimental Study (0.46)
Personal > Obituary (0.46)

Industry:

Transportation (1.00)
Law > Criminal Law (0.68)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Accelerating Reinforcement Learning by Composing Solutions of Automatically Identified Subtasks

Drummond, C.

This paper discusses a system that accelerates reinforcement learning by using transfer from related tasks. Without such transfer, even if two tasks are very similar at some abstract level, an extensive re-learning effort is required. The system achieves much of its power by transferring parts of previously learned solutions rather than a single complete solution. The system exploits strong features in the multi-dimensional function produced by reinforcement learning in solving a particular task. These features are stable and easy to recognize early in the learning process. They generate a partitioning of the state space and thus the function. The partition is represented as a graph. This is used to index and compose functions stored in a case base to form a close approximation to the solution of the new task. Experiments demonstrate that function composition often produces more than an order of magnitude increase in learning rate compared to a basic reinforcement learning algorithm.

machine learning, reinforcement learning, snake, (14 more...)

doi: 10.1613/jair.904

1106.1796

Country:

North America > Canada (0.46)
Europe > United Kingdom (0.28)

Genre: Research Report (1.00)

Industry: Education (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Discovery of a missing disease spreader

Maeno, Yoshiharu

No sooner had a new year begun in 2003 than citizens were seized with panic in Guangdong in south China. Hundreds were suffered from a pneumonia-like strange disease, some of which had been dead. Both Chinese government and Chinese media remained silent all the time as to the risk of a possible epidemic. No one in the rest of the world knew there was any real cause for alarm. But in March, local outbreaks of a mysterious disease were reported in Hong Kong and Southeast Asian countries. The World Health Organization(WHO) issued a global alert. Even then, health authorities could not reveal where the disease had come from. This story at the onset of the Severe Acute Respiratory Syndrome (SARS) outbreak poses an interesting question. Is it possible to discover the presence of a missing disease spreader from the surveillance records on the cases in other regions?

artificial intelligence, bioinformatics, machine learning, (20 more...)

doi: 10.1016/j.physa.2011.05.005

1006.2322

Country: Asia > China > Guangdong Province (0.48)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications (0.69)
Information Technology > Biomedical Informatics (0.68)