AITopics | Country

Collaborating Authors

Country

Learning the Structure of Deep Sparse Graphical Models

Adams, Ryan Prescott, Wallach, Hanna M., Ghahramani, Zoubin

arXiv.org Machine LearningAug-19-2010

Deep belief networks are a powerful way to model complex probability distributions. However, learning the structure of a belief network, particularly one with hidden units, is difficult. The Indian buffet process has been used as a nonparametric Bayesian prior on the directed structure of a belief network with a single infinitely wide hidden layer. In this paper, we introduce the cascading Indian buffet process (CIBP), which provides a nonparametric prior on the structure of a layered, directed belief network that is unbounded in both depth and width, yet allows tractable inference. We use the CIBP prior with the nonlinear Gaussian belief network so each unit can additionally vary its behavior between discrete and continuous representations. We provide Markov chain Monte Carlo algorithms for inference in these belief networks and explore the structures learned on several image data sets.

artificial intelligence, belief network, machine learning, (15 more...)

arXiv.org Machine Learning

1001.016

Country: North America > Canada > Ontario > Toronto (0.28)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Modeling Spammer Behavior: Na\"ive Bayes vs. Artificial Neural Networks

Islam, Md. Saiful, Khaled, Shah Mostafa, Farhan, Khalid, Rahman, Md. Abdur, Rahman, Joy

arXiv.org Artificial IntelligenceAug-19-2010

Addressing the problem of spam emails in the Internet, this paper presents a comparative study on Na\"ive Bayes and Artificial Neural Networks (ANN) based modeling of spammer behavior. Keyword-based spam email filtering techniques fall short to model spammer behavior as the spammer constantly changes tactics to circumvent these filters. The evasive tactics that the spammer uses are themselves patterns that can be modeled to combat spam. It has been observed that both Na\"ive Bayes and ANN are best suitable for modeling spammer common patterns. Experimental results demonstrate that both of them achieve a promising detection rate of around 92%, which is considerably an improvement of performance compared to the keyword-based contemporary filtering approaches.

artificial intelligence, machine learning, spam filtering, (15 more...)

arXiv.org Artificial Intelligence

1008.3282

Country:

Asia > Bangladesh (0.15)
Oceania > Australia (0.15)
Oceania > New Zealand (0.14)
North America > United States (0.14)

Genre: Research Report > New Finding (0.35)

Technology:

Information Technology > Security & Privacy > Spam Filtering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.49)

Add feedback

A unifying view for performance measures in multi-class prediction

Jurman, Giuseppe, Furlanello, Cesare

arXiv.org Machine LearningAug-17-2010

In the last few years, many different performance measures have been introduced to overcome the weakness of the most natural metric, the Accuracy. Among them, Matthews Correlation Coefficient has recently gained popularity among researchers not only in machine learning but also in several application fields such as bioinformatics. Nonetheless, further novel functions are being proposed in literature. We show that Confusion Entropy, a recently introduced classifier performance measure for multi-class problems, has a strong (monotone) relation with the multi-class generalization of a classical metric, the Matthews Correlation Coefficient. Computational evidence in support of the claim is provided, together with an outline of the theoretical explanation.

artificial intelligence, confusion matrix, machine learning, (12 more...)

arXiv.org Machine Learning

doi: 10.1371/journal.pone.0041882

1008.2908

Country: Europe (0.28)

Genre: Research Report (0.50)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

A Minimum Relative Entropy Principle for Learning and Acting

Ortega, P. A., Braun, D. A.

Journal of Artificial Intelligence ResearchAug-16-2010

This paper proposes a method to construct an adaptive agent that is universal with respect to a given class of experts, where each expert is designed specifically for a particular environment. This adaptive control problem is formalized as the problem of minimizing the relative entropy of the adaptive agent from the expert that is most suitable for the unknown environment. If the agent is a passive observer, then the optimal solution is the well-known Bayesian predictor. However, if the agent is active, then its past actions need to be treated as causal interventions on the I/O stream rather than normal probability conditions. Here it is shown that the solution to this new variational problem is given by a stochastic controller called the Bayesian control rule, which implements adaptive behavior as a mixture of experts. Furthermore, it is shown that under mild assumptions, the Bayesian control rule converges to the control law of the most suitable expert.

agent, bayesian control rule, operation mode, (12 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.3062

AI Access Foundation

10659

Journal of Artificial Intelligence Research

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.46)
Europe > Poland (0.04)
South America > Chile (0.04)
(2 more...)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.93)

Add feedback

PMOG: The projected mixture of Gaussians model with application to blind source separation

Pendse, Gautam V.

arXiv.org Artificial IntelligenceAug-16-2010

We extend the mixtures of Gaussians (MOG) model to the projected mixture of Gaussians (PMOG) model. In the PMOG model, we assume that q dimensional input data points z_i are projected by a q dimensional vector w into 1-D variables u_i. The projected variables u_i are assumed to follow a 1-D MOG model. In the PMOG model, we maximize the likelihood of observing u_i to find both the model parameters for the 1-D MOG as well as the projection vector w. First, we derive an EM algorithm for estimating the PMOG model. Next, we show how the PMOG model can be applied to the problem of blind source separation (BSS). In contrast to conventional BSS where an objective function based on an approximation to differential entropy is minimized, PMOG based BSS simply minimizes the differential entropy of projected sources by fitting a flexible MOG model in the projected 1-D space while simultaneously optimizing the projection vector w. The advantage of PMOG over conventional BSS algorithms is the more flexible fitting of non-Gaussian source densities without assuming near-Gaussianity (as in conventional BSS) and still retaining computational feasibility.

artificial intelligence, differential entropy, machine learning, (16 more...)

arXiv.org Artificial Intelligence

1008.2743

Country: North America > United States (0.45)

Genre: Research Report (1.00)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Epistemic irrelevance in credal nets: the case of imprecise Markov trees

de Cooman, Gert, Hermans, Filip, Antonucci, Alessandro, Zaffalon, Marco

arXiv.org Artificial IntelligenceAug-15-2010

We focus on credal nets, which are graphical models that generalise Bayesian nets to imprecise probability. We replace the notion of strong independence commonly used in credal nets with the weaker notion of epistemic irrelevance, which is arguably more suited for a behavioural theory of probability. Focusing on directed trees, we show how to combine the given local uncertainty models in the nodes of the graph into a global model, and we use this to construct and justify an exact message-passing algorithm that computes updated beliefs for a variable in the tree. The algorithm, which is linear in the number of nodes, is formulated entirely in terms of coherent lower previsions, and is shown to satisfy a number of rationality requirements. We supply examples of the algorithm's operation, and report an application to on-line character recognition that illustrates the advantages of our approach for prediction. We comment on the perspectives, opened by the availability, for the first time, of a truly efficient algorithm based on epistemic irrelevance.

artificial intelligence, independence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

1008.2514

Country:

North America > United States (0.28)
Europe (0.28)

Genre: Research Report (0.81)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Faithfulness in Chain Graphs: The Gaussian Case

Peña, Jose M.

arXiv.org Artificial IntelligenceAug-13-2010

Previously, it has been proven that for any undirected graph there exists a regular Gaussian distribution that is faithful to it (Lněnička & Matúš, 2007, Corollary 3). A stronger result has been proven for acyclic directed graphs: In certain measure-theoretic sense, almost all the regular Gaussian distributions that factorize with respect to an acyclic directed graph are faithful to it (Spirtes et al., 1993, Theorem 3.2). Therefore, this paper extends the latter result to chain graphs. It is worth mentioning that we have recently proved in (Peña, 2009) a result analogous to the one in this paper but for strictly positive discrete probability distributions with arbitrary prescribed sample space. It is also worth noticing that a result analogous to the one in this paper has been proven in (Levitz et al., 2001, Theorem 6.1) under the

artificial intelligence, machine learning, regular gaussian distribution, (15 more...)

arXiv.org Artificial Intelligence

1008.2277

Country: Europe (0.46)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)

Add feedback

Large Scale Variational Inference and Experimental Design for Sparse Generalized Linear Models

Seeger, Matthias W., Nickisch, Hannes

arXiv.org Machine LearningAug-12-2010

Many problems of low-level computer vision and image processing, such as denoising, deconvolution, tomographic reconstruction or super-resolution, can be addressed by maximizing the posterior distribution of a sparse linear model (SLM). We show how higher-order Bayesian decision-making problems, such as optimizing image acquisition in magnetic resonance scanners, can be addressed by querying the SLM posterior covariance, unrelated to the density's mode. We propose a scalable algorithmic framework, with which SLM posteriors over full, high-resolution images can be approximated for the first time, solving a variational optimization problem which is convex iff posterior mode finding is convex. These methods successfully drive the optimization of sampling trajectories for real-world magnetic resonance imaging through Bayesian experimental design, which has not been attempted before. Our methodology provides new insight into similarities and differences between sparse reconstruction and approximate Bayesian inference, and has important implications for compressive sensing of real-world images.

artificial intelligence, bayesian inference, machine learning, (17 more...)

arXiv.org Machine Learning

0810.0901

Country: Europe > Germany (0.67)

Genre: Research Report > New Finding (0.67)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.48)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.93)

Add feedback

RDFViewS: A Storage Tuning Wizard for RDF Applications

Goasdoué, François, Karanasos, Konstantinos, Leblay, Julien, Manolescu, Ioana

arXiv.org Artificial IntelligenceAug-12-2010

In recent years, the significant growth of RDF data used in numerous applications has made its efficient and scalable manipulation an important issue. In this paper, we present RDFViewS, a system capable of choosing the most suitable views to materialize, in order to minimize the query response time for a specific SPARQL query workload, while taking into account the view maintenance cost and storage space constraints. Our system employs practical algorithms and heuristics to navigate through the search space of potential view configurations, and exploits the possibly available semantic information - expressed via an RDF Schema - to ensure the completeness of the query evaluation.

artificial intelligence, query, rdfviews, (14 more...)

arXiv.org Artificial Intelligence

1008.2186

Country: North America > Canada > Ontario (0.15)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)

Add feedback

The effect of discrete vs. continuous-valued ratings on reputation and ranking systems

Medo, Matus, Wakeling, Joseph Rushton

arXiv.org Artificial IntelligenceAug-12-2010

When users rate objects, a sophisticated algorithm that takes into account ability or reputation may produce a fairer or more accurate aggregation of ratings than the straightforward arithmetic average. Recently a number of authors have proposed different co-determination algorithms where estimates of user and object reputation are refined iteratively together, permitting accurate measures of both to be derived directly from the rating data. However, simulations demonstrating these methods' efficacy assumed a continuum of rating values, consistent with typical physical modelling practice, whereas in most actual rating systems only a limited range of discrete values (such as a 5-star system) is employed. We perform a comparative test of several co-determination algorithms with different scales of discrete ratings and show that this seemingly minor modification in fact has a significant impact on algorithms' performance. Paradoxically, where rating resolution is low, increased noise in users' ratings may even improve the overall performance of the system.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1209/0295-5075/91/48004

1001.3745

Country: Europe (0.93)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback