Goto

Collaborating Authors

 Industry


Efficient independent component analysis

arXiv.org Machine Learning

Independent component analysis (ICA) has been widely used for blind source separation in many fields such as brain imaging analysis, signal processing and telecommunication. Many statistical techniques based on M-estimates have been proposed for estimating the mixing matrix. Recently, several nonparametric methods have been developed, but in-depth analysis of asymptotic efficiency has not been available. We analyze ICA using semiparametric theories and propose a straightforward estimate based on the efficient score function by using B-spline approximations. The estimate is asymptotically efficient under moderate conditions and exhibits better performance than standard ICA methods in a variety of simulations.


A Leaf Recognition Algorithm for Plant Classification Using Probabilistic Neural Network

arXiv.org Artificial Intelligence

In this paper, we employ Probabilistic Neural Network (PNN) with image and data processing techniques to implement a general purpose automated leaf recognition algorithm. 12 leaf features are extracted and orthogonalized into 5 principal variables which consist the input vector of the PNN. The PNN is trained by 1800 leaves to classify 32 kinds of plants with an accuracy greater than 90%. Compared with other approaches, our algorithm is an accurate artificial intelligence approach which is fast in execution and easy in implementation.


Learning Probabilistic Models of Word Sense Disambiguation

arXiv.org Artificial Intelligence

This dissertation presents several new methods of supervised and unsupervised learning of word sense disambiguation models. The supervised methods focus on performing model searches through a space of probabilistic models, and the unsupervised methods rely on the use of Gibbs Sampling and the Expectation Maximization (EM) algorithm. In both the supervised and unsupervised case, the Naive Bayesian model is found to perform well. An explanation for this success is presented in terms of learning rates and bias-variance decompositions.


Practical Approach to Knowledge-based Question Answering with Natural Language Understanding and Advanced Reasoning

arXiv.org Artificial Intelligence

This research hypothesized that a practical approach in the form of a solution framework known as Natural Language Understanding and Reasoning for Intelligence (NaLURI), which combines full-discourse natural language understanding, powerful representation formalism capable of exploiting ontological information and reasoning approach with advanced features, will solve the following problems without compromising practicality factors: 1) restriction on the nature of question and response, and 2) limitation to scale across domains and to real-life natural language text.


Learning Symbolic Models of Stochastic Domains

Journal of Artificial Intelligence Research

In this article, we work towards the goal of developing agents that can learn to act in complex worlds. We develop a probabilistic, relational planning rule representation that compactly models noisy, nondeterministic action effects, and show how such rules can be effectively learned. Through experiments in simple planning domains and a 3D simulated blocks world with realistic physics, we demonstrate that this learning algorithm allows agents to effectively model world dynamics.


Semantic Matchmaking as Non-Monotonic Reasoning: A Description Logic Approach

Journal of Artificial Intelligence Research

Matchmaking arises when supply and demand meet in an electronic marketplace, or when agents search for a web service to perform some task, or even when recruiting agencies match curricula and job profiles. In such open environments, the objective of a matchmaking process is to discover best available offers to a given request. We address the problem of matchmaking from a knowledge representation perspective, with a formalization based on Description Logics. We devise Concept Abduction and Concept Contraction as non-monotonic inferences in Description Logics suitable for modeling matchmaking in a logical framework, and prove some related complexity results. We also present reasonable algorithms for semantic matchmaking based on the devised inferences, and prove that they obey to some commonsense properties. Finally, we report on the implementation of the proposed matchmaking framework, which has been used both as a mediator in e-marketplaces and for semantic web services discovery.


The Cyborg Astrobiologist: Porting from a wearable computer to the Astrobiology Phone-cam

arXiv.org Artificial Intelligence

Planetary exploration by autonomous robotic systems cannot be carried out successfully unless significant testing of the underlying computer vision algorithms is performed. In our previous work, we have demonstrated the use of a wearable computer system, the Cyborg Astrobiologist, capable of testing computer-vision algorithms as part of semi-autonomous exploration systems at remote geological and astrobiological field sites (McGuire et al., 2004, 2005). In that work, we showed that the exploration system, which was based upon newly-developed'uncommon maps' and previously-developed'interest maps' (Rae et al., 1999; McGuire et al., 2002), could viably and robustly be utilized during remote field missions to localize interesting geochemical or hydrological features. Our system carries out the navigation process using the lower end of the spectral resolution, making use of three colour imagery to distinguish between regions of unusual colour. Navigation using higher spectral resolution spectrometry, for example, navigation based on mineralogical differences, will yield more interesting results but this is beyond the scope of the current work.


Model Selection Through Sparse Maximum Likelihood Estimation

arXiv.org Artificial Intelligence

We consider the problem of estimating the parameters of a Gaussian or binary distribution in such a way that the resulting undirected graphical model is sparse. Our approach is to solve a maximum likelihood problem with an added l_1-norm penalty term. The problem as formulated is convex but the memory requirements and complexity of existing interior point methods are prohibitive for problems with more than tens of nodes. We present two new algorithms for solving problems with at least a thousand nodes in the Gaussian case. Our first algorithm uses block coordinate descent, and can be interpreted as recursive l_1-norm penalized regression. Our second algorithm, based on Nesterov's first order method, yields a complexity estimate with a better dependence on problem size than existing interior point methods. Using a log determinant relaxation of the log partition function (Wainwright & Jordan (2006)), we show that these same algorithms can be used to solve an approximate sparse maximum likelihood problem for the binary case. We test our algorithms on synthetic data, as well as on gene expression and senate voting records data.


The Role of Time in the Creation of Knowledge

arXiv.org Artificial Intelligence

ABSTRACT In this paper I assume that in humans the creation of knowledge depends on a discrete time, or stage, sequential decision-making process subjected to a stochastic, information transmitting environment. For each time-stage, this environment randomly transmits Shannon type info rmation-packets to the decision-maker, who examines each of them for relevancy and then determines his optimal choices. Using this set of relevant information-packets, the decision-maker adapts, over time, to the stochastic nature of his environment, and optimizes the subjective expected rate-of-growth of knowledge. The decision-maker's optimal actions, lead to a decision function that involves, over time, his view of the subjective entropy of the environmental process and other important parameters at each time-stage of the process. Using this model of human behavior, one could create psychometri c experiments using computer simulation and real decision-makers, to play programmed games to measure the resulting human performance. KEYWORDS decision-making, dynamic programming, entropy, epistemology, information theory, knowledge, adaptive, event-time based sequential process, subjective probability Scientists seek to understand the experience of our environment. Some build hypothetical, mathematical models that reflect our view of reality as they adumbrate the laws of nature, enabling them to conduct experiments leading to the validation of a hypothesis as they reach out for even more truths about nature.


A Robust Linguistic Platform for Efficient and Domain specific Web Content Analysis

arXiv.org Artificial Intelligence

Web semantic access in specific domains calls for specialized search engines with enhanced semantic querying and indexing capacities, which pertain both to information retrieval (IR) and to information extraction (IE). A rich linguistic analysis is required either to identify the relevant semantic units to index and weight them according to linguistic specific statistical distribution, or as the basis of an information extraction process. Recent developments make Natural Language Processing (NLP) techniques reliable enough to process large collections of documents and to enrich them with semantic annotations. This paper focuses on the design and the development of a text processing platform, Ogmios, which has been developed in the ALVIS project. The Ogmios platform exploits existing NLP modules and resources, which may be tuned to specific domains and produces linguistically annotated documents. We show how the three constraints of genericity, domain semantic awareness and performance can be handled all together.