AITopics | Africa

Collaborating Authors

Africa

Inconsistency of Pitman-Yor process mixtures for the number of components

Miller, Jeffrey W., Harrison, Matthew T.

arXiv.org Machine LearningAug-30-2013

In population genetics, determining the "population structure" is an important step in the analysis of sampled data. As an illustrative example, consider the impala, a species of antelope in southern Africa. Impalas are divided into two subspecies: the common impala occupying much of the eastern half of the region, and the black-faced impala inhabiting a small area in the west. While common impalas are abundant, the number of black-faced impalas has been decimated by drought, poaching, and declining resources due to human and livestock expansion. To assist conservation efforts, Lorenzen, Arctander and Siegismund (2006) collected samples from 216 impalas, and analyzed the genetic variation between/within the two subspecies. A key part of their analysis consisted of inferring the population structure -- that is, partitioning the data into distinct populations, and in particular, determining how many such populations there are. To infer the impala population structure, Lorenzen et al. employed a widely-used tool called Structure (Pritchard, Stephens and Donnelly, 2000) which, in the simplest version, models the data as a finite mixture, with each component in the mixture corresponding to a dis-Supported in part by NSF grant DMS-1007593 and DARPA contract FA8650-11-1-715.

artificial intelligence, exponential family, machine learning, (17 more...)

arXiv.org Machine Learning

1309.0024

Country:

North America > United States (0.66)
Africa (0.54)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback

Generative, Fully Bayesian, Gaussian, Openset Pattern Classifier

Brummer, Niko

arXiv.org Machine LearningJul-24-2013

This report works out the details of a closed-form, fully Bayesian, multiclass, openset, generative pattern classifier using multivariate Gaussian likelihoods, with conjugate priors. The generative model has a common within-class covariance, which is proportional to the between-class covariance in the conjugate prior. The scalar proportionality constant is the only plugin parameter. All other model parameters are intergated out in closed form. An expression is given for the model evidence, which can be used to make plugin estimates for the proportionality constant. Pattern recognition is done via the predictive likeihoods of classes for which training data is available, as well as a predicitve likelihood for any as yet unseen class.

artificial intelligence, machine learning, predictive distribution, (14 more...)

arXiv.org Machine Learning

1307.6143

Country: Africa (0.14)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Topic Segmentation and Labeling in Asynchronous Conversations

Joty, S., Carenini, G., Ng, R. T.

Journal of Artificial Intelligence ResearchJul-22-2013

Topic segmentation and labeling is often considered a prerequisite for higher-level conversation analysis and has been shown to be useful in many Natural Language Processing (NLP) applications. We present two new corpora of email and blog conversations annotated with topics, and evaluate annotator reliability for the segmentation and labeling tasks in these asynchronous conversations. We propose a complete computational framework for topic segmentation and labeling in asynchronous conversations. Our approach extends state-of-the-art methods by considering a fine-grained structure of an asynchronous conversation, along with other conversational features by applying recent graph-based methods for NLP. For topic segmentation, we propose two novel unsupervised models that exploit the fine-grained conversational structure, and a novel graph-theoretic supervised model that combines lexical, conversational and topic features. For topic labeling, we propose two novel (unsupervised) random walk models that respectively capture conversation specific clues from two different sources: the leading sentences and the fine-grained conversational structure. Empirical evaluation shows that the segmentation and the labeling performed by our best models beat the state-of-the-art, and are highly correlated with human annotations.

annotation, asynchronous conversation, segmentation, (15 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.3940

AI Access Foundation

10826

Journal of Artificial Intelligence Research

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > District of Columbia > Washington (0.04)
(29 more...)

Genre:

Research Report > New Finding (1.00)
Workflow (0.92)
Research Report > Promising Solution (0.87)
Research Report > Experimental Study (0.68)

Industry:

Leisure & Entertainment > Games (0.46)
Media > News (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.67)
(2 more...)

Add feedback

Probabilistic inverse reinforcement learning in unknown environments

Tossou, Aristide C. Y., Dimitrakakis, Christos

arXiv.org Machine LearningJul-14-2013

We consider the problem of learning by demonstration from agents acting in unknown stochastic Markov environments or games. Our aim is to estimate agent preferences in order to construct improved policies for the same task that the agents are trying to solve. To do so, we extend previous probabilistic approaches for inverse reinforcement learning in known MDPs to the case of unknown dynamics or opponents. We do this by deriving two simplified probabilistic models of the demonstrator's policy and utility. For tractability, we use maximum a posteriori estimation rather than full Bayesian inference. Under a flat prior, this results in a convex optimisation problem. We find that the resulting algorithms are highly competitive against a variety of other methods for inverse reinforcement learning that do have knowledge of the dynamics.

algorithm, demonstration, reward function, (13 more...)

arXiv.org Machine Learning

1307.3785

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > San Mateo County > Menlo Park (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.66)

Add feedback

An Extended GHKM Algorithm for Inducing Lambda-SCFG

Li, Peng (Tsinghua University, China) | Liu, Yang | Sun, Maosong

AAAI ConferencesJul-9-2013

Semantic parsing, which aims at mapping a natural language (NL) sentence into its formal meaning representation (e.g., logical form), has received increasing attention in recent years. While synchronous context-free grammar (SCFG) augmented with lambda calculus (lambda-SCFG) provides an effective mechanism for semantic parsing, how to learn such lambda-SCFG rules still remains a challenge because of the difficulty in determining the correspondence between NL sentences and logical forms. To alleviate this structural divergence problem, we extend the GHKM algorithm, which is a state-of-the-art algorithm for learning synchronous grammars in statistical machine translation, to induce lambda-SCFG from pairs of NL sentences and logical forms. By treating logical forms as trees, we reformulate the theory behind GHKM that gives formal semantics to the alignment between NL words and logical form tokens. Experiments on the GEOQUERY dataset show that our semantic parser achieves an F-measure of 90.2%, the best result published to date.

artificial intelligence, logical form, natural language, (17 more...)

AAAI Conferences

Twenty-Seventh AAAI Conference on Artificial Intelligence

Country:

North America > United States > Alaska (0.04)
Asia > Singapore (0.04)
Asia > China > Beijing > Beijing (0.04)
Africa > Middle East > Egypt > Giza Governorate > Giza (0.04)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Add feedback

Interdependent Multi-Issue Negotiation for Energy Exchange in Remote Communities

Alam, Muddasser (University of Southampton) | Rogers, Alex ( University of Southampton ) | Ramchurn, Sarvapali D (University of Southampton)

AAAI ConferencesJul-9-2013

We present a novel negotiation protocol to facilitate energy exchange between off-grid homes that are equipped with renewable energy generation and electricity storage. Our protocol imposes restrictions over negotiation such that it reduces the complex interdependent multi-issue negotiation to one where agents have a strategy profile in subgame perfect Nash equilibrium. We show that our negotiation protocol is tractable, concurrent, scalable and leads to Pareto-optimal outcomes in a decentralised manner. We empirically evaluate our protocol and show that, in this instance, a society of agents can (i) improve the overall utilities by 14% and (ii) reduce their overall use of the batteries by 37%.

agent, artificial intelligence, game theory, (18 more...)

AAAI Conferences

Twenty-Seventh AAAI Conference on Artificial Intelligence

Country:

North America > United States > New York (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > United Kingdom > England > Hampshire > Southampton (0.04)
(3 more...)

Industry:

Energy > Energy Storage (1.00)
Energy > Renewable > Wind (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Conference Committees

Hamilton, Carol (Association for the Advancement of Artificial Intelligence)

AAAI ConferencesJul-9-2013

List of conference committees for AAAI-13, IAAI-13, and the EAAI-13 symposium.

artificial intelligence, university, usa, (17 more...)

AAAI Conferences

Twenty-Seventh AAAI Conference on Artificial Intelligence

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.32)
North America > United States > California > Los Angeles County > Los Angeles (0.15)
Europe > Switzerland > Zürich > Zürich (0.15)
(135 more...)

Industry:

Education > Educational Setting > Higher Education (0.95)
Information Technology (0.95)
Health & Medicine (0.93)
(3 more...)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Commonsense Reasoning and Large Network Analysis: A Computational Study of ConceptNet 4

Diochnos, Dimitrios I.

arXiv.org Artificial IntelligenceJun-23-2013

Our aim is to compute the minimal data-set implied by the assertions of the English language, extract it from the database, and store it in files of our own format. Towards this direction we read the table of assertions (conceptnet assertion) and keep the entries that have their language id set to en. According to Table A.1 in Appendix A, every assertion is associated with entries from the database tables conceptnet concept (Table A.2), conceptnet relation (Table A.3), nl frequency (Table A.4), conceptnet frame (Table A.5), conceptnet surfaceform (Table A.6), and conceptnet rawassertion (Table A.7). Through conceptnet rawassertion the assertions are also associated with the actual sentences which are located in the table corpus sentence (Table A.6). Moreover, we do not need any other table from the database, as the important entries from all the above tables are contained in among these tables. It turns out that reading once the assertions and then all the entries referenced from the assertions in the English language is not enough to produce a minimal consistent data-set. Section 1.1 explains why, and gives a high-level overview of the process that we follow in order to compute the closure of the data-set implied by the assertions of the English language. However, before we describe these reasons we mention which fields we are going to keep from each table of the original ConceptNet 4 database.

artificial intelligence, survey article, weakly connected component, (17 more...)

arXiv.org Artificial Intelligence

1304.5863

Country:

South America (1.00)
Oceania (1.00)
North America > United States (1.00)
(3 more...)

Genre:

Research Report (0.63)
Overview (0.46)

Industry:

Media (1.00)
Leisure & Entertainment > Sports (1.00)
Health & Medicine > Consumer Health (1.00)
(6 more...)

Technology:

Information Technology > Communications > Networks (0.86)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.40)

Add feedback

Random Drift Particle Swarm Optimization

Sun, Jun, Wu, Xiaojun, Palade, Vasile, Fang, Wei, Shi, Yuhui

arXiv.org Artificial IntelligenceJun-12-2013

The random drift particle swarm optimization (RDPSO) algorithm, inspired by the free electron model in metal conductors placed in an external electric field, is presented, systematically analyzed and empirically studied in this paper. The free electron model considers that electrons have both a thermal and a drift motion in a conductor that is placed in an external electric field. The motivation of the RDPSO algorithm is described first, and the velocity equation of the particle is designed by simulating the thermal motion as well as the drift motion of the electrons, both of which lead the electrons to a location with minimum potential energy in the external electric field. Then, a comprehensive analysis of the algorithm is made, in order to provide a deep insight into how the RDPSO algorithm works. It involves a theoretical analysis and the simulation of the stochastic dynamical behavior of a single particle in the RDPSO algorithm. The search behavior of the algorithm itself is also investigated in detail, by analyzing the interaction between the particles. Some variants of the RDPSO algorithm are proposed by incorporating different random velocity components with different neighborhood topologies. Finally, empirical studies on the RDPSO algorithm are performed by using a set of benchmark functions from the CEC2005 benchmark suite. Based on the theoretical analysis of the particle's behavior, two methods of controlling the algorithmic parameters are employed, followed by an experimental analysis on how to select the parameter values, in order to obtain a good overall performance of the RDPSO algorithm and its variants in real-world applications. A further performance comparison between the RDPSO algorithms and other variants of PSO is made to prove the efficiency of the RDPSO algorithms.

evolutionary algorithm, machine learning, particle, (17 more...)

arXiv.org Artificial Intelligence

1306.2863

Country:

Europe > United Kingdom (0.46)
Africa (0.28)
Asia (0.27)

Genre: Research Report > Experimental Study (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Add feedback

Towards more accurate clustering method by using dynamic time warping

Ghanem, Khadoudja

arXiv.org Machine LearningApr-12-2013

An intrinsic problem of classifiers based on machine learning (ML) methods is that their learning time grows as the size and complexity of the training dataset increases. For this reason, it is important to have efficient computational methods and algorithms that can be applied on large datasets, such that it is still possible to complete the machine learning tasks in reasonable time. In this context, we present in this paper a more accurate simple process to speed up ML methods. An unsupervised clustering algorithm is combined with Expectation, Maximization (EM) algorithm to develop an efficient Hidden Markov Model (HMM) training. The idea of the proposed process consists of two steps. In the first step, training instances with similar inputs are clustered and a weight factor which represents the frequency of these instances is assigned to each representative cluster. Dynamic Time Warping technique is used as a dissimilarity function to cluster similar examples. In the second step, all formulas in the classical HMM training algorithm (EM) associated with the number of training instances are modified to include the weight factor in appropriate terms. This process significantly accelerates HMM training while maintaining the same initial, transition and emission probabilities matrixes as those obtained with the classical HMM training algorithm. Accordingly, the classification accuracy is preserved. Depending on the size of the training set, speedups of up to 2200 times is possible when the size is about 100.000 instances. The proposed approach is not limited to training HMMs, but it can be employed for a large variety of MLs methods.

algorithm, mining & knowledge management process, sequence, (14 more...)

arXiv.org Machine Learning

doi: 10.5121/ijdkp.2013.3207

1304.3745

Country:

North America > United States (0.04)
North America > Canada (0.04)
Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
(7 more...)

Genre: Research Report (0.82)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback