AITopics | Country

Collaborating Authors

Country

Networks and Natural Language Processing

Radev, Dragomir R. (University of Michigan) | Mihalcea, Rada (University of North Texas)

AI MagazineSep-15-2008

Over the last few years, a number of areas of natural language processing have begun applying graph-based techniques. These include, among others, text summarization, syntactic parsing, word-sense disambiguation, ontology construction, sentiment and subjectivity analysis, and text clustering. In this paper, we present some of the most successful graph-based representations and algorithms used in language processing and try to explain how and why they work.

artificial intelligence, natural language, text processing, (16 more...)

AI Magazine

Country:

North America > United States (1.00)
Europe (1.00)
Asia > Middle East > Iraq (0.48)

Industry:

Government > Military (0.93)
Government > Regional Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.67)

Add feedback

Introduction to the Special Issue on AI and Networks

Jardins, Marie des (University of Maryland) | Gaston, Matthew E. (Viz) | Radev, Dragomir R. (University of Michigan)

AI MagazineSep-15-2008

This introduction to AI Magazine's Special Issueon Networks and AI summarizes the seven articles in thespecial issue by characterizing the nature of thenetworks that are the focus of each of the papers.A short tutorial on graph theory and network structuresis included for those less familiar with the topic.

artificial intelligence, natural language, node, (17 more...)

AI Magazine

Country: North America > United States > Maryland (0.30)

Genre: Collection > Journal > Special Issue (0.30)

Industry: Leisure & Entertainment (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.97)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.47)

Add feedback

Approximation of the Two-Part MDL Code

Adriaans, Pieter, Vitanyi, Paul

arXiv.org Artificial IntelligenceSep-15-2008

In machine learning pure applications of MDL are rare, partially because of the difficulties one encounters trying to define an adequate model code and data-to-model code, and partially because of the operational difficulties that are poorly understood. We analyze aspects of both the power and the perils of MDL precisely and formally. Let us first resurrect a familiar problem from our childhood to illustrate some of the issues involved. The process of solving a jigsaw puzzle involves an incremental reduction of entropy, and this serves to illustrate the analogous features of the learning problems which are the main issues of this work. Initially, when the pieces come out of the box they have a completely random ordering. Gradually we combine pieces, thus reducing the entropy and increasing the order until the puzzle is solved. In this last stage we have found a maximal ordering. Suppose that Alice and Bob both start to solve two versions of the same puzzle, but that they follow different strategies. Initially, Alice sorts all pieces according to color, and Bob starts by sorting the pieces according to shape.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

cs/0612095

Country: Europe > Netherlands (0.28)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Normalized Information Distance

Vitanyi, Paul M. B., Balbach, Frank J., Cilibrasi, Rudi L., Li, Ming

arXiv.org Artificial IntelligenceSep-15-2008

The normalized information distance is a universal distance measure for objects of all kinds. It is based on Kolmogorov complexity and thus uncomputable, but there are ways to utilize it. First, compression algorithms can be used to approximate the Kolmogorov complexity if the objects have a string representation. Second, for names and abstract concepts, page count statistics from the World Wide Web can be used. These practical realizations of the normalized information distance can then be applied to machine learning tasks, expecially clustering, to perform feature-free and parameter-free data mining. This chapter discusses the theoretical foundations of the normalized information distance and both practical realizations. It presents numerous examples of successful real-world applications based on these distance measures, ranging from bioinformatics to music clustering to machine translation.

data mining, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

0809.2553

Country:

Europe (0.67)
North America > United States > Washington > King County (0.28)

Genre: Research Report (0.40)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Government (0.67)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback

Randomized Distributed Configuration Management of Wireless Networks: Multi-layer Markov Random Fields and Near-Optimality

Jeon, Sung-eok, Ji, Chunayi

arXiv.org Artificial IntelligenceSep-11-2008

Distributed configuration management is imperative for wireless infrastructureless networks where each node adjusts locally its physical and logical configuration through information exchange with neighbors. Two issues remain open. The first is the optimality. The second is the complexity. We study these issues through modeling, analysis, and randomized distributed algorithms. Modeling defines the optimality. We first derive a global probabilistic model for a network configuration which characterizes jointly the statistical spatial dependence of a physical- and a logical-configuration. We then show that a local model which approximates the global model is a two-layer Markov Random Field or a random bond model. The complexity of the local model is the communication range among nodes. The local model is near-optimal when the approximation error to the global model is within a given error bound. We analyze the trade-off between an approximation error and complexity, and derive sufficient conditions on the near-optimality of the local model. We validate the model, the analysis and the randomized distributed algorithms also through simulation.

algorithm, configuration, local model, (16 more...)

arXiv.org Artificial Intelligence

0809.1916

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > United States > New York > Rensselaer County > Troy (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
(5 more...)

Genre: Research Report (0.81)

Industry: Telecommunications > Networks (0.89)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.61)

Add feedback

Grammar-Based Random Walkers in Semantic Networks

Rodriguez, Marko A.

arXiv.org Artificial IntelligenceSep-10-2008

Semantic networks qualify the meaning of an edge relating any two vertices. Determining which vertices are most "central" in a semantic network is difficult because one relationship type may be deemed subjectively more important than another. For this reason, research into semantic network metrics has focused primarily on context-based rankings (i.e. user prescribed contexts). Moreover, many of the current semantic network metrics rank semantic associations (i.e. directed paths between two vertices) and not the vertices themselves. This article presents a framework for calculating semantically meaningful primary eigenvector-based metrics such as eigenvector centrality and PageRank in semantic networks using a modified version of the random walker model of Markov chain analysis. Random walkers, in the context of this article, are constrained by a grammar, where the grammar is a user defined data structure that determines the meaning of the final vertex ranking. The ideas in this article are presented within the context of the Resource Description Framework (RDF) of the Semantic Web initiative.

artificial intelligence, rwr, semantic web, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.knosys.2008.03.030

0803.4355

Country:

Europe (1.00)
North America > United States (0.93)

Genre: Research Report (0.40)

Technology:

Information Technology > Communications > Web > Semantic Web (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)

Add feedback

Agent-based Ecological Model Calibration - on the Edge of a New Approach

Pereira, Antonio, Duarte, Pedro, Reis, Luis Paulo

arXiv.org Artificial IntelligenceSep-9-2008

- In every mathematical model, parameters regulate the behaviour of equations describing temporal and spatial changes of model state variables and their interactions. Generally, there is some uncertainty associated with each parameter. Model calibration is performed by comparing observed with predicted data and it is a crucial phase in the modelling process. It's an iterative and interactive task in which, after each simulation, the "modeller" analyses the results and performs changes on one or more equation's parameters trying to tune the model. This "tuning" procedure is a hard and "tedious" work requiring a good understanding of the effects of different parameters over the available variables. Automatic calibration procedures, based on systematic and exhaustive generation of parameter vectors and using several convergence methods, are available but they require a large number of model runs and are, therefore, not applicable to very complex ecosystem models demanding large computational times.

artificial intelligence, calibration, simulation, (14 more...)

arXiv.org Artificial Intelligence

0809.1686

Country:

North America (0.28)
Europe > Portugal (0.14)
Asia > China (0.14)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Predictive Hypothesis Identification

Hutter, Marcus

arXiv.org Machine LearningSep-8-2008

While statistics focusses on hypothesis testing and on estimating (properties of) the true sampling distribution, in machine learning the performance of learning algorithms on future data is the primary issue. In this paper we bridge the gap with a general principle (PHI) that identifies hypotheses with best predictive performance. This includes predictive point and interval estimation, simple and composite hypothesis testing, (mixture) model selection, and others as special cases. For concrete instantiations we will recover well-known methods, variations thereof, and new ones. PHI nicely justifies, reconciles, and blends (a reparametrization invariant variation of) MAP, ML, MDL, and moment estimation. One particular feature of PHI is that it can genuinely deal with nested hypotheses.

estimation, hypothesis, prediction, (15 more...)

arXiv.org Machine Learning

0809.1270

Country:

Oceania > Australia > Australian Capital Territory > Canberra (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
Europe > Poland (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.49)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)

Add feedback

Necessary and Sufficient Conditions for Success of the Nuclear Norm Heuristic for Rank Minimization

Recht, Benjamin, Xu, Weiyu, Hassibi, Babak

arXiv.org Machine LearningSep-7-2008

Minimizing the rank of a matrix subject to constraints is a challenging problem that arises in many applications in control theory, machine learning, and discrete geometry. This class of optimization problems, known as rank minimization, is NP-HARD, and for most practical problems there are no efficient algorithms that yield exact solutions. A popular heuristic algorithm replaces the rank function with the nuclear norm--equal to the sum of the singular values--of the decision variable. In this paper, we provide a necessary and sufficient condition that quantifies when this heuristic successfully finds the minimum rank solution of a linear constraint set. We additionally provide a probability distribution over instances of the affine rank minimization problem such that instances sampled from this distribution satisfy our conditions for success with overwhelming probability provided the number of constraints is appropriately large. Finally, we give empirical evidence that these probabilistic bounds provide accurate predictions of the heuristic's performance in non-asymptotic scenarios.

artificial intelligence, machine learning, optimization problem, (18 more...)

arXiv.org Machine Learning

doi: 10.1109/CDC.2008.4739332

0809.1260

Country:

Europe (0.46)
North America > United States > California (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)

Add feedback

Variable Neighborhood Search for the University Lecturer-Student Assignment Problem

Geiger, Martin Josef, Wenger, Wolf

arXiv.org Artificial IntelligenceSep-5-2008

The paper presents a study of local search heuristics in general and variable neighborhood search in particular for the resolution of an assignment problem studied in the practical work of universities. Here, students have to be assigned to scientific topics which are proposed and supported by members of staff. The problem involves the optimization under given preferences of students which may be expressed when applying for certain topics. It is possible to observe that variable neighborhood search leads to superior results for the tested problem instances. One instance is taken from an actual case, while others have been generated based on the real world data to support the analysis with a deeper analysis. An extension of the problem has been formulated by integrating a second objective function that simultaneously balances the workload of the members of staff while maximizing utility of the students. The algorithmic approach has been prototypically implemented in a computer system. One important aspect in this context is the application of the research work to problems of other scientific institutions, and therefore the provision of decision support functionalities.

artificial intelligence, optimization problem, student, (11 more...)

arXiv.org Artificial Intelligence

0809.1077

Country: Europe > Germany (0.14)

Genre: Research Report (0.40)

Industry: Education > Educational Setting > Higher Education (0.78)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback