AITopics | Europe

Collaborating Authors

Europe

Systematic Evaluation of Convergence Criteria in Iterative Training for NLP

Brent, Patricia (Oak Ridge National Laboratory) | Green, Nathan David (North Carolina State University) | Breimyer, Paul (North Carolina State University) | Krishnamurthy, Ramya (Oak Ridge National Laboratory) | Samatova, Nagiza F. (North Carolina State University)

AAAI ConferencesMay-21-2009

Natural Language Processing (NLP) tasks, such as Named Entity Recognition (NER), involve an iterative process of model optimization to identify different types of words or semantic entities. This optimization to achieve a more precise model becomes computationally difficult as the number of iterations increase. The small datasets available for training typically limit the models. Adding iterations on such sets to further optimize the model can often cause over-fitting, which generally leads to reduced performance. Therefore, the choice of convergence criteria is a critical step in robust and accurate model building. We evaluate different convergence criteria in terms of their robustness, stopping threshold selection, and independence from the training data size and entity. The underlying framework employs a limited-memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) parameter optimization in the context of Conditional Random Fields (CRF). This paper presents a convergence criterion for robust training irrespective of semantic types and data sizes with two-orders of magnitude reduction in stopping threshold for improved model accuracy and faster convergence. Additionally, we examine convergence with active learning to further reduce the training data and training time.

active learning, convergence, convergence criteria, (14 more...)

AAAI Conferences

Twenty-Second International FLAIRS Conference

Country:

North America > United States > North Carolina (0.05)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

FLAIRS-22 Conference Committees

Lane, H. Chad (USC/ICT) | Guesgen, Hans W. (Massey University)

AAAI ConferencesMay-21-2009

Conference and special track committees responsible for the 2009 FLAIRS conference.

state university, university, usa, (14 more...)

AAAI Conferences

Twenty-Second International FLAIRS Conference

Country:

North America > United States > California (0.21)
Europe > United Kingdom > England > Nottinghamshire > Nottingham (0.15)
North America > Canada > Ontario > Toronto (0.14)
(63 more...)

Industry:

Government (0.73)
Education > Educational Setting > Higher Education (0.54)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
Information Technology > Artificial Intelligence > Natural Language (0.69)

Add feedback

Florida AI Research Society

Lane, H. Chad (USC/ICT) | Guesgen, Hans W. (Massey University)

AAAI ConferencesMay-21-2009

List of officers of the FLAIRS society.

florida ai research society, university, usa

AAAI Conferences

Twenty-Second International FLAIRS Conference

Country:

North America > United States > North Carolina (0.21)
North America > United States > New York (0.21)
Europe > Germany > Brandenburg > Potsdam (0.21)

Industry: Education > Educational Setting > Higher Education (0.47)

Technology: Information Technology > Artificial Intelligence (0.57)

Add feedback

Optimistic Simulated Exploration as an Incentive for Real Exploration

Danihelka, Ivo

arXiv.org Artificial IntelligenceMay-20-2009

Many reinforcement learning exploration techniques are overly optimistic and try to explore every state. Such exploration is impossible in environments with the unlimited number of states. I propose to use simulated exploration with an optimistic model to discover promising paths for real exploration. This reduces the needs for the real exploration.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

0903.2972

Country: Europe > Czechia (0.15)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.52)

Add feedback

Interpretations of the Web of Data

Rodriguez, Marko A.

arXiv.org Artificial IntelligenceMay-20-2009

The emerging Web of Data utilizes the web infrastructure to represent and interrelate data. The foundational standards of the Web of Data include the Uniform Resource Identifier (URI) and the Resource Description Framework (RDF). URIs are used to identify resources and RDF is used to relate resources. While RDF has been posited as a logic language designed specifically for knowledge representation and reasoning, it is more generally useful if it can conveniently support other models of computing. In order to realize the Web of Data as a general-purpose medium for storing and processing the world's data, it is necessary to separate RDF from its logic language legacy and frame it simply as a data model. Moreover, there is significant advantage in seeing the Semantic Web as a particular interpretation of the Web of Data that is focused specifically on knowledge representation and reasoning. By doing so, other interpretations of the Web of Data are exposed that realize RDF in different capacities and in support of different computing models.

artificial intelligence, lanl, semantic web, (16 more...)

arXiv.org Artificial Intelligence

0905.3378

Country:

North America > United States (1.00)
Europe (1.00)

Genre: Research Report (0.64)

Technology:

Information Technology > Communications > Web > Semantic Web (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

A Note on the Complexity of the Satisfiability Problem for Graded Modal Logics

Kazakov, Yevgeny, Pratt-Hartmann, Ian

arXiv.org Artificial IntelligenceMay-19-2009

Graded modal logic is the formal language obtained from ordinary (propositional) modal logic by endowing its modal operators with cardinality constraints. Under the familiar possible-worlds semantics, these augmented modal operators receive interpretations such as "It is true at no fewer than 15 accessible worlds that...", or "It is true at no more than 2 accessible worlds that...". We investigate the complexity of satisfiability for this language over some familiar classes of frames. This problem is more challenging than its ordinary modal logic counterpart--especially in the case of transitive frames, where graded modal logic lacks the tree-model property. We obtain tight complexity bounds for the problem of determining the satisfiability of a given graded modal logic formula over the classes of frames characterized by any combination of reflexivity, seriality, symmetry, transitivity and the Euclidean property.

artificial intelligence, logic & formal reasoning, modal logic, (15 more...)

arXiv.org Artificial Intelligence

0905.3108

Country: Europe > United Kingdom > England (0.28)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)

Add feedback

The Role of Self-Forensics in Vehicle Crash Investigations and Event Reconstruction

Mokhov, Serguei A.

arXiv.org Artificial IntelligenceMay-14-2009

This paper further introduces and formalizes a novel concept of self-forensics for automotive vehicles, specified in the Forensic Lucid language. We argue that self-forensics, with the forensics taken out of the cybercrime domain, is applicable to "self-dissection" of intelligent vehicles and hardware systems for automated incident and anomaly analysis and event reconstruction by the software with or without the aid of the engineering teams in a variety of forensic scenarios. We propose a formal design, requirements, and specification of the self-forensic enabled units (similar to blackboxes) in vehicles that will help investigation of incidents and also automated reasoning and verification of theories along with the events reconstruction in a formal model. We argue such an analysis is beneficial to improve the safety of the passengers and their vehicles, like the airline industry does for planes.

autonomic computing, logic & formal reasoning, specification, (18 more...)

arXiv.org Artificial Intelligence

0905.2449

Country:

North America > United States (1.00)
Europe (1.00)

Genre: Research Report (1.00)

Industry:

Transportation > Air (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications (1.00)
Information Technology > Architecture > Autonomic Computing (0.96)
(2 more...)

Add feedback

Quantified Multimodal Logics in Simple Type Theory

Benzmueller, Christoph, Paulson, Lawrence C.

arXiv.org Artificial IntelligenceMay-14-2009

We present a straightforward embedding of quantified multimodal logic in simple type theory and prove its soundness and completeness. Modal operators are replaced by quantification over a type of possible worlds. We present simple experiments, using existing higher-order theorem provers, to demonstrate that the embedding allows automated proofs of statements in these logics, as well as meta properties of them.

artificial intelligence, logic, logic & formal reasoning, (15 more...)

arXiv.org Artificial Intelligence

0905.2435

Country:

Europe > Germany (0.47)
North America > United States (0.46)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)

Add feedback

Effect of Tuned Parameters on a LSA MCQ Answering Model

Lifchitz, Alain, Jhean-Larose, Sandra, Denhière, Guy

arXiv.org Artificial IntelligenceMay-14-2009

This paper presents the current state of a work in progress, whose objective is to better understand the effects of factors that significantly influence the performance of Latent Semantic Analysis (LSA). A difficult task, which consists in answering (French) biology Multiple Choice Questions, is used to test the semantic properties of the truncated singular space and to study the relative influence of main parameters. A dedicated software has been designed to fine tune the LSA semantic space for the Multiple Choice Questions task. With optimal parameters, the performances of our simple model are quite surprisingly equal or superior to those of 7th and 8th grades students. This indicates that semantic spaces were quite good despite their low dimensions and the small sizes of training data sets. Besides, we present an original entropy global weighting of answers' terms of each question of the Multiple Choice Questions which was necessary to achieve the model's success.

machine learning, multiple choice question, natural language, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.3758/BRM.41.4.1201

0811.0146

Country: Europe > France > Île-de-France (0.14)

Genre:

Research Report (0.82)
Questionnaire & Opinion Survey (0.81)

Industry: Education > Educational Setting (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Multi-Instance Learning by Treating Instances As Non-I.I.D. Samples

Zhou, Zhi-Hua, Sun, Yu-Yin, Li, Yu-Feng

arXiv.org Artificial IntelligenceMay-13-2009

Multi-instance learning attempts to learn from a training set consisting of labeled bags each containing many unlabeled instances. Previous studies typically treat the instances in the bags as independently and identically distributed. However, the instances in a bag are rarely independent, and therefore a better performance can be expected if the instances are treated in an non-i.i.d. way that exploits the relations among instances. In this paper, we propose a simple yet effective multi-instance learning method, which regards each bag as a graph and uses a specific kernel to distinguish the graphs by considering the features of the nodes as well as the features of the edges that convey some relations among instances. The effectiveness of the proposed method is validated by experiments.

artificial intelligence, learning, machine learning, (14 more...)

arXiv.org Artificial Intelligence

0807.1997

Country:

Europe (1.00)
Asia (0.68)
North America > United States > Nebraska (0.28)

Genre: Research Report > Experimental Study (0.46)

Industry: Information Technology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.46)

Add feedback