AITopics | Law

Techniques such as probabilistic topic models and latent-semantic indexing have been shown to be broadly useful at automatically extracting the topical or semantic content of documents, or more generally for dimension-reduction of sparse count data. These types of models and algorithms can be viewed as generating an abstraction from the words in a document to a lower-dimensional latent variable representation that captures what the document is generally about beyond the specific words it contains. In this paper we propose a new probabilistic model that tempers this approach by representing each document as a combination of (a) a background distribution over common words, (b) a mixture distribution over general topics, and (c) a distribution over words that are treated as being specific to that document. We illustrate how this model can be used for information retrieval by matching documents both at a general topic level and at a specific word level, providing an advantage over techniques that only match documents at a general level (such as topic models or latent-sematic indexing) or that only match documents at the specific word level (such as TF-IDF).

background distribution, query, topic model, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Orange County > Irvine (0.14)
Asia > Middle East > Jordan (0.05)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre: Research Report (0.68)

Industry:

Government > Voting & Elections (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Law (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.93)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)

Add feedback

Modeling General and Specific Aspects of Documents with a Probabilistic Topic Model

Chemudugunta, Chaitanya, Smyth, Padhraic, Steyvers, Mark

Neural Information Processing SystemsDec-31-2007

Techniques such as probabilistic topic models and latent-semantic indexing have been shown to be broadly useful at automatically extracting the topical or semantic content of documents, or more generally for dimension-reduction of sparse count data. These types of models and algorithms can be viewed as generating an abstraction from the words in a document to a lower-dimensional latent variable representation that captures what the document is generally about beyond the specific words it contains. In this paper we propose a new probabilistic model that tempers this approach by representing each document as a combination of (a) a background distribution over common words, (b) a mixture distribution over general topics, and (c) a distribution over words that are treated as being specific to that document. We illustrate how this model can be used for information retrieval by matching documents both at a general topic level and at a specific word level, providing an advantage over techniques that only match documents at a general level (such as topic models or latent-sematic indexing) or that only match documents at the specific word level (such as TF-IDF).

background distribution, query, topic model, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Orange County > Irvine (0.14)
Asia > Middle East > Jordan (0.05)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre: Research Report (0.68)

Industry:

Government > Voting & Elections (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Law (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.93)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)

Add feedback

Modeling General and Specific Aspects of Documents with a Probabilistic Topic Model

Chemudugunta, Chaitanya, Smyth, Padhraic, Steyvers, Mark

Neural Information Processing SystemsDec-31-2007

Approaches such as LSI and LDA have both been shown to be useful for "object matching" in their

machine learning, natural language, topic model, (17 more...)

Neural Information Processing Systems

Country: North America > United States > California > Orange County > Irvine (0.14)

Genre: Research Report (0.68)

Industry:

Government > Voting & Elections (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Law (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.69)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.53)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.47)

Add feedback

Knowware: the third star after Hardware and Software

Lu, Ruqian

arXiv.org Artificial IntelligenceNov-27-2007

This book proposes to separate knowledge from software and to make it a commodity that is called knowware. The architecture, representation and function of Knowware are discussed. The principles of knowware engineering and its three life cycle models: furnace model, crystallization model and spiral model are proposed and analyzed. Techniques of software/knowware co-engineering are introduced. A software component whose knowledge is replaced by knowware is called mixware. An object and component oriented development schema of mixware is introduced. In particular, the tower model and ladder model for mixware development are proposed and discussed. Finally, knowledge service and knowware based Web service are introduced and compared with Web service. In summary, knowware, software and hardware should be considered as three equally important underpinnings of IT industry. Ruqian Lu is a professor of computer science of the Institute of Mathematics, Academy of Mathematics and System Sciences. He is a fellow of Chinese Academy of Sciences. His research interests include artificial intelligence, knowledge engineering and knowledge based software engineering. He has published more than 100 papers and 10 books. He has won two first class awards from the Academia Sinica and a National second class prize from the Ministry of Science and Technology. He has also won the sixth Hua Loo-keng Mathematics Prize.

data mining, knowledge management, natural language, (19 more...)

arXiv.org Artificial Intelligence

0711.4309

Country:

Asia > Japan (0.04)
North America > United States > New York (0.04)
North America > United States > Hawaii (0.04)
(8 more...)

Genre:

Instructional Material (0.93)
Personal (0.85)
Research Report (0.63)

Industry:

Media (1.00)
Leisure & Entertainment (1.00)
Law > Intellectual Property & Technology Law (1.00)
(5 more...)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Data Science > Data Mining (1.00)
(5 more...)

Add feedback

Practical Approach to Knowledge-based Question Answering with Natural Language Understanding and Advanced Reasoning

Wong, Wilson

arXiv.org Artificial IntelligenceJul-24-2007

This research hypothesized that a practical approach in the form of a solution framework known as Natural Language Understanding and Reasoning for Intelligence (NaLURI), which combines full-discourse natural language understanding, powerful representation formalism capable of exploiting ontological information and reasoning approach with advanced features, will solve the following problems without compromising practicality factors: 1) restriction on the nature of question and response, and 2) limitation to scale across domains and to real-life natural language text.

information retrieval, natural language, question answering, (21 more...)

arXiv.org Artificial Intelligence

0707.3559

Country:

Asia > Thailand (0.13)
Asia > South Korea (0.13)
Asia > Malaysia > Kuala Lumpur > Kuala Lumpur (0.04)
(21 more...)

Genre: Research Report > Promising Solution (0.45)

Industry:

Leisure & Entertainment (1.00)
Law > Litigation (1.00)
Law > Intellectual Property & Technology Law (1.00)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
(6 more...)

Add feedback

Reports on the 2006 AAAI Fall Symposia

Bongard, Joshua, Brock, Derek, Collins, Samuel G., Duraiswami, Ramani, Finin, Tim, Harrison, Ian, Honavar, Vasant, Hornby, Gregory S., Jonsson, Ari, Kassoff, Mike, Kortenkamp, David, Kumar, Sanjeev, Murray, Ken, Rudnicky, Alexander I., Trajkovski, Goran

AI MagazineMar-15-2007

The American Association for Artificial Intelligence was pleased to present the AAAI 2006 Fall Symposium Series, held Friday through Sunday, October 13-15, at the Hyatt Regency Crystal City in Washington, DC. Seven symposia were held. The titles were (1) Aurally Informed Performance: Integrating Ma- chine Listening and Auditory Presentation in Robotic Systems; (2) Capturing and Using Patterns for Evidence Detection; (3) Developmental Systems; (4) Integrating Reasoning into Everyday Applications; (5) Interaction and Emergent Phenomena in Societies of Agents; (6) Semantic Web for Collaborative Knowledge Acquisition; and (7) Spacecraft Autonomy: Using AI to Expand Human Space Exploration.

artificial intelligence, machine learning, natural language, (16 more...)

AI Magazine

Country:

North America > United States > District of Columbia > Washington (0.25)
North America > United States > California > Santa Clara County > Palo Alto (0.14)
North America > United States > California > Orange County > Irvine (0.14)
(7 more...)

Industry:

Law (1.00)
Government (0.71)
Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

Calendar of Events

AAAI,

AI MagazineMar-15-2007

(MDAI 2007).

artificial intelligence, email, university, (14 more...)

AI Magazine

Country:

Europe (1.00)
Asia (1.00)
North America > United States > Tennessee > Davidson County > Nashville (0.15)
(2 more...)

Industry:

Law (0.95)
Education > Educational Setting > Higher Education (0.95)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)

Add feedback

Calendar of Events

AAAI,

AI MagazineDec-15-2006

(MDAI 2007).

artificial intelligence, email, university, (16 more...)

AI Magazine

Country:

North America > Canada (0.99)
Europe > United Kingdom (0.71)
North America > United States > California > Santa Clara County > Stanford (0.14)

Industry:

Law (0.95)
Education > Educational Setting > Higher Education (0.95)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)

Add feedback

Learning Sentence-internal Temporal Relations

Lapata, M., Lascarides, A.

Journal of Artificial Intelligence ResearchSep-28-2006

In this paper we propose a data intensive approach for inferring sentence-internal temporal relations. Temporal inference is relevant for practical NLP applications which either extract or synthesize temporal information (e.g., summarisation, question answering). Our method bypasses the need for manual coding by exploiting the presence of markers like ``after", which overtly signal a temporal relation. We first show that models trained on main and subordinate clauses connected with a temporal marker achieve good performance on a pseudo-disambiguation task simulating temporal inference (during testing the temporal marker is treated as unseen and the models must select the right marker from a set of possible candidates). Secondly, we assess whether the proposed approach holds promise for the semi-automatic creation of temporal annotations. Specifically, we use a model trained on noisy and approximate data (i.e., main and subordinate clauses) to predict intra-sentential relations present in TimeBank, a corpus annotated rich temporal information. Our experiments compare and contrast several probabilistic models differing in their feature space, linguistic assumptions and data requirements. We evaluate performance against gold standard corpora and also against human subjects.

machine learning, natural language, relation, (24 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.2015

AI Access Foundation

10467

Journal of Artificial Intelligence Research

Country:

North America > Canada (0.46)
North America > United States > California (0.28)
Europe > France (0.14)
(8 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Energy > Oil & Gas (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)
(2 more...)

Add feedback

Representing Conversations for Scalable Overhearing

Gutnik, G., Kaminka, G. A.

Journal of Artificial Intelligence ResearchMar-16-2006

Open distributed multi-agent systems are gaining interest in the academic community and in industry. In such open settings, agents are often coordinated using standardized agent conversation protocols. The representation of such protocols (for analysis, validation, monitoring, etc) is an important aspect of multi-agent applications. Recently, Petri nets have been shown to be an interesting approach to such representation, and radically different approaches using Petri nets have been proposed. However, their relative strengths and weaknesses have not been examined. Moreover, their scalability and suitability for different tasks have not been addressed. This paper addresses both these challenges. First, we analyze existing Petri net representations in terms of their scalability and appropriateness for overhearing, an important task in monitoring open multi-agent systems. Then, building on the insights gained, we introduce a novel representation using Colored Petri nets that explicitly represent legal joint conversation states and messages. This representation approach offers significant improvements in scalability and is particularly suitable for overhearing. Furthermore, we show that this new representation offers a comprehensive coverage of all conversation features of FIPA conversation standards. We also present a procedure for transforming AUML conversation protocol diagrams (a standard human-readable representation), to our Colored Petri net representation.

agent, communicative act, representation, (17 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1829

AI Access Foundation

10446

Journal of Artificial Intelligence Research

Country: