AITopics

2406.17663

Country:

North America > United States > Oregon (0.14)
Asia (0.14)
Europe (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

arXiv.org Artificial IntelligenceFeb-12-2024

Beyond LLMs: Advancing the Landscape of Complex Reasoning

Chu-Carroll, Jennifer, Beck, Andrew, Burnham, Greg, Melville, David OS, Nachman, David, Özcan, A. Erdem, Ferrucci, David

Since the advent of Large Language Models a few years ago, they have often been considered the de facto solution for many AI problems. However, in addition to the many deficiencies of LLMs that prevent them from broad industry adoption, such as reliability, cost, and speed, there is a whole class of common real world problems that Large Language Models perform poorly on, namely, constraint satisfaction and optimization problems. These problems are ubiquitous and current solutions are highly specialized and expensive to implement. At Elemental Cognition, we developed our EC AI platform which takes a neuro-symbolic approach to solving constraint satisfaction and optimization problems. The platform employs, at its core, a precise and high performance logical reasoning engine, and leverages LLMs for knowledge acquisition and user interaction. This platform supports developers in specifying application logic in natural and concise language while generating application user interfaces to interact with users effectively. We evaluated LLMs against systems built on the EC AI platform in three domains and found the EC AI systems to significantly outperform LLMs on constructing valid and optimal solutions, on validating proposed solutions, and on repairing invalid solutions.

large language model, machine learning, natural language, (16 more...)

2402.08064

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceOct-23-2020

Open-Domain Frame Semantic Parsing Using Transformers

Kalyanpur, Aditya, Biran, Or, Breloff, Tom, Chu-Carroll, Jennifer, Diertani, Ariel, Rambow, Owen, Sammons, Mark

Frame semantic parsing is a complex problem which includes multiple underlying subtasks. Recent approaches have employed joint learning of subtasks (such as predicate and argument detection), and multi-task learning of related tasks (such as syntactic and semantic parsing). In this paper, we explore multi-task learning of all subtasks with transformer-based models. We show that a purely generative encoder-decoder architecture handily beats the previous state of the art in FrameNet 1.7 parsing, and that a mixed decoding multi-task approach achieves even better performance. Finally, we show that the multi-task model also outperforms recent state of the art systems for PropBank SRL parsing on the CoNLL 2012 benchmark.

artificial intelligence, computational linguistics, natural language, (15 more...)

2010.10998

Country:

Europe (1.00)
North America > United States > New York (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

arXiv.org Artificial IntelligenceSep-16-2020

GLUCOSE: GeneraLized and COntextualized Story Explanations

Mostafazadeh, Nasrin, Kalyanpur, Aditya, Moon, Lori, Buchanan, David, Berkowitz, Lauren, Biran, Or, Chu-Carroll, Jennifer

When humans read or listen, they make implicit commonsense inferences that frame their understanding of what happened and why. As a step toward AI systems that can build similar mental models, we introduce GLUCOSE, a large-scale dataset of implicit commonsense causal knowledge, encoded as causal mini-theories about the world, each grounded in a narrative context. To construct GLUCOSE, we drew on cognitive psychology to identify ten dimensions of causal explanation, focusing on events, states, motivations, and emotions. Each GLUCOSE entry includes a story-specific causal statement paired with an inference rule generalized from the statement. This paper details two concrete contributions: First, we present our platform for effectively crowdsourcing GLUCOSE data at scale, which uses semi-structured templates to elicit causal explanations. Using this platform, we collected 440K specific statements and general rules that capture implicit commonsense knowledge about everyday situations. Second, we show that existing knowledge resources and pretrained language models do not include or readily predict GLUCOSE's rich inferential content. However, when state-of-the-art neural models are trained on this knowledge, they can start to make commonsense inferences on unseen stories that match humans' mental models.

dimension, expert system, neural network, (18 more...)

2009.07758

Country:

Europe (1.00)
North America > United States > Minnesota (0.29)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.68)
(3 more...)

arXiv.org Artificial IntelligenceMay-11-2020

To Test Machine Comprehension, Start by Defining Comprehension

Dunietz, Jesse, Burnham, Gregory, Bharadwaj, Akash, Rambow, Owen, Chu-Carroll, Jennifer, Ferrucci, David

Many tasks aim to measure machine reading comprehension (MRC), often focusing on question types presumed to be difficult. Rarely, however, do task designers start by considering what systems should in fact comprehend. In this paper we make two key contributions. First, we argue that existing approaches do not adequately define comprehension; they are too unsystematic about what content is tested. Second, we present a detailed definition of comprehension -- a "Template of Understanding" -- for a widely useful class of texts, namely short narratives. We then conduct an experiment that strongly suggests existing systems are not up to the task of narrative understanding as we define it.

computational linguistics, health & medicine, oncology, (19 more...)

2005.01525

Country:

Europe (1.00)
North America > United States > Minnesota (0.28)
North America > United States > Massachusetts (0.28)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Education (0.89)
Health & Medicine > Therapeutic Area > Oncology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

WatsonPaths: Scenario-Based Question Answering and Inference over Unstructured Information

Lally, Adam (Information Technology and Services) | Bagchi, Sugato (IBM Research) | Barborak, Michael A. (IBM T. J. Watson Research Center) | Buchanan, David W. (IBM T. J. Watson Research Center) | Chu-Carroll, Jennifer (IBM Research) | Ferrucci, David A. (Bridgewater) | Glass, Michael R. (IBM Research) | Kalyanpur, Aditya (IBM T. J. Watson Research Center) | Mueller, Erik T. (Capital One) | Murdock, J. William (IBM T. J. Watson Research Center) | Patwardhan, Siddharth (IBM T. J. Watson Research Center) | Prager, John M. (IBM T. J. Watson Research Center)

AI MagazineJul-1-2017

WatsonPaths: Scenario-Based Question Answering and Inference over Unstructured Information

artificial intelligence, natural language, watsonpath, (4 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)

WatsonPaths: Scenario-Based Question Answering and Inference over Unstructured Information

AI MagazineJul-1-2017

We present WatsonPaths, a novel system that can answer scenario-based questions. These include medical questions that present a patient summary and ask for the most likely diagnosis or most appropriate treatment. WatsonPaths builds on the IBM Watson question answering system. WatsonPaths breaks down the input scenario into individual pieces of information, asks relevant subquestions of Watson to conclude new information, and represents these results in a graphical model. Probabilistic inference is performed over the graph to conclude the answer. On a set of medical test preparation questions, WatsonPaths shows a significant improvement in accuracy over multiple baselines.

bayesian inference, neurology, watsonpath, (20 more...)

Country:

North America > United States > Texas (0.28)
North America > United States > Maryland (0.28)
North America > United States > California (0.28)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.93)

Industry:

Health & Medicine > Therapeutic Area > Neurology > Parkinson's Disease (0.68)
Health & Medicine > Diagnostic Medicine (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.93)
(3 more...)

AAAI ConferencesAug-4-2011

Leveraging Wikipedia Characteristics for Search and Candidate Generation in Question Answering

Chu-Carroll, Jennifer (IBM T. J. Watson Research Center) | Fan, James (IBM T. J. Watson Research Center)

Most existing Question Answering (QA) systems adopt a type-and-generate approach to candidate generation that relies on a pre-defined domain ontology. This paper describes a type independent search and candidate generation paradigm for QA that leverages Wikipedia characteristics. This approach is particularly useful for adapting QA systems to domains where reliable answer type identification and type-based answer extraction are not available. We present a three-pronged search approach motivated by relations an answer-justifying title-oriented document may have with the question/answer pair. We further show how Wikipedia metadata such as anchor texts and redirects can be utilized to effectively extract candidate answers from search results without a type ontology. Our experimental results show that our strategies obtained high binary recall in both search and candidate generation on TREC questions, a domain that has mature answer type extraction technology, as well as on Jeopardy! questions, a domain without such technology. Our high-recall search and candidate generation approach has also led to high overall QA performance in Watson, our end-to-end system.

artificial intelligence, jeopardy! quiz show, natural language, (17 more...)

AAAI Conferences

Twenty-Fifth AAAI Conference on Artificial Intelligence

Country:

Europe (0.46)
North America > United States > New York (0.14)

Genre: Research Report > New Finding (0.34)

Industry: Leisure & Entertainment > Games > Jeopardy! (0.95)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)

Building Watson: An Overview of the DeepQA Project

AI MagazineOct-10-2010

IBM Research undertook a challenge to build a computer system that could compete at the human champion level in real time on the American TV Quiz show, Jeopardy! The extent of the challenge includes fielding a real-time automatic contestant on the show, not merely a laboratory exercise. The Jeopardy! Challenge helped us address requirements that led to the design of the DeepQA architecture and the implementation of Watson. After 3 years of intense research and development by a core team of about 20 researches, Watson is performing at human expert-levels in terms of precision, confidence and speed at the Jeopardy! Quiz show. Our results strongly suggest that DeepQA is an effective and extensible architecture that may be used as a foundation for combining, deploying, evaluating and advancing a wide range of algorithmic techniques to rapidly advance the field of QA.

candidate answer, neural network, us government, (24 more...)

Country:

Asia (0.93)
North America > United States > California (0.28)
North America > United States > Maryland (0.28)

Genre: Research Report > New Finding (0.66)

Industry:

Leisure & Entertainment > Games > Jeopardy! (1.00)
Government > Regional Government > North America Government > United States Government (0.92)

Technology:

Information Technology > Knowledge Management (1.00)
Information Technology > Information Management > Search (1.00)
Information Technology > Data Science > Data Mining (1.00)
(4 more...)

The AAAI Spring Symposia

Green, Nancy, Chu-Carroll, Jennifer, Kortenkamp, David, Schultz, Alan, Coen, Michael H., Radev, Dragomir R., Hovy, Eduard, Haddawy, Peter, Hanks, Steve, Freuder, Eugene, Ortiz, Charlie, Sen, Sandip

AI MagazineSep-15-1999

The Association for the Advancement of Artificial Intelligence, in cooperation with Stanford University's Department of Computer Science, held the 1998 Spring Symposium Series on 23 to 25 March at Stanford University. The topics of the eight symposia were (1) Applying Machine Learning to Discourse Processing, (2) Integrating Robotic Research: Taking the Next Leap, (3) Intelligent Environments, (4) Intelligent Text Summarization, (5) Interactive and Mixed-Initiative Decision-Theoretic Systems, (6) Multimodal Reasoning, (7) Prospects for a Common-Sense Theory of Causation, and (8) Satisficing Models.

AAAI Spring Symposia, artificial intelligence, management and information, (1 more...)

Industry: Information Technology > Smart Houses & Appliances (0.77)

Technology: Information Technology > Artificial Intelligence (1.00)