AITopics | document and query

Building extractive QA system using Haystack, OpenAI and Pinecone

#artificialintelligenceJan-2-2023, 04:50:05 GMT

Closed book Abstractive: These systems do not have access to external data store. They store information internally in the model parameters. ChatGPT and other large language models are part of this category. Unlike open book systems, these system do not have access to the latest information.

document store, information, qa system, (14 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

Ontology-Based Query Expansion with Latently Related Named Entities for Semantic Text Search

Ngo, Vuong M., Cao, Tru H.

arXiv.org Artificial IntelligenceJul-15-2018

Traditional information retrieval systems represent documents and queries by keyword sets. However, the content of a document or a query is mainly defined by both keywords and named entities occurring in it. Named entities have ontological features, namely, their aliases, classes, and identifiers, which are hidden from their textual appearance. Besides, the meaning of a query may imply latent named entities that are related to the apparent ones in the query. We propose an ontology-based generalized vector space model to semantic text search. It exploits ontological features of named entities and their latently related ones to reveal the semantics of documents and queries. We also propose a framework to combine different ontologies to take their complementary advantages for semantic annotation and searching.

artificial intelligence, natural language, text processing, (18 more...)

arXiv.org Artificial Intelligence

1807.05579

Country:

Asia > Vietnam > Hồ Chí Minh City > Hồ Chí Minh City (0.15)
North America > United States > Indiana (0.05)
North America > United States > Texas (0.04)
(6 more...)

Genre: Research Report (0.40)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Automobiles & Trucks > Manufacturer (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (0.51)

Add feedback

Dependent Gated Reading for Cloze-Style Question Answering

Ghaeini, Reza, Fern, Xiaoli Z., Shahbazi, Hamed, Tadepalli, Prasad

arXiv.org Artificial IntelligenceMay-26-2018

We present a novel deep learning architecture to address the cloze-style question answering task. Existing approaches employ reading mechanisms that do not fully exploit the interdependency between the document and the query. In this paper, we propose a novel \emph{dependent gated reading} bidirectional GRU network (DGR) to efficiently model the relationship between the document and the query during encoding and decision making. Our evaluation shows that DGR obtains highly competitive performance on well-known machine comprehension benchmarks such as the Children's Book Test (CBT-NE and CBT-CN) and Who DiD What (WDW, Strict and Relaxed). Finally, we extensively analyze and validate our model by ablation and attention studies.

machine learning, natural language, question answering, (17 more...)

arXiv.org Artificial Intelligence

1805.10528

Country:

North America > United States (0.68)
North America > Canada (0.46)

Genre: Research Report (1.00)

Industry: Media (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.84)

Add feedback

An introduction to representation learning

#artificialintelligenceSep-12-2017, 14:15:38 GMT

Although many companies today possess massive amounts of data, the vast majority of that data is often unstructured and unlabeled. In fact, the amount of data that is appropriately labeled for a specific business need is typically quite small (possibly even zero), and acquiring new labels is usually a slow, expensive endeavor. As a result, algorithms that can extract features from unlabeled data to improve the performance of data-limited tasks are quite valuable. Most machine learning practitioners are first exposed to feature extraction techniques through unsupervised learning. In unsupervised learning, an algorithm attempts to discover the latent features that describe a data set's "structure" under certain (either explicit or implicit) assumptions.

artificial intelligence, machine learning, natural language, (20 more...)

#artificialintelligence

Country:

Europe > Italy (0.05)
Europe > France (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.79)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.60)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.50)

Add feedback

Mean Field Approach to a Probabilistic Model in Information Retrieval

Wu, Bin, Wong, K., Bodoff, David

Neural Information Processing SystemsDec-31-2003

We study an explicit parametric model of documents, queries, and relevancy assessment for Information Retrieval (IR). Mean-field methods are applied to analyze the model and derive efficient practical algorithms to estimate the parameters in the problem. The hyperparameters are estimated by a fast approximate leave-one-out cross-validation procedure based on the cavity method. The algorithm is further evaluated on several benchmark databases by comparing with standard algorithms in IR.

document and query, estimation, hyperparameter, (12 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.06)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > United States > New York (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.73)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.52)

Add feedback

Mean Field Approach to a Probabilistic Model in Information Retrieval

Wu, Bin, Wong, K., Bodoff, David

Neural Information Processing SystemsDec-31-2003

We study an explicit parametric model of documents, queries, and relevancy assessment for Information Retrieval (IR). Mean-field methods are applied to analyze the model and derive efficient practical algorithms to estimate the parameters in the problem. The hyperparameters are estimated by a fast approximate leave-one-out cross-validation procedure based on the cavity method. The algorithm is further evaluated on several benchmark databases by comparing with standard algorithms in IR.

document and query, estimation, hyperparameter, (12 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.06)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > United States > New York (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.73)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.52)

Add feedback

Mean Field Approach to a Probabilistic Model in Information Retrieval

Wu, Bin, Wong, K., Bodoff, David

Neural Information Processing SystemsDec-31-2003

We study an explicit parametric model of documents, queries, and relevancy assessmentfor Information Retrieval (IR). Mean-field methods are applied to analyze the model and derive efficient practical algorithms to estimate the parameters in the problem. The hyperparameters are estimated bya fast approximate leave-one-out cross-validation procedure based on the cavity method. The algorithm is further evaluated on several benchmark databases by comparing with standard algorithms in IR.

information retrieval, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.15)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.73)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.52)

Add feedback

Restructuring Sparse High Dimensional Data for Effective Retrieval

Jr., Charles Lee Isbell, Viola, Paul A.

Neural Information Processing SystemsDec-31-1999

The task in text retrieval is to find the subset of a collection of documents relevant to a user's information request, usually expressed as a set of words. Classically, documents and queries are represented as vectors of word counts. In its simplest form, relevance is defined to be the dot product between a document and a query vector-a measure of the number of common terms. A central difficulty in text retrieval is that the presence or absence of a word is not sufficient to determine relevance to a query. Linear dimensionality reduction has been proposed as a technique for extracting underlying structure from the document collection.

algorithm, axis, query, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Tennessee (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Africa > South Africa (0.04)
Africa > Ethiopia (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Restructuring Sparse High Dimensional Data for Effective Retrieval

Jr., Charles Lee Isbell, Viola, Paul A.

Neural Information Processing SystemsDec-31-1999

The task in text retrieval is to find the subset of a collection of documents relevant to a user's information request, usually expressed as a set of words. Classically, documents and queries are represented as vectors of word counts. In its simplest form, relevance is defined to be the dot product between a document and a query vector-a measure of the number of common terms. A central difficulty in text retrieval is that the presence or absence of a word is not sufficient to determine relevance to a query. Linear dimensionality reduction has been proposed as a technique for extracting underlying structure from the document collection.

algorithm, axis, query, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Tennessee (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Africa > South Africa (0.04)
Africa > Ethiopia (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Restructuring Sparse High Dimensional Data for Effective Retrieval

Jr., Charles Lee Isbell, Viola, Paul A.

Neural Information Processing SystemsDec-31-1999

The task in text retrieval is to find the subset of a collection of documents relevant to a user's information request, usually expressed as a set of words. Classically, documents and queries are represented as vectors of word counts. In its simplest form, relevance is defined to be the dot product between a document and a query vector-a measure of the number of common terms. A central difficulty in text retrieval is that the presence or absence of a word is not sufficient to determine relevance to a query. Linear dimensionality reduction has been proposed as a technique forextracting underlying structure from the document collection.

artificial intelligence, machine learning, natural language, (15 more...)

Neural Information Processing Systems

Country: