Discovering Latent Information By Spreading Activation Algorithm For Document Retrieval
–arXiv.org Artificial Intelligence
Syntactic search relies on keywords contained in a query to find suitable documents. So, documents that do not contain the keywords but contain information related to the query are not retrieved. Spreading activation is an algorithm for finding latent information in a query by exploiting relations between nodes in an associative network or semantic network. However, the classical spreading activation algorithm uses all relations of a node in the network that will add unsuitable information into the query. In this paper, we propose a novel approach for semantic text search, called query-oriented-constrained spreading activation that only uses relations relating to the content of the query to find really related information. Experiments on a benchmark dataset show that, in terms of the MAP measure, our search engine is 18.9% and 43.8% respectively better than the syntactic search and the search using the classical constrained spreading activation. NTRODUCTION With rapid development of the Word Wide Web and e-societies, information retrieval (IR) has many challenges in exploiting those rich and huge information resources. Whereas, the keyword based IR has many limitations in finding suitable documents for user's queries. Semantic search improves search precision and recall by understanding user's intent and the contextual meaning of terms in documents and queries.
arXiv.org Artificial Intelligence
Jul-29-2018
- Country:
- North America > United States
- New York (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia
- Southeast Asia (0.05)
- Thailand
- Chiang Mai > Chiang Mai (0.05)
- Bangkok > Bangkok (0.05)
- Phuket > Phuket (0.04)
- Middle East > Israel
- Jerusalem District > Jerusalem (0.04)
- Indonesia > Java
- North America > United States
- Genre:
- Research Report > Promising Solution (0.34)
- Technology:
- Information Technology
- Information Management > Search (1.00)
- Artificial Intelligence
- Systems & Languages > Programming Languages (1.00)
- Representation & Reasoning > Ontologies (1.00)
- Natural Language > Information Retrieval (1.00)
- Cognitive Science > Problem Solving (1.00)
- Machine Learning > Performance Analysis
- Accuracy (0.34)
- Information Technology