AITopics | real-world knowledge

Collaborating Authors

real-world knowledge

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

CoreEval: Automatically Building Contamination-Resilient Datasets with Real-World Knowledge toward Reliable LLM Evaluation

Zhao, Jingqian, Wang, Bingbing, Tu, Geng, Zhang, Yice, Wang, Qianlong, Liang, Bin, Li, Jing, Xu, Ruifeng

arXiv.org Artificial IntelligenceNov-25-2025

Data contamination poses a significant challenge to the fairness of LLM evaluations in natural language processing tasks by inadvertently exposing models to test data during training. Current studies attempt to mitigate this issue by modifying existing datasets or generating new ones from freshly collected information. However, these methods fall short of ensuring contamination-resilient evaluation, as they fail to fully eliminate pre-existing knowledge from models or preserve the semantic complexity of the original datasets. To address these limitations, we propose \textbf{CoreEval}, a \textbf{Co}ntamination-\textbf{re}silient \textbf{Eval}uation strategy for automatically updating data with real-world knowledge. This approach begins by extracting entity relationships from the original data and leveraging the GDELT database to retrieve relevant, up-to-date knowledge. The retrieved knowledge is then recontextualized and integrated with the original data, which is refined and restructured to ensure semantic coherence and enhanced task relevance. Ultimately, a robust data reflection mechanism is employed to iteratively verify and refine labels, ensuring consistency between the updated and original datasets. Extensive experiments on updated datasets validate the robustness of CoreEval, demonstrating its effectiveness in mitigating performance overestimation caused by data contamination.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2025.acl-long.1085

2511.18889

Country: Asia > China (0.46)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic

Tewel, Yoad, Shalev, Yoav, Schwartz, Idan, Wolf, Lior

arXiv.org Artificial IntelligenceNov-29-2021

Recent text-to-image matching models apply contrastive learning to large corpora of uncurated pairs of images and sentences. While such models can provide a powerful score for matching and subsequent zero-shot tasks, they are not capable of generating caption given an image. In this work, we repurpose such models to generate a descriptive text given an image at inference time, without any further training or tuning step. This is done by combining the visual-semantic model with a large language model, benefiting from the knowledge in both web-scale models. The resulting captions are much less restrictive than those obtained by supervised captioning methods. Moreover, as a zero-shot learning method, it is extremely flexible and we demonstrate its ability to perform image arithmetic in which the inputs can be either images or text and the output is a sentence. This enables novel high-level vision capabilities such as comparing two images or solving visual analogy tests.

arithmetic, caption, knowledge, (15 more...)

arXiv.org Artificial Intelligence

2111.14447

Country:

Europe > Germany (0.28)
Asia > China (0.28)
Oceania > Australia > Australian Capital Territory > Canberra (0.04)
(8 more...)

Genre: Research Report (1.00)

Industry: Government > Regional Government > Europe Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Machines that answer back

AITopics Original LinksJan-18-2017, 11:21:20 GMT

"PRESS one if you are calling to check your balance, press two to set up an appointment." Such automated telephone services have been annoying callers looking for simple help for several years now. On the telephone, people want to deal with a human being, not a recorded voice putting them through hoops (see article). Yet that does not have to be the case. Consider the success that some companies have had with services that respond automatically to inquiries sent by e-mail.

answer back, artificial intelligence, software, (7 more...)

AITopics Original Links

Country:

North America > United States > California > Santa Clara County > Mountain View (0.06)
North America > United States > California > San Francisco County > San Francisco (0.06)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.34)

Add feedback

NETL: A system for representing and using real-world knowledge

Fahlman, S. E.

ClassicsFeb-1-1979

This report describes a knowledge-base system in which the information is stored in a network of small parallel processing elements??de and link units??ich are controlled by an external serial computer. This network is similar to the semantic network system of Quillian, but is much more tightly controlled. Such a network can perform certain critical deductions and searches very quickly; it avoids many of the problems of current systems, which must use complex heuristics to limit and guided their searches. It is argued (with examples) that the key operation in a knowledge-base system is the intersection of large explicit and semi-explicit sets. The parallel network system does this in a small, essentially constant number of cycles; a serial machine takes time proportional to the size of the sets, except in special cases.

artificial intelligence, knowledge-base system, real-world knowledge, (2 more...)

Classics

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)

Add feedback