AITopics | epq

Collaborating Authors

epq

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

RTRLfactorstheerrorgradientacrosstimeandspaceas dE dWpq |t = X

Neural Information Processing SystemsFeb-9-2026, 17:16:53 GMT

Threshold adaptation is not used here in order to focus on capacity of the temporal credit propagation mechanism. As an example in our network, variableWpq can impact statesp,t directly through Eq. It keeps track of all the paths thatzj,t can affectWpq (for every j,p,q). With these definitions, we call theW-chainWi1,...,is 1 = First, we link modulatory weights to type-specific GPCR efficacies, which means they are type-specific,i.e. To what extent they are linked in the brain is unclear.

artificial intelligence, epq, machine learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Exclusively Penalized Q-learning for Offline Reinforcement Learning

Neural Information Processing SystemsOct-10-2025, 16:54:18 GMT

Reinforcement learning (RL) is gaining significant attention for solving complex Markov decision process (MDP) tasks.

dataset, learning, penalty, (14 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Lazio > Rome (0.04)
Asia > South Korea > Ulsan > Ulsan (0.04)

Genre: Research Report > Experimental Study (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Exclusively Penalized Q-learning for Offline Reinforcement Learning

Yeom, Junghyuk, Jo, Yonghyeon, Kim, Jungmo, Lee, Sanghyeon, Han, Seungyul

arXiv.org Artificial IntelligenceMay-22-2024

Constraint-based offline reinforcement learning (RL) involves policy constraints or imposing penalties on the value function to mitigate overestimation errors caused by distributional shift. This paper focuses on a limitation in existing offline RL methods with penalized value function, indicating the potential for underestimation bias due to unnecessary bias introduced in the value function. To address this concern, we propose Exclusively Penalized Q-learning (EPQ), which reduces estimation bias in the value function by selectively penalizing states that are prone to inducing estimation errors. Numerical results show that our method significantly reduces underestimation bias and improves performance in various offline control tasks compared to other offline RL methods.

dataset, penalty, q-function, (15 more...)

arXiv.org Artificial Intelligence

2405.14082

Country: Asia > South Korea > Ulsan > Ulsan (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

ForecastTKGQuestions: A Benchmark for Temporal Question Answering and Forecasting over Temporal Knowledge Graphs

Ding, Zifeng, Li, Zongyue, Qi, Ruoxia, Wu, Jingpei, He, Bailan, Ma, Yunpu, Meng, Zhao, Chen, Shuo, Liao, Ruotong, Han, Zhen, Tresp, Volker

arXiv.org Artificial IntelligenceJul-18-2023

Question answering over temporal knowledge graphs (TKGQA) has recently found increasing interest. TKGQA requires temporal reasoning techniques to extract the relevant information from temporal knowledge bases. The only existing TKGQA dataset, i.e., CronQuestions, consists of temporal questions based on the facts from a fixed time period, where a temporal knowledge graph (TKG) spanning the same period can be fully used for answer inference, allowing the TKGQA models to use even the future knowledge to answer the questions based on the past facts. In real-world scenarios, however, it is also common that given the knowledge until now, we wish the TKGQA systems to answer the questions asking about the future. As humans constantly seek plans for the future, building TKGQA systems for answering such forecasting questions is important. Nevertheless, this has still been unexplored in previous research. In this paper, we propose a novel task: forecasting question answering over temporal knowledge graphs. We also propose a large-scale TKGQA benchmark dataset, i.e., ForecastTKGQuestions, for this task. It includes three types of questions, i.e., entity prediction, yes-no, and fact reasoning questions. For every forecasting question in our dataset, QA models can only have access to the TKG information before the timestamp annotated in the given question for answer inference. We find that the state-of-the-art TKGQA methods perform poorly on forecasting questions, and they are unable to answer yes-no questions and fact reasoning questions. To this end, we propose ForecastTKGQA, a TKGQA model that employs a TKG forecasting module for future inference, to answer all three types of questions. Experimental results show that ForecastTKGQA outperforms recent TKGQA methods on the entity prediction questions, and it also shows great effectiveness in answering the other two types of questions.

natural language, question answering, temporal reasoning, (15 more...)

arXiv.org Artificial Intelligence

2208.06501

Country: