epq
- Information Technology > Artificial Intelligence > Robots (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
RTRLfactorstheerrorgradientacrosstimeandspaceas dE dWpq |t = X
Threshold adaptation is not used here in order to focus on capacity of the temporal credit propagation mechanism. As an example in our network, variableWpq can impact statesp,t directly through Eq. It keeps track of all the paths thatzj,t can affectWpq (for every j,p,q). With these definitions, we call theW-chainWi1,...,is 1 = First, we link modulatory weights to type-specific GPCR efficacies, which means they are type-specific,i.e. To what extent they are linked in the brain is unclear.
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Yeom, Junghyuk, Jo, Yonghyeon, Kim, Jungmo, Lee, Sanghyeon, Han, Seungyul
Constraint-based offline reinforcement learning (RL) involves policy constraints or imposing penalties on the value function to mitigate overestimation errors caused by distributional shift. This paper focuses on a limitation in existing offline RL methods with penalized value function, indicating the potential for underestimation bias due to unnecessary bias introduced in the value function. To address this concern, we propose Exclusively Penalized Q-learning (EPQ), which reduces estimation bias in the value function by selectively penalizing states that are prone to inducing estimation errors. Numerical results show that our method significantly reduces underestimation bias and improves performance in various offline control tasks compared to other offline RL methods.
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (0.66)
ForecastTKGQuestions: A Benchmark for Temporal Question Answering and Forecasting over Temporal Knowledge Graphs
Ding, Zifeng, Li, Zongyue, Qi, Ruoxia, Wu, Jingpei, He, Bailan, Ma, Yunpu, Meng, Zhao, Chen, Shuo, Liao, Ruotong, Han, Zhen, Tresp, Volker
Question answering over temporal knowledge graphs (TKGQA) has recently found increasing interest. TKGQA requires temporal reasoning techniques to extract the relevant information from temporal knowledge bases. The only existing TKGQA dataset, i.e., CronQuestions, consists of temporal questions based on the facts from a fixed time period, where a temporal knowledge graph (TKG) spanning the same period can be fully used for answer inference, allowing the TKGQA models to use even the future knowledge to answer the questions based on the past facts. In real-world scenarios, however, it is also common that given the knowledge until now, we wish the TKGQA systems to answer the questions asking about the future. As humans constantly seek plans for the future, building TKGQA systems for answering such forecasting questions is important. Nevertheless, this has still been unexplored in previous research. In this paper, we propose a novel task: forecasting question answering over temporal knowledge graphs. We also propose a large-scale TKGQA benchmark dataset, i.e., ForecastTKGQuestions, for this task. It includes three types of questions, i.e., entity prediction, yes-no, and fact reasoning questions. For every forecasting question in our dataset, QA models can only have access to the TKG information before the timestamp annotated in the given question for answer inference. We find that the state-of-the-art TKGQA methods perform poorly on forecasting questions, and they are unable to answer yes-no questions and fact reasoning questions. To this end, we propose ForecastTKGQA, a TKGQA model that employs a TKG forecasting module for future inference, to answer all three types of questions. Experimental results show that ForecastTKGQA outperforms recent TKGQA methods on the entity prediction questions, and it also shows great effectiveness in answering the other two types of questions.
- Europe > Switzerland > Zürich > Zürich (0.14)
- South America > Argentina (0.14)
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- (29 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Temporal Reasoning (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.91)