Plotting





Belief-State Query Policies for User-Aligned POMDPs

Neural Information Processing Systems

Planning in real-world settings often entails addressing partial observability while aligning with users' requirements. We present a novel framework for expressing users' constraints and preferences about agent behavior in a partially observable setting using parameterized belief-state query (BSQ) policies in the setting of goaloriented partially observable Markov decision processes (gPOMDPs). We present the first formal analysis of such constraints and prove that while the expected cost function of a parameterized BSQ policy w.r.t its parameters is not convex, it is piecewise constant and yields an implicit discrete parameter search space that is finite for finite horizons. This theoretical result leads to novel algorithms that optimize gPOMDP agent behavior with guaranteed user alignment. Analysis proves that our algorithms converge to the optimal user-aligned behavior in the limit. Empirical results show that parameterized BSQ policies provide a computationally feasible approach for user-aligned planning in partially observable settings.



A Theory of Optimistically Universal Online Learnability for General Concept Classes

Neural Information Processing Systems

We provide a full characterization of the concept classes that are optimistically universally online learnable with {0, 1} labels. The notion of optimistically universal online learning was defined in [Hanneke, 2021] in order to understand learnability under minimal assumptions. In this paper, following the philosophy behind that work, we investigate two questions, namely, for every concept class: (1) What are the minimal assumptions on the data process admitting online learnability?


Appendix A Optimal Path Search

Neural Information Processing Systems

For the optimal route search we want to find the shortest path in a modified graph. We adapt the standard Dijkstra's algorithm for the same. A priority queue is maintained for all the unvisited nodes. Each unvisited node is mapped to the minimum cost to reach that node from the source node, using the visited ones. In each iteration the minimum element is popped from the queue and it's neighbours are updated using the negative log likelihoods found from the model and added to the queue (lines 7-11).


MLR: Robust & Reliable Route Recommendation on Road Networks

Neural Information Processing Systems

Predicting the most likely route from a source location to a destination is a core functionality in mapping services. Although the problem has been studied in the literature, two key limitations remain to be addressed. First, our study reveals that a significant portion of the routes recommended by existing methods fail to reach the destination. Second, existing techniques are transductive in nature; hence, they fail to recommend routes if unseen roads are encountered at inference time.