accrual
Learning Soft Constraints From Constrained Expert Demonstrations
Gaurav, Ashish, Rezaee, Kasra, Liu, Guiliang, Poupart, Pascal
Inverse reinforcement learning (IRL) methods assume that the expert data is generated by an agent optimizing some reward function. However, in many settings, the agent may optimize a reward function subject to some constraints, where the constraints induce behaviors that may be otherwise difficult to express with just a reward function. We consider the setting where the reward function is given, and the constraints are unknown, and propose a method that is able to recover these constraints satisfactorily from the expert data. While previous work has focused on recovering hard constraints, our method can recover cumulative soft constraints that the agent satisfies on average per episode. In IRL fashion, our method solves this problem by adjusting the constraint function iteratively through a constrained optimization procedure, until the agent behavior matches the expert behavior. We demonstrate our approach on synthetic environments, robotics environments and real world highway driving scenarios.
Admissibility in Strength-based Argumentation: Complexity and Algorithms (Extended Version with Proofs)
Bacquey, Yohann, Mailly, Jean-Guy, Moraitis, Pavlos, Rossit, Julien
Recently, Strength-based Argumentation Frameworks (StrAFs) have been proposed to model situations where some quantitative strength is associated with arguments. In this setting, the notion of accrual corresponds to sets of arguments that collectively attack an argument. Some semantics have already been defined, which are sensitive to the existence of accruals that collectively defeat their target, while their individual elements cannot. However, until now, only the surface of this framework and semantics have been studied. Indeed, the existing literature focuses on the adaptation of the stable semantics to StrAFs. In this paper, we push forward the study and investigate the adaptation of admissibility-based semantics. Especially, we show that the strong admissibility defined in the literature does not satisfy a desirable property, namely Dung's fundamental lemma. We therefore propose an alternative definition that induces semantics that behave as expected. We then study computational issues for these new semantics, in particular we show that complexity of reasoning is similar to the complexity of the corresponding decision problems for standard argumentation frameworks in almost all cases. We then propose a translation in pseudo-Boolean constraints for computing (strong and weak) extensions. We conclude with an experimental evaluation of our approach which shows in particular that it scales up well for solving the problem of providing one extension as well as enumerating them all.
5 Business Processes Machine Learning Is Revolutionizing
The complexity facing business has never been greater. Information pours into databases at unprecedented speed and from sources unimaginable even just a few years ago. Information on customer sentiment, employee performance, market movements, work-in-progress status, financial positions, project completion, and countless other sources is about to be dwarfed by the data generated by the Internet of Things (IoT). There's gold in this data, but most businesses don't have the tools to extract it. In their 2016 Big Data Dilemma report, members of the U.K. House of Commons Science and Technology Committee wrote: "Despite data-driven companies being 10% more productive than those that do not operationalize their data, most companies estimate they are analyzing just 12% of their data."
Bayesian Inference for Radar Imagery Based Surveillance
We are interested in creating an automated or semi-automated system with the capability of taking a set of radar imagery, collection parameters and a priori map and other tactical data, and producing likely interpretations of the possible military situations given the available evidence. This paper is concerned with the problem of the interpretation and computation of certainty or belief in the conclusions reached by such a system. For example, if we consider the problem of confirming or denying the presence of a battalion in a given area, we should include in our decision making process the prior likelihood of military presence based on tactical objectives, the evidence of military vehicles in radar image data, the spatial and tactical clustering and patterns of the vehicles extracted from the imagery, etc. Furthermore, if the user of the system has particular interests such as knowing specific deployments, location of battalion headquarters, etc., then these interests should also be responded to
On the Accrual of Arguments in Defeasible Logic Programming
Lucero, Mauro Javier Gómez (Universidad Nacional del Sur (UNS)) | Chesñevar, Carlos Iván (Universidad Nacional del Sur (UNS)) | Simari, Guillermo Ricardo (Universidad Nacional del Sur (UNS))
Recently, the notion of accrual of arguments has received some attention from the argumentation community. Three principles for argument accrual have been identified as necessary to hold in argumentation frameworks. In this paper we propose an approach to model the accrual of arguments in the context of Defeasible Logic Programming, a logic programming approach to argumentation which has proven to be successful for many real-world applications. We will analyze the above mentioned principles in the context of our proposal, studying other interesting properties.