xtt 2
10 Supplementary Material for the paper LeadCache Regret Optimal Caching in Networks by and
Following Cohen and Hazan [2015] we derive a general expression for the regret upper bound applicable to any linear reward function under an anytime FTPL policy. This is accomplished in the following steps. First, we extend the argument of Cohen and Hazan [2015] to the anytime setting. Then, we specialize this bound to our problem setting. Recall the notations used in the paper - the aggregate file-request sequence from all users is denoted by {xt}t 1 and the virtual cache configuration sequence is denoted by {zt}t 1. Define the cumulative requests up to time tas: Xt = Furthermore, since the max function 14 is convex, we may interchange the expectation and gradient to obtain Φηt(Xt) =E(zt) [Bertsekas, 1973, Proposition 2.2]. Plugging in the expression of the inner product from Eqn. (25) in expression (26), we obtain: Bounding the term (a): Next, to upper bound the expected regret, we control term (a) in inequality (28).
On ALSV Rules Formulation and Inference
Nalepa, Grzegorz Jacek (AGH University of Science and Technology) | Ligeza, Antoni (AGH University of Science and Technology)
In this paper knowledge representation and inference issues for rule-based systems are discussed. The paper deals with improving the logical calculus of Set Attributive Logic founding an expressive rule language XTT2. Representation extensions are introduced, and practical inference rules provided. The original includes an extended state specification, as well as interpreter design. xamples of rule analysis are given. Visual design tool HQed assuring rule quality is also presented.