Online Structured Prediction via Coactive Learning

Jun-27-2012–arXiv.org Artificial Intelligence

We propose Coactive Learning as a model of interaction between a learning system and a human user, where both have the common goal of providing results of maximum utility to the user. At each step, the system (e.g. search engine) receives a context (e.g. query) and predicts an object (e.g. ranking). The user responds by correcting the system if necessary, providing a slightly improved -- but not necessarily optimal -- object as feedback. We argue that such feedback can often be inferred from observable user behavior, for example, from clicks in web-search. Evaluating predictions by their cardinal utility to the user, we propose efficient learning algorithms that have ${\cal O}(\frac{1}{\sqrt{T}})$ average regret, even though the learning algorithm never observes cardinal utility values as in conventional online learning. We demonstrate the applicability of our model and learning algorithms on a movie recommendation task, as well as ranking for web-search.

information retrieval, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Jun-27-2012

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom (0.28)

Genre:
- Research Report > New Finding (0.68)

Industry:
- Media > Film (0.35)
- Education > Educational Setting (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Personal Assistant Systems (0.69)
  - Natural Language > Information Retrieval (0.67)
  - Machine Learning
    - Inductive Learning (0.83)
    - Supervised Learning (0.65)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found