Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook

Lin, Baihan

arXiv.org Artificial Intelligence 

As two cornerstones of modern day technologies, speech processing and natural language processing (NLP) are innately sequence learning problems to extract information from these linguistic or speech signals and provide insights into interactive systems to communicate in human understandable languages. The sequential and interactive nature of these problems can make them well-suited into the algorithmic framework of reinforcement learning (RL). In a reinforcement learning setting, an agent interacts with an environment through observations and actions, and based on the reward feedback attributed by the underlying reward function of this environment, the agent learns how to perform the task of interest through trials and errors. While the successful applications of reinforcement learning have been highlighted by a wide range of surveys in many real-world engineering domains such as robotics [1], vision [2], finance [3], healthcare [4], linguistics [5], and energy management [6], there have not been one for the rich community of both the speech and language domains. This is the first survey that emphasizes the synergy among the growing fields of the speech processing, natural language processing and the reinforcement learning. We aim to fill this gap by adopting a complete, timely and classical view of the reinforcement learning problems and their connections to speech and language processing.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found