Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook

Oct-19-2023–arXiv.org Artificial Intelligence

As two cornerstones of modern day technologies, speech processing and natural language processing (NLP) are innately sequence learning problems to extract information from these linguistic or speech signals and provide insights into interactive systems to communicate in human understandable languages. The sequential and interactive nature of these problems can make them well-suited into the algorithmic framework of reinforcement learning (RL). In a reinforcement learning setting, an agent interacts with an environment through observations and actions, and based on the reward feedback attributed by the underlying reward function of this environment, the agent learns how to perform the task of interest through trials and errors. While the successful applications of reinforcement learning have been highlighted by a wide range of surveys in many real-world engineering domains such as robotics [1], vision [2], finance [3], healthcare [4], linguistics [5], and energy management [6], there have not been one for the rich community of both the speech and language domains. This is the first survey that emphasizes the synergy among the growing fields of the speech processing, natural language processing and the reinforcement learning. We aim to fill this gap by adopting a complete, timely and classical view of the reinforcement learning problems and their connections to speech and language processing.

reinforcement, reinforcement learning and bandit, speech and language processing, (7 more...)

arXiv.org Artificial Intelligence

Oct-19-2023

arXiv.org PDF

Add feedback

Country:
- Oceania > New Zealand
  - North Island > Auckland Region > Auckland (0.04)
- North America
  - Mexico (0.04)
  - United States
    - New York > New York County
      - New York City (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
    - Illinois > Cook County
      - Chicago (0.04)
- Europe > United Kingdom
  - Scotland > City of Edinburgh > Edinburgh (0.04)
- Asia > Japan
  - Honshū > Kantō > Kanagawa Prefecture > Yokohama (0.04)

Genre:
- Research Report (1.00)
- Overview (1.00)
- Instructional Material > Course Syllabus & Notes (1.00)

Industry:
- Leisure & Entertainment (1.00)
- Energy (1.00)
- Health & Medicine > Therapeutic Area
  - Neurology (0.45)
- Education
  - Educational Setting > Online (0.67)
  - Focused Education > Special Education (0.45)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.92)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Neural Networks > Deep Learning (1.00)
    - Learning Graphical Models
      - Undirected Networks > Markov Models (1.00)
      - Directed Networks > Bayesian Learning (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found