Training a Goal-Oriented Chatbot with Deep Reinforcement Learning -- Part III
The dialogue state tracker or just state tracker (ST) in a goal-oriented dialogue system has the primary job of preparing the state for the agent. As we discussed in the previous part the agent needs a useful state to be able to make a good choice on what action to take. The ST updates its internal history of the dialogue by collecting both user and agent actions as they are taken. It also keeps track of all inform slots that have been contained in any agent and user actions thus far in the current episode. The state used by the agent is a numpy array made of information from the current history and the current informs of the ST. In addition, whenever the agent wishes to inform a slot to the user the ST queries the database for a value that works given its current informs.
Sep-15-2019, 16:19:07 GMT
- Technology: