Google open-sources datasets for AI assistants with human-level understanding
Both datasets are being shared by Google AI researchers to supply the training material necessary to model natural language systems that achieve human-level performance. Google researchers call CCPE a new way to collect voice data. It includes 500 dialogues with people about their movie preferences -- 10,000 in total, across 12,000 utterances. Movie preferences were chosen as a topic because of the value of metadata such as the names of actors and directors. "We do not restrict the workers to detailed scripts or to a small knowledge base and hence we observe that our dataset contains more realistic and diverse conversations in comparison to existing datasets," a paper published covering CCPE reads.
Sep-8-2019, 02:53:48 GMT