Information Retrieval for ZeroSpeech 2021: The Submission by University of Wroclaw
Chorowski, Jan, Ciesielski, Grzegorz, Dzikowski, Jarosław, Łańcucki, Adrian, Marxer, Ricard, Opala, Mateusz, Pusz, Piotr, Rychlikowski, Paweł, Stypułkowski, Michał
–arXiv.org Artificial Intelligence
We build on the In this paper we present our submission which tries to address unsupervised representations of speech proposed by the organizers all four tasks. We extend the baseline solution in several as a baseline, derived from CPC and clustered with the k-directions: we refine the intermediate representations, extracted means algorithm. We demonstrate that simple methods of refining with CPC, to directly improve the ABX scores. We show that those representations can narrow the gap, or even improve such representations can be used to perform simple fuzzy lookups upon the solutions which use a high computational budget. The in a large dataset, and even extract some common patterns results lead to the conclusion that the CPC-derived representations that serve as pseudo-words. Our approach to the semantic word are still too noisy for training language models, but stable similarity task is also based on pseudo-words.
arXiv.org Artificial Intelligence
Jun-22-2021
- Country:
- Oceania > Australia
- North America > United States
- Maryland > Baltimore (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Europe
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Poland > Lower Silesia Province
- Wroclaw (0.40)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Spain > Valencian Community
- Genre:
- Research Report (0.64)
- Technology: