Aligned Contrastive Predictive Coding

Chorowski, Jan, Ciesielski, Grzegorz, Dzikowski, Jarosław, Łańcucki, Adrian, Marxer, Ricard, Opala, Mateusz, Pusz, Piotr, Rychlikowski, Paweł, Stypułkowski, Michał

Jun-22-2021–arXiv.org Artificial Intelligence

We investigate the possibility of forcing a self-supervised model trained using a contrastive predictive loss, to extract slowly varying latent representations. Rather than producing individual predictions for each of the future representations, the model emits a sequence of predictions shorter than the sequence of upcoming representations to which they will be aligned. In this way, the prediction network solves a simpler task of predicting the next symbols, but not their exact timing, while the encoding network is trained to produce piece-wise constant latent codes. We evaluate the model on a speech coding task and demonstrate that the proposed Aligned Contrastive Predictive Figure 1: ACPC architecture. The encoder maps chunks of input Coding (ACPC) leads to higher linear phone prediction accuracy data into a latent space and the autoregressive model predicts and lower ABX error rates, while being slightly faster to K upcoming latent vectors. They are aligned using DTW to the train due to the reduced number of prediction heads.

acpc, prediction, representation, (12 more...)

arXiv.org Artificial Intelligence

Jun-22-2021

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > New York (0.04)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Europe
  - Poland > Lower Silesia Province
    - Wroclaw (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)

Genre:
- Research Report (0.40)

Industry:
- Law > Litigation (0.41)

Technology:
- Information Technology > Artificial Intelligence
  - Speech (1.00)
  - Natural Language (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (0.47)
    - Performance Analysis > Accuracy (0.35)