See, Hear, Explore: curiosity via audio-visual association

Jun-29-2021, 12:26:11 GMT–AIHub

To compute audio features, we take an audio clip spanning 4 time steps (th of a second for these 60 frame per second environments) and apply a Fast Fourier Transform (FFT). The FFT output is downsampled using max pooling to a 512-dimensional feature vector, which is used as input to the discriminator along with a 512-dimensional visual feature vector.

agent, discriminator, exploration, (17 more...)

AIHub

Jun-29-2021, 12:26:11 GMT

News Web Page

Add feedback

Industry:
- Leisure & Entertainment > Games > Computer Games (0.49)

Technology:
- Information Technology
  - Data Science > Data Quality
    - Data Transformation (0.54)
  - Artificial Intelligence > Machine Learning
    - Supervised Learning (0.71)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found