GANterpretations

Nov-6-2020–arXiv.org Artificial Intelligence

Since the introduction of Generative Adversarial Networks (GANs) [Goodfellow et al., 2014] there has been a regular stream of both technical advances (e.g., Arjovsky et al. [2017]) and creative uses of these generative models (e.g., [Karras et al., 2019, Zhu et al., 2017, Jin et al., 2017]). In this work we propose an approach for using the power of GANs to automatically generate videos to accompany audio recordings by aligning to spectral properties of the recording. This allows musicians to explore new forms of multi-modal creative expression, where musical performance can induce an AIgenerated musical video that is guided by said performance, as well as a medium for creating a visual narrative to follow a storyline (similar to what was proposed by Frosst and Kereliuk [2019]). When trained properly, these latent spaces are learned in a structured manner, where nearby points generate similar images. For our work we make use of the BigGAN family of models [Brock et al., 2019], which are class-conditional generative models.

artificial intelligence, inflection point, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Nov-6-2020

arXiv.org PDF

Add feedback

Country:
- North America > Canada (0.15)
- Oceania > Australia (0.15)

Genre:
- Research Report (0.40)

Industry:
- Leisure & Entertainment (0.59)
- Media > Music (0.59)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.57)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found