AITopics | pianoroll

Collaborating Authors

pianoroll

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation

Zhang, Jincheng, Fazekas, György, Saitis, Charalampos

arXiv.org Artificial IntelligenceMay-7-2025

Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation 1 st Jincheng Zhang Centre for Digital Music Queen Mary University of London London, UK jincheng.zhang@qmul.ac.uk 2 nd Gy orgy Fazekas Centre for Digital Music Queen Mary University of London London, UK george.fazekas@qmul.ac.uk 3 rd Charalampos Saitis Centre for Digital Music Queen Mary University of London London, UK c.saitis@qmul.ac.uk Abstract --The recent surge in the popularity of diffusion models for image synthesis has attracted new attention to their potential for generation tasks in other domains. However, their applications to symbolic music generation remain largely under-explored because symbolic music is typically represented as sequences of discrete events and standard diffusion models are not well-suited for discrete data. We represent symbolic music as image-like pi-anorolls, facilitating the use of diffusion models for the generation of symbolic music. Moreover, this study introduces a novel diffusion model that incorporates our proposed Transformer-Mamba block and learnable wavelet transform. Classifier-free guidance is utilised to generate symbolic music with target chords. Our evaluation shows that our method achieves compelling results in terms of music quality and controllability, outperforming the strong baseline in pianoroll generation. Index T erms --symbolic music generation, deep learning, diffusion models, wavelet transform, Mamba I.

artificial intelligence, diffusion model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2505.03314

Country: Europe > United Kingdom > England > Greater London > London (0.65)

Genre: Research Report (0.82)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Text Conditioned Symbolic Drumbeat Generation using Latent Diffusion Models

Jajoria, Pushkar, McDermott, James

arXiv.org Artificial IntelligenceAug-5-2024

This study introduces a text-conditioned approach to generating drumbeats with Latent Diffusion Models (LDMs). It uses informative conditioning text extracted from training data filenames. By pretraining a text and drumbeat encoder through contrastive learning within a multimodal network, aligned following CLIP, we align the modalities of text and music closely. Additionally, we examine an alternative text encoder based on multihot text encodings. Inspired by musics multi-resolution nature, we propose a novel LSTM variant, MultiResolutionLSTM, designed to operate at various resolutions independently. In common with recent LDMs in the image space, it speeds up the generation process by running diffusion in a latent space provided by a pretrained unconditional autoencoder. We demonstrate the originality and variety of the generated drumbeats by measuring distance (both over binary pianorolls and in the latent space) versus the training dataset and among the generated drumbeats. We also assess the generated drumbeats through a listening test focused on questions of quality, aptness for the prompt text, and novelty. We show that the generated drumbeats are novel and apt to the prompt text, and comparable in quality to those created by human musicians.

dataset, drumbeat, encoder, (16 more...)

arXiv.org Artificial Intelligence

2408.02711

Country: Europe > Italy > Campania > Naples (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Structure-informed Positional Encoding for Music Generation

Agarwal, Manvi, Wang, Changhong, Richard, Gaël

arXiv.org Artificial IntelligenceFeb-28-2024

Music generated by deep learning methods often suffers from a lack of coherence and long-term organization. Yet, multi-scale hierarchical structure is a distinctive feature of music signals. To leverage this information, we propose a structure-informed positional encoding framework for music generation with Transformers. We design three variants in terms of absolute, relative and non-stationary positional information. We comprehensively test them on two symbolic music generation tasks: next-timestep prediction and accompaniment generation. As a comparison, we choose multiple baselines from the literature and demonstrate the merits of our methods using several musically-motivated evaluation metrics. In particular, our methods improve the melodic and structural consistency of the generated pieces.

baseline, positional encoding, transformer, (15 more...)

arXiv.org Artificial Intelligence

2402.13301

Country: Europe > France > Île-de-France > Paris > Paris (0.04)

Genre: Research Report (0.50)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Combinatorial music generation model with song structure graph analysis

Go, Seonghyeon, Lee, Kyogu

arXiv.org Artificial IntelligenceDec-23-2023

In this work, we propose a symbolic music generation model with the song structure graph analysis network. We construct a graph that uses information such as note sequence and instrument as node features, while the correlation between note sequences acts as the edge feature. We trained a Graph Neural Network to obtain node representation in the graph, then we use node representation as input of Unet to generate CONLON pianoroll image latent. The outcomes of our experimental results show that the proposed model can generate a comprehensive form of music. Our approach represents a promising and innovative method for symbolic music generation and holds potential applications in various fields in Music Information Retreival, including music composition, music classification, and music inpainting systems.

music, music generation, musical pattern, (12 more...)

arXiv.org Artificial Intelligence

2312.154

Country:

Asia > South Korea > Seoul > Seoul (0.05)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.91)
Information Technology > Data Science > Data Mining (0.84)

Add feedback

Graph-based Polyphonic Multitrack Music Generation

Cosenza, Emanuele, Valenti, Andrea, Bacciu, Davide

arXiv.org Artificial IntelligenceJul-27-2023

Graphs can be leveraged to model polyphonic multitrack symbolic music, where notes, chords and entire sections may be linked at different levels of the musical hierarchy by tonal and rhythmic relationships. Nonetheless, there is a lack of works that consider graph representations in the context of deep learning systems for music generation. This paper bridges this gap by introducing a novel graph representation for music and a deep Variational Autoencoder that generates the structure and the content of musical graphs separately, one after the other, with a hierarchical architecture that matches the structural priors of music. By separating the structure and content of musical graphs, it is possible to condition generation by specifying which instruments are played at certain times. This opens the door to a new form of human-computer interaction in the context of music co-creation. After training the model on existing MIDI datasets, the experiments show that the model is able to generate appealing short and long musical sequences and to realistically interpolate between them, producing music that is tonally and rhythmically consistent. Finally, the visualization of the embeddings shows that the model is able to organize its latent space in accordance with known musical concepts.

artificial intelligence, machine learning, representation, (17 more...)

arXiv.org Artificial Intelligence

2307.14928

Genre: Research Report (0.40)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Music Embedding: A Tool for Incorporating Music Theory into Computational Music Applications

HekmatiAthar, SeyyedPooya, Anwar, Mohd

arXiv.org Artificial IntelligenceApr-24-2021

Advancements in the digital technologies have enabled researchers to develop a variety of Computational Music applications. Such applications are required to capture, process, and generate data related to music. Therefore, it is important to digitally represent music in a music theoretic and concise manner. Existing approaches for representing music are ineffective in terms of utilizing music theory. In this paper, we address the disjoint of music theory and computational music by developing an opensource representation tool based on music theory. Through the wide range of use cases, we run an analysis on the classical music pieces to show the usefulness of the developed music embedding.

music, music theory, representation, (16 more...)

arXiv.org Artificial Intelligence

2104.1188

Country:

North America > United States > North Carolina > Guilford County > Greensboro (0.05)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.40)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback

Creating a music genre model with your own data in AWS DeepComposer Amazon Web Services

#artificialintelligenceJun-1-2020, 14:56:26 GMT

AWS DeepComposer is an educational AWS service that teaches generative AI and uses Generative Adversarial Networks (GANs) to transform a melody that you provide into a completely original song. With AWS DeepComposer, you can use one of the pre-trained music genre models (such as Jazz, Rock, Pop, Symphony, or Jonathan-Coulton) or train your own. As a part of training your custom music genre model, you store your music data files in NumPy objects. This post accompanies the training steps in Lab 2 – Train a custom GAN model on GitHub and demonstrates how to convert your MIDI files to the proper training format for AWS DeepComposer. For this use case, you use your own MIDI files to train a Reggae music genre model.

artificial intelligence, instrument, machine learning, (17 more...)

#artificialintelligence

Country: North America > Jamaica (0.05)

Industry:

Retail > Online (0.40)
Information Technology > Services (0.40)
Leisure & Entertainment (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.56)

Add feedback

Learning a Latent Space of Style-Aware Symbolic Music Representations by Adversarial Autoencoders

Valenti, Andrea, Carta, Antonio, Bacciu, Davide

arXiv.org Machine LearningJan-15-2020

We address the challenging open problem of learning an effective latent space for symbolic music data in generative music modeling. We focus on leveraging adversarial regularization as a flexible and natural mean to imbue variational autoencoders with context information concerning music genre and style. Through the paper, we show how Gaussian mixtures taking into account music metadata information can be used as an effective prior for the autoencoder latent space, introducing the first Music Adversarial Autoencoder (MusAE). The empirical analysis on a large scale benchmark shows that our model has a higher reconstruction accuracy than state-of-the-art models based on standard variational autoencoders. It is also able to create realistic interpolations between two musical sequences, smoothly changing the dynamics of the different tracks. Experiments show that the model can organise its latent space accordingly to low-level properties of the musical pieces, as well as to embed into the latent variables the high-level genre information injected from the prior distribution to increase its overall performance. This allows us to perform changes to the generated pieces in a principled way.

information, latent space, pianoroll, (16 more...)

arXiv.org Machine Learning

2001.05494

Country: Europe > Italy > Tuscany > Pisa Province > Pisa (0.04)

Genre: Research Report (1.00)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback