AITopics | guitar

Collaborating Authors

guitar

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Paul McCartney on playing guitar with Paul Mescal: 'He knew it better than I did!'

BBC NewsMay-27-2026, 23:13:45 GMT

Paul McCartney on playing guitar with Paul Mescal: 'He knew it better than I did!' Hey, I know you! exclaims Paul McCartney, gripping my hand as we walk into his office in central London. And while I'm realistic enough to know he doesn't really hold treasured memories of our previous encounters, I'm impressed by his ability to defuse the tension of Meeting A Beatle. We gather in Soho at lunchtime. Instead of Wild Honey Pie or Savoy Truffle, McCartney has opted for a simple bagel (topping: a terrifying blend of Marmite and hummus), which he prepared in a kitchenette next to his assistant's desk. As he eats, he scans a printed list of film titles - mainly vintage comedies - looking for something to play at his family movie night.

artificial intelligence, beatle, mccartney, (15 more...)

BBC News

Country:

Europe > United Kingdom (0.48)
North America > United States (0.29)

Industry:

Media > Music (1.00)
Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence (0.69)

Add feedback

Taylor Swift files to trademark voice and image after AI concerns

BBC NewsApr-27-2026, 17:02:55 GMT

Taylor Swift has applied to trademark her voice and appearance in an apparent attempt to protect herself from artificial intelligence impersonations. The pop superstar has lodged three trademark applications in the US - one using a photo of herself on stage during her Eras Tour, and the other two being audio clips of her introducing herself while promoting her last album. AI-generated versions of Swift have cropped up in various ways in recent years - from explicit images to a fake election ad in which she appeared to urge people to vote for Donald Trump. The move comes after actor Matthew McConaughey became the first celebrity to use trademark rules to attempt to protect his voice and image from AI misuse earlier this year . Trademark applications are a relatively new way for celebrities to combat the growing issue of AI rip-offs.

artificial intelligence, business technology health culture art, swift, (10 more...)

BBC News

Country: North America > United States (0.90)

Industry:

Leisure & Entertainment (1.00)
Law > Intellectual Property & Technology Law (1.00)

Technology: Information Technology > Artificial Intelligence (0.53)

Add feedback

6f2268bd1d3d3ebaabb04d6b5d099425-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 20:29:21 GMT

epoch, resnet3d-18, xdc, (16 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry: Leisure & Entertainment > Sports (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Graph Engine for Guitar Chord-Tone Soloing Education

Keating, Matthew, Casey, Michael

arXiv.org Artificial IntelligenceOct-23-2025

We present a graph-based engine for computing chord tone soloing suggestions for guitar students. Chord tone soloing is a fundamental practice for improvising over a chord progression, where the instrumentalist uses only the notes contained in the current chord. This practice is a building block for all advanced jazz guitar theory but is difficult to learn and practice. First, we discuss methods for generating chord-tone arpeggios. Next, we construct a weighted graph where each node represents a chord tone arpeggio for a chord in the progression. Then, we calculate the edge weight between each consecutive chord's nodes in terms of optimal transition tones. We then find the shortest path through this graph and reconstruct a chord-tone soloing line. Finally, we discuss a user-friendly system to handle input and output to this engine for guitar students to practice chord tone soloing.

artificial intelligence, chord tone, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2510.19666

Genre: Research Report (0.40)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Human Computer Interaction (0.68)

Add feedback

A Machine Learning Approach for MIDI to Guitar Tablature Conversion

Kaliakatsos-Papakostas, Maximos, Bastas, Gregoris, Makris, Dimos, Herremans, Dorien, Katsouros, Vassilis, Maragos, Petros

arXiv.org Artificial IntelligenceOct-15-2025

Guitar tablature transcription consists in deducing the string and the fret number on which each note should be played to reproduce the actual musical part. This assignment should lead to playable string-fret combinations throughout the entire track and, in general, preserve parsimonious motion between successive combinations. Throughout the history of guitar playing, specific chord fingerings have been developed across different musical styles that facilitate common idiomatic voicing combinations and motion between them. This paper presents a method for assigning guitar tablature notation to a given MIDI-based musical part (possibly consisting of multiple polyphonic tracks), i.e. no information about guitar-idiomatic expressional characteristics is involved (e.g. bending etc.) The current strategy is based on machine learning and requires a basic assumption about how much fingers can stretch on a fretboard; only standard 6-string guitar tuning is examined. The proposed method also examines the transcription of music pieces that was not meant to be played or could not possibly be played by a guitar (e.g. potentially a symphonic orchestra part), employing a rudimentary method for augmenting musical information and training/testing the system with artificial data. The results present interesting aspects about what the system can achieve when trained on the initial and augmented dataset, showing that the training with augmented data improves the performance even in simple, e.g. monophonic, cases. Results also indicate weaknesses and lead to useful conclusions about possible improvements.

evolutionary algorithm, machine learning, tablature, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.5281/zenodo.6822204

2510.10619

Country: Europe > Greece (0.14)

Genre: Research Report (0.50)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.68)

Add feedback

GOAT: A Large Dataset of Paired Guitar Audio Recordings and Tablatures

Loth, Jackson, Sarmento, Pedro, Sarkar, Saurjya, Guo, Zixun, Barthet, Mathieu, Sandler, Mark

arXiv.org Artificial IntelligenceSep-30-2025

In recent years, the guitar has received increased attention from the music information retrieval (MIR) community driven by the challenges posed by its diverse playing techniques and sonic characteristics. Mainly fueled by deep learning approaches, progress has been limited by the scarcity and limited annotations of datasets. To address this, we present the Guitar On Audio and Tablatures (GOAT) dataset, comprising 5.9 hours of unique high-quality direct input audio recordings of electric guitars from a variety of different guitars and players. We also present an effective data augmentation strategy using guitar amplifiers which delivers near-unlimited tonal variety, of which we provide a starting 29.5 hours of audio. Each recording is annotated using guitar tablatures, a guitar-specific symbolic format supporting string and fret numbers, as well as numerous playing techniques. For this we utilise both the Guitar Pro format, a software for tablature playback and editing, and a text-like token encoding. Furthermore, we present competitive results using GOAT for MIDI transcription and preliminary results for a novel approach to automatic guitar tablature transcription. We hope that GOAT opens up the possibilities to train novel models on a wide variety of guitar-related MIR tasks, from synthesis to transcription to playing technique detection.

artificial intelligence, deep learning, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2509.22655

Country: Asia > Japan (0.28)

Genre: Research Report > Promising Solution (0.68)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

SemanticControl: A Training-Free Approach for Handling Loosely Aligned Visual Conditions in ControlNet

Joung, Woosung, Chae, Daewon, Kim, Jinkyu

arXiv.org Artificial IntelligenceSep-29-2025

ControlNet has enabled detailed spatial control in text-to-image diffusion models by incorporating additional visual conditions such as depth or edge maps. However, its effectiveness heavily depends on the availability of visual conditions that are precisely aligned with the generation goal specified by text prompt-a requirement that often fails in practice, especially for uncommon or imaginative scenes. For example, generating an image of a cat cooking in a specific pose may be infeasible due to the lack of suitable visual conditions. In contrast, structurally similar cues can often be found in more common settings-for instance, poses of humans cooking are widely available and can serve as rough visual guides. Unfortunately, existing ControlNet models struggle to use such loosely aligned visual conditions, often resulting in low text fidelity or visual artifacts. To address this limitation, we propose SemanticControl, a training-free method for effectively leveraging misaligned but semantically relevant visual conditions. Our approach adaptively suppresses the influence of the visual condition where it conflicts with the prompt, while strengthening guidance from the text. The key idea is to first run an auxiliary denoising process using a surrogate prompt aligned with the visual condition (e.g., "a human playing guitar" for a human pose condition) to extract informative attention masks, and then utilize these masks during the denoising of the actual target prompt (e.g., cat playing guitar). Experimental results demonstrate that our method improves performance under loosely aligned conditions across various conditions, including depth maps, edge maps, and human skeletons, outperforming existing baselines. Our code is available at https://mung3477.github.io/semantic-control.

artificial intelligence, machine learning, visual condition, (15 more...)

arXiv.org Artificial Intelligence

2509.21938

Country: North America > United States > Michigan (0.28)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation

Zhou, Jinxing, Zhou, Yanghao, Han, Mingfei, Wang, Tong, Chang, Xiaojun, Cholakkal, Hisham, Anwer, Rao Muhammad

arXiv.org Artificial IntelligenceAug-7-2025

Referring Audio-Visual Segmentation (Ref-AVS) aims to segment target objects in audible videos based on given reference expressions. Prior works typically rely on learning latent embeddings via multimodal fusion to prompt a tunable SAM/SAM2 decoder for segmentation, which requires strong pixel-level supervision and lacks interpretability. From a novel perspective of explicit reference understanding, we propose TGS-Agent, which decomposes the task into a Think-Ground-Segment process, mimicking the human reasoning procedure by first identifying the referred object through multimodal analysis, followed by coarse-grained grounding and precise segmentation. To this end, we first propose Ref-Thinker, a multimodal language model capable of reasoning over textual, visual, and auditory cues. We construct an instruction-tuning dataset with explicit object-aware think-answer chains for Ref-Thinker fine-tuning. The object description inferred by Ref-Thinker is used as an explicit prompt for Grounding-DINO and SAM2, which perform grounding and segmentation without relying on pixel-level supervision. Additionally, we introduce R\textsuperscript{2}-AVSBench, a new benchmark with linguistically diverse and reasoning-intensive references for better evaluating model generalization. Our approach achieves state-of-the-art results on both standard Ref-AVSBench and proposed R\textsuperscript{2}-AVSBench. Code will be available at https://github.com/jasongief/TGS-Agent.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2508.04418

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
(2 more...)

Add feedback

Music Source Restoration

Zang, Yongyi, Dai, Zheqi, Plumbley, Mark D., Kong, Qiuqiang

arXiv.org Artificial IntelligenceMay-29-2025

We introduce Music Source Restoration (MSR), a novel task addressing the gap between idealized source separation and real-world music production. Current Music Source Separation (MSS) approaches assume mixtures are simple sums of sources, ignoring signal degradations employed during music production like equalization, compression, and reverb. MSR models mixtures as degraded sums of individually degraded sources, with the goal of recovering original, undegraded signals. Due to the lack of data for MSR, we present RawStems, a dataset annotation of 578 songs with unprocessed source signals organized into 8 primary and 17 secondary instrument groups, totaling 354.13 hours. To the best of our knowledge, RawStems is the first dataset that contains unprocessed music stems with hierarchical categories. We consider spectral filtering, dynamic range compression, harmonic distortion, reverb and lossy codec as possible degradations, and establish U-Former as a baseline method, demonstrating the feasibility of MSR on our dataset. We release the RawStems dataset annotations, degradation simulation pipeline, training code and pre-trained models to be publicly available.

artificial intelligence, deep learning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2505.21827

Country: North America (0.28)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment (0.93)
Media > Music (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback