wavenet
- North America > United States (0.14)
- Europe > France > Île-de-France > Paris > Paris (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- Media > Music (1.00)
- Leisure & Entertainment (1.00)
- North America > Canada (0.04)
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.05)
- North America > United States (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Deep Voice 2: Multi-Speaker Neural Text-to-Speech
Andrew Gibiansky, Sercan Arik, Gregory Diamos, John Miller, Kainan Peng, Wei Ping, Jonathan Raiman, Yanqi Zhou
We introduce a technique for augmenting neural text-to-speech (TTS) with low-dimensional trainable speaker embeddings to generate different voices from a single model. As a starting point, we show improvements over the two state-of-the-art approaches for single-speaker neural TTS: Deep V oice 1 and Tacotron.
- North America > United States > California > Santa Clara County > Sunnyvale (0.04)
- North America > United States > California > Los Angeles County > Long Beach (0.04)
- Asia (0.04)
- North America > United States (0.14)
- Europe > France > Île-de-France > Paris > Paris (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- Media > Music (1.00)
- Leisure & Entertainment (1.00)
Parametric Neural Amp Modeling with Active Learning
Grötschla, Florian, Jiao, Longxiang, Lanzendörfer, Luca A., Wattenhofer, Roger
ABSTRACT We introduce PANAMA, an active learning framework to train parametric guitar amp models end-to-end using a combination of an LSTM model and a WaveNet-like architecture. With PANAMA, one can create a virtual amp by recording samples that are determined through an ensemble-based active learning strategy to minimize the amount of datapoints needed (i.e., amp knob settings). Our strategy uses gradient-based optimization to maximize the disagreement among ensemble models, in order to identify the most informative dat-apoints. MUSHRA listening tests reveal that, with 75 data-points, our models are able to match the perceptual quality of NAM, the leading open-source non-parametric amp modeler. Index T erms-- neural amp modeling, active learning 1. INTRODUCTION In recent years, data-driven guitar amp modeling has become increasingly popular.
- North America > Panama (0.45)
- Europe > Switzerland > Zürich > Zürich (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
We thank all the reviewers for their valuable comments
We thank all the reviewers for their valuable comments. We would like to clarify that, 'When the model was trained without the mel-spectrogram loss, the training process We also think that applying the L1/L2 loss gives no disadvantage in one-to-one mapping as our work. We will clarify the details of the experiments in Section 3. Table 1: Mean Opinion Scores. All models were trained up to 500k steps. MOS evaluation results are shown in [Table 1].