AITopics | reverb

Collaborating Authors

reverb

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Reverb: Open-Source ASR and Diarization from Rev

Bhandari, Nishchal, Chen, Danny, Fernández, Miguel Ángel del Río, Delworth, Natalie, Fox, Jennifer Drexler, Jetté, Migüel, McNamara, Quinten, Miller, Corey, Novotný, Ondřej, Profant, Ján, Qin, Nan, Ratajczak, Martin, Robichaud, Jean-Philippe

arXiv.org Artificial IntelligenceOct-4-2024

Today, we are open-sourcing our core speech recognition and diarization models for non-commercial use. We are releasing both a full production pipeline for developers as well as pared-down research models for experimentation. Rev hopes that these releases will spur research and innovation in the fast-moving domain of voice technology. The speech recognition models released today outperform all existing open source speech recognition models across a variety of long-form speech recognition domains.

open-source asr and diarization, rev, speech recognition, (10 more...)

arXiv.org Artificial Intelligence

2410.0393

Genre: Research Report (0.51)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.96)

Add feedback

GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models

Wang, Hanjing, Sit, Man-Kit, He, Congjie, Wen, Ying, Zhang, Weinan, Wang, Jun, Yang, Yaodong, Mai, Luo

arXiv.org Artificial IntelligenceOct-8-2023

This paper introduces a distributed, GPU-centric experience replay system, GEAR, designed to perform scalable reinforcement learning (RL) with large sequence models (such as transformers). With such models, existing systems such as Reverb face considerable bottlenecks in memory, computation, and communication. GEAR, however, optimizes memory efficiency by enabling the memory resources on GPU servers (including host memory and device memory) to manage trajectory data. Furthermore, it facilitates decentralized GPU devices to expedite various trajectory selection strategies, circumventing computational bottlenecks. GEAR is equipped with GPU kernels capable of collecting trajectories using zero-copy access to host memory, along with remote-directed-memory access over InfiniBand, improving communication efficiency. Cluster experiments have shown that GEAR can achieve performance levels up to 6x greater than Reverb when training state-of-the-art large RL models. GEAR is open-sourced at https://github.com/bigrl-team/gear.

selection, server, trajectory, (12 more...)

arXiv.org Artificial Intelligence

2310.05205

Country:

Asia > China > Shanghai > Shanghai (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
(3 more...)

Add feedback

Unsupervised vocal dereverberation with diffusion-based generative models

Saito, Koichi, Murata, Naoki, Uesaka, Toshimitsu, Lai, Chieh-Hsin, Takida, Yuhta, Fukui, Takao, Mitsufuji, Yuki

arXiv.org Artificial IntelligenceNov-8-2022

Removing reverb from reverberant music is a necessary technique to clean up audio for downstream music manipulations. Reverberation of music contains two categories, natural reverb, and artificial reverb. Artificial reverb has a wider diversity than natural reverb due to its various parameter setups and reverberation types. However, recent supervised dereverberation methods may fail because they rely on sufficiently diverse and numerous pairs of reverberant observations and retrieved data for training in order to be generalizable to unseen observations during inference. To resolve these problems, we propose an unsupervised method that can remove a general kind of artificial reverb for music without requiring pairs of data for training. The proposed method is based on diffusion models, where it initializes the unknown reverberation operator with a conventional signal processing technique and simultaneously refines the estimate with the help of diffusion models. We show through objective and perceptual evaluations that our method outperforms the current leading vocal dereverberation benchmarks.

diffusion model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2211.04124

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Asia > South Korea > Seoul > Seoul (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.41)

Add feedback

Impulse Response -- data augmentation for audio deep learning

#artificialintelligenceAug-22-2021, 01:40:11 GMT

In recent years, deep learning for audio has come a long way with models beating traditional signal processing techniques in many of the downstream tasks. However, many such solutions are trained on "homogeneous" datasets -- datasets where there is little variability in the recording conditions (noise, accent, language, etc.). Many such models do not perform very well (especially audio conversion/synthesis tasks) when used on real world "audio events" which can contain short burst, environment noises, background speakers, poor microphones, etc. While there are many techniques address them, here we concern ourselves with data augmentation with impulse responses, which at times can be really powerful since it simulates different recording environments. An impulse response of a dynamic system describes how it reacts when presented with a brief input signal called the impulse.

audio deep learning, data augmentation, deep learning, (9 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)

Add feedback

Reverb: A Framework For Experience Replay

Cassirer, Albin, Barth-Maron, Gabriel, Brevdo, Eugene, Ramos, Sabela, Boyd, Toby, Sottiaux, Thibault, Kroiss, Manuel

arXiv.org Artificial IntelligenceFeb-9-2021

A central component of training in Reinforcement Learning (RL) is Experience: the data used for training. The mechanisms used to generate and consume this data have an important effect on the performance of RL algorithms. In this paper, we introduce Reverb: an efficient, extensible, and easy to use system designed specifically for experience replay in RL. Reverb is designed to work efficiently in distributed configurations with up to thousands of concurrent clients. The flexible API provides users with the tools to easily and accurately configure the replay buffer. It includes strategies for selecting and removing elements from the buffer, as well as options for controlling the ratio between sampled and inserted elements. This paper presents the core design of Reverb, gives examples of how it can be applied, and provides empirical results of Reverb's performance characteristics.

experience replay, reverb, server, (13 more...)

arXiv.org Artificial Intelligence

2102.04736

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > California (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Cooper FX Arcades review: Plumbing the depths of lo-fi guitar effects

EngadgetOct-7-2020, 12:00:43 GMT

Let's get one thing out of the way right up front: Yes, the main conceit of the $329 Cooper FX Arcades is a little gimmicky. It's a guitar pedal into which you stick cards to apply different effects, kinda like a game console. But while the somewhat novel approach to building a multi-effects unit may have helped Arcades garner attention, this pedal is no mere gimmick. A post shared by Tom Majeski (@cooper.fx) Tom Majeski of Cooper FX is not the first person to take this approach. Line 6 had its ToneCore line of pedals in the mid'aughts, Elta had the Console and TipTop Audio sells the Z-DSP. But Z-DSP is a eurorack module, not a guitar pedal.

artificial intelligence, generation loss, pedal, (12 more...)

Engadget

Genre: Overview (0.34)

Technology: Information Technology > Artificial Intelligence (0.34)

Add feedback

Real Time Speech Enhancement in the Waveform Domain

Defossez, Alexandre, Synnaeve, Gabriel, Adi, Yossi

arXiv.org Machine LearningSep-6-2020

We present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities. We perform evaluations on several standard benchmarks, both using objective metrics and human judgements. The proposed model matches state-of-the-art performance of both causal and non causal methods while working directly on the raw waveform.

machine learning, natural language, real time system, (17 more...)

arXiv.org Machine Learning

2006.12847

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Speech (0.96)
Information Technology > Architecture > Real Time Systems (0.72)
(2 more...)

Add feedback

sonible's AI Powered smart:reverb Delivers A Custom Reverb For Every Input Signal

#artificialintelligenceAug-15-2020, 03:15:06 GMT

Often you have to be careful whether the AI addition is only a marketing phrase to sell the plugin better or if it's really useful for the musician. There is a Synthesizer plugin on the market where the former has been confirmed. They already showed this with their intelligent EQ plugins. With their new reverb "smart:reverb", they continue this idea and also use their AI. In this case, the technology is used to creating custom-tailored reverb by adjusting its processing to the individual characteristics of the input material.

artificial intelligence, plugin, reverb, (5 more...)

#artificialintelligence

Country: Europe > Austria (0.07)

Technology: Information Technology > Artificial Intelligence (0.74)

Add feedback

Reverb: a framework for experience replay

#artificialintelligenceJul-8-2020, 17:13:31 GMT

The use of experience plays a key role in reinforcement learning (RL). How best to use this data is one of the central problems of this field. As RL agents have advanced over recent years, taking on bigger and more complex problems (Atari, Go, StarCraft, Dota), the generated data has grown in both size and complexity. To cope with this complexity many RL systems split the learning problem into two distinct parts: experience producers (actors) and experience consumers (learners) — allowing these different parts to run in parallel. Often a data storage system lies at the intersection between these two components. The question of how to efficiently store and transport the data is itself a challenging engineering problem.

deep learning, experience replay, reinforcement learning, (3 more...)

#artificialintelligence

Industry: Education > Focused Education > Special Education (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback