AITopics | non-parallel data

Collaborating Authors

non-parallel data

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

51ae7d9db3423ae96cd6afeb01529819-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 23:27:57 GMT

aclanthology, dataset, reward function, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Europe > Romania > Sud-Est Development Region > Tulcea County > Tulcea (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback

Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion

Neural Information Processing SystemsDec-25-2025, 17:48:18 GMT

End-to-end models for raw audio generation are a challenge, specially if they have to work with non-parallel data, which is a desirable setup in many situations. Voice conversion, in which a model has to impersonate a speaker in a recording, is one of those situations. In this paper, we propose Blow, a single-scale normalizing flow using hypernetwork conditioning to perform many-to-many voice conversion between raw audio. Blow is trained end-to-end, with non-parallel data, on a frame-by-frame basis using a single speaker identifier. We show that Blow compares favorably to existing flow-based architectures and other competitive baselines, obtaining equal or better performance in both objective and subjective evaluations. We further assess the impact of its main components with an ablation study, and quantify a number of properties such as the necessary amount of training data or the preference for source or target speakers.

name change, non-parallel raw-audio voice conversion, single-scale hyperconditioned flow, (2 more...)

Neural Information Processing Systems

Country: South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.08)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.41)

Add feedback

51ae7d9db3423ae96cd6afeb01529819-Paper-Conference.pdf

Neural Information Processing SystemsAug-14-2025, 20:20:55 GMT

aclanthology, dataset, reward function, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta (0.14)
Europe > Romania > Sud-Est Development Region > Tulcea County > Tulcea (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback

Transfer the linguistic representations from TTS to accent conversion with non-parallel data

Chen, Xi, Pei, Jiakun, Xue, Liumeng, Zhang, Mingyang

arXiv.org Artificial IntelligenceJan-7-2024

Accent conversion aims to convert the accent of a source speech to a target accent, meanwhile preserving the speaker's identity. This paper introduces a novel non-autoregressive framework for accent conversion that learns accent-agnostic linguistic representations and employs them to convert the accent in the source speech. Specifically, the proposed system aligns speech representations with linguistic representations obtained from Text-to-Speech (TTS) systems, enabling training of the accent voice conversion model on non-parallel data. Furthermore, we investigate the effectiveness of a pretraining strategy on native data and different acoustic features within our proposed framework. We conduct a comprehensive evaluation using both subjective and objective metrics to assess the performance of our approach. The evaluation results highlight the benefits of the pretraining strategy and the incorporation of richer semantic features, resulting in significantly enhanced audio quality and intelligibility.

accent conversion, representation, speech, (14 more...)

arXiv.org Artificial Intelligence

2401.03538

Country:

Asia > China > Guangdong Province > Shenzhen (0.05)
Asia > Japan > Honshū > Tōhoku > Iwate Prefecture > Morioka (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.69)

Add feedback

Teacher Forcing Recovers Reward Functions for Text Generation

Hao, Yongchang, Liu, Yuxin, Mou, Lili

arXiv.org Artificial IntelligenceJan-18-2023

Reinforcement learning (RL) has been widely used in text generation to alleviate the exposure bias issue or to utilize non-parallel datasets. The reward function plays an important role in making RL training successful. However, previous reward functions are typically task-specific and sparse, restricting the use of RL. In our work, we propose a task-agnostic approach that derives a step-wise reward function directly from a model trained with teacher forcing. We additionally propose a simple modification to stabilize the RL training on non-parallel datasets with our induced reward function. Empirical results show that our method outperforms self-training and reward regression methods on several text generation tasks, confirming the effectiveness of our reward function.

machine learning, natural language, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

2210.08708

Country:

North America > Canada > Alberta (0.14)
Oceania > Australia (0.04)
Europe > Romania > Sud-Est Development Region > Tulcea County > Tulcea (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.90)
(2 more...)

Add feedback

Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion

Serrà, Joan, Pascual, Santiago, Perales, Carlos Segura

Neural Information Processing SystemsMar-18-2020, 23:17:14 GMT

non-parallel data, non-parallel raw-audio voice conversion, single-scale hyperconditioned flow

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

Bilingual Lexicon Induction from Non-Parallel Data with Minimal Supervision

Zhang, Meng (Tsinghua University) | Peng, Haoruo (University of Illinois, Urbana-Champaign) | Liu, Yang (Tsinghua University) | Luan, Huanbo (Tsinghua University) | Sun, Maosong (Tsinghua University)

AAAI ConferencesFeb-14-2017

Building bilingual lexica from non-parallel data is a long-standing natural language processing research problem that could benefit thousands of resource-scarce languages which lack parallel data. Recent advances of continuous word representations have opened up new possibilities for this task, e.g. by establishing cross-lingual mapping between word embeddings via a seed lexicon. The method is however unreliable when there are only a limited number of seeds, which is a reasonable setting for resource-scarce languages. We tackle the limitation by introducing a novel matching mechanism into bilingual word representation learning. It captures extra translation pairs exposed by the seeds to incrementally improve the bilingual word embeddings. In our experiments, we find the matching mechanism to substantially improve the quality of the bilingual vector space, which in turn allows us to induce better bilingual lexica with seeds as few as 10.

artificial intelligence, machine learning, natural language, (16 more...)

AAAI Conferences

Thirty-First AAAI Conference on Artificial Intelligence

Country: Asia > China (0.28)

Genre: Research Report > New Finding (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.47)

Add feedback