Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations
–Neural Information Processing Systems
We present a neural analysis and synthesis (NANSY) framework that can manipulate voice, pitch, and speed of an arbitrary speech signal. Most of the previous works have focused on using information bottleneck to disentangle analysis features for controllable synthesis, which usually results in poor reconstruction quality.
Neural Information Processing Systems
Nov-14-2025, 23:07:37 GMT
- Country:
- Asia
- Europe > Germany
- Berlin (0.04)
- North America > United States
- Minnesota > Hennepin County > Minneapolis (0.14)
- South America > Chile
- Genre:
- Research Report (0.68)
- Industry:
- Information Technology > Security & Privacy (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks (1.00)
- Natural Language (1.00)
- Speech > Speech Recognition (0.68)
- Information Technology > Artificial Intelligence