Goto

Collaborating Authors

 speech


Trump's new world order has become real and Europe is having to adjust fast

BBC News

Trump's new world order has become real and Europe is having to adjust fast Downtown Munich is best-known for chic shops and flashy fast cars but right now its streets are bedecked with posters advertising next generation drones. Europe's security under construction boasts the slogan on an eye-catching set of sleek black-and-white photographs, festooned across a scaffolding-clad church on one of this town's best known pedestrian boulevards. Such an unapologetic public display of military muscle would have been unimaginable here just a few years ago, but the world outside Germany is changing fast, and taking this country with it. The southern region of Bavaria has become Germany's leading defence technology hub, focusing on AI, drones and aerospace. People here, like most other Europeans, say they feel increasingly exposed - squeezed between an expansionist Russia and an economically aggressive China to the east, and an increasingly unpredictable, former best pal, the United States, to the west.


Words Without Consequence

The Atlantic - Technology

What does it mean to have speech without a speaker? For the first time, speech has been decoupled from consequence. We now live alongside AI systems that converse knowledgeably and persuasively--deploying claims about the world, explanations, advice, encouragement, apologies, and promises--while bearing no vulnerability for what they say. Millions of people already rely on chatbots powered by large language models, and have integrated these synthetic interlocutors into their personal and professional lives. An LLM's words shape our beliefs, decisions, and actions, yet no speaker stands behind them. This dynamic is already familiar in everyday use. A chatbot gets something wrong. When corrected, it apologizes and changes its answer.



World's rules-based order 'no longer exists', Germany's Merz warns

BBC News

The rules-based world order no longer exists, the German Chancellor has warned at a major security summit. Opening the annual Munich Security Conference, Friedrich Merz told other world leaders that our freedom is not guaranteed in an era of big power politics, and that Europeans must be ready to make sacrifice. He also admitted that a deep divide has opened between Europe and the United States. The conference is taking place on the backdrop of US President Donald Trump threatening Denmark's sovereignty over Greenland by pledging to annex the Arctic territory and his tariffs on imports from European nations. US Secretary of State Marco Rubio, who was listening to Merz and will deliver his own speech on Saturday, earlier spoke of a new era in geopolitics.



Neural Dubber: Dubbing for Videos According to Scripts

Neural Information Processing Systems

Dubbing is a post-production process of re-recording actors' dialogues, which is extensively used in filmmaking and video production. It is usually performed manually by professional voice actors who read lines with proper prosody, and in synchronization with the pre-recorded videos. In this work, we propose Neural Dubber, the first neural network model to solve a novel automatic video dubbing (AVD) task: synthesizing human speech synchronized with the given video from the text. Neural Dubber is a multi-modal text-to-speech (TTS) model that utilizes the lip movement in the video to control the prosody of the generated speech. Furthermore, an image-based speaker embedding (ISE) module is developed for the multi-speaker setting, which enables Neural Dubber to generate speech with a reasonable timbre according to the speaker's face. Experiments on the chemistry lecture single-speaker dataset and LRS2 multi-speaker dataset show that Neural Dubber can generate speech audios on par with state-of-the-art TTS models in terms of speech quality. Most importantly, both qualitative and quantitative evaluations show that Neural Dubber can control the prosody of synthesized speech by the video, and generate high-fidelity speech temporally synchronized with the video.



Bill Maher roasts Billie Eilish's anti-ICE Grammys speech: 'Knowledge' matters

FOX News

This material may not be published, broadcast, rewritten, or redistributed. Quotes displayed in real-time or delayed by at least 15 minutes. Market data provided by Factset . Powered and implemented by FactSet Digital Solutions . Mutual Fund and ETF data provided by LSEG .


Mistral's New Ultra-Fast Translation Model Gives Big AI Labs a Run for Their Money

WIRED

Mistral's New Ultra-Fast Translation Model Gives Big AI Labs a Run for Their Money "Too many GPUs makes you lazy," says the French startup's vice president of science operations, as the company carves out a different path than the major US AI companies. Mistral AI has released a new family of AI models that it claims will clear the path to seamless conversation between people speaking different languages . On Wednesday, the Paris-based AI lab released two new speech-to-text models: Voxtral Mini Transcribe V2 and Voxtral Realtime. The former is built to transcribe audio files in large batches and the latter for nearly real-time transcription, within 200 milliseconds; both can translate between 13 languages. Voxtral Realtime is freely available under an open source license.


AI wearable helps stroke survivors speak again

FOX News

Revoice wearable device helps stroke survivors with dysarthria communicate naturally. Cambridge researchers developed throat-sensing technology achieving 4.2% word error rates.