AITopics | lip sync

Collaborating Authors

lip sync

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Style-Preserving Lip Sync via Audio-Aware Style Reference

Zhong, Weizhi, Li, Jichang, Cai, Yinqi, Lin, Liang, Li, Guanbin

arXiv.org Artificial IntelligenceAug-9-2024

Audio-driven lip sync has recently drawn significant attention due to its widespread application in the multimedia domain. Individuals exhibit distinct lip shapes when speaking the same utterance, attributed to the unique speaking styles of individuals, posing a notable challenge for audio-driven lip sync. Earlier methods for such task often bypassed the modeling of personalized speaking styles, resulting in sub-optimal lip sync conforming to the general styles. Recent lip sync techniques attempt to guide the lip sync for arbitrary audio by aggregating information from a style reference video, yet they can not preserve the speaking styles well due to their inaccuracy in style aggregation. This work proposes an innovative audio-aware style reference scheme that effectively leverages the relationships between input audio and reference audio from style reference video to address the style-preserving audio-driven lip sync. Specifically, we first develop an advanced Transformer-based model adept at predicting lip motion corresponding to the input audio, augmented by the style information aggregated through cross-attention layers from style reference video. Afterwards, to better render the lip motion into realistic talking face video, we devise a conditional latent diffusion model, integrating lip motion through modulated convolutional layers and fusing reference facial images via spatial cross-attention layers. Extensive experiments validate the efficacy of the proposed approach in achieving precise lip sync, preserving speaking styles, and generating high-fidelity, realistic talking face videos.

lip sync, style reference video, video, (15 more...)

arXiv.org Artificial Intelligence

2408.05412

Country:

Asia > China > Hong Kong (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A deep learning technique to generate real-time lip sync for live 2-D animation

#artificialintelligenceNov-13-2019, 04:21:08 GMT

Live 2-D animation is a fairly new and powerful form of communication that allows human performers to control cartoon characters in real time while interacting and improvising with other actors or members of an audience. Recent examples include Stephen Colbert interviewing cartoon guests on The Late Show, Homer answering live phone-in questions from viewers during a segment of The Simpsons, Archer talking to a live audience at ComicCon, and the stars of Disney's Star vs. The Forces of Evil and My Little Pony hosting live chat sessions with fans via YouTube or Facebook Live. Producing realistic and effective live 2-D animations requires the use of interactive systems that can automatically transform human performances into animations in real time. A key aspect of these systems is attaining a good lip sync, which essentially means that the mouths of animated characters move appropriately when speaking, mimicking the movements observed in the mouths of performers.

animation, li and aneja, lip sync, (13 more...)

#artificialintelligence

Industry:

Media > Television (0.55)
Leisure & Entertainment (0.55)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Canny AI: Imagine world leaders singing

#artificialintelligenceMay-20-2019, 20:21:11 GMT

Deep Learning is really starting to establish itself as a major new tool in visual effects. Currently the tools are still in their infancy but they are changing the way visual effects can be approached. Instead of a pipeline consisting of modelling, texturing, lighting and rendering, these new approaches are hallucinating or plausibly creating imagery that is based on training data sets. Machine Learning, the superset of Deep Learning and similar approaches have had great success in image classification, image recognition and image synthesis. At fxguide we covered Synthesia in the UK, a company born out of research first published as Face2Face.

artificial intelligence, machine learning, video, (17 more...)

#artificialintelligence

Country:

Asia > South Korea (0.31)
Europe > United Kingdom (0.25)
Asia > Middle East > Israel (0.17)

Industry:

Media (1.00)
Leisure & Entertainment (0.96)
Government > Regional Government > Asia Government (0.30)
Government > Military > Army (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Add feedback

Big Mouth Billy Bass is back! New $40 version of hit toy works with Amazon's Alexa smart speaker

Daily Mail - Science & techNov-29-2018, 21:06:11 GMT

It is one of the most irritating toys ever made - and has been given a hi-tech makeover. The original Big Mouth Billy Bass infuriated many with its incessant flapping and singing. Now, it can lip sync to anything Alexa says, and even dance along to music. The original Big Mouth Billy Bass infuriated many with its incessant flapping and singing. Now, it can lip sync to anything Alexa says, and even dance along to music.

artificial intelligence, chatbot, natural language, (17 more...)

Daily Mail - Science & tech

Industry: Appliances & Durable Goods (0.51)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)

Add feedback

Big Mouth Billy Bass is back! New $40 version of hit toy works with Amazon's Alexa smart speaker

Daily Mail - Science & techNov-29-2018, 06:44:59 GMT

artificial intelligence, chatbot, natural language, (17 more...)

Daily Mail - Science & tech

Industry: Appliances & Durable Goods (0.51)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)

Add feedback

'Deep fakes': Sorting fact from fiction in the fake-Obama video era

#artificialintelligenceMar-3-2018, 22:20:48 GMT

It always starts with porn. What first revealed the internet's power to distribute information? Porn has historically been a reliable canary in the coal mine, so the "deep fakes" video Vice found in late 2017 has lawmakers paying attention. Using free machine-learning platforms, people on Reddit superimposed the face of Wonder Woman's Gal Godot on a porn actress's body in a creepy, almost-convincing sex video. Researchers use "Real-time Face Capture" on Russian President Vladimir Putin.

artificial intelligence, machine learning, social media, (14 more...)

#artificialintelligence

Country: North America > United States (0.99)

Industry: Government > Regional Government > North America Government > United States Government (0.99)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media (0.92)

Add feedback