博彩评级网-博彩网_百家乐投资_全讯网新2

position: EnglishChannel  > News> Upload a Photo, Get a Video

Upload a Photo, Get a Video

Source: Science and Technology Daily | 2025-06-11 11:29:11 | Author: LI LInxu

The rapid developments in AI have unlocked new possibilities for digital representation. With the help of AI models, you can now achieve a remarkable feat: bringing characters to life with just an image and an audio clip.

Jointly developed by Tencent Hunyuan and Tencent Music, the newly released HunyuanVideo-Avatar, a multimodal diffusion transformer-based model, is capable of simultaneously generating dynamic, emotion-controllable, and multi-character dialogue videos. This capability supports head-and-shoulder, half-body, and full-body views, encompassing multiple styles, species, and even dual-character scenes.

To put it simply, you just upload a photo and a voice clip, and the model figures out the context, emotion and lip movements to create a realistic animated video.

For instance, if you upload an image of a woman sitting on a beach with a guitar, along with a piece of lyrical music,  the model understands the scene as "a woman playing the guitar and singing a lyrical song by the sea," and subsequently generates a video of the woman performing the song.

The model provides video creators with highly consistent and dynamic video generation capabilities. Its versatility can unlock a myriad of applications in fields like entertainment, media, e-commerce, advertising and education.

It has already been applied in multiple scenarios within Tencent Music, such as AI companions for music listening, long-form audio podcasts, and music videos (MVs).

For example, on the app QQ Music, when users listen to songs by "AI Leehom" (a fully AI-driven singer created by Tencent Music and Team Leehom), a lively and adorable AI Leehom image synchronizes its singing in real-time on the player.

On WeSing, a popular karaoke singing app, users can upload their images to generate personalized MVs of themselves singing.

In subject consistency and audio-video synchronization, the HunyuanVideo-Avatar shows top-tier industry performance. For video dynamics and natural body movements, it exceeds open-source solutions and rivals closed-source ones.

Currently, the model supports audio uploads of up to 14 seconds for video generation, with more capabilities to be released and open-sourced in the future.

Editor:李林旭

Top News

Energy Cooperation Gets New Direction

?Chinese President Xi Jinping sent a congratulatory message to the 7th China-Russia Energy Business Forum in Beijing on November 25, sparking enthusiastic responses from various sectors in both countries.

WEEKLY REVIEW (Dec.3-10)

Liang Wenfeng, founder and CEO of the Chinese AI firm DeepSeek, and "deep diver" Chinese geoscientist Du Mengran are on the annual "Nature's 10" list, which highlights 10 people at the heart of some of the biggest science stories of 2025.

抱歉,您使用的瀏覽器版本過低或開啟了瀏覽器兼容模式,這會影響您正常瀏覽本網頁

您可以進行以下操作:

1.將瀏覽器切換回極速模式

2.點擊下面圖標升級或更換您的瀏覽器

3.暫不升級,繼續瀏覽

繼續瀏覽
上海百家乐的玩法技巧和规则| bet365体育投注提款要几天| 百家乐平台网| 佳豪国际娱乐| 怎么玩百家乐网上赌博| 沧源| 利都百家乐国际娱乐平台| 菲律宾百家乐官网试玩| 金木棉百家乐网络破解| 百家乐官网娱乐城官方网| 太阳城伞| 喜来登百家乐官网的玩法技巧和规则| 利记国际娱乐| 百家乐路单显示程序| 百家乐官网平台有什么优势 | 利津县| 大发888下载 大发888游戏平台| 百家乐官网专用| 百家乐官网路单资料| A8百家乐官网赌场娱乐网规则| 高尔夫百家乐的玩法技巧和规则| 真人百家乐官网赌注| 棋牌评测网xjqppc| 百家乐扑克多少张| 澳门百家乐官网有限公司| 视频棋牌游戏大厅| 百家乐最好的投注方法| 足球.百家乐官网投注网出租| 荥经县| 百家乐官网游戏机路法| 易发国际| 大发888网页版| 百家乐遥控洗牌器| 在线百家乐有些一| 千亿娱百家乐官网的玩法技巧和规则| 百家乐官网的规则博彩正网 | 百家乐官网桌子黑色| 澳门百家乐官网玩大小| 百家乐德州扑克发牌盒| 百家乐和的几率| 太阳城网站|