一级大片免费_成人免费观看在线_国产一区二区三区精品久久久无广告_久久99精品久久久久久青青91_com.黄_久久久久久久国产免费看

position: EnglishChannel  > News> Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Source: Science and Technology Daily | 2024-10-24 17:41:46 | Author: Gong Qian

Emu3 text-to-image cases. (COURTESY PHOTO)

By GONG Qian

On October 21, the Beijing Academy of Artificial Intelligence (BAAI), a Chinese non-profit organization engaged in AI R&D, released Emu3, a multimodal AI model that seamlessly integrates text, image, and video modalities into a single, unified framework.

The BAAI research team said Emu3 is expected to be used in scenario applications such as robot brains, autonomous driving, multimodal dialogue and inference.

Emu3, based solely on next-token prediction, proves that next-token prediction can be a powerful paradigm for multimodal models.

The existing multimodal AI models are mostly designed for specific tasks. Each has its corresponding architecture and methods. For instance, in the field of video generation, many developers use the diffusion in time (DiT) architecture, as referenced by Sora. Other models such as Stable Diffusion are used for text-to-image synthesis, Sora for text-to-video conversion, and GPT-4V for image-to-text generation.

In contrast to these models, which have a combination of isolated skills rather than an inherently unified ability, Emu3, eliminates the need for diffusion or compositional approaches. By tokenizing images, text, and videos into a discrete space, BAAI has developed a single transformer from scratch.

Emu3 outperforms several well-established task-specific models in both generation and perception tasks, surpassing flagship models such as SDXL and LLaVA.

In September, BAAI open-sourced the key technologies and models of Emu3 including the chat model and generation model after supervised fine-tuning.

Emu3 has been receiving rave reviews from overseas developers. "For researchers, a new opportunity has emerged to explore multimodality through a unified architecture, eliminating the need to combine complex diffusion models with large language models. This approach is akin to the transformative impact of transformers in vision-related tasks," AI consultant Muhammad Umair said on social media platform Meta.

While next-token prediction is considered a promising path towards artificial general intelligence, it struggled to excel in multimodal tasks, which were dominated by diffusion models such as Stable Diffusion and compositional approaches like CLIP combined with large language models.

Raphael Mansuy, co-founder of QuantaLogic, an AI agent platform, thinks that Em3 has significant implications for Al development. Mansuy wrote on X that Em3's success suggests several key insights: Next-token prediction as a viable path to general multimodal Al; potential for simplified and more scalable model architectures; challenge to the dominance of diffusion and compositional approaches.

Editor:GONG Qian

Top News

China Focus: China takes firm countermeasures against U.S. tariff bullying

China has taken swift, firm countermeasures following the latest U.S. tariff hike on Chinese imports, in a move to safeguard its legitimate rights and interests.

2025 ZGC Forum: Gala for Global Sci-tech Cooperation

The 2025 Zhongguancun Forum Annual Conference (2025 ZGC Forum), with a focus on new quality productive forces, concluded on March 31, with significant results and promotion of international sci-tech cooperation.

抱歉,您使用的瀏覽器版本過(guò)低或開(kāi)啟了瀏覽器兼容模式,這會(huì)影響您正常瀏覽本網(wǎng)頁(yè)

您可以進(jìn)行以下操作:

1.將瀏覽器切換回極速模式

2.點(diǎn)擊下面圖標(biāo)升級(jí)或更換您的瀏覽器

3.暫不升級(jí),繼續(xù)瀏覽

繼續(xù)瀏覽
主站蜘蛛池模板: 久久久影片 | 大地资源网在线高清 | 久久久精品有限公司 | 国产日产欧美一区二区三区 | 九色成人免费视频 | 国产日韩精品一区二区三区在线 | 欧美日韩亚洲国内综合网 | 999久久久久久久久 理论片一级片 | 国产精品一区二区三区四 | 国产精品嫩草69影院 | 我有一个朋友在线观看 | 一级做a爰全过程免费视频毛片 | 成人成人成人在线视频 | 姑娘第6集在线观看免费播放 | 国产天堂第一区 | 亚洲午夜久久久久久久 | 777色狠狠一区二区三区 | 可以免费观看的av | 国产精品自拍区 | 成人91 | 日韩夜夜操 | 好看的大地资源最新大地资源 | 国产精品v欧美精品 | 国产欧美日韩中文 | 国产视频网站一区二区三区 | 欧美精彩视频一区二区三区 | 日本韩国一区 | 黑人精品欧美一区二区蜜桃 | 一区观看| 520av视频| 一区二区高清视频在线观看 | 99热九九这里只有精品10 | 亚洲九九爱 | av高清一区二区三区 | 亲含舔丰满湿插 | 日韩xxxxxxxxx| 天天好逼综合 | 麻豆第一区mv免费观看网站 | 性欧美另类 | av免费网站在线 | 91亚洲精品久久 |