中文字幕区一区二_免费欧美一区_久久精品视频免费播放_一级毛片免费的_亚洲AV永久无码天堂网毛片_免费在线影视观看入口

position: EnglishChannel  > AI ripples> Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Chinese AI Model Emu3 Handles Text, Image, Video Seamlessly

Source: Science and Technology Daily | 2024-12-17 15:44:35 | Author: Gong Qian

On October 21, the Beijing Academy of Artificial Intelligence (BAAI), a Chinese non-profit organization engaged in AI R&D, released Emu3, a multimodal AI model that seamlessly integrates text, image, and video modalities into a single, unified framework.

The BAAI research team said Emu3 is expected to be used in scenario applications such as robot brains, autonomous driving, multimodal dialogue and inference.

Emu3, based solely on next-token prediction, proves that next-token prediction can be a powerful paradigm for multimodal models.

The existing multimodal AI models are mostly designed for specific tasks. Each has its corresponding architecture and methods. For instance, in the field of video generation, many developers use the diffusion in time (DiT) architecture, as referenced by Sora. Other models such as Stable Diffusion are used for text-to-image synthesis, Sora for text-to-video conversion, and GPT-4V for image-to-text generation.

In contrast to these models, which have a combination of isolated skills rather than an inherently unified ability, Emu3, eliminates the need for diffusion or compositional approaches. By tokenizing images, text, and videos into a discrete space, BAAI has developed a single transformer from scratch.

Emu3 outperforms several well-established task-specific models in both generation and perception tasks, surpassing flagship models such as SDXL and LLaVA.

In September, BAAI open-sourced the key technologies and models of Emu3 including the chat model and generation model after supervised fine-tuning.

Emu3 has been receiving rave reviews from overseas developers. "For researchers, a new opportunity has emerged to explore multimodality through a unified architecture, eliminating the need to combine complex diffusion models with large language models. This approach is akin to the transformative impact of transformers in vision-related tasks," AI consultant Muhammad Umair said on social media platform Meta.

While next-token prediction is considered a promising path towards artificial general intelligence, it struggled to excel in multimodal tasks, which were dominated by diffusion models such as Stable Diffusion and compositional approaches like CLIP combined with large language models.

Raphael Mansuy, co-founder of QuantaLogic, an AI agent platform, thinks that Em3 has significant implications for Al development. Mansuy wrote on X that Em3's success suggests several key insights: Next-token prediction as a viable path to general multimodal Al; potential for simplified and more scalable model architectures; challenge to the dominance of diffusion and compositional approaches.

Editor:GONG Qian

Top News

Tapping Into China's Vast Opportunities

Two recent expos — the fifth edition of the China International Consumer Products Expo (CICPE) and the 137th edition of the China Import and Export Fair (the Canton Fair) — have once again solidified the country's role as a cornerstone of global commerce.

LLM Speeds Up High-altitude Research

QwQ-32B, a reasoning large language model (LLM) by Alibaba's Tongyi Qianwen (Qwen), has been integrated with the scientific research of several institutes of the Chinese Academy of Sciences (CAS), facilitating research on solar flare and water resources on the Qinghai-Xizang Plateau.

抱歉,您使用的瀏覽器版本過低或開啟了瀏覽器兼容模式,這會影響您正常瀏覽本網頁

您可以進行以下操作:

1.將瀏覽器切換回極速模式

2.點擊下面圖標升級或更換您的瀏覽器

3.暫不升級,繼續瀏覽

繼續瀏覽
主站蜘蛛池模板: 久久精品aaaaaa羞羞羞 | 乱人伦人妻中文字幕无码久久网 | 国产欧美日韩二区 | 久久视频亚洲 | 亚洲成av人片在线观看无码 | 97夜夜澡人人爽人人喊91洗澡 | 爱射综合网 | jvid在线播放观看免费 | 亚洲欧美成人a毛片 | 亚洲成av人片在线观看无码不卡 | 91看片网页 | 少妇人妻14页_麻花色 | 三级视频在线观看 | av网站国产| 国产成人超碰人人澡人人澡 | 老司机午夜在线视频 | 亚洲精品久久婷婷丁香51 | 草逼一级片 | 日韩精品无码久久久久久 | 亚欧洲精品视频免费观看mv在线观看 | 成人av网站免费 | 亚洲精品字幕在线观看 | 日韩亚洲欧美一区二区 | 最新国产の精品合集bt伙计 | 麻豆精品传媒一二三区 | 久久亚洲精品ab无码播放 | 小荡货好紧好爽A片视频 | 久草资源在线视频 | 亚洲一区二区免费看 | 国产精品女丝袜白丝袜 | 亚洲一区视频 | 欧美日韩一卡 | 久久香蕉成人免费大片 | 亚洲精品无码久久久久av麻豆 | 久热这里只有精品99国产6 | 久久精品国产99久久99久久久 | 乱码精品国产成人观看免费 | 国产传媒av在线 | 性色欲网站人妻丰满中文久久不卡 | 亚洲综合第一在线影视 | 男操女视频网站 |