Fellowship: Visual ChatGPT, Talking, Drawing and Editing with Visual Foundation Models

  Рет қаралды 215

Launchpad

Launchpad

10 ай бұрын

#arxiv #artificialintelligence #datascience #machinelearning #deeplearning #conversationalAI #VisualChatGPT
Link to paper: arxiv.org/pdf/2303.04671.pdf
Paper by: Chenfei Wu, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, Nan Duan from Microsoft Research Asia.
Presentation by Fellowship.ai team: www.fellowship.ai/
Fellowship.ai is brought to you by Launchpad.ai: www.launchpad.ai/
Launchpad brings cutting-edge technologies and AI applications to organizations, to learn more about our products and services check: www.launchpad.ai/ai-developme...
Abstract: This paper introduces Visual ChatGPT, an innovative approach enhancing the capabilities of ChatGPT to process language and images. The system incorporates Visual Foundation Models, allowing users to interact with ChatGPT through complex visual questions or visual editing instructions that require the collaboration of multiple AI models over multiple steps. It also allows users to provide feedback and ask for corrected results. Experiments reveal that Visual ChatGPT enhances the understanding of the visual roles of ChatGPT with the help of Visual Foundation Models/
Code and demo are available at github.com/microsoft/visual-c....

Пікірлер
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
Escape From Spike With Herobrine and Entity
00:27
Garri Creative
Рет қаралды 13 МЛН
Genial gadget para almacenar y lavar lentes de Let's GLOW
00:26
Let's GLOW! Spanish
Рет қаралды 7 МЛН
Help Herobrine Escape From Spike
00:28
Garri Creative
Рет қаралды 45 МЛН
Jeff Dean (Google): Exciting Trends in Machine Learning
1:12:30
Rice Ken Kennedy Institute
Рет қаралды 162 М.
What Happened To Google Search?
14:05
Enrico Tartarotti
Рет қаралды 3 МЛН
How ChatGPT Works Technically For Beginners
33:11
Kurdiez
Рет қаралды 1 МЛН
Neil deGrasse Tyson Explains The Three-Body Problem
11:45
StarTalk
Рет қаралды 2,1 МЛН
I Made a Graph of Wikipedia... This Is What I Found
19:44
adumb
Рет қаралды 1,6 МЛН
What's the future for generative AI? - The Turing Lectures with Mike Wooldridge
1:00:59
What's Really Happening At CERN
17:41
Cleo Abram
Рет қаралды 384 М.
How we teach computers to understand pictures | Fei Fei Li
18:03
respect 35
0:40
BD Social Media
Рет қаралды 22 МЛН
ОЦЕНКА (смешное видео, приколы, поржать, школьный юмор)
0:59
Натурал Альбертович
Рет қаралды 1,8 МЛН