How to Make Small Language Models Work. Yejin Choi Presents at Data + AI Summit 2024

  Рет қаралды 3,263

Databricks

Databricks

11 күн бұрын

Speaker: Yejin Choi, Professor and MacArthur Fellow at the University of Washington, and Senior Research Director for Commonsense AI at AI2

Пікірлер: 6
@user-wr4yl7tx3w
@user-wr4yl7tx3w 8 күн бұрын
anyone has the link to the paper on archive?
@BeginnerAlchemist
@BeginnerAlchemist 6 күн бұрын
Impossible Distillation for Paraphrasing and Summarization: How to Make High-quality Lemonade out of Small, Low-quality Models
@BeginnerAlchemist
@BeginnerAlchemist 6 күн бұрын
I have a question: why we try to research Small-LM just to avoid using GPUs? If we want to save the money for training, we can do the research for how to make GPU or model more effectively, not to avoid using higher techs.
@DamaruM
@DamaruM 5 күн бұрын
GPU= power consumption
@tulikabose5120
@tulikabose5120 2 күн бұрын
It's not just for GPUs...Small-LM has its own market for on-device or on-edge processing, where there are concerns of privacy and customers would not want their data to go to clouds, and secondly in many industrial use-cases where internet and cloud access isn't accessible due to the remote nature of the use-case, and model inference needs to be done on device...The demand for SLMs is increasing in such use cases...Many big tech companies are not just working on LLMs but also on SLMs under the hood as both of them have to co-exist to cater to different user requirements.
@BeginnerAlchemist
@BeginnerAlchemist 2 күн бұрын
@@tulikabose5120 Thank you, I see. It is useful for small devices with limited calculation hardware and the privacy. That's true. So many LLM need a huge data to train and it should collect people's private info to become stronger. That's hated by most of people.
Always be more smart #shorts
00:32
Jin and Hattie
Рет қаралды 29 МЛН
The day of the sea 🌊 🤣❤️ #demariki
00:22
Demariki
Рет қаралды 78 МЛН
СНЕЖКИ ЛЕТОМ?? #shorts
00:30
Паша Осадчий
Рет қаралды 8 МЛН
Мы никогда не были так напуганы!
00:15
Аришнев
Рет қаралды 1,6 МЛН
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
What Is an AI Anyway? | Mustafa Suleyman | TED
22:02
TED
Рет қаралды 1,1 МЛН
Fine-tuning Large Language Models (LLMs) | w/ Example Code
28:18
Shaw Talebi
Рет қаралды 256 М.
Is AGI Just a Fantasy?
41:26
Machine Learning Street Talk
Рет қаралды 37 М.
The Evolution of Delta Lake from Data + AI Summit 2024
16:07
Databricks
Рет қаралды 1,7 М.
Data + AI Summit 2024 - Keynote Day 2 - Full
2:15:38
Databricks
Рет қаралды 11 М.
Andrew Ng On AI Agentic Workflows And Their Potential For Driving AI Progress
30:54
💅🏻Айфон vs Андроид🤮
0:20
Бутылочка
Рет қаралды 690 М.