[ACL 2024 - Outstanding Paper] Speech Translation with Speech Foundation Models and LLMs

  Рет қаралды 30

Sara Papi

Sara Papi

Күн бұрын

The paper won both Outstanding paper and Senior Area Chair Award and was selected for both oral and poster presentation at ACL 2024 (top 8%).
- Title: Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?
- Authors: Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli
- Abstract:
The field of natural language processing (NLP) has recently witnessed a transformative shift with the emergence of foundation models, particularly Large Language Models (LLMs) that have revolutionized text-based NLP. This paradigm has extended to other modalities, including speech, where researchers are actively exploring the combination of Speech Foundation Models (SFMs) and LLMs into single, unified models capable of addressing multimodal tasks. Among such tasks, this paper focuses on speech-to-text translation (ST). By examining the published papers on the topic, we propose a unified view of the architectural solutions and training strategies presented so far, highlighting similarities and differences among them. Based on this examination, we not only organize the lessons learned but also show how diverse settings and evaluation approaches hinder the identification of the best-performing solution for each architectural building block and training choice. Lastly, we outline recommendations for future works on the topic aimed at better understanding the strengths and weaknesses of the SFM+LLM solutions for ST.
- Paper: aclanthology.o...

Пікірлер
Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use
15:21
CMU Advanced NLP Fall 2024 (1): Introduction to NLP
1:13:57
Graham Neubig
Рет қаралды 1,3 М.
Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn
00:20
佐助与鸣人
Рет қаралды 54 МЛН
Apple peeling hack @scottsreality
00:37
_vector_
Рет қаралды 125 МЛН
At the end of the video, deadpool did this #harleyquinn #deadpool3 #wolverin #shorts
00:15
Anastasyia Prichinina. Actress. Cosplayer.
Рет қаралды 19 МЛН
ACL 2024 Keynote: Can LLMs Reason & Plan?
59:38
Subbarao Kambhampati
Рет қаралды 3,5 М.
🚨 YOU'RE VISUALIZING YOUR DATA WRONG. And Here's Why...
17:11
Adam Finer - Learn BI Online
Рет қаралды 149 М.
AI, Machine Learning, Deep Learning and Generative AI Explained
10:01
IBM Technology
Рет қаралды 203 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem
19:15
NEW Tesla Prototype LEAKED at WB Studios | This Design Is Weird
20:34
How I'd Learn AI (If I Had to Start Over)
15:04
Thu Vu data analytics
Рет қаралды 798 М.
Or is Harriet Quinn good? #cosplay#joker #Harriet Quinn
00:20
佐助与鸣人
Рет қаралды 54 МЛН