Emerging architectures for LLM applications

  Рет қаралды 50,660

Superwise

Superwise

Күн бұрын

Everything from training models from scratch and fine-tuning open-source models to using hosted APIs, with a particular emphasis on the design pattern of in-context learning.
Key topics we'll cover during the session include:
- Data preprocessing and embedding, focusing on the role of contextual data, embeddings, and vector databases in creating effective LLM applications.
- Strategies for prompt construction and retrieval, which are becoming increasingly complex and critical for product differentiation.
- Prompt execution and inference, analyzing the leading language model providers, their models, and tools used for logging, tracking, and evaluation of LLM outputs.
- Hosting solutions for LLMs, comparing the common solutions and emerging tools for easier and more efficient hosting of LLM applications.
Whether you're a seasoned AI professional, a developer beginning your journey with LLMs, or simply an enthusiast interested in the applications of AI, this webinar offers valuable insights that can help you navigate the rapidly evolving landscape of LLMs.
Follow along with the slides here go.superwise.a...

Пікірлер: 33
@MattHabermehl
@MattHabermehl Жыл бұрын
4k views and only 2 comments. This is the best KZbin video I've seen by far on these strategies. Great content - thank you so much for sharing your expertise!
@investigativeinterviewing4617
@investigativeinterviewing4617 Жыл бұрын
This is one of the best webinars I have seen on this topic. Great slides and presenters!
@vakman9497
@vakman9497 11 ай бұрын
I was very pleased to see how well everything was broken down! I was also shook to see a lot of the architecture strategies were things we were already implementing at our company so I'm happy to see we are on the right track 😅
@maria-wh3km
@maria-wh3km Ай бұрын
it was awesome, thanks guys, keep up the good work.
@dr-maybe
@dr-maybe Жыл бұрын
Very interesting, thanks for sharing
@vikassalaria24
@vikassalaria24 Жыл бұрын
Really great presentation.Keep up the good work
@williampourmajidi4710
@williampourmajidi4710 Жыл бұрын
🎯 Key Takeaways for quick navigation: 00:00 📚 Introduction to the topic of emerging architectures for LLM applications. 01:54 🧐 Why focus on LLM architectures. 04:02 📊 Audience poll on LLM use cases. 05:17 🧠 Retrieval Augmented Generation (RAG) as a design pattern. 08:05 💡 Advanced techniques in RAG and architectural considerations. 14:40 📦 Orchestration and addressing complex tasks with LLMs. 23:53 🧩 LLMs in Intermediate Summarization 26:43 📊 Monitoring in LLM Architecture 32:04 🛠️ LLM Agents and Tools 39:05 🔄 Improving LLM Inference Speed 49:26 🛡️ OpenAI's ChatGPT and its relevance in the field, 50:12 🌐 Evolution of ChatGPT and the AI landscape, 51:09 💼 OpenAI's models and their resource allocation, 52:16 🏢 Factors influencing model choice: Engineering, economy, and legal considerations, Made with HARPA AI
@vichitravirdwivedi
@vichitravirdwivedi 7 ай бұрын
crazy
@todd-alex
@todd-alex Жыл бұрын
Very informative. Several layers of LLM architectures need to be simplified like this. Maybe a standard for XAI should be developed based on a simplified architectural stack like this for LLMs.
@IsraelDavid-z8g
@IsraelDavid-z8g Жыл бұрын
Wonderful video, learns a lot, thanks. This vieo was great! Thank you so much..
@hidroman1993
@hidroman1993 Жыл бұрын
So informative, looking forward to seeing more
@mayurpatilprince2936
@mayurpatilprince2936 11 ай бұрын
Informative video ... Waiting for next video :)
@_rjlynch
@_rjlynch 11 ай бұрын
Very informative, thanks!
@billykotsos4642
@billykotsos4642 11 ай бұрын
Great talk !
@vladimirobellini6128
@vladimirobellini6128 8 ай бұрын
great ideas txs!
@afederici75
@afederici75 Жыл бұрын
This vieo was great! Thank you so much.
@HodgeLukeCEO
@HodgeLukeCEO Жыл бұрын
Can you make the slides available? I have an issue seeing them and following along.
@superwiseai
@superwiseai Жыл бұрын
No problem here you go - go.superwise.ai/hubfs/PDF%20assets/LLM%20Architectures_8.8.2023.pdf
@RiazLaghari
@RiazLaghari 7 ай бұрын
Great!
@VaibhavPatil-rx7pc
@VaibhavPatil-rx7pc Жыл бұрын
Excellent detailed information thanks, please share slide details,
@superwiseai
@superwiseai Жыл бұрын
Thank you! You can access the slides here - go.superwise.ai/hubfs/PDF%20assets/LLM%20Architectures_8.8.2023.pdf
@GigaFro
@GigaFro Жыл бұрын
Can someone provide an example of how one might introduce time as a factor in the embedding?
@serkanserttop1
@serkanserttop1 Жыл бұрын
It would be in a meta field that you use to filter results, not in the vector embeddings itself.
@Aidev7876
@Aidev7876 Жыл бұрын
Honestly. Not huge value for 55 minutes,,,
@k.8597
@k.8597 Жыл бұрын
these videos seldom are.. lol.
@chirusikar
@chirusikar 10 ай бұрын
Total gibberish in this video
@MengGe-s8l
@MengGe-s8l Жыл бұрын
Wonderful video, learns a lot, thanks
@sunnychopper6663
@sunnychopper6663 Жыл бұрын
Really informative video. It will be interesting to see how different layers are formed throughout the coming months. Given the complexities of RAG, it'd be interesting to see hosted solutions that can offer competitive pricing on a RAG engine.
@MMABeijing
@MMABeijing Жыл бұрын
That was very nice, thank you all
@zhw7635
@zhw7635 Жыл бұрын
Nice to see these topics covered, these come up as soon as I was attempting to implement something with llms
@salahuddeenilyasu4018
@salahuddeenilyasu4018 Жыл бұрын
I am curious to know what you are trying to implement.
Unraveling prompt engineering
1:07:19
Superwise
Рет қаралды 1,8 М.
[Webinar] LLMs for Evaluating LLMs
49:07
Arthur
Рет қаралды 10 М.
Миллионер | 1 - серия
34:31
Million Show
Рет қаралды 1,5 МЛН
Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use
15:21
AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"
23:47
Building Production-Ready RAG Applications: Jerry Liu
18:35
AI Engineer
Рет қаралды 314 М.
How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini
34:22
Google for Developers
Рет қаралды 61 М.
What are AI Agents?
12:29
IBM Technology
Рет қаралды 475 М.
Cognitive Architectures for Language Agents
57:16
LangChain
Рет қаралды 10 М.
Create fine-tuned models with NO-CODE for Ollama & LMStudio!
21:52
Tim Carambat
Рет қаралды 31 М.
GraphRAG: LLM-Derived Knowledge Graphs for RAG
15:40
Alex Chao
Рет қаралды 114 М.
😱ЭТО СМАРТФОНЫ SAMSUNG!
1:00
Thebox - о технике и гаджетах
Рет қаралды 1,9 МЛН
D3 XIAOMI SU7 MAX
14:25
smotraTV
Рет қаралды 591 М.
Mac USB
0:59
Alina Saito / 斎藤アリーナ
Рет қаралды 21 МЛН