Llama 3 RAG Demo with DSPy Optimization, Ollama, and Weaviate!

  Рет қаралды 16,418

Connor Shorten

Connor Shorten

Күн бұрын

Пікірлер: 26
@cipritom
@cipritom 8 ай бұрын
Is it me, or at least the last part is a digital avatar?
@LaHoraMaker
@LaHoraMaker 8 ай бұрын
Dear Connor, that was the fastest release ever!
@dianaliu7543
@dianaliu7543 8 ай бұрын
This is so different from RAG using GPT. Lots to learn
@catchychazz
@catchychazz 8 ай бұрын
Great video, Connor. Have you tested out if SAMMO is better than DSPy for production?
@larsbell1569
@larsbell1569 8 ай бұрын
I thought SAMMO was primarily prompt templating? Does it have some new features we are not aware of?
@catchychazz
@catchychazz 8 ай бұрын
@@larsbell1569 No, you're right. I was working on the assumption that in production, you'd be using the most capable language models. At the risk of running token-heavy pre-compiled DSPy prompts, having a simpler prompt that automatically augments few-shot prompting only on specific user inputs/triggered events might be a more cost-efficient solution.
@Tarun_Mamidi
@Tarun_Mamidi 8 ай бұрын
Is llama really OSS if we don’t know how or what it is trained on?
@tobkin
@tobkin 8 ай бұрын
Two questions: - Why use gpt-4 instead of gpt-4-turbo for the teleprompter? - What are you using to make your pointer act like that?
@VincentFulco
@VincentFulco 8 ай бұрын
Would love to see an interface to groq please!
@charismaowojoameh7681
@charismaowojoameh7681 7 ай бұрын
I tried the implementation but i keep getting the error "model not found"
@koljanos
@koljanos 8 ай бұрын
Hey, what version of Weaviate-client you are using????
@connor-shorten
@connor-shorten 8 ай бұрын
Hey! I am using v4 and the latest version of dspy-ai, can you please share any error messages as an Issue on Weaviate recipes? It might be easier to help debug there instead of KZbin comments.
@koljanos
@koljanos 8 ай бұрын
Allright, will check the issues!
@RichardHamnett
@RichardHamnett 7 ай бұрын
The recipe is gone
@connor-shorten
@connor-shorten 7 ай бұрын
Hey Richard! Sorry we refactored recipes! The links are now fixed!
@human_agi
@human_agi 8 ай бұрын
how to install ollama
@420_gunna
@420_gunna 8 ай бұрын
this mf cookin
@PeterWilliams97
@PeterWilliams97 8 ай бұрын
I ran your notebook and got the following error. print(RAG()("What is binary quantization?").answer) AttributeError Traceback (most recent call last) Cell In[7], line 1 ----> 1 print(RAG()("What is binary quantization?").answer) File ~/code/vector_search/weaviate/recipes/.wenv/lib/python3.11/site-packages/dspy/primitives/program.py:26, in Module.__call__(self, *args, **kwargs) 25 def __call__(self, *args, **kwargs): ---> 26 return self.forward(*args, **kwargs) Cell In[6], line 16 15 def forward(self, question): ---> 16 context = self.retrieve(question).passages 17 pred = self.generate_answer(context=context, question=question).answer 18 return dspy.Prediction(context=context, answer=pred, question=question) File ~/code/vector_search/weaviate/recipes/.wenv/lib/python3.11/site-packages/dspy/retrieve/retrieve.py:30, in Retrieve.__call__(self, *args, **kwargs) 29 def __call__(self, *args, **kwargs): ---> 30 return self.forward(*args, **kwargs) File ~/code/vector_search/weaviate/recipes/.wenv/lib/python3.11/site-packages/dspy/retrieve/retrieve.py:39, in Retrieve.forward(self, query_or_queries, k) 36 # print(queries) 37 # TODO: Consider removing any quote-like markers that surround the query too. 38 k = k if k is not None else self.k ---> 39 passages = dsp.retrieveEnsemble(queries, k=k) 40 return Prediction(passages=passages) ... 79 .do() 81 results = results["data"]["Get"][self._weaviate_collection_name] 82 parsed_results = [result[self._weaviate_collection_text_key] for result in results] AttributeError: 'WeaviateClient' object has no attribute 'query'
@connor-shorten
@connor-shorten 8 ай бұрын
Hey Peter! Apologies we have upgraded the WeaviateRM to use the Weaviate v4 client, can you please try upgrading dspy with `!pip install dspy-ai --upgrade` ?
@connor-shorten
@connor-shorten 8 ай бұрын
Can you please share any error messages as an Issue on Weaviate recipes? It might be easier to help debug there instead of KZbin comments.
@LyuboslavPetrov
@LyuboslavPetrov 8 ай бұрын
@connorshorten6311 Please do update the video with accurate setup instructions. I have been fighting to get this running (DSPY + Weaviate + OLLAMA) for the past 2-3 hours to no avail. Tried multiple weaviate-client/server combinations, ran trough docker and standalone, configured, updated/downgraded dspy-ai. Went through so many help pages, cannot count now. I am tired, but still would like to play with this set of technologies. Thanks
@MrjbushM
@MrjbushM 8 ай бұрын
Thank you.
@SolSay
@SolSay 5 ай бұрын
this guy looks like AI)
@leeme179
@leeme179 8 ай бұрын
maybe just me but the blur/smooth filter to the face cam makes me suspect that face cam is AI generated 🤣🤣🤣🤣🤣🤣
@dixon1e
@dixon1e 8 ай бұрын
Uncanny valley feel.
Getting Started with RAG in DSPy!
31:54
Connor Shorten
Рет қаралды 15 М.
This Llama 3 is powerful and uncensored, let’s run it
14:58
David Ondrej
Рет қаралды 183 М.
The evil clown plays a prank on the angel
00:39
超人夫妇
Рет қаралды 53 МЛН
Мясо вегана? 🧐 @Whatthefshow
01:01
История одного вокалиста
Рет қаралды 7 МЛН
Mom Hack for Cooking Solo with a Little One! 🍳👶
00:15
5-Minute Crafts HOUSE
Рет қаралды 23 МЛН
"okay, but I want Llama 3 for my specific use case" - Here's how
24:20
Adding Custom Models to Ollama
10:12
Matt Williams
Рет қаралды 34 М.
RAG from the Ground Up with Python and Ollama
15:32
Decoder
Рет қаралды 36 М.
Adding Depth to DSPy Programs
1:08:24
Connor Shorten
Рет қаралды 8 М.
Reliable, fully local RAG agents with LLaMA3
21:19
LangChain
Рет қаралды 119 М.
Llama3 Full Rag - API with Ollama, LangChain and ChromaDB with Flask API and PDF upload
47:09
Using Ollama To Build a FULLY LOCAL "ChatGPT Clone"
11:17
Matthew Berman
Рет қаралды 261 М.
The evil clown plays a prank on the angel
00:39
超人夫妇
Рет қаралды 53 МЛН