Llama 3 RAG Demo with DSPy Optimization, Ollama, and Weaviate!

Рет қаралды 16,418

Connor Shorten

Күн бұрын

Пікірлер: 26

@cipritom 8 ай бұрын

Is it me, or at least the last part is a digital avatar?

@LaHoraMaker 8 ай бұрын

Dear Connor, that was the fastest release ever!

@dianaliu7543 8 ай бұрын

This is so different from RAG using GPT. Lots to learn

@catchychazz 8 ай бұрын

Great video, Connor. Have you tested out if SAMMO is better than DSPy for production?

@larsbell1569 8 ай бұрын

I thought SAMMO was primarily prompt templating? Does it have some new features we are not aware of?

@catchychazz 8 ай бұрын

@@larsbell1569 No, you're right. I was working on the assumption that in production, you'd be using the most capable language models. At the risk of running token-heavy pre-compiled DSPy prompts, having a simpler prompt that automatically augments few-shot prompting only on specific user inputs/triggered events might be a more cost-efficient solution.

@Tarun_Mamidi 8 ай бұрын

Is llama really OSS if we don’t know how or what it is trained on?

@tobkin 8 ай бұрын

Two questions: - Why use gpt-4 instead of gpt-4-turbo for the teleprompter? - What are you using to make your pointer act like that?

@VincentFulco 8 ай бұрын

Would love to see an interface to groq please!

@charismaowojoameh7681 7 ай бұрын

I tried the implementation but i keep getting the error "model not found"

@koljanos 8 ай бұрын

Hey, what version of Weaviate-client you are using????

@connor-shorten 8 ай бұрын

Hey! I am using v4 and the latest version of dspy-ai, can you please share any error messages as an Issue on Weaviate recipes? It might be easier to help debug there instead of KZbin comments.

@koljanos 8 ай бұрын

Allright, will check the issues!

@RichardHamnett 7 ай бұрын

The recipe is gone

@connor-shorten 7 ай бұрын

Hey Richard! Sorry we refactored recipes! The links are now fixed!

@human_agi 8 ай бұрын

how to install ollama

@420_gunna 8 ай бұрын

this mf cookin

@PeterWilliams97 8 ай бұрын

I ran your notebook and got the following error. print(RAG()("What is binary quantization?").answer) AttributeError Traceback (most recent call last) Cell In[7], line 1 ----> 1 print(RAG()("What is binary quantization?").answer) File ~/code/vector_search/weaviate/recipes/.wenv/lib/python3.11/site-packages/dspy/primitives/program.py:26, in Module.__call__(self, *args, **kwargs) 25 def __call__(self, *args, **kwargs): ---> 26 return self.forward(*args, **kwargs) Cell In[6], line 16 15 def forward(self, question): ---> 16 context = self.retrieve(question).passages 17 pred = self.generate_answer(context=context, question=question).answer 18 return dspy.Prediction(context=context, answer=pred, question=question) File ~/code/vector_search/weaviate/recipes/.wenv/lib/python3.11/site-packages/dspy/retrieve/retrieve.py:30, in Retrieve.__call__(self, *args, **kwargs) 29 def __call__(self, *args, **kwargs): ---> 30 return self.forward(*args, **kwargs) File ~/code/vector_search/weaviate/recipes/.wenv/lib/python3.11/site-packages/dspy/retrieve/retrieve.py:39, in Retrieve.forward(self, query_or_queries, k) 36 # print(queries) 37 # TODO: Consider removing any quote-like markers that surround the query too. 38 k = k if k is not None else self.k ---> 39 passages = dsp.retrieveEnsemble(queries, k=k) 40 return Prediction(passages=passages) ... 79 .do() 81 results = results["data"]["Get"][self._weaviate_collection_name] 82 parsed_results = [result[self._weaviate_collection_text_key] for result in results] AttributeError: 'WeaviateClient' object has no attribute 'query'

@connor-shorten 8 ай бұрын

Hey Peter! Apologies we have upgraded the WeaviateRM to use the Weaviate v4 client, can you please try upgrading dspy with `!pip install dspy-ai --upgrade` ?

@connor-shorten 8 ай бұрын

Can you please share any error messages as an Issue on Weaviate recipes? It might be easier to help debug there instead of KZbin comments.

@LyuboslavPetrov 8 ай бұрын

@connorshorten6311 Please do update the video with accurate setup instructions. I have been fighting to get this running (DSPY + Weaviate + OLLAMA) for the past 2-3 hours to no avail. Tried multiple weaviate-client/server combinations, ran trough docker and standalone, configured, updated/downgraded dspy-ai. Went through so many help pages, cannot count now. I am tired, but still would like to play with this set of technologies. Thanks