No video

How Good is Phi-3-Mini for RAG, Routing, Agents

  Рет қаралды 12,523

Prompt Engineering

Prompt Engineering

Күн бұрын

Пікірлер: 27
@engineerprompt
@engineerprompt 3 ай бұрын
You want learn RAG beyond basics? Make sure to sign up here: tally.so/r/3y9bb0
@VerdonTrigance
@VerdonTrigance 3 ай бұрын
You said about phi-3 small, but it's not releazed yet. Later in the video you are downloading phi-3 mini, which smaller than small.
@VerdonTrigance
@VerdonTrigance 3 ай бұрын
I'm actually wait for phi-3 small 128 k context length for talking to the documents which are set of different docs like docx, xlsx, txt and python scripts. They are all relevant and I want to put them all in a RAG, but maybe routing will be helpful for that too. But I need a really big context for that. Or I should somehow train it. Only option I know to train on a big document set is ask another model to generate questions and later ask to answer that questions. Anyway, any of thse would be helpful.
@PseudoProphet
@PseudoProphet 3 ай бұрын
It's not making any mistake, meta is the real open AI. 😂😂😂
@3choff
@3choff 3 ай бұрын
Brilliant content! I think it is more interesting to test a model by looking at practical applications rather than asking a series of questions that could be in the training data. You should consider making a series of videos in this format.
@johnkintree763
@johnkintree763 3 ай бұрын
Excellent demo of Phi-3's RAG abilities. At the same time we seek a 3 billion parameters language model that runs well on a smartphone with at least 6 GB of RAM, we will also want a speech recognition model, and a dynamic graph neural network that can merge with a vector store to provide long-term memory.
@krisvq
@krisvq 3 ай бұрын
Was thinking the same. We expect a lot from a small model.
@tvwithtiffani
@tvwithtiffani 3 ай бұрын
I enjoyed your calm mellow speaking tone. Nice contrast to pretty much all of YT. Subscribed!
@tvwithtiffani
@tvwithtiffani 3 ай бұрын
Question: Is there a local model that you would recommend for RAG? I've been building rag systems since gpt 3 (not 3.5) and I've yet to find a model that comes close to simply understand whats being asked at that given point in the conversation, extracting relevant info from stuffed context, and providing a response. I would even have gpt 3.0 (pre-chatgpt) quote the sentence from which it got its answer. My experience so far locally is that all of the moving parts outside of the local model have to be damn near 100% perfect to work correctly and even then the model will muck it up somehow every now and again, to the point its unreliable. Which models do you recommend for this specific use-case?
@engineerprompt
@engineerprompt 3 ай бұрын
I personally like the zypher models if you are looking for smaller LLMs. For bigger local LLMs, llama-3 70B is good (in my use cases) and also CommmandR+.
@NoidoDev
@NoidoDev Ай бұрын
19:33 - The model should only decide if something is a mathematical question, and then the script should decide that it has to use a tool.
@patrickscheich7532
@patrickscheich7532 3 ай бұрын
Nearly Perfect! Somehow my agent does not use tools for all questions, including the ones about meta, but the rest works :)
@outeast1052
@outeast1052 3 ай бұрын
Bro, please, fix sound level. It's too quiet. I'm on 100 and can't hear anything, while all other videos are fine on 30.
@borisrusev9474
@borisrusev9474 3 ай бұрын
I don't get the concept of multiple vector stores. How do they differ? Do they store different documents? Use different embedding models? Or maybe the chunking strategies are different?
@engineerprompt
@engineerprompt 3 ай бұрын
In this case, each store will contain different docs. Imagine you have different knowledge bases for different departments and you want to retrieve info from the relevant department just based on the query
@jaysonp9426
@jaysonp9426 3 ай бұрын
Couldn't you just add a different meta filter though? Is there a computational advantage to multiple vector stores?
@aa-xn5hc
@aa-xn5hc 3 ай бұрын
These tests are really great! Please recommend what are the best llms for these purposes, at the time of making your tests.
@yotubecreators47
@yotubecreators47 3 ай бұрын
Perfect content but the camera consumed me I am not able to focus I hope I can find similar content with normal static camera view like other videos
@trilogen
@trilogen 3 ай бұрын
When running these what caliber of computing power are we talking? Any mid-high end laptop or mid-high end PC rig with good graphics card?
@engineerprompt
@engineerprompt 3 ай бұрын
For this model, you will be able to run it on 6-8GB of vRAM. Potentially even with CPU.
@kingfunny4821
@kingfunny4821 3 ай бұрын
can train it with document data or not
@engineerprompt
@engineerprompt 3 ай бұрын
Yes, you can finetune it
@stanTrX
@stanTrX 3 ай бұрын
Your code looks so difficult mate. But thanks 🎉
How Good is LLAMA-3 for RAG, Routing, and Function Calling
17:57
Prompt Engineering
Рет қаралды 10 М.
Get your own custom Phi-3-mini for your use cases
17:46
Prompt Engineering
Рет қаралды 16 М.
Magic trick 🪄😁
00:13
Andrey Grechka
Рет қаралды 46 МЛН
Идеально повторил? Хотите вторую часть?
00:13
⚡️КАН АНДРЕЙ⚡️
Рет қаралды 18 МЛН
10 weird algorithms
9:06
Fireship
Рет қаралды 1,2 МЛН
Graph RAG: Improving RAG with Knowledge Graphs
15:58
Prompt Engineering
Рет қаралды 50 М.
How AI 'Understands' Images (CLIP) - Computerphile
18:05
Computerphile
Рет қаралды 195 М.
5 Good Python Habits
17:35
Indently
Рет қаралды 499 М.
What are AI Agents?
12:29
IBM Technology
Рет қаралды 187 М.
GraphRAG: LLM-Derived Knowledge Graphs for RAG
15:40
Alex Chao
Рет қаралды 105 М.
I wish every AI Engineer could watch this.
33:49
1littlecoder
Рет қаралды 79 М.
host ALL your AI locally
24:20
NetworkChuck
Рет қаралды 997 М.
Understanding B-Trees: The Data Structure Behind Modern Databases
12:39
AI Pioneer Shows The Power of AI AGENTS - "The Future Is Agentic"
23:47