Very informative video with excellent production. Looking forward for more content
@mulderbm5 ай бұрын
Thanks for using the the right tools for the purpose. I am looking at a tabular dataset that i want to use as the material for an llm to generate synthetic sample graphs from so instead of extracting it from the Wikipedia page it has to write the page given the base knowledge graph. And I believe an llm is very useful for that.
@MehdiAllahyari5 ай бұрын
Yes for your use case llm is actually the best tool as you want to convert structured data into natural language form.
@rowdyjuneja5 ай бұрын
Thanks for the informative video. Other than LLMS , could you suggest some approach or models to try for relationship extraction?
@TwoSetAI5 ай бұрын
There are traditional ner methods. We will share more in new videos!
@zoroyee2765 ай бұрын
Thanks for your video, I am also looking forward to new video of relationships extraction by traditional ways!
@Ash2Tutorial27 күн бұрын
It would help if the code was also available. Could you post the link to your code shown in your Jupyter Notebook ?
@aGianOstaLgia5 ай бұрын
what was wrong with the previous video? As always, thank you!
@MehdiAllahyari5 ай бұрын
Because the subtitles were distracting, we had to re-upload a new one. Unfortunately, the comments of last video cannot be not displayed for this one!
@TwoSetAI5 ай бұрын
i removed the subtitle. hopefully this is easier to watch! thanks!
@myfolder45615 ай бұрын
Would you recommend using the SLIM local models you introduced earlier in this series for NER, intent classification etc to construct knowledge graphs? Looks like it could be a cost-saver plus it offers structured and consistent inputs for graph construction, although I'm not sure if any of the existing available set of SLIMs is well trained enough for this purpose?
@TwoSetAI4 ай бұрын
@@myfolder4561 That potentially is a good idea. We haven’t tried it ourselves. Let us know if you try this approach!
@TwoSetAI4 ай бұрын
@@myfolder4561 it’s indeed possible you will need to train your own SLIM model for this.
@apulacheyt4 ай бұрын
Very interesting review. Any chances you share the code to try it myself? Thanks in advance. BTW I'm reading your RAG book.
@MehdiAllahyari4 ай бұрын
Awesome! Here's the code: github.com/mallahyari/twosetai/blob/main/02_kg_construction.ipynb
@TwoSetAI2 ай бұрын
@@apulacheyt thank you! Some materials might be outdated due to changes in the libraries . Check out our course for the latest updates!
@artur505 ай бұрын
Mehdi, if you say that LLMs are that excellent at making KGs and you prefer other libraries that are more practical at making KGs, could you say what libraries you mean?
@TwoSetAI5 ай бұрын
Yes we will share more
@MehdiAllahyari5 ай бұрын
There are many depending on the domain. But here are some of them that tend to work very well for many domains: - github.com/urchade/GLiNER - github.com/universal-ner/universal-ner - github.com/kamalkraj/BERT-NER
@artur505 ай бұрын
@@MehdiAllahyari thanx, it looks awesome. I will test it for sure…
@artur505 ай бұрын
Thanks! Will you publish the code or github?
@MehdiAllahyari5 ай бұрын
Here's the code: github.com/mallahyari/twosetai
@artur505 ай бұрын
@@MehdiAllahyarithanx!!!
@TwoSetAI5 ай бұрын
Yes.
@arjungoalset84424 ай бұрын
Can you please share the link of the notebook that you went through in the video
@MehdiAllahyari4 ай бұрын
Sure. Here's the code: github.com/mallahyari/twosetai/blob/main/02_kg_construction.ipynb
@HaseebAshraf-q3q2 ай бұрын
I have a huge company policy document that I want to create a knowledge graph for, how do I define labels for that? or is it better to do it without? If yes can you please guide me how to go about it without defining labels?
@MehdiAllahyari2 ай бұрын
By labels I assume you're talking about entity names. Those are things that you should already know or have some common sense about. So you can start there or manually create a few and use LLMs or some other model to extract/generate additional ones based on them.
@neatpaul4 ай бұрын
What if we have no information about the entities? Suppose it is for an application that takes documents as input, in that case we have no sure idea about what the entities will be. How will it work then?
@karthickdurai21574 ай бұрын
Then you can let a general NER model to parse depending on your use case, or if you do know the domain, like if it's a PII data, Finance dataset, etc. you can run it through a pretrained NER model for that particular domain
@neatpaul4 ай бұрын
@@karthickdurai2157The type of application I'm developing, is intended to work on all types of documents irrespective of the domain.
@dmitrivillevald9274Ай бұрын
Building a good app with a knowledge graph without understanding input data would be challenging. A practical first step would be to generate a summary or extract key topic(s) from the document to understand its content before constructing the graph.
@truliapro71124 ай бұрын
How to use documents which have images like some product manual pdf files. How can we use Grpahrag for this problem?
@MehdiAllahyari2 ай бұрын
if you need to use images as well, you're going to need to use some libraries to identify and extract images. There are few like github.com/ai8hyf/TF-ID or PyMuPDF4LLM.
@sgwbutcher5 ай бұрын
The challenge for me is that LLMs are not consistent within or between documents. In the example, you see "us" and "u.s.". I'm also concerned that Fiat is an Organization but Chrysler is a Company. And in the LLM example of triples, many of the objects are just, well, sentence fragments. The killer feature of KGs is that you can make connections...but the overspecificity would seem to prevent this. For example, I cannot connect Tom Hanks to any other "fourth highest grossing actor"...he's the only one! There seems to be no good way to create a prompt where the LLM generates entities and relationships at a consistent and appropriate level of hyper/hypo-nymy. This is perhaps not surprising given that LLMs don't think, reason, whatever. And therein lies the trap in getting LLMs to lift themselves up by their own bootstraps.
@MehdiAllahyari5 ай бұрын
That's exactly my point in the video too. Many people are hyped/over excited to use LLM for extracting name entities and relations especially when you don't define your schema at first. However, there is no guarantee that you get consistent results. Plus the cost if prohibitive!
@awakenwithoutcoffee5 ай бұрын
I think adding an extra layer of adding metadata (e.g. Parent documents) could solve this issue e.g.: - you can have an LLM embedding with semantic ability to go over each chunk and add metadata related to that chunk so that the LLM can understand the context of each word e.g. "u.s" -> "united states, country".
@jackbauer3225 ай бұрын
spacy-llm can help you do few shots NER and the performs is almost 99% of traditional approach
@karthickdurai21574 ай бұрын
I think spacy-llm also uses a LLM behind the scenes, so it may not be as fast as this