I am very new to this and having trouble to understand it but still trying my best. Wanted to ask will it work for documents with complex layout as well? such as PDFs with multi column tables, images or tables that span across multiple pages, tables that have images inside them. Developing a RAG based pdf query system especially for complex PDFs and i am confused what is the best chunking method for my task.
@SwingingInTheHoodАй бұрын
The issue in your case is extraction. Semantic chunking is basically organizing the content hierarchy. Typically,, this is text. A good PDF to text extractor is LlamaParse. It does a very good job of maintaining table structures. As for images, now you are talking multi-modal vectorization, which is beyond my pay grade at this moment. It is possible, but you will need to investigate which vectorizors support it, and how the images need to be submitted.
@TismoGaming2 ай бұрын
I don’t know if you’re still using this or not but I believe that you were right about the micro usb port of the camera itself is weak and finicky. It won’t surprise me since wyze cut corners by using cheap parts to cut products prices. There are these things that alleviate that connection issue “ Adapter with Power for Host Devices/Fire Stick etc” Side note, with my age, I am having hard time seeing how those microusb connecters line up so whenever I succeed in making a good connection, I trace one side of the cable and connector with a permanent marker with a line and then after I just have to align the lines next time I want to plug in the cable again 😉
@AmadeoDominguez-q5u4 ай бұрын
En Donde las benden nesesito2
@magicman4575 ай бұрын
Hi there, thank you for making this review! It is really inspiring to see your passion and progress for the sport. How have you managed to stay so limber and able to move as you have gotten older? I am 24 now and hope to keep moving well as long as I can. Do you have any advice for someone in their 20s? I hope you enjoy your Ollos and happy training!
@SwingingInTheHood5 ай бұрын
I am a dancer, so I was already used to training my body in movement when I started Parkour. Stretch, don't stop moving and don't overdo it. Make staying healthy a lifestyle -- physically, mentally and spiritually. In 40+ years, your older self will thank you.
@magicman4575 ай бұрын
I really appreciate that response. It’s great to see you enjoying your movement so much, I hope to share in that passion for many years to come.
@luzdelacruz44345 ай бұрын
❤
@myfaaammilly16075 ай бұрын
Hoping the one i bought similar to this one works. Cat was chilling on top of my car like he was at the mall people watching. Pisses me off!!
@jerryturner81265 ай бұрын
Would have been nice to hear the audio output of the device.....nice music though....
@silvahawk5 ай бұрын
is this still working with Wyze latest firmware? I noticed on the github that there's a max firmware written on it which is not the current latest firmware
@SwingingInTheHood5 ай бұрын
The adapters work with the wz_mini_hacks software. You can look at this discussion thread with respect to the latest comments on adapters: github.com/gtxaspec/wz_mini_hacks/discussions/73 Do not know if the software itself is supporting the latest Wyze firmware.
@silvahawk5 ай бұрын
@@SwingingInTheHood oh so you've not updated your cameras recently and just stick to the version that was working?
@SwingingInTheHood5 ай бұрын
@@silvahawk With respect to my POE Wyze cameras, yes. It is a hack, so you never know if new firmware will break it. Until it's too late.
@sabbadinsabbasoftcom6 ай бұрын
I would like you to share the prompt you use to ask the LLM to make up a clear question from a user follow up question.. thanks
@tommaso61876 ай бұрын
amazing video
@sugarbee94757 ай бұрын
Lol
@philsowers7 ай бұрын
I'm still having trouble accessing the lm1200 via a browser, can't ping the ip at a console either after setting a direct connection with an ip in the same subnet.
@herbsu43308 ай бұрын
I want use something like this to stop squirrels at the bird feeder and suet cage. Will it deter birds?
@Jhardy18648 ай бұрын
I just purchased one. This is encouraging. I pray I have the same results. Thanks for sharing!
@naderjanhaoui58310 ай бұрын
If you use an OCR system like the OCR API of Adobe PDF Service, you can easily obtain the semantic schema. Unlike regex, which makes it impossible to detect titles, sections, or other parts of the document, OCR allows you to identify every element in your document, such as tables or lists. This ensures that you have a cleanly parsed document.
@SwingingInTheHood10 ай бұрын
Thanks for the info. As someone who has successfully used OCR and regular expressions for decades, I would hardly say it makes it impossible to detect formatting. Au contraire, that's what it was designed for. However, I find working with PDFs and bookmarks becoming much easier. I would recommend PDF WonderShare Element and Nitro PDF Pro. Both have auto bookmarking features, Nitro's being the best because you can search by font and text.
@MALITHA810 ай бұрын
has any one been able to get the video feed coming from the RJ45 and how have you done it. THis will power the Wyze cam but thats it. The connection is through wi fi
@wadethehawk812811 ай бұрын
CAN SOMEONE PLEASE TELL ME THE SETTINGS FOR BOTH KNOBS IN THIS VIDEO
@mikewilson713211 ай бұрын
i know this a new post will this work for neighbours dog shiting in my yard
@leomarc634 Жыл бұрын
I need for dogs any digestion
@CarlosGomez-THX_1138 Жыл бұрын
Thanks for doing all the research. Much appreciated.
@CarlosGomez-THX_1138 Жыл бұрын
I now know what to do with Radpberry Pi. Thanks!
@illinoisslots4323 Жыл бұрын
thanks for the help! really appreciate it!
@petesfeeder Жыл бұрын
Watching you play really made me want to come join you! I haven't jumped around that beautiful park in 35 years but man was it a good time. Especially when the fountains are on. Be well Much love
@AndreaPorter Жыл бұрын
Does it work for dogs???
@doeplatform5285 Жыл бұрын
360p .. we meet again!
@ozarkmedia Жыл бұрын
You explained very well. I'm planning to get the LM1200 and doing my homework before I do. This really helps.
@vicentedumandan7866 Жыл бұрын
how to order?
@SwingingInTheHood Жыл бұрын
amzn.to/44KS7mn
@zkiyyeller3525 Жыл бұрын
Thank you for this.
@TheRues Жыл бұрын
Do you have contact info that you could post?
@TheRues Жыл бұрын
I’m trying to learn how to “Chicago style step” can you teach me? Do you do lessons? I need to learn how to follow 😭 I’m near Seattle but I’m from Chicago and it’s sad that I can’t step (rhythm challenges)
@benjapereira9373 Жыл бұрын
El angole es fandango tango dos x cuatro y el etíope es el catonga tango dos x dos el maestro Astor Piazzolla creo mucho en el estilo etíope
@gambit633 Жыл бұрын
When you were on bar your face was facing away, maybe why they lost you? Do you know if they are tracking body or face?
@SwingingInTheHood Жыл бұрын
This is a very good question. I seem to recall when I chatted with tech support, that the device is tracking the face.
@FelipeMeres Жыл бұрын
GROBID is the closest I've found to a solution to semantically segment PDFs - it doesn't segment by chapter out of the box but I've been experimenting with different ways to use the TEI xml output as a starting point to different degrees of success so far. If you're not familiar I'd have a look both at GROBID and TEI in general.
@johnday2631 Жыл бұрын
link to code repo?
@SwingingInTheHood Жыл бұрын
Not yet. But I think I will create a Github repo and post the code I have created for my use. I'll add the link here when it is done. Thanks for the suggestion.
@Victor-ww2hx Жыл бұрын
@@SwingingInTheHood still no repo?
@deftcg8 ай бұрын
@@Victor-ww2hx bump
@SwingingInTheHood7 ай бұрын
Still no repo, primarily because the current code is part of the embedding pipeline in my existing system. Trying to pull it out to make it standalone is just too big a task at the moment. However, I am thinking about making an API available: community.openai.com/t/using-gpt-4-api-to-semantically-chunk-documents/715689/100?u=somebodysysop Or, if you're up to the coding challenge yourself, in this discussion we have created a roadmap on developing this process yourself: community.openai.com/t/using-gpt-4-api-to-semantically-chunk-documents/715689/
@pierresayad1667 Жыл бұрын
Way too much talking bro. Get to it and just show the required components and what you did. Too much talking. Losing audience.
@SwingingInTheHood Жыл бұрын
Fortunately for me, I'm not trying to build an audience. I'm just sharing what I know, the best way I know how. I don't script these things because I don't need to because I don't care if I talk too much. And, for the record, I agree with you. But, this is just how I roll right now.
@pierresayad1667 Жыл бұрын
Keep doin’ you! No harm no foul .. just thought I’d provide a suggestion. Thx for the info regarding the Poe splitter.
@SwingingInTheHood Жыл бұрын
@@pierresayad1667 No problem. Actually, I just watched a video I just did where I keep saying "You know?" I know as a viewer that would irritate the heck out of me. So, I will take your suggestion to heart and perhaps try to outline my topics a little better so I don't ramble as much.
@sharannagarajan4089 Жыл бұрын
I’m also looking for a solution where PDF hierarchical schema is maintained for chunking
@SwingingInTheHood Жыл бұрын
Outside of custom regex code, another method I've found is to use pdf bookmarking. If it's not that large of a document, I simply go through and bookmark the individual sections, then use a pdf splitter tool to split the document by section. The tool I've been using is Sejda.com, but there ae a few of them out there.
@naderjanhaoui58310 ай бұрын
You can use a ocr system contact me if you need help
@SwingingInTheHood7 ай бұрын
If you're up to the coding challenge yourself, in this discussion we have created a roadmap on developing this process yourself: community.openai.com/t/using-gpt-4-api-to-semantically-chunk-documents/715689/
@plummzzz Жыл бұрын
Will this work with the v3 pan?
@SwingingInTheHood Жыл бұрын
Unknown. Never tested with that model. You'll have to check the Wyze Mini Hacks Github: github.com/gtxaspec/wz_mini_hacks
@mazenlahham8029 Жыл бұрын
Amazing idea, thanks for sharing ❤
@maziua Жыл бұрын
area covered by device is very less, but an addition to garden for saving plants.......not commercial fields
@DrTentaCulo Жыл бұрын
Does it make any nose that will keep me up at night? The damn neighbors cat keep sleeping on my roof and it’s pissing me off, I’m not about to have my paint scratched because she leaves her back door open at night for her car. It’s either this if it works or coolant 🤷🏽♂️
@videogamesplanet6631 Жыл бұрын
Got the solar version of this and it hurts my brain. Its too damn strong even on the lowest settings. It not only repels cats but humans too. 😂
@myfaaammilly16075 ай бұрын
😂😂😂
@tomm7232 Жыл бұрын
This helped me a lot thank you!
@bluefungi Жыл бұрын
Ayeee I've been there too. 😁
@galdx_ Жыл бұрын
Did some tests here and also noticed a substantial improvement when using the header approach per chunk. I searched for some pdf parsers, but could not find one that recognizes the structure of the document and then parses it. Did you have any luck with it? I believe that this problem might have been solved by someone already.
@SwingingInTheHood Жыл бұрын
A pdf export program that could export documents according to their hierarchal organization would be a dream come true. But, alas, I have yet to find one. I did make a request to ABBYY to look into it. What I have ended up doing is writing code that reads the header I created to chunk the document, then re-organizes all the chunks in hierarchal order. Now, I can import these text files as "book" nodes into Drupal, where they create their own natural "table of contents". And, using my SolrAI module, I vectorize these nodes from within Drupal and now have some pretty organized content that always knows where it is in the hierarchy.
@galdx_ Жыл бұрын
@@SwingingInTheHood yes, it solves the issue, but it is not scalable right? maybe there is an opportunity.
@SwingingInTheHood Жыл бұрын
@@galdx_ Au contraire, Drupal is the most scalable CMS available today. It is the preferred CMS of enterprise organizations. The reason the updates are queued is so that they can be upserted to the vector store in a more manageable manner. If you have hundreds, even thousands of updates going on hourly, the only difference would be that they would need to be queued and batched instead of the one-per system I have now. If this is what you mean.
@OdeeOz Жыл бұрын
Stupid background music, with no Device Audio. 👎👎
@lorla5236 Жыл бұрын
Wowww! I was coming to see if it would repel raccoons & not bother the cats but I clearly see the answer lol This wont fix my situation but I know what to get if I want to scurry away cats. Great video! Broox should sponsor this clip.
@Fahad-ik1zo6 ай бұрын
I believe this works for raccoons and more animals you just need to change the frequency on it
@zigzagfly1635 Жыл бұрын
Good informative and funny
@semilladeagua3123 Жыл бұрын
Best soundtrack ¡¡¡
@HappyVibeRaider Жыл бұрын
So does this makes it wired and not wireless?
@SwingingInTheHood Жыл бұрын
Using wz_mini_hacks and poe adapter, you can make it wired. However, if physically wired, it will still appear in your Wyze app with all Wyze features still available.
@raczyk Жыл бұрын
1. can they track you in the city amongst crowd of people? 2. What the maximum distance it can track?