Run LLM Evals with Pytest and LangSmith
15:52
Run LLM evals with Jest and LangSmith
14:07
Scheduled Tasks in LangGraph
5:40
21 сағат бұрын
Open Source Social Media Agent
25:21
Adding Chat-LangChain To Slack
13:26
Beginner's Guide to Agent Evaluations
40:45
How To Run Open Canvas Locally
8:05
Пікірлер
@gmsharpe
@gmsharpe 2 сағат бұрын
I'm trying run the Deepseek model on Ollama in a Google Colab notebook with Langsmith, but having a hard time figuring out how to configure Langsmith. Any help appreciated. I've been looking at the docs, but no obvious solution stands out. All the examples use OpenAI.
@kundalinimacaroni
@kundalinimacaroni 6 сағат бұрын
can you create a video on how to build this step by step for a non-coder?
@jonatan01i
@jonatan01i 8 сағат бұрын
The most useful and also most interesting part of the deepseek-r1 model is literally what's in between the <think></think> tokens, you wouldn't want to train them away.
@CHRABAKH
@CHRABAKH 6 сағат бұрын
I believe the author of the video has noticed that the <think></think> tags are not useful to the model in the query phase. The <think></think> are useful for understanding the model, but when using the model, it may not be the best strategy especially if the model does not have a strategy for using them in the query phase.
@jonatan01i
@jonatan01i 6 сағат бұрын
@@CHRABAKH absolutely, but training them away would make the model not output the thinking process, making the model more like 4o without thinking.
@jonatan01i
@jonatan01i 6 сағат бұрын
@@CHRABAKH format="json" somehow gets around of using the thinking entirely for example and it shows..
@expensivetechnology9963
@expensivetechnology9963 8 сағат бұрын
#LangChain I feel smarter after watching your content. Thank you for your dense AI stack mentoring. Prior to watching this video I didn’t fully appreciate how ‘in your face’ R1’s performance is until seeing your comparison to o1. Done at a fraction of the training cost.
@patolobos8266
@patolobos8266 10 сағат бұрын
Have you seen Stanford's Storm?
@brendanm4179
@brendanm4179 10 сағат бұрын
Thank you, I know this learns from a users tweets. I think it would be cool to create ai agent/bots/personalities (still learning the difference if any) of people not online or who speak differently online, like fictional characters… interacting with captain jack sparrow for example
@Cenot4ph
@Cenot4ph 12 сағат бұрын
OpenAI needs to change its name already to ClosedAI
@jamieknight326
@jamieknight326 12 сағат бұрын
Thanks for the explainer. This is really accessible and easy to follow. I’ve been experimenting with R1:70b on a local server and it’s remarkable what it can do. The explanation it gives of its ‘process’ is extremely useful for working out how best to refine a prompt. The main downside is that the model has biases (ask it about Taiwan!) which limit the ability to trust the model without being extra careful to verify any information it presents.
@j8888x
@j8888x 10 сағат бұрын
China defeated Japan in World War II and took back Taiwan, which was occupied by Japan in the Treaty of Shimonoseki.
@elviscotena2046
@elviscotena2046 13 сағат бұрын
I would really like a diagram of the connection Deepseek identified. My question was ‘how are the causal factors in obesity interconnected’ would be very grateful for any ideas. By the way, humans are mainly visual so when trying to understand interconnection a diagram is best. Thank you.
@subhankarmukherjee9838
@subhankarmukherjee9838 14 сағат бұрын
Great explanation.
@alexbalistreya
@alexbalistreya Күн бұрын
Hi there, thanks for your video. I understand you are going more into the nuts and bolts as to how these local systems can function. My question is: I really like using Notebook LM for a kind of organizing space for developing my research with a multitude of different sources. I am wondering if it is possible to do something similar without having to upload on Notebook LM's network - in other words, at times I use sensitive data which I would like to similarly work with an LLM and research tool it provides , but offline and maybe just communicating on my own computer. I am not a programmer so I don't know coding but would like to learn to make my own closed network to be able to do some more sensitive data research on a closed network. If you or anyone reading this has any resources, I would love to learn. Thanks.
@DavidTaylor-cz9pz
@DavidTaylor-cz9pz Күн бұрын
I have watched a lot of videos on AI and LLMs, and I am especially eager to understand how Deepseek can do so much with so little in the way of development costs and runtime resources. This is by far the best explanation I have seen, and you have just earned a loyal subscriber. rather than spend your time telling us how SHOCKED the entire industry is, you provide a clear, detailed description of how this LLM achieve its reasoning results, using graphics to pull everything together. I look forward to watching all your videos, but I do have one suggestion. Don’t superimpose your face on the diagrams and text. It makes me crazy not to see the very thing that you’re describing until after you are done with it. This one small change will make your outstanding video even better. Thanks.
@maheshprabhu
@maheshprabhu Сағат бұрын
I don't understand why people put a video of their face on the presentation. We don't need your face, we just need to hear you.
@KnoxTrades
@KnoxTrades Күн бұрын
Good video but man please add chapters
@YOGiiZA
@YOGiiZA Күн бұрын
Good talk, thank you
@muzzletov
@muzzletov Күн бұрын
idiots keep refraining this stupid notion that theres system 1 reasoning and system 2 reasoning without (giving) any proof that this is how the human brain works. kahnemann introduced his "theory" by giving examples. thats all.
@matsiv5707
@matsiv5707 Күн бұрын
Don't ask deep seek what happened the 3rd of June 1989 in Tiananmen square
@aleppax
@aleppax 7 сағат бұрын
Any local instance replies without censorship.
@digitalsmkstudio
@digitalsmkstudio Күн бұрын
Please update the github repo link, I want to access the code but you may move the file to any other location. in addition, during recording, if you zoom the screen, then it would be helpful for those who have weak eye-sight
@ru2979
@ru2979 Күн бұрын
Hi Mr. Lang ☺ Thank you for the video
@coopersnyder4675
@coopersnyder4675 Күн бұрын
we need to have an experiment on groups of people try to learn something from scratch; one group learns from textbooks and classical work and the other uses LLM "summaries". To get a sense of what I'm getting at, is do you want your heart surgeon an alum of using LLM research tools to understand their profession and practice? Maybe it's good at extending your reach, in a sense of like experimental and cutting edge research but without learning solid foundations of established knowledge from primary sources anything gleaned from llm summaries seems like fast junk food yielding superficial learning. This is incredible though, the goal posts keep moving lol
@brulsmurf
@brulsmurf Күн бұрын
It's how you use it I guess. I'm sure LLM can give you practice questions etc like a teacher would.
@SpaceReii
@SpaceReii Күн бұрын
Ngl... He looks like Dr House. Dr House x LLM's 🚨
@aaronschweidler2769
@aaronschweidler2769 2 күн бұрын
I also thought it was awesome for DeepSeek to release the training methodology. This is all super frontier - but from my experience it seems like the distilled models are better than their base model (say Qwen 2.5 or Llama 3) at reasoning specifically but not much else.. that is to say the layer of reasoning data didn't magically unlock the ability to generate something profoundly better than the original models, just better at reasoning-based prompts. Still need to test but the 14b model still has a lot of issues/errors from my early exp.
@iyerasri
@iyerasri 2 күн бұрын
Where is the Python 3.11 interpreter installed when you use the "uvx" command? I want to open this repo in VS Code but need help in figuring out which interpreter to select.
@georgytioro
@georgytioro 2 күн бұрын
In essence, what DeepSeek R1 shown is that openAI is just doing horizontal and vertical scaling of their models together with some tweaks for fine-tuning, nothing special, nothing "ground breaking " (what Sam Altman was announcing for years now - AGI is close!?! ) . Basically, OpenAI was just making holes in the water and made all their users believe that with the next model, they will release AGI 😂
@andrybratun7064
@andrybratun7064 2 күн бұрын
o1, o1-preview does not have structured output yet?
@asgermller3608
@asgermller3608 Күн бұрын
It does
@pedroodelvalle
@pedroodelvalle 2 күн бұрын
Great content! Thanks!!
@carlkim2577
@carlkim2577 2 күн бұрын
Very good explanation. I'm not a programmer, but I can see how and when I would use this.
@peronsh
@peronsh 2 күн бұрын
Do I understand correctly that to build the e.g., Qwen-R1 you replace the DeepSeek-v3 foundation model with any of the Qwen models? And follow the latter preocedure.
@paulmiller591
@paulmiller591 2 күн бұрын
Great timing very topical Thanks!
@jugalsheth-x8w
@jugalsheth-x8w 2 күн бұрын
Excellent
@subashchandra9557
@subashchandra9557 2 күн бұрын
Hey Lance, can you make a video soon about storing memories in a vector DB and reranking etc? I feel like thats the final step right here!
@abdulshariq1330
@abdulshariq1330 2 күн бұрын
exceptional -- really good explanation and a great notebook . thanks for sharing.
@AmolJadhav-x6o
@AmolJadhav-x6o 2 күн бұрын
How can I create graph in the studio. And where can I get the Assistant/Graph ID?
@developer-h6e
@developer-h6e 3 күн бұрын
Hello, I would to use deep researcher for searching the web for scholarship with criteria that I provided, and search the web deeply for the ones that meet that criteria, can it help with that? what do you say.
@Pregidth
@Pregidth 3 күн бұрын
Thanks!
@gullyburns1280
@gullyburns1280 3 күн бұрын
Really great workthrough demo and, as always, nice deep discussion of LangGraph's architecture here.
@ArpitGupta-g4x
@ArpitGupta-g4x 3 күн бұрын
could you please make it in python too
@vireshranjan5367
@vireshranjan5367 3 күн бұрын
he just showed us how to build our own Perplexity in less than 15 mins !
@returncode0000
@returncode0000 3 күн бұрын
So far the best explanation I could found. Great video, thanks!
@eygs493
@eygs493 3 күн бұрын
DeepSeek-R1 is nothing but another chinese junk
@johnsullivan8673
@johnsullivan8673 Күн бұрын
Ask your mom why she didn’t abort you.
@hrmanager6883
@hrmanager6883 3 күн бұрын
Awesome 🙌
@Dr.UldenWascht
@Dr.UldenWascht 3 күн бұрын
I was eagerly anticipating your video on this. Thanks
@IXAKEPI
@IXAKEPI 3 күн бұрын
like a brain having sections for reasoning and communication, those sections need to communicate, but should not overwrite each other while training
@Guilherme-nw3gc
@Guilherme-nw3gc 3 күн бұрын
Amazing
@tee_iam78
@tee_iam78 3 күн бұрын
Excellent video. Thank you.
@zohaibramzan6381
@zohaibramzan6381 3 күн бұрын
how can we make this update_state as part of the graph for api calls? I am waiting for that sort of implementation.
@janmizgajski3297
@janmizgajski3297 3 күн бұрын
Nitpicking here: The way you define precision is slightly off from the canonical way you'd use it in an information retrieval setting with multiple documents - setting it to 0 when only 1 document is irrelevant makes it confusing. What would be more clear to use in this setting would be `Precision@k = (Number of relevant items in top k results) / k` or name this check something like `all_documents_relevant` to indicate that you are basically using a custom defined metric not a canonical way to use precision.
@arfh2759
@arfh2759 3 күн бұрын
🔥
@lucamatteobarbieri2493
@lucamatteobarbieri2493 3 күн бұрын
How about inserting in models reviews some history benchmark? It would be nice to see politically brainwashed llms like deepseek-R1 fail miserably.
@benjpac5
@benjpac5 2 күн бұрын
Lol what
@lucamatteobarbieri2493
@lucamatteobarbieri2493 2 күн бұрын
@benjpac5 There is nothing to laugh about. Censorship is a real problem.
@cristianandrei5462
@cristianandrei5462 Күн бұрын
Unfortunately, you are right, models should be tested on political biases and many will fail, including this one, just ask him about Taiwan status as a country. Another exemple would be google image generator models that would only give images of black people as a result.
@testacals
@testacals 11 сағат бұрын
@@cristianandrei5462 Taiwan is not a country according to UN and USA laws. You are the one who is biased
@testacals
@testacals 11 сағат бұрын
@@cristianandrei5462 Taiwan is not a country according to UN and USA laws. You are the one who is biased
@tspis
@tspis 3 күн бұрын
Excellent content
@kevin41420
@kevin41420 3 күн бұрын
Release the notionnnnnn (please)