Comparing LLMs with LangChain

  Рет қаралды 17,114

Sam Witteveen

Sam Witteveen

Күн бұрын

Пікірлер: 30
@FreestyleTraceur
@FreestyleTraceur Жыл бұрын
Really glad I stumbled upon your videos. Super high quality, informative, and helpful. I'm sure your subscriber count will be ballooning in the coming weeks and months. Great stuff!
@AykutKlc
@AykutKlc Жыл бұрын
Alpaca 30B - 4bit quantized: > What is the opposite of up? Down. > Answer the following question by reasoning step by step. The cafeteria had 23 apples. If they used 20 for lunch, and bought 6 more, how many apple do they have? They now have 18 apples (the original 23 minus the 5 that were eaten at lunch). > Can Geoffrey Hinton have a conversation with George Washington? Give the rationale before answering. No, because they are not in the same time period and thus cannot communicate directly. However, if we had access to some sort of portal or device that could allow them both to be present at once then it would theoretically be possible for Geoffrey Hinton to have a conversation with George Washington. The others were'nt good enough.
@samwitteveenai
@samwitteveenai Жыл бұрын
I haven't tried the 4bit version but the 30B model should have better results than that if finetuned properly.
@micbab-vg2mu
@micbab-vg2mu Жыл бұрын
At the moment because I am a beginner I focus only on the GPT model, but maybe in the future, I will use others. Thank you for the video.
@AntonioSorrentini
@AntonioSorrentini Жыл бұрын
The most powerful, and by far the best of all these, today are Alpaca 7B and Vicuna 13B. Yes, these days innovation runs fast and frenetic in this sector.
@samwitteveenai
@samwitteveenai Жыл бұрын
Checkout Koala 13B it s like Vicuna but trained on more data.
@AntonioSorrentini
@AntonioSorrentini Жыл бұрын
@@samwitteveenai Thank you very much!
@sharifehfadaei1245
@sharifehfadaei1245 Жыл бұрын
Thank you for your wonderful video 👌how can I compare the performance of different finetuned LLMs based on my dataset?
@doctalk
@doctalk Жыл бұрын
It will be great if you compared the alpaca one here as well
@raghadabdulrab7844
@raghadabdulrab7844 10 ай бұрын
What are my options If i want to compare really large LLMs?
@CesarAngeles28
@CesarAngeles28 Жыл бұрын
Hi Sam, Thanks so much for the video. It is quite cool and funny as well. You mention a paper in there, is it a research paper? Could we get ahold of it? Thanks so much again and all the best!
@samwitteveenai
@samwitteveenai Жыл бұрын
Yeah all papers should be on Arxiv if not in the description of the video. That video is from quite a while back, so not sure which paper you are talking about, but take a look on Arxiv.
@hamzakivideo
@hamzakivideo Жыл бұрын
Great video, do you think we can do the same evaluation for qa retrieval from a doc?
@samwitteveenai
@samwitteveenai Жыл бұрын
yeah you should be able to, but it might have to be a bit more manual about how you do it.
@hamzakivideo
@hamzakivideo Жыл бұрын
@@samwitteveenai , I have been working on one, got mpnet embedding and flan as retrieval to work, would be great to collaborate with you on it!
@attilavass6935
@attilavass6935 Жыл бұрын
The creative task should have been done with much higher Temp than 0.1 - I think. I see it's not easy to setup params by task (compare), but I might try this with higher Temps later.
@samwitteveenai
@samwitteveenai Жыл бұрын
yes totally the more randomness (creativity) you want in the higher the temp.
@ChenXibo
@ChenXibo Жыл бұрын
What a funny video you made! I love it.
@samwitteveenai
@samwitteveenai Жыл бұрын
Thanks! 😃
@RedCloudServices
@RedCloudServices Жыл бұрын
Sam thank you these LLM reviews could become Yelp for AI 😆 I have a question can you post a video about computational LLM? For example to provide an actual table of data and then have a conversation or derive insights w the LLM. Langchain have a csv agent but do others?
@samwitteveenai
@samwitteveenai Жыл бұрын
The CSV idea is certainly interesting, I am planning to show fine tuning a custom model for natural language to SQL. Maybe I will try to incorporate the csv or tables analysis in there.
@RedCloudServices
@RedCloudServices Жыл бұрын
@@samwitteveenai this GPT plugin announcement could make so many computational use cases possible. There are langchain text to sql agents but csv is such a simple use case. Imagine uploading an entire budget integrated with a custom LLM corpus. Or an ML plugin with label predictions on-the-fly.
@samwitteveenai
@samwitteveenai Жыл бұрын
I do think there is a lot of discreet applications currently , but I am also very interested in neural plugins eg separate models within a plugin for ChatGPT etc
@toddnedd2138
@toddnedd2138 Жыл бұрын
Very nice information. Giving a LLM a kind of memory is a little bit challenging. Currently I´m trying to fine tune LLaMA for text-to-sql. Then it could access the DB for previous information. The DB schema is not so easy to design, maybe some (artificial) intelligence could help :D
@samwitteveenai
@samwitteveenai Жыл бұрын
Are you using a public dataset?
@toddnedd2138
@toddnedd2138 Жыл бұрын
​@@samwitteveenai Thanks for asking. No, I generate the dataset by myself. To be honest, I let GPT4 do all the heavy lifting. It is also more a spare time project and maybe a simple and naive approach. To give you an example, using your example sentence, a little bit extended: 1. Let the AI create a semantic interpretation of sentence Q: Find the subject an it´s attributes in the following sentence and classify the attributes. The sentence is "Hi, I´m Sam and 42 years old". A: "Sam" is an attribute of the subject that specifies the name of the person being referred to, and "42 years old" is another attribute that specifies the age of the person. 2. Let the AI create a class object, e.g. in C# Q: Could you give me a c# class that models the subject. A: public class Person { public string Name { get; set; } public int Age { get; set; } } 3. Let the AI create the SQL to create the table and store the concrete object CREATE TABLE Persons ( Id INT IDENTITY(1,1) PRIMARY KEY, Name NVARCHAR(50) NOT NULL, Age INT NOT NULL ); INSERT INTO Persons (Name, Age) VALUES ('Sam', 42); The training data point consists then of: {'instruction': 'Convert text to sql', 'input': 'Hi, I´m Sam and 42 years old', 'output': 'CREATE TABLE Persons ( Id INT IDENTITY(1,1) PRIMARY KEY, Name NVARCHAR(50) NOT NULL, Age INT NOT NULL ); INSERT INTO Persons (Name, Age) VALUES ('Sam', 42);' Not sure if and how it will work. There is maybe more context in the instruction and/or input necessary.
@samwitteveenai
@samwitteveenai Жыл бұрын
@@toddnedd2138 Checkout WikiSQL www.kaggle.com/datasets/shahrukhkhan/wikisql you may find this useful.
@toddnedd2138
@toddnedd2138 Жыл бұрын
@@samwitteveenai Thank you for the link, appreciate it very much. 🤩
Using Constitutional AI in LangChain
7:33
Sam Witteveen
Рет қаралды 6 М.
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 4,3 МЛН
БАБУШКА ШАРИТ #shorts
0:16
Паша Осадчий
Рет қаралды 4,1 МЛН
Почему Катар богатый? #shorts
0:45
Послезавтра
Рет қаралды 2 МЛН
ССЫЛКА НА ИГРУ В КОММЕНТАХ #shorts
0:36
Паша Осадчий
Рет қаралды 8 МЛН
Microsoft's Visual ChatGPT using LangChain
13:59
Sam Witteveen
Рет қаралды 13 М.
How to Improve LLMs with RAG (Overview + Python Code)
21:41
Shaw Talebi
Рет қаралды 91 М.
Local GraphRAG with LLaMa 3.1 - LangChain, Ollama & Neo4j
15:01
Coding Crash Courses
Рет қаралды 37 М.
PydanticAI Agents that Code
15:09
Riza, Inc.
Рет қаралды 2,8 М.
LangChain vs LangGraph: A Tale of Two Frameworks
9:55
IBM Technology
Рет қаралды 65 М.
NEW LangChain Expression Language!!
16:22
Sam Witteveen
Рет қаралды 17 М.
Improve your BabyAGI with LangChain
16:00
Sam Witteveen
Рет қаралды 21 М.
Fine-tuning Large Language Models (LLMs) | w/ Example Code
28:18
Shaw Talebi
Рет қаралды 387 М.
Using LangChain with DuckDuckGO Wikipedia & PythonREPL Tools
13:40
Sam Witteveen
Рет қаралды 25 М.
The Best RAG Technique Yet? Anthropic’s Contextual Retrieval Explained!
16:14
БАБУШКА ШАРИТ #shorts
0:16
Паша Осадчий
Рет қаралды 4,1 МЛН