WizardLM 2 - First Open Model Outperforming GPT-4

Рет қаралды 16,313

Күн бұрын

In this video, we test the first Open LLM that outperforms GPT-4 on MT-Bench. Open LLMs are catching up really fast.
🦾 Discord: / discord
☕ Buy me a Coffee: ko-fi.com/promptengineering
|🔴 Patreon: / promptengineering
💼Consulting: calendly.com/engineerprompt/c...
📧 Business Contact: engineerprompt@gmail.com
Become Member: tinyurl.com/y5h28s6h
💻 Pre-configured localGPT VM: bit.ly/localGPT (use Code: PromptEngineering for 50% off).
Signup for Advanced RAG:
tally.so/r/3y9bb0
LINKS:
How was it trained? / 1
How it performs? : / 1779899325868589372
Github Repo: wizardlm.github.io/
TIMESTAMPS:
[00:00] Ground breaking Open LLM
[00:58] Deep Dive into Model Training and Performance
[01:58] Testing it with LM Studio
[05:08] Exploring the Model's Reasoning and Writing Skills
All Interesting Videos:
Everything LangChain: • LangChain
Everything LLM: • Large Language Models
Everything Midjourney: • MidJourney Tutorials
AI Image Generation: • AI Image Generation Tu...

Пікірлер: 64

@SoonerStoneAI 2 ай бұрын

We aren’t running out of human generated data. We are just running out of the easily internet accessible data.

@DynamicUnreal 2 ай бұрын

True. Humans generate insane levels of data everyday outside of the internet. Companies have to look for ingenious ways to try and capture some of that data.

@borisrusev9474 2 ай бұрын

Command R+ is the first open model to outperform GPT-4-0314 according to the LMSYS Chatbot Arena Leaderboard.

@engineerprompt 2 ай бұрын

Agree, everyone is using different benchmarks, ones that suites the model creators :)

@stratos7755 2 ай бұрын

Remove censorship and it will be good.

@ShimoriUta77 2 ай бұрын

@tawansunflower 2 ай бұрын

Thank you for the informative video! By the way, how did you record this video the zooms and the cursor look super smooth!

@engineerprompt 2 ай бұрын

thanks, I use. screen.studio/

@SeeFoodDie 2 ай бұрын

Wow that Llama3 is here we can ignore all these models for a few days. Until the next best thing is released! The pace is breathtaking.

@engineerprompt 2 ай бұрын

I agree, I wonder if people are actually using every new model or just sticking to their old stack.

@kc-jm3cd 2 ай бұрын

Once I start downloading these I will run everything of quality that comes out looking for mostly storytelling abilities and some general knowledge ai

@BlackMita 2 ай бұрын

Zenzorzhip bad

@legendarystuff6971 2 ай бұрын

First.. you know.. I miss 2015 😢

@PazLeBon 2 ай бұрын

i miss 2005 :/

@Nihilvs 2 ай бұрын

@@PazLeBon I mis 500 BC

@unclecode 2 ай бұрын

Isn't it crazy that u uploaded the video 13hrs ago, and about 5hrs later, Llama3 came out with an impressive claimed benchmark and a 400B version in training? Just 9 days ago Mixtral8x22B, then 3 days ago with WizardLM, and now Llama3! I think the table is changing; now open-source models are pushing proprietary models to improve themselves. Tbh, I think the only thing left for OpenAI to impress the market is to drop AGI :D:D

@engineerprompt 2 ай бұрын

I agree, the pace is just crazy. Hard to keep up. Its OpenAI's turn now :D

@engineerprompt 2 ай бұрын

btw any plans adding function calling to llama3, that would be great.

@unclecode 2 ай бұрын

@@engineerprompt haha u read my mind 😎 working on it since morning, stay tuned , update you soon

@engineerprompt 2 ай бұрын

@@unclecode Awesome, will be waiting for it.

@cucciolo182 2 ай бұрын

The thing is how to pack all those tools so we can sell customs gpts and merge them into websites 😂

@user-qb2jn9zh9i 2 ай бұрын

Unfortunately, I missed it and then couldn't find the part in the video that said which version was being tested. Maybe someone understands - the author managed to download a version that the manufacturer later removed, or will he get access to a new, improved version?

@pedrogorilla483 2 ай бұрын

He didn’t explain it well. What happened was the weights for 7B and 8x22B were uploaded and then deleted. However the license used was Apache 2.0 which allows for copying and reuploading. So people who managed to download the weights before they deleted reuploaded the weights fully legally. Just search on hugging face. Only the 70B is missing which they never uploaded.

@LibertyRecordsFree 2 ай бұрын

MaziyarPanahi/WizardLM-2-8x22B-GGUF WizardLM-2-8x22B.IQ3_XS-00003-of-00005.gguf

@user-qb2jn9zh9i 2 ай бұрын

Thank you for the clarification! We still managed to download and post it! :)

@lancemarchetti8673 2 ай бұрын

*Does anyone know where I can test the Mistral 8x22b online, as I don't have a system that supports local models?*

@engineerprompt 2 ай бұрын

checkout labs.perplexity.ai/ its the base version not the instruct version

@ziad_jkhan 2 ай бұрын

Why not use open Ollama instead of closed LM Studio?

@kylequinn1963 2 ай бұрын

Because LM Studio has a wicked user interface and Ollama barely functions on windows, that's my reason anyway.

@engineerprompt 2 ай бұрын

I tested it on ollama but the model is generating gibberish. Still figuring out what is the issue there.

@ziad_jkhan 2 ай бұрын

@@kylequinn1963 Well, it might also be wicked in the real sense. How can we know without access to the source?

@ziad_jkhan 2 ай бұрын

@@engineerprompt May be report the issue on Github or DIscord. That's why it is open-source after all.

@ziad_jkhan 2 ай бұрын

@@engineerprompt The Github repository accepts bug issues

@ilianos 2 ай бұрын

If I don't want to/can't use this model locally: Does anyone know if it's already hosted somewhere online and available per API?

@engineerprompt 2 ай бұрын

Not this but the instruct fine-tuned version by Mistral AI is available on their platform.

@Gatrehs 2 ай бұрын

You could try checking Infermatic, not sure how their API runs though.

@coreyhughes1456 2 ай бұрын

What are the VRAM requirements to run these models?

@kylequinn1963 2 ай бұрын

Massive. I'm running the Q3 variant on my machine with a 4090 and 128gb of ram and the model itself is around 65gb, referring to the 8x22b model specifically.

@engineerprompt 2 ай бұрын

I am running this on M2 Max 96GB RAM. Can run the Q3 only.

@williamcoleman7869 2 ай бұрын

I am running the Q8 model on a desktop with a 3060 12gb. It takes about 4 seconds to start writing. That's fine with me.

@NavneetRingania_from_Guwahati 2 ай бұрын

Would the price of this hosted be lower than gpt4

@engineerprompt 2 ай бұрын

Self hosting will be cheaper in the long run but in short term it will be more expensive.

@Gatrehs 2 ай бұрын

@@engineerprompt What kinda hardware are you running this on? Edit: Nevermind I saw it further down.

@snuwan 2 ай бұрын

There is a version of it in ollama. Is it different

@engineerprompt 2 ай бұрын

I have tried the latest version of ollama (1.32) and have issues running the 4bit version. 8bit works but is too 🐌

@snuwan 2 ай бұрын

@@engineerprompt I have an NVidia 3090 with 24GB VRAM so might be able to load it. Need to try it with Ollama

@pabloe1802 2 ай бұрын

Its possible to run it using 2 GPU? any tutorial with langchain

@engineerprompt 2 ай бұрын

Yup depending on the vRAM you have in each gpu. you will need about 48GB

@MonkeySimius 2 ай бұрын

As far as the trick question about whether Sally is John's sister and it figuring out its mistake once you pointed it out: You should do another test where you do specify that Sally is John's sister and then gaslight it saying the initial prompt didn't say that. I'm curious how it would respond.

@engineerprompt 2 ай бұрын

Interesting, will try that for sure with this and llama3.

@mohammadhamidi5517 2 ай бұрын

what hardware spec does it need to run ?

@engineerprompt 2 ай бұрын

I am running this on M2 Max 96GB and takes about 50GB

@alexsov 2 ай бұрын

Why not just finetune on benchmark questions?)

@ilianos 2 ай бұрын

I'm just genuinely curious: are you being sarcastic? :) We would need new benchmark questions then. But in my opinion, we need new benchmarks (reguarly) anyways, to prevent false advertising of new models.

@RickySupriyadi 2 ай бұрын

i got these naughty poem inside my notes it was converted notes from my teens, somehow i got them into one of my daily notes no wonder i never ever find those poems. using ollama + obsidian copilot with dolphin model i got that old notes back and then i was calling all my buddies from the 90's then we all having great time they even remember those silly naughty poems.... ah the beauty of uncensored LLM. without censorship all kind information can be used in all kind different ways whenever it's for good nor for bad. censorship in my country already be misused in all kinds different creative corrupt way to get monopoly for the profit of few ~they censor yet they access ~they censor yet they gain strategics ~they censor in favor of their ideology ~they censor in favor their politics (this is fact) uncensored = good will gain, bad also gain. let's us human thrive in information and tech.