WizardLM 2 - First Open Model Outperforming GPT-4

Рет қаралды 16,476

Prompt Engineering

Күн бұрын

Пікірлер: 64

@borisrusev9474 5 ай бұрын

Command R+ is the first open model to outperform GPT-4-0314 according to the LMSYS Chatbot Arena Leaderboard.

@engineerprompt 5 ай бұрын

Agree, everyone is using different benchmarks, ones that suites the model creators :)

@SoonerStoneAI 5 ай бұрын

We aren’t running out of human generated data. We are just running out of the easily internet accessible data.

@DynamicUnreal 5 ай бұрын

True. Humans generate insane levels of data everyday outside of the internet. Companies have to look for ingenious ways to try and capture some of that data.

@stratos7755 5 ай бұрын

Remove censorship and it will be good.

@ShimoriUta77 4 ай бұрын

@unclecode 5 ай бұрын

Isn't it crazy that u uploaded the video 13hrs ago, and about 5hrs later, Llama3 came out with an impressive claimed benchmark and a 400B version in training? Just 9 days ago Mixtral8x22B, then 3 days ago with WizardLM, and now Llama3! I think the table is changing; now open-source models are pushing proprietary models to improve themselves. Tbh, I think the only thing left for OpenAI to impress the market is to drop AGI :D:D

@engineerprompt 5 ай бұрын

I agree, the pace is just crazy. Hard to keep up. Its OpenAI's turn now :D

@engineerprompt 5 ай бұрын

btw any plans adding function calling to llama3, that would be great.

@unclecode 5 ай бұрын

@@engineerprompt haha u read my mind 😎 working on it since morning, stay tuned , update you soon

@engineerprompt 5 ай бұрын

@@unclecode Awesome, will be waiting for it.

@cucciolo182 5 ай бұрын

The thing is how to pack all those tools so we can sell customs gpts and merge them into websites 😂

@ziad_jkhan 5 ай бұрын

Why not use open Ollama instead of closed LM Studio?

@kylequinn1963 5 ай бұрын

Because LM Studio has a wicked user interface and Ollama barely functions on windows, that's my reason anyway.

@engineerprompt 5 ай бұрын

I tested it on ollama but the model is generating gibberish. Still figuring out what is the issue there.

@ziad_jkhan 5 ай бұрын

@@kylequinn1963 Well, it might also be wicked in the real sense. How can we know without access to the source?

@ziad_jkhan 5 ай бұрын

@@engineerprompt May be report the issue on Github or DIscord. That's why it is open-source after all.

@ziad_jkhan 5 ай бұрын

@@engineerprompt The Github repository accepts bug issues

@SeeFoodDie 5 ай бұрын

Wow that Llama3 is here we can ignore all these models for a few days. Until the next best thing is released! The pace is breathtaking.

@engineerprompt 5 ай бұрын

I agree, I wonder if people are actually using every new model or just sticking to their old stack.

@kc-jm3cd 5 ай бұрын

Once I start downloading these I will run everything of quality that comes out looking for mostly storytelling abilities and some general knowledge ai

@MonkeySimius 5 ай бұрын

As far as the trick question about whether Sally is John's sister and it figuring out its mistake once you pointed it out: You should do another test where you do specify that Sally is John's sister and then gaslight it saying the initial prompt didn't say that. I'm curious how it would respond.

@engineerprompt 5 ай бұрын

Interesting, will try that for sure with this and llama3.

@NavneetRingania_from_Guwahati 5 ай бұрын

Would the price of this hosted be lower than gpt4

@engineerprompt 5 ай бұрын

Self hosting will be cheaper in the long run but in short term it will be more expensive.

@Gatrehs 5 ай бұрын

@@engineerprompt What kinda hardware are you running this on? Edit: Nevermind I saw it further down.

@efifragin7455 5 ай бұрын

the current model is not 1106... there is april updated chat4 turbo version

@engineerprompt 5 ай бұрын

Oh, yes, you are right

@tawansunflower 5 ай бұрын

Thank you for the informative video! By the way, how did you record this video the zooms and the cursor look super smooth!

@engineerprompt 5 ай бұрын

thanks, I use. screen.studio/

@ПавелКуликов-м9м 5 ай бұрын

Unfortunately, I missed it and then couldn't find the part in the video that said which version was being tested. Maybe someone understands - the author managed to download a version that the manufacturer later removed, or will he get access to a new, improved version?

@pedrogorilla483 5 ай бұрын

He didn’t explain it well. What happened was the weights for 7B and 8x22B were uploaded and then deleted. However the license used was Apache 2.0 which allows for copying and reuploading. So people who managed to download the weights before they deleted reuploaded the weights fully legally. Just search on hugging face. Only the 70B is missing which they never uploaded.

@LibertyRecordsFree 5 ай бұрын

MaziyarPanahi/WizardLM-2-8x22B-GGUF WizardLM-2-8x22B.IQ3_XS-00003-of-00005.gguf

@ПавелКуликов-м9м 5 ай бұрын

Thank you for the clarification! We still managed to download and post it! :)

@legendarystuff6971 5 ай бұрын

First.. you know.. I miss 2015 😢

@PazLeBon 5 ай бұрын

i miss 2005 :/

@Nihilvs 5 ай бұрын

@@PazLeBon I mis 500 BC

@coreyhughes1456 5 ай бұрын

What are the VRAM requirements to run these models?

@kylequinn1963 5 ай бұрын

Massive. I'm running the Q3 variant on my machine with a 4090 and 128gb of ram and the model itself is around 65gb, referring to the 8x22b model specifically.

@engineerprompt 5 ай бұрын

I am running this on M2 Max 96GB RAM. Can run the Q3 only.

@williamcoleman7869 5 ай бұрын

I am running the Q8 model on a desktop with a 3060 12gb. It takes about 4 seconds to start writing. That's fine with me.

@BlackMita 5 ай бұрын

Zenzorzhip bad

@alexsov 5 ай бұрын

Why not just finetune on benchmark questions?)

@ilianos 5 ай бұрын

I'm just genuinely curious: are you being sarcastic? :) We would need new benchmark questions then. But in my opinion, we need new benchmarks (reguarly) anyways, to prevent false advertising of new models.

@pabloe1802 5 ай бұрын

Its possible to run it using 2 GPU? any tutorial with langchain

@engineerprompt 5 ай бұрын

Yup depending on the vRAM you have in each gpu. you will need about 48GB

@lancemarchetti8673 5 ай бұрын

*Does anyone know where I can test the Mistral 8x22b online, as I don't have a system that supports local models?*

@engineerprompt 5 ай бұрын

checkout labs.perplexity.ai/ its the base version not the instruct version

@RickySupriyadi 5 ай бұрын

i got these naughty poem inside my notes it was converted notes from my teens, somehow i got them into one of my daily notes no wonder i never ever find those poems. using ollama + obsidian copilot with dolphin model i got that old notes back and then i was calling all my buddies from the 90's then we all having great time they even remember those silly naughty poems.... ah the beauty of uncensored LLM. without censorship all kind information can be used in all kind different ways whenever it's for good nor for bad. censorship in my country already be misused in all kinds different creative corrupt way to get monopoly for the profit of few ~they censor yet they access ~they censor yet they gain strategics ~they censor in favor of their ideology ~they censor in favor their politics (this is fact) uncensored = good will gain, bad also gain. let's us human thrive in information and tech.