Let's test QwQ, the new opensource alternative to o1

  Рет қаралды 1,260

Volko Volko

Volko Volko

Күн бұрын

Пікірлер: 11
@UCs6ktlulE5BEeb3vBBOu6DQ
@UCs6ktlulE5BEeb3vBBOu6DQ 9 күн бұрын
btw QwQ can totally do multi-turn. Set it to 32k context and 16k output tokens so its thinking isn't cut before he's done. llama.cpp has much more settings.
@volkovolko
@volkovolko 9 күн бұрын
Oh okay, I didn't knew that. I thought it cannot do multi turn because it's single turn only in the QwQ Space ^^ Thanks a lot for the precision !
@UCs6ktlulE5BEeb3vBBOu6DQ
@UCs6ktlulE5BEeb3vBBOu6DQ 9 күн бұрын
Tetris game is often my coding test and they all struggle with it.
@volkovolko
@volkovolko 9 күн бұрын
Yes, tetris is quite difficult for LLMs. Only Claude 3.5 Sonnet and Qwen2.5 Coder 32B got it right on my tests. Even gpt4o didn't got it in my test (but i think it has more related to luck)
@SoM3KiK
@SoM3KiK 12 күн бұрын
hey! Would it work with a 3060ti and 32gb ram?
@hatnis
@hatnis 11 күн бұрын
I mean, you can't fit the required 24 gb of VRAM on your graphics card, but hey, only one way to find out if it works right.
@SoM3KiK
@SoM3KiK 11 күн бұрын
@@hatnis well, it was free to ask 😅
@volkovolko
@volkovolko 10 күн бұрын
Yes, but you will have to offload a lot in your CPU/RAM. It will run pretty slow but it will work 👍
@volkovolko
@volkovolko 10 күн бұрын
In the video, I ran it in my 24Go of VRAM. I think it is q4_k_m
@Timely-ud4rm
@Timely-ud4rm 10 күн бұрын
I was able to get it working on my new Mac mini base m4 pro chip model. QwQ-32B-Preview-GGUF bartowski repo. IQ3_XS quantization. the only one I could download as this one is 13.71 gb of ram. Note because I am using a Mac mini apples ram is unified so my 24gb of ram is shared between the gpu and cpu. if I spent spent a extra 300$ from the 1.4k I spent for the m4 pro model I could of loaded the max quantization model but I don't really do AI locally as I use online Ai services more. I hope this helps!
Qwen QwQ-32B Tested LOCALLY: An Open Source Model that THINKS
14:26
Ominous Industries
Рет қаралды 2,5 М.
This Video is AI Generated! SORA Review
16:41
Marques Brownlee
Рет қаралды 3,5 МЛН
How to treat Acne💉
00:31
ISSEI / いっせい
Рет қаралды 108 МЛН
coco在求救? #小丑 #天使 #shorts
00:29
好人小丑
Рет қаралды 120 МЛН
She made herself an ear of corn from his marmalade candies🌽🌽🌽
00:38
Valja & Maxim Family
Рет қаралды 18 МЛН
Qwen2.5 Coder 32B vs GPT4o vs Claude 3.5 Sonnet (new)
14:17
Volko Volko
Рет қаралды 5 М.
Google’s Quantum Chip: Did We Just Tap Into Parallel Universes?
9:34
Anthropic MCP + Ollama. No Claude Needed? Check it out!
18:06
What The Func? w/ Ed Zynda
Рет қаралды 8 М.
I Created The Best AI Tool Ever
9:12
ThePrimeTime
Рет қаралды 110 М.
OpenAI O1 Tested: Smarter, But Is It Truly Reliable?
18:26
Prompt Engineering
Рет қаралды 6 М.
ChatGPT o1 Pro - The smartest AI Model I've ever used
7:54
SullyOmar
Рет қаралды 25 М.
AI Is Not Designed for You
8:29
No Boilerplate
Рет қаралды 203 М.
I built a REAL Desktop App with both Tauri and Electron
12:22
Bufferhead
Рет қаралды 73 М.
How to treat Acne💉
00:31
ISSEI / いっせい
Рет қаралды 108 МЛН