The new Llama 3.3 Instruct model in Private LLM

  Рет қаралды 209

Private LLM

Private LLM

Күн бұрын

Пікірлер: 10
@christiancrow
@christiancrow Ай бұрын
❤❤❤❤❤ thank you
@PrivateLLM
@PrivateLLM 17 күн бұрын
You are so welcome
@christiancrow
@christiancrow Ай бұрын
M4 mac mini ? Or pro
@PrivateLLM
@PrivateLLM 21 күн бұрын
Either one should be fine, as long as you have 48GB of RAM. This demo was run on an M2 Max Mac Studio, but an M4 Mac Mini should work as well.
@christiancrow
@christiancrow 21 күн бұрын
@PrivateLLM 2 grand for a 48 GB system , I would be interested in base model I wonder if it could run faster on newest llama
@PrivateLLM
@PrivateLLM 21 күн бұрын
@@christiancrow Base model with 16Gb of RAM can easily run Gemma 2 9B, Llama 3.1 8B, Qwen 2.5 14B. Checkout the model list on our website for a full list of models along with RAM requirements. privatellm.app/en#models
@Lp-ze1tg
@Lp-ze1tg Ай бұрын
Is there a tutorial for private llm?
@PrivateLLM
@PrivateLLM 21 күн бұрын
You can check this out: privatellm.app/blog/run-local-gpt-on-ios-complete-guide The article is slightly dated, and we need to revise it with new model recommendations. We will do it soon.
@tak4272
@tak4272 Ай бұрын
Does privateLLM have an OpenAI-compatible API? If it doesn't, then being somewhat faster in inference compared to Ollama won't be a significant advantage. Many software applications are compatible with OpenAI's API, so using Ollama offers various benefits. I think without an API, it would just be a chatbot.
@PrivateLLM
@PrivateLLM 21 күн бұрын
@@tak4272 We’re working on adding an HTTP API. We’ve always supported extension through macOS shortcuts which llama.cpp wrappers lack. Also, Ollama has additional features that we’ll never be able match: Slow inference, and low quality RTN quantized models.
Mastering Llama 3.3 on MacBook M3 Max - Real Insights & Surprises!
19:42
Une nouvelle voiture pour Noël 🥹
00:28
Nicocapone
Рет қаралды 9 МЛН
Правильный подход к детям
00:18
Beatrise
Рет қаралды 11 МЛН
Gaming On The RTX 5090 With DLSS 4!
16:25
PC Centric
Рет қаралды 67 М.
LM Studio vs Private LLM: Llama 3.3 70B Local AI Reasoning Test
1:54
Netflix Removed React?
20:36
Theo - t3․gg
Рет қаралды 74 М.
Mind-Blowing Humanoid Robot Walked Outside (The Internet Exploded)
13:51
Deep Learning: Train/Dev/Test Sets, Bias-Variance, Overfitting & More
22:22
LM Studio vs Private LLM: Mixtral 8x7B Model Performance
2:10
Private LLM
Рет қаралды 2,1 М.