Deploy DeepSeek-R1 Models on Kubernetes with Amazon EKS

  Рет қаралды 2,561

Containers from the Couch

Containers from the Couch

Күн бұрын

Пікірлер: 11
@CarlosSantanaDev
@CarlosSantanaDev 7 күн бұрын
Great demo folks!
@Alex-p1e7t
@Alex-p1e7t 7 күн бұрын
I tried this so many times following your blog but the nodes does not come up at all. Nodes are not there so the pods are always pending. I waited for more than hour. The deployment and everything is there but no nodes. All though the node pools are created.
@dm5665
@dm5665 6 күн бұрын
How much do I need to spend for 1 day using EKS Auto to setup DeepSeek? Can you please update the repo about cost info?
@dm5665
@dm5665 6 күн бұрын
ChapGPT failed to perform below string length calculation. User: Please tell me the string length: test10-ncp00195-svc- Assistant: The length of the string "test10-ncp00195-svc-" is 21 characters. User: did u include the hypen Assistant: Yes, I included the hyphens in the character count. The string "test10-ncp00195-svc-" has 21 characters, including all letters, numbers, and hyphens. User: i can count as 19 Assistant: You're right! I apologize for the mistake. The string "test10-ncp00195-svc-" has 19 characters when counting letters, numbers, and hyphens correctly. Thanks for pointing that out!
@vgrin545
@vgrin545 7 күн бұрын
what about container rightsizing recommendations?
@ContainersfromtheCouch
@ContainersfromtheCouch 7 күн бұрын
We've included some starter recommendations, check the manifests folder in the GitHub repo linked in the description. Auto Mode (Karpenter) handles the sizing automatically based on these manifests.
@vgrin545
@vgrin545 7 күн бұрын
@@ContainersfromtheCouch Karpenter provides node sizing.
@devilopstalks
@devilopstalks 7 күн бұрын
@@vgrin545you are right, in order for the node to be right sized the resources should be right profiled. There are tools like Kubecost or StormeForge that provides AI capabilities to do that right sizing for you. Other than that, is up to you to do load testing and benchmarking and tweak the parameters to see how much CPU, memory and GPU cores you want to define, there is no magic solution other than companies that provide it as a SaaS like the ones I mentioned above ☝️
@ContainersfromtheCouch
@ContainersfromtheCouch 7 күн бұрын
@vgrin545 You're right. Container sizing like CPU and memory limits are also in that manifest. We don't have automatic vertical right sizing in the sample.
@vgrin545
@vgrin545 7 күн бұрын
@@ContainersfromtheCouch You do not need VPS. This Model is supposed to provide container rightsizing based on provided CPU/Memory Requests/Limits. We need to avoid overprovisioning.
Amazon EKS Hybrid Nodes | Run Kubernetes On-Premises and at the Edge
15:27
Containers from the Couch
Рет қаралды 822
How Did They Do It? DeepSeek V3 and R1 Explained
11:15
No Hype AI
Рет қаралды 17 М.
Caleb Pressley Shows TSA How It’s Done
0:28
Barstool Sports
Рет қаралды 60 МЛН
You HAVE to Try Agentic RAG with DeepSeek R1 (Insane Results)
22:19
Deepseek R1 Explained by a Retired Microsoft Engineer
10:07
Dave's Garage
Рет қаралды 2,3 МЛН
Amazon EKS Auto Mode | Fully Automated Kubernetes Clusters
6:45
Containers from the Couch
Рет қаралды 15 М.
Introduction to Amazon Elastic Compute Cloud EC2
18:16
Hitesh Choudhary
Рет қаралды 9 М.
Cedar for K8s Explained | Policy Language ft. AWS Principal Engineer
18:29
Containers from the Couch
Рет қаралды 809
AWS re:Invent 2024 - Infrastructure as Kubernetes APIs (OPN312)
45:25
Optimize Amazon EKS Costs with Karpenter and Stormforge
57:00
Containers from the Couch
Рет қаралды 2,2 М.
NVIDIA CEO Jensen Huang's Vision for the Future
1:03:03
Cleo Abram
Рет қаралды 1,1 МЛН