Рет қаралды 7,337
In this video, we will look at the new king of LLM benchmarks, Claude-3 from Anthropics. We will do a few tests of our own and will look at why the reported results may not reflect the true performance of the Claude-3 family.
🦾 Discord: / discord
☕ Buy me a Coffee: ko-fi.com/promptengineering
|🔴 Patreon: / promptengineering
💼Consulting: calendly.com/engineerprompt/c...
📧 Business Contact: engineerprompt@gmail.com
Become Member: tinyurl.com/y5h28s6h
LINKS:
Claude-3 Announcement: www.anthropic.com/claude
Claude Chat: claude.ai/chats
Technical Report: tinyurl.com/yc5y6zwj
Claude-3 vs GPT-4: tinyurl.com/mprdy3rp
Claude-3 API Access: console.anthropic.com/
TIMESTAMPS:
[00:00] Introducing Cloud3 3: The Challenger to GPT-4
[01:41] Benchmarking Cloud3 3 Against GPT-4: The Reality
[03:35] Intended Applications and Price Analysis of Cloud 3 Models
[06:21] Hands-On Tests: Accuracy, Image Understanding, and Coding Abilities
[14:04] Revisiting Benchmarks: A Closer Look at Cloud 3 vs. GPT-4
All Interesting Videos:
Everything LangChain: • LangChain
Everything LLM: • Large Language Models
Everything Midjourney: • MidJourney Tutorials
AI Image Generation: • AI Image Generation Tu...