Is Claude 3 The ChatGPT Killer?

By Shelly Palmer of ShellyPalmer.com

Tuesday, March 5, 2024 1:39 PM EST

Anthropic claims that Claude 3, the company’s most recent AI release, has achieved “near-human” capabilities in various cognitive tasks. It’s a bold claim. Let’s put it in perspective.

Anthropic’s claims for Claude 3 center around its performance across a range of cognitive tasks, including reasoning, expert knowledge, mathematics, and language fluency. The company suggests that the Opus model (in particular) exhibits near-human levels of comprehension and fluency on complex tasks. This claim is supported by Claude 3 Opus outperforming OpenAI’s GPT-4 (the underlying model that powers ChatGPT) on 10 AI benchmarks, including MMLU (undergraduate level knowledge), GSM8K (grade school math), HumanEval (coding), and HellaSwag (common knowledge).

Despite these achievements, it’s important to note that achieving “near-human” capabilities on specific benchmarks does not equate to Claude 3 possessing general intelligence akin to human cognition. The AI research community often uses terms like “know” or “reason” to describe large language models’ capabilities, but use of these words does not imply that these models have consciousness or understanding in the human sense.

This new iteration of the Claude AI model series includes three versions: Claude 3 Haiku, Claude 3 Sonnet, and Claude 3 Opus, each offering different levels of complexity and performance. The most powerful among them, Claude 3 Opus, is available through a subscription service, while Sonnet powers the Claude.ai chatbot accessible for free with an email sign-in.

Claude 3’s advancements are not limited to cognitive tasks. The models demonstrate improved performance in areas like coding, understanding non-English languages, and adhering to brand voice guidelines. They also feature advanced vision capabilities, enabling them to process a wide range of visual formats, including photos, charts, graphs, and technical diagrams. This makes Claude 3 models particularly useful for applications that involve PDFs, flowcharts, or presentation slides.

Anthropic says that it trained Claude 3 on both nonpublic internal and public-facing data, utilizing hardware from Amazon Web Services (AWS) and Google Cloud. They also claim the model is more accurate and less likely to hallucinate.

That said, you should keep Anthropic’s claims about Claude 3’s “near-human” capabilities in perspective. Outperforming its competitors on AI benchmarks does not equate to human-like consciousness or understanding. When artificial general intelligence (AGI) is achieved, you won’t need to read my daily newsletter to get the news.

More By This Author:

Apple’s AI is Hiding In Plain Sight
Mistral Large: A New AI Model And A Microsoft Partnership
Lights, Camera, Inaction: AI Has Tyler Perry Rethinking His $800M Studio Upgrade

Disclosure: This is not a sponsored post. I am the author of this article and it expresses my own opinions. I am not, nor is my company, receiving compensation for it.

How did you like this article? Let us know so we can better customize your reading experience.

Comments

Leave a comment to automatically be entered into our contest to win a free Echo Show.