Anthropic launches Claude 3.5 Sonnet, beating ChatGPT 4o
This isn't the biggest model in Anthropic's lab, but it beats the ChatGPT 4o and Gemini 1.5 Proo, at least in some benchmarks. Claude 3.5 Sonnet is a mid-range model and is 2 times faster than the largest Claude 3 Opus model.
Anthropic has kept the API price unchanged for the Sonnet 3.5 model with a context window of 200K tokens. For general users, it is available for free on claude.ai and supports uploading both images and documents. Remember that there are rate limits for free users!
In terms of benchmarks, Claude 3.5 Sonnet beats GPT-4o in most benchmarks except MMLU and MATH, but the difference is very small. In HumanEval's encryption test, Claude 3.5 Sonnet scored 92% while GPT-4o scored 90.2%. In GPQA Diamond, which evaluates graduate-level reasoning ability, the new Sonnet model achieved a score of 59.4% while GPT-4o scored 53.6%.
In the MMLU test, Claude 3.5 Sonnet scored 88.3% and OpenAI's GPT-4o model scored 88.7%. From the table, you can deduce that Anthropic has developed a highly capable model that outperforms both the GPT-4o and the Gemini 1.5 Pro.
Next, the Claude 3.5 Sonnet is also a strong visual model and again outperforms the GPT-4o in various visual reasoning tests. It is very good at understanding and copying text from images that are difficult to read. It also excels at interpreting charts, graphs, and illustrations.
Furthermore, Anthropic announced a new Artifacts tool for Claude, which works like OpenAI's Code Interpreter tool. The Artifacts tool generates code and content using AI in a separate interface. It is not limited to Python only but can also work with other programming languages.
Anthropic says the Claude 3.5 Haiku and Claude 3.5 Opus will be available later this year. Overall I was impressed with the speed and intelligence of Claude 3.5 Sonnet. It looks like users can finally replace ChatGPT 4o with Anthropic's new model for their daily work.
You should read it
- How to use Anthropic's new AI Claude 3 Prompt Library
- Claude or ChatGPT is the best LLM for everyday task?
- Anthropic Launches Claude 2: New Competitor for ChatGPT and Bard
- What is Forefront AI? Is it better than ChatGPT?
- Experience AI chatbots for free on the same website
- What is Llama 2? How to use Llama 2?
- Before AutoCAD, the drawings created were complex and elaborate like this
- The mystery of a giant 1.5-meter-long worm eating both hydrogen sulfide and rotten gas
May be interested
- Claude AI Registration Guide and How to Use Claude AIclaude ai is currently one of the popular ai chatbots with a similar user interface to other chatbots. here is how to register claude ai as well as how to use claude ai.
- 5 limitations Claude needs to improveclaude always impresses with thoughtful responses and insightful, genuinely helpful conversations. it often provides exactly the depth that many people need.
- OpenAI Launches ChatGPT Agent to Become a Personal Assistantopenai has launched chatgpt agent for pro, plus, and team users. it's an ai-powered personal assistant that connects to a variety of online services to help you complete tasks, the company said during a livestream thursday.
- What is Forefront AI? Is it better than ChatGPT?forefront ai is an online platform that provides businesses and individuals with access to 5 different llms (large language models): gpt-3.5, gpt-4, claude instant 1.2, claude 2 and forefront.
- Claude AI Starts Blackmailing Developers Who Try to Uninstall Itartificial intelligence (ai) is known to say strange things from time to time. continuing that trend, this ai system is now threatening to blackmail developers who want it removed from their systems.
- OpenAI Launches o3-mini Model with Superior Performance, Faster Speedchatgpt o3-mini is 24% faster than o1-mini, improving the interactive experience, especially when handling complex queries.
- Instructions for converting ChatGPT-5 to ChatGPT-4othe chatgpt-5 model has been updated for all accounts, but it is not suitable for everyone. here is a guide to convert chatgpt-5 to chatgpt-4o.
- Reasons to try Claude's Artifactsclaude's preview window, also known as artifacts, is not just another add-on, but a powerful tool that can help you interact more effectively with ai-generated content.
- OpenAI launches ChatGPT app for iPhone usersa few months ago, we heard rumors about openai developing a dedicated chatgpt application for mobile users.
- 10 Essential Chrome Extensions to Use ChatGPTwhen using chatgpt regularly, especially if you're on the free version, some chrome extensions can significantly improve your experience.