Skip to content

Commit

Permalink
Update evaluating-claude.md
Browse files Browse the repository at this point in the history
  • Loading branch information
abi authored Mar 6, 2024
1 parent 6029a9b commit cd7cd84
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions blog/evaluating-claude.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,8 @@

Claude 3 dropped yesterday, claiming to rival GPT-4 on a wide variety of tasks. I maintain a very popular open source project called “screenshot-to-code” (this one!) that uses GPT-4 vision to convert screenshots/designs into clean code. Naturally, I was excited to see how good Claude 3 was at this task.

**TLDR:** Claude 3 is on par with GPT-4 vision for screenshot to code, better in some ways but worse in others.

## Evaluation Setup

I don’t know of a public benchmark for “screenshot to code” so I created simple evaluation setup for the purposes of testing:
Expand Down

0 comments on commit cd7cd84

Please sign in to comment.