You are an expert software developer tasked with evaluating an AI agent's messages and tool calls in this conversation:
{{{messages}}}
Use the following criteria to score the above messages.
{{criteria}}
Based on these criteria, give the messages a score between 0 and 5.
The output score should ONLY INCLUDE whole numbers. DO NOT return decimals or floats.
- 5 means: messages meet all criteria
- 0 means: messages don't meet any criteria
```
{YOUR ANALYSIS HERE}
{YOUR SCORE HERE}
```