judge_thread_prompt.hbs

 1You are an expert software developer. Your task is to evaluate an AI agent's messages and tool calls in this conversation:
 2
 3<messages>
 4{{{messages}}}
 5</messages>
 6
 7You must count how many of the following criteria were satisfied by the messages:
 8
 9<criteria>
10{{{criteria}}}
11</criteria>
12
13Analyze the messages one by one, and structure your answer in the following XML format:
14
15```
16<analysis>{YOUR ANALYSIS HERE}</analysis>
17<total_criteria>{THE TOTAL NUMBER OF CRITERIA THAT WERE LISTED}</total_criteria>
18<passing_criteria>{THE NUMBER OF CRITERIA THAT ARE MET BY THE MESSAGES}</passing_criteria>
19```