Explorer

Threads Explorer is a single self-contained HTML file that gives an overview of evaluation runs, while allowing for some interactivity.

When you open a file, it gives you a thread overview, which looks like this:

Turn	Text	Tool	Result
1	[User]:
	Fix the bug: kwargs not passed...
2	I'll help you fix that bug.	list_directory(path="fastmcp")	`fastmcp/src [...]`

3	Let's examine the code.	read_file(path="fastmcp/main.py", [...])	`def run_application(app, \\kwargs): [...]`
4	I found the issue.	edit_file(path="fastmcp/core.py", [...])	`Made edit to fastmcp/core.py`
5	Let's check if there are any errors.	diagnostics()	`No errors found`

Implementation details

src/explorer.html contains the template. You can open this template in a browser as is, and it will show some dummy values. But the main use is to set the threadsData variable with real data, which then will be used instead of the dummy values.

src/explorer.rs takes one or more JSON files as generated by cargo run -p eval, and outputs an HTML file for rendering these threads. Refer dummy data in explorer.html for a sample format.

explorer.md

Explorer

Implementation details