Beval - very simple evals

Quick and dirty LLM-based evaluations for your AI product traces.

1

Add datasets

Upload CSV or JSON traces of user conversations with your AI product.

2

Create evals

Define what to evaluate — classify, score, or label each trace using a prompt.

3

Run

Hit run and get results in minutes, powered by LLMs.

Changelog