aboutsummaryrefslogtreecommitdiffstats
path: root/pkg/aflow/testdata/TestToolMisbehavior.llm.json
Commit message (Collapse)AuthorAgeFilesLines
* pkg/aflow: abstract away LLM temperatureDmitry Vyukov2026-02-021-1/+1
| | | | | | | | | | Introduce abstract "task type" for LLM agents instead of specifying temperature explicitly for each agent. This has 2 advantages: - we don't hardcode it everywhere, and can change centrally as our understanding of the right temperature evolves - we can control other LLM parameters (topn/topk) using task type as well Update #6576
* pkg/aflow: reduce size of golden test filesDmitry Vyukov2026-01-301-436/+0
| | | | Don't memorize repeated request configs.
* pkg/aflow: fix Temperature handlingDmitry Vyukov2026-01-261-5/+5
| | | | | | If LLMAgent.Temperature is assigned an untyped float const (0.5) it will be typed as float64 rather than float32. So recast them. Cap Temperature at model's supported MaxTemperature.
* pkg/aflow: handle empty LLM repliesDmitry Vyukov2026-01-231-2/+313
|
* pkg/aflow: refactor testsDmitry Vyukov2026-01-231-0/+907
Add helper function that executes test workflows, compares results (trajectory, LLM requests) against golden files, and if requested updates these golden files.