aboutsummaryrefslogtreecommitdiffstats
path: root/dashboard/app/ai_test.go
Commit message (Collapse)AuthorAgeFilesLines
* pkg/aflow: make LLM model per-agent rather than per-flowDmitry Vyukov2026-01-201-15/+15
| | | | | | Having LLM model per-agent is even more flexible than per-flow. We can have some more complex tasks during patch generation with the most elaborate model, but also some simpler ones with less elaborate models.
* pkg/aflow: allow to specify model per-flowDmitry Vyukov2026-01-141-21/+15
| | | | | We may want to use a weaker model for some workflows. Allow to use different models for different workflows.
* dashboard/app: add race harmfullness labelDmitry Vyukov2026-01-131-0/+57
| | | | | Add race:harmful/benign label. Set it automatically by confirmed AI jobs.
* dashboard/app: add spanner Jobs.Correct fieldDmitry Vyukov2026-01-131-2/+2
| | | | | | Lay down foundation for spanner DB migrations by adding Jobs.Correct field. This will allow us to test deployment of such changes. The field will be used to record results of manual assessment of AI job results.
* pkg/aflow/flow/assessment: add UAF moderation workflowDmitry Vyukov2026-01-121-0/+1
| | | | | | Add workflow that can be used for moderation of UAF bugs (consistent/actionable reports), such UAF bugs can be upstreammed automatically, even if they happened only once and don't have a reproducer.
* dashboard/app: add support for AI workflowsDmitry Vyukov2026-01-051-0/+173
| | | | | | | | Support for: - polling for AI jobs - handling completion of AI jobs - submitting job trajectory logs - basic visualization for AI jobs
* dashboard/app: make it possible to test code that uses spannerDmitry Vyukov2025-12-311-0/+26
Start spanner emulator for tests. Create isolated per-test instance+database. Test that DDL migration scripts are work.