aboutsummaryrefslogtreecommitdiffstats
path: root/pkg/aflow/flow/assessment
Commit message (Collapse)AuthorAgeFilesLines
* pkg/aflow: make LLM model per-agent rather than per-flowDmitry Vyukov2026-01-202-2/+2
| | | | | | Having LLM model per-agent is even more flexible than per-flow. We can have some more complex tasks during patch generation with the most elaborate model, but also some simpler ones with less elaborate models.
* pkg/aflow: allow to specify model per-flowDmitry Vyukov2026-01-142-0/+2
| | | | | We may want to use a weaker model for some workflows. Allow to use different models for different workflows.
* dashboard/app: add race harmfullness labelDmitry Vyukov2026-01-131-7/+1
| | | | | Add race:harmful/benign label. Set it automatically by confirmed AI jobs.
* pkg/aflow/flow/assessment: add UAF moderation workflowDmitry Vyukov2026-01-122-1/+115
| | | | | | Add workflow that can be used for moderation of UAF bugs (consistent/actionable reports), such UAF bugs can be upstreammed automatically, even if they happened only once and don't have a reproducer.
* pkg/aflow/flow/assessment: refine KCSAN promptDmitry Vyukov2026-01-122-39/+34
| | | | | | | | | | | | | | Rephrase the prompt to be only about KCSAN, currently it has some leftovers from more generic assessment prompt that covered KASAN bugs as well (actionability). Also add Confident bool output. We may want to act on both benign/non-benign, so we need to know when LLM wasn't actually sure either way. This should also be useful for manual verification/statistics. If LLM is not confident and can can admit that, it's much better than giving a wrong answer. But we will likely want to track percent of non-confident answers.
* pkg/aflow/flow/assessment: add KCSAN bug assessment workflowDmitry Vyukov2026-01-092-0/+89