syz - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	syz-agent: systematically show verbose error messages	Dmitry Vyukov	7 days	3	-5/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	Currently we added custom code to kernel build action, and few others to expose verbose errors from executed binaries (notably make). But lots of other binary executions missing this logic, e.g. for git failure we currently see unuseful: failed to run ["git" "fetch" "--force" "--tags" exit status 128 Instead of adding more and more custom code to do the same, remove the custom code and always add verbose output in syz-agent and tools/syz-aflow.
*	pkg/aflow/tool/grepper: disable tests on non-linux	Dmitry Vyukov	7 days	1	-0/+3
\| \| \| \| \| \| \| \| \|	The tests fail on OpenBSD with: expected: "bad expression: fatal: command line, 'bad expression (': Unmatched ( or \\(" actual : "bad expression: fatal: command line, 'bad expression (': parentheses not balanced" Disable the tests on non-linux for now.
*	pkg/aflow: return syzkaller program as output	Alexander Potapenko	7 days	1	-3/+1
\| \| \| \| \|	Requesting to return the program as one of the agent's outputs enforces its structure and prevents LLM from using garbage formatting.
*	pkg/aflow: ensure we don't register MCP tools with duplicate names	Dmitry Vyukov	7 days	5	-4/+40
\| \| \| \| \| \| \|	If we have duplicate names, then only one of the duplicates will be used at random. Add a check that we don't have duplicate names. Currently it's only "crash-reproducer" (both action and a tool). Also ignore "set-results" tool, and all tools created in tests.
*	pkg/aflow: handle more genai errors	Dmitry Vyukov	7 days	2	-6/+41
\| \| \| \|	Fixes #6897
*	pkg/aflow: add Flow.Consts instead of Provide	Dmitry Vyukov	8 days	12	-64/+133
\| \| \| \| \| \| \| \| \|	There is no point in using Provide more than once, and anywhere besides the first action of a flow. So it's not really an action, but more of a flow property. Add Flow.Consts field to handle this case better. Also provide slightly less verbose syntax by using a map instead of a struct, and add tests.
*	pkg/aflow/flow/repro: give agent relevant docs	Dmitry Vyukov	8 days	1	-2/+17
\| \| \| \| \| \| \|	LLM seems to have some knowledge about syzkaller program syntax, but presumably it's still useful to give it all details about syntax. Update #6878
*	pkg/aflow/flow/repro: give agent codesearch tools	Dmitry Vyukov	8 days	1	-5/+12
\| \| \| \| \| \| \| \| \| \|	It's useful to be able to look at the kernel source code when creating a reproducer for a bug. So give the agent codesearch tools. Also slightly refine prompt wording. Update #6878
*	pkg/aflow: instructions for implementing tools in GEMINI.md	Alexander Potapenko	8 days	1	-0/+5
\| \| \| \| \| \| \|	Provide some instructions on how tools should be named, implemented and registered. Update #6878
*	pkg/aflow/flow/repro: add `read-description` to the flow	Alexander Potapenko	8 days	1	-1/+4
\| \| \| \| \| \|	Teach the repro flow about the `read-description` tool. Update #6878
*	pkg/aflow/tool/syzlang: add the `read-description` tool	Alexander Potapenko	8 days	2	-0/+34
\| \| \| \| \| \| \| \| \| \| \|	Adds a tool that allows an agent to read the content of syzlang description files (e.g., `sys.txt`, `socket.txt`). Providing the ability to fetch exact system call definitions helps reasoning models generate correct and compiling programs from crash reports. Update #6878
*	pkg/aflow: add Reproduce tool	Taras Madan	8 days	3	-0/+88
\|
*	pkg/aflow/action/crash: collect test coverage	Dmitry Vyukov	8 days	2	-35/+86
\| \| \| \| \| \| \|	Collect code coverage for test programs. This is likley to be needed for #6878 and seed generation workflow. For now it's not wired into any workflow/tool and is not tested. But this should provide most of the plumbing to wire it up.
*	pkg/aflow: add GEMINI.md	Taras Madan	11 days	1	-0/+64
\|
*	pkg/aflow: add Tools function	Dmitry Vyukov	12 days	4	-5/+22
\| \| \| \| \| \|	When we combine tool sets for agents, there is always a protential problem with aliasing existing slices and introducing subtle bugs. Add Tools function that can append tool/tool sets w/o aliasing problem.
*	pkg/aflow/flow/repro: provide proper syzkaller commit	Dmitry Vyukov	12 days	1	-5/+10
\| \| \| \|	Update #6878
*	pkg/aflow/tool/syzlang: provide list of description files	Dmitry Vyukov	12 days	4	-0/+49
\| \| \| \|	Update #6878
*	pkg/aflow: repro workflow skeleton	Taras Madan	12 days	3	-0/+65
\|
*	pkg/aflow: delete SyzkallerCommit	Taras Madan	14 days	3	-30/+26
\| \| \| \|	It is not used.
*	pkg/aflow: add the repro workflow const	Aleksandr Nogikh	2026-02-26	1	-0/+1
\| \| \| \| \|	There's no workflow implementation, but having the const there will let us implement the dashboard side in parallel.
*	dashboard/app: apply actionable label after AI moderation	Dmitry Vyukov	2026-02-24	2	-7/+7
\| \| \| \| \| \|	This allows auto-upstreamming of actionable bugs. Fixes #6779
*	pkg/aflow: fix handling of optional tool arguments	Dmitry Vyukov	2026-02-19	4	-1/+227
\| \| \| \| \| \| \|	Currently we crash on nil deref, if LLM specifies explicit 'nil' for an optional (pointer) argument. Handle such cases properly. Fixes #6811
*	pkg/aflow/tool/codesearcher: add end-to-end tests	Dmitry Vyukov	2026-02-19	2	-16/+69
\| \| \| \|	Update #6811
*	syz-agent: add MCP server	Dmitry Vyukov	2026-02-18	3	-3/+79
\| \| \| \| \| \|	The MCP server exports all aflow tools (and actions as tools) we have. Fixes #6763
*	pkg/aflow: export Context.Close	Dmitry Vyukov	2026-02-18	3	-5/+5
\| \| \| \|	This will be needed to an MCP server.
*	pkg/aflow: factor out sliding window logic	Dmitry Vyukov	2026-02-10	1	-43/+44
\| \| \| \| \|	Linter started complaining about too high cyclomatic complexity. Split the chat function.
*	pkg/aflow: make it possible for LLMAgent to return only structured outputs	Dmitry Vyukov	2026-02-10	4	-2/+183
\| \| \| \| \|	In some cases there may be not final text reply, only some structured outputs (e.g. some bool). Don't require final reply, if structured outputs are specified.
*	pkg/aflow: fix structured outputs handling	Dmitry Vyukov	2026-02-10	4	-1/+351
\| \| \| \| \| \|	If LLM calls set-results tool to set structured results, and then calls another unrelated tool, currently we lose structured results (overwrite with nil). Don't do that, keep structured results.
*	pkg/aflow: simplify TestSummaryWindow test	Dmitry Vyukov	2026-02-10	1	-19/+14
\| \| \| \|	We don't a separate var for agent, nor the Pipeline for 1 agent.
*	pkg/aflow/tool/grepper: fix grep invocation	Dmitry Vyukov	2026-02-06	2	-8/+21
\| \| \| \| \|	If LLM searches for "->", grep considered it as a flag and failed. Add "--" before the expression to fix such cases.
*	syz-agent: wipe codesearch binary	Dmitry Vyukov	2026-02-06	4	-22/+18
\| \| \| \|	Now it's compiled into the syz-agent binary itself.
*	tools/clang: compile clang tools into the binary	Dmitry Vyukov	2026-02-06	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Compiled clang tools into Go binaries using cgo. This significantly simplifies building and deployment. This also enables unit testing of clang tools. Now raw go test for clang tools will build them, run, and verify output. Each clang tool is still started as a subprocess. I've experimented with running them in-process, but this makes stdout/stderr interception extremly complicated, and it seems that clang tools still use unsynchronized global state, which breaks when invoked multiple times. Subprocesses also make it safer in the face of potential memory leaks, or memory corruptions in clang tools. Fixes #6645
*	pkg/aflow/action/crash: handle boot errors better	Dmitry Vyukov	2026-02-05	2	-7/+34
\| \| \| \|	Provide better errors messages on boot errors.
*	pkg/aflow/flow/assessment: mention adjacent races in the KCSAN prompt	Dmitry Vyukov	2026-02-03	1	-0/+14
\| \| \| \|	Update #6578
*	pkg/aflow/tool/grepper: add the tool	Dmitry Vyukov	2026-02-02	7	-13/+192
\| \| \| \| \| \|	Add a tool that executes git grep with the given expression. It can handle long tail of cases that codesearcher can't handle, while still providing less output than reading whole files.
*	pkg/aflow/action/crash: clang-format the patch diff	Dmitry Vyukov	2026-02-02	1	-1/+37
\| \| \| \|	Fixes #6671
*	pkg/aflow/flow/patching: fix getting list of recent commits	Dmitry Vyukov	2026-02-02	3	-11/+21
\| \| \| \| \|	We need to run git log in the master git repo b/c out KernelSrc/KernelScratchSrc are shallow checkouts that don't have history.
*	pkg/aflow: abstract away LLM temperature	Dmitry Vyukov	2026-02-02	15	-44/+59
\| \| \| \| \| \| \| \| \| \|	Introduce abstract "task type" for LLM agents instead of specifying temperature explicitly for each agent. This has 2 advantages: - we don't hardcode it everywhere, and can change centrally as our understanding of the right temperature evolves - we can control other LLM parameters (topn/topk) using task type as well Update #6576
*	pkg/aflow/flow/patching: use recent commit subjects	Dmitry Vyukov	2026-01-31	8	-11/+163
\| \| \| \| \| \| \|	Give LLM the recent commit subjects when it generates description, so that it can use the same style. Add infrastrcuture to write end-to-end action tests to test it.
*	syz-agent: don't send poll requests w/o workflows	Dmitry Vyukov	2026-01-30	1	-0/+3
\| \| \| \|	This will cause dashboard to log errors.
*	pkg/aflow: add a TODO	Dmitry Vyukov	2026-01-30	1	-0/+4
\|
*	pkg/aflow: fix role in test replies	Dmitry Vyukov	2026-01-30	2	-10/+10
\|
*	pkg/aflow: refactor the LLM summarization test	Dmitry Vyukov	2026-01-30	4	-72/+213
\| \| \| \| \| \| \| \| \|	It's very inconvinient to hardcode exact LLM replies in this test, because it's hard to understand when exactly it will be asked to summarize. It's easy to make a bug in the test, and provide summary reply when it wasn't asked to. Instead support proving full generateContent callback, and just model what an LLM would do -- provide summary only when it's asked to.
*	pkg/aflow: reduce size of golden test files	Dmitry Vyukov	2026-01-30	6	-1263/+12
\| \| \| \|	Don't memorize repeated request configs.
*	pkg/aflow/flow/patching: improve prompts	Dmitry Vyukov	2026-01-30	2	-5/+36
\| \| \| \| \|	More instructions slightly more concrete, and add details about some bug types.
*	pkg/aflow/flow/patching: move Outputs type to ai packages	Dmitry Vyukov	2026-01-30	3	-24/+31
\| \| \| \| \| \|	Move it so that it can be accessed by the dashboard as well. Add kernel branch to output (it's needed for gerrit), provide actual kernel commit hash instead of tag name.
*	pkg/aflow: adding sliding window summary feature	Yulong Zhang	2026-01-30	5	-47/+919
\| \| \| \| \| \| \| \| \|	This adds a flow feature (and creates a new flow using it) called "sliding window summary". It works by asking the AI to always summarize the latest knowledge, and then we toss the old messages if they fall outside the context sliding window.
*	pkg/aflow/action/crash: cache patch testing result	Dmitry Vyukov	2026-01-29	1	-25/+42
\| \| \| \| \| \| \| \| \|	This caching is very handy when testing some dashboard features related to stating jobs, or handling jobs completion, or testing changes in the last steps of patching workflow. Without caching each testing takes 10 mins, with caching the whole workflow completes almost immidiatly .
*	pkg/aflow/flow/patching: find maintainers for patches	Dmitry Vyukov	2026-01-29	2	-0/+49
\| \| \| \|	Provide base kernel repo/commit and recipients (to/cc) for patches.
*	pkg/aflow: add timeout for LLM queries	Dmitry Vyukov	2026-01-28	1	-1/+10
\| \| \| \| \|	Sometimes LLM requests just hang dead for tens of minutes, abort them after 10 minutes and retry.