syz - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	pkg/rpcserver: fix race in local server shutdown	Dmitry Vyukov	2025-12-29	1	-10/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently we stop both executor binary and the RPC server concurrently due to use of errgroup.WithContext. As the result, executor may SYZFAIL on the closed network connection before it's killed. This race leads to very high percent (25%) of failed repro attempts for my local syz-manager runs. When we run syz-execprog with Repeat=false, the race triggers frequently. May have something to do with heavily instrumented kernel where some operations may take longer (e.g. killing syz-executor and stopping all of its threads). This should also fix #6091
*	pkg/rpcserver: add suffix to executor restarts	Aleksandr Nogikh	2025-08-19	1	-1/+1
\| \| \| \| \|	To be useful, the stat should have a different name depending on the VM pool name.
*	pkg/rpcserver: move executor restart to named stats	Aleksandr Nogikh	2025-08-18	1	-6/+8
\| \| \| \| \|	This will let us see executor restart statistics per VM pool (relevant for diff fuzzing).
*	all/mocks: regenerate with mockery v3	Taras Madan	2025-07-01	1	-71/+79
\|
*	pkg/rpcserver: fix fallback coverage	Aleksandr Nogikh	2025-04-28	1	-4/+6
\| \| \| \| \|	If we set it too early, it will be filtered out as it is not within the addresses of the .text segment.
*	all/mocks: update	Taras Madan	2025-03-28	1	-0/+119
\|
*	all: use mockery config instead of go:generate	Taras Madan	2025-03-28	1	-1/+0
\|
*	pkg/rpcserver: pkg/flatrpc: executor: add handshake stage 0	Alexander Potapenko	2025-02-20	2	-29/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As we figured out in #5805, syz-manager treats random incoming RPC connections as trusted, and will crash if a non-executor client sends an invalid packet to it. To address this issue, we introduce another stage of handshake, which includes a cookie exchange: - upon connection from an executor, the manager sends a ConnectHello RPC message to it, which contains a random 64-bit cookie; - the executor calculates a hash of that cookie and includes it into its ConnectRequest together with the other information; - before checking the validity of ConnectRequest, the manager ensures client sanity (passed ID didn't change, hashed cookie has the expected value) We deliberately pick a random cookie instead of a magic number: if the fuzzer somehow learns to send packets to the manager, we don't want it to crash multiple managers on the same machine.
*	pkg/rpcserver: improve the mismatching arches error message	Alexander Potapenko	2025-02-19	1	-1/+1
\| \| \| \| \|	Dump the whole flatrpc.ConnectRequest to the logs, so that we can better understand the cause of #5805
*	go.mod: update mockery	Taras Madan	2025-02-07	1	-3/+3
\|
*	pkg/rpcserver: run machine check from the global context	Aleksandr Nogikh	2025-02-03	2	-36/+51
\| \| \| \| \|	Running it from the VM context causes its cancellation each time VM crashes or the connection is aborted.
*	pkg/vminfo: gracefully handle context abortion	Aleksandr Nogikh	2025-02-03	1	-0/+4
\| \| \| \| \|	On context abortion, return a special error. On the pkg/rpcserver side, recognize and process it.
*	pkg/rpcserver: test on crash during machine check	Aleksandr Nogikh	2025-02-03	3	-11/+108
\| \| \| \| \|	If an instance crashed during machine check, that should not normally abort all RPCServer operation.
*	pkg/rpcserver: refactor RunLocal	Aleksandr Nogikh	2025-02-03	1	-58/+75
\| \| \| \| \|	Accept context as a function argument. Split out the code that creates a syz-executor process instance.
*	pkg/fuzzer: collect executor debug logs in tests	Aleksandr Nogikh	2025-01-30	1	-4/+10
\| \| \| \|	It should hopefully let us debug #5674.
*	all: clarify the error in case of ExecFailure	Aleksandr Nogikh	2025-01-30	1	-1/+4
\| \| \| \| \|	Whenever the status is set, also include the reason. It should help easier debug execution and machine check time problems.
*	pkg/rpcserver: refactor to remove Fatalf calls	Aleksandr Nogikh	2025-01-29	5	-66/+147
\| \| \| \|	Apply necessary changes to pkg/flatrpc and pkg/manager as well.
*	pkg/vminfo: remove Context from the constructor	Aleksandr Nogikh	2025-01-24	2	-5/+7
\| \| \| \| \| \| \|	The context is assumed to be passed into the function doing the actual processing. Refactor vminfo to follow this approach. This will help refactor pkg/rpcserver later.
*	pkg/rpcserver: prevent a nil pointer dereference	Aleksandr Nogikh	2025-01-22	1	-1/+4
\| \| \| \| \| \| \|	If we get a Hanged != "" response from a non-RequestTypeProgram request, we used to end up trying to serialize an nil *prog.Prog value. Add a missing if condition.
*	executor: query globs in the test program context	Dmitry Vyukov	2024-12-11	4	-33/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We query globs for 2 reasons: 1. Expand glob types in syscall descriptions. 2. Dynamic file probing for automatic descriptions generation. In both of these contexts are are interested in files that will be present during test program execution (rather than normal unsandboxed execution). For example, some files may not be accessible to test programs after pivot root. On the other hand, we create and link some additional files for the test program that don't normally exist. Add a new request type for querying of globs that are executed in the test program context.
*	pkg/rpcserver: refactoring in preparation for dynamic interface extraction	Dmitry Vyukov	2024-11-26	4	-22/+44
\| \| \| \| \| \| \| \| \| \| \|	Few assorted changes to reduce future diffs: - add rpcserver.RemoteConfig similar to LocalConfig (there are too many parameters) - add CheckGlobs to requesting additional globs from VMs - pass whole InfoRequest to the MachineChecked callback so that it's possible to read globs information - add per-mode config checking in the manager - add Manager.saveJson helper
*	executor: use any executor if the avoid mask included all of them	Andrei Vagin	2024-11-18	1	-3/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	After 9fc8fe026baa ("executor: better handling for hanged test processes"), yz-executor's responses may reference procids outside of the [0;procs] range. If procids are no longer dense on the syz-executor side, we cannot rely on this check in pkg/rpcserver: ``` if avoid == (uint64(1)<<runner.procs)-1 { avoid = 0 } ``` Signed-off-by: Andrei Vagin <avagin@google.com>
*	pkg/rpcserver: export modules stat	Dmitry Vyukov	2024-11-07	1	-7/+8
\| \| \| \| \|	We have a /modules link in the manager, but it's not exposed anywhere. Add a stat with this link.
*	pkg/manager: support multiple pools in Web UI	Aleksandr Nogikh	2024-10-25	1	-5/+21
\|
*	pkg/rpcserver: take stats as a dependency	Aleksandr Nogikh	2024-10-25	3	-14/+35
\| \| \| \| \|	It will enable collecting statistics for several simultaneous RPCServer objects.
*	executor: better handling for hanged test processes	Dmitry Vyukov	2024-10-24	4	-10/+65
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently we kill hanged processes and consider the corresponding test finished. We don't kill/wait for the actual test subprocess (we don't know its pid to kill, and waiting will presumably hang). This has 2 problems: 1. If the hanged process causes "task hung" report, we can't reproduce it, since the test finished too long ago (manager thinks its finished and discards the request). 2. The test process still consumed per-pid resources. Explicitly detect and handle such cases: Manager keeps these hanged tests forever, and we assign a new proc id for future processes (don't reuse the hanged one).
*	all: regenerate mocks	Taras Madan	2024-09-10	1	-1/+1
\| \| \| \|	./tools/syz-env make generate
*	pkg/rpcserver, syz-manager: always include the program from Comm	Aleksandr Nogikh	2024-09-10	2	-5/+19
\| \| \| \| \| \| \| \| \|	It does sometimes happen that the kernel is crashed so fast that syz-manager is not notified that the syz-executor has started running the faulty input. In cases when the exact program is known from Comm, let's make sure it's always present in the log of the last executed programs.
*	pkg/rpcserver: add unit tests, Manager mocks	Sabyrzhan Tasbolatov	2024-09-09	4	-36/+389
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Added more test coverage of the package and created an interface of rpcserver to use it as the dependency (for syz-manager). Also tried to cover with tests a private method handleConn(), though it calls handleRunnerConn which has a separate logic in Handshake(), which within handleConn() unit test we should've mocked. This will require a refactoring of `runners map[int]*Runner` and runner.go in general with a separate interface which we can mock as well. General idea is to have interfaces of Server (rpc), Runner etc. and mock a compound logic like Handshake during a separate public (or private if it has callable, if-else logic) method unit-testing.
*	syz-manager: move prependExecuting() to packages	Aleksandr Nogikh	2024-08-06	1	-0/+18
\|
*	pkg/rpcserver: use dense VM indices instead of string names	Dmitry Vyukov	2024-08-02	2	-36/+26
\| \| \| \| \| \|	Using actual VM indices for VM identification allows to match these indices to VMs in the pool, allows to use dense arrays to store information about runners (e.g. in queue.Distributor), and just removes string names as unnecessary additional entities.
*	executor: restart procs more deterministically	Dmitry Vyukov	2024-08-02	2	-11/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently we force restart in rpcserver, but this has 2 problems: 1. It does not know the proc where the requets will land. 2. It does not take into account if the proc has already restarted recently for other reasons. Restart procs in executor only if they haven't restarted recenlty. Also make it deterministic. Given all other randomess we have, there does not seem to be a reason to use randomized restarts and restart after fewer/more runs. Also restart only after corpus triage. Corpus triage is slow already and there does not seem to be enough benefit to restart during corpus triage. Also restart at most 1 proc at a time, since there are lots of serial work in the kernel.
*	pkg/fuzzer: try to triage on different VMs	Dmitry Vyukov	2024-08-02	2	-6/+32
\| \| \| \|	Distribute triage requests to different VMs.
*	pkg/vminfo: don't parse modules for gvisor or starnix	Laura Peskin	2024-07-26	1	-0/+2
\|
*	all: add qemu snapshotting mode	Dmitry Vyukov	2024-07-25	1	-3/+3
\|
*	pkg/fuzzer/queue: move common fuzzing stats	Dmitry Vyukov	2024-07-25	1	-19/+10
\| \| \| \| \|	These stats will be needed for snapshot mode that does not use rpcserver. Move them from pkg/rpcserver to pkg/fuzzer/queue.
*	pkg/stat: rename package name to singular form	Dmitry Vyukov	2024-07-24	2	-27/+27
\| \| \| \| \| \| \| \|	Go package names should generally be singular form: https://go.dev/blog/package-names https://rakyll.org/style-packages https://groups.google.com/g/golang-nuts/c/buBwLar1gNw
*	pkg/stats: rename Create to New	Dmitry Vyukov	2024-07-24	1	-8/+8
\| \| \| \| \| \|	New is more idiomatic name and is shorter (lines where stats.Create is used are usually long, so making them a bit shorter is good).
*	pkg/report: support to symbolize line with module+offset	Joey Jiao	2024-07-23	1	-6/+1
\|
*	prog: restricts hints to at most 10 attempts per single kernel PC	Dmitry Vyukov	2024-07-22	1	-0/+9
\| \| \| \| \| \| \| \| \|	We are getting too many generated candidates, the fuzzer may not keep up with them at all (hints jobs keep growing infinitely). If a hint indeed came from the input w/o transformation, then we should guess it on the first attempt (or at least after few attempts). If it did not come from the input, or came with a non-trivial transformation, then any number of attempts won't help. So limit the total number of attempts (until the next restart).
*	pkg/rpcserver: exit on connection loop abortion	Aleksandr Nogikh	2024-07-15	3	-8/+20
\| \| \| \| \| \| \| \| \| \| \| \|	For local rpcserver runs, we do not reboot the executor in case of errors. Moreover, if the error did not lead to the executor process exit, we may never detect that something went wrong. Return an error channel from CreateInstance() to be able to act on connection loop errors. Explicitly register the instance during local executions and exit from RunLocal() in case of connection problems.
*	pkg/rpcserver: debug executor stalls	Aleksandr Nogikh	2024-07-11	2	-34/+81
\| \| \| \| \| \| \| \|	In some cases, the executor seems to be mysteriously silent when we were awaiting a reply. During pkg/runtest tests, give it 1 minute to prepare a reply, then try to request the current state and abort the connection.
*	all: transition to instance.Pool	Aleksandr Nogikh	2024-07-11	2	-51/+31
\| \| \| \| \|	Rely on instance.Pool to perform fuzzing and do bug reproductions. Extract the reproduction queue logic to separate testable class.
*	vm/dispatcher: introduce a generic instance pool	Aleksandr Nogikh	2024-07-11	1	-5/+5
\| \| \| \| \| \| \|	The pool operates on a low level and assumes that there's one default activity (=fuzzing) that is performed by the VMs and that there are also occasional non-default activities that must be performed by some VMs (=bug reproduction).
*	all: move KernelModule into vminfo package	Joey Jiao	2024-07-10	2	-5/+5
\|
*	pkg/rpcserver: don't fail requests in LocalRun	Aleksandr Nogikh	2024-07-08	1	-19/+0
\| \| \| \| \|	It's assumed that the caller would use a context to control waits on individual requests.
*	pkg/rpcserver: stop the loop on shutdown	Aleksandr Nogikh	2024-07-08	1	-1/+4
\| \| \| \| \| \| \| \| \|	There's no sense in continuing the operation once the Runner has been stopped. If no new requests are coming, the loop goroutine may last a long time since it never actually interacts with the (possibly already closed) socket.
*	pkg/vminfo: accept context.Context	Aleksandr Nogikh	2024-07-08	2	-4/+5
\| \| \| \|	The object enables a graceful shutdown of machine checks.
*	pkg/rpcserver: tolerate the kernel directory absence	Aleksandr Nogikh	2024-07-04	1	-7/+12
\| \| \| \| \|	For fuzzing, we don't strictly need the kernel directory or the kernel object file. We just need a disk/kernel image.
*	pkg/rpcserver: capitalize external interface methods	Aleksandr Nogikh	2024-07-04	2	-22/+22
\| \| \| \| \|	Make it explicit which methods of Runner refer to its implementation and which are supposed to be invoked by its users.