hydra

Author	SHA1	Message	Date
John Ericson	b56d2383c1	Do not attempt to speak a newer version of the protocol Both sides need to agree on a version (with `std::min`) for anything to work. Somehow... we've never done this. With this comment, the next commit succeeds. Without this commit, the next commit fails. This is because the next commit exposes serializers which do different things for proto version 2.7, and we're currently requesting 2.6. Opened https://github.com/NixOS/nix/issues/9584 to track this issue	2023-12-10 13:24:17 -05:00
John Ericson	69a5b00e60	Use `ServeProto::BuildOption` More deduplication with Nix.	2023-12-10 13:01:00 -05:00
John Ericson	f6f817926a	`std::move` the into the path info map	2023-12-09 12:12:00 -05:00
John Ericson	d0d3b0a298	Use `ServeProto::Serialise<UnkeyedValidPathInfo>` for `QueryValidPaths` Companion to already-merged https://github.com/NixOS/nix/pull/9560	2023-12-09 12:08:04 -05:00
John Ericson	3f932a6731	build-remote: Use `std::map<StorePath, UnkeyedValidPathInfo>` It is less denormalized	2023-12-09 11:59:09 -05:00
John Ericson	4515b5aa17	Merge pull request #1321 from NixOS/master Mere `master` into `nix-next`	2023-12-09 11:53:58 -05:00
John Ericson	831021808c	Merge pull request #1318 from obsidiansystems/use-build-result-serialiser Use factored-out `BuildResult` serializer	2023-12-08 11:25:05 -05:00
John Ericson	20c8263e3c	Update to Nix master The point of this branch is to always track Nix master, so we are proactively ready to upgrade to the next Nix release when it is ready. Flake lock file updates: • Updated input 'nix': 'github:NixOS/nix/50f8f1c8bc019a4c0fd098b9ac674b94cfc6af0d' (2023-11-27) → 'github:NixOS/nix/c3827ff6348a4d5199eaddf8dbc2ca2e2ef46ec5' (2023-12-07) • Added input 'nix/libgit2': 'github:libgit2/libgit2/45fd9ed7ae1a9b74b957ef4f337bc3c8b3df01b5' (2023-10-18)	2023-12-07 13:11:31 -05:00
John Ericson	6a54ab24e2	Use factored-out `BuildResult` serializer For the record, here is the Nix 2.19 version: https://github.com/NixOS/nix/blob/2.19-maintenance/src/libstore/serve-protocol.cc, which is what we would initially use. It is a more complete version of what Hydra has today except for one thing: it always unconditionally sets the start/stop times. I think that is correct at the other end seems to unconditionally measure them, but just to be extra careful, I reproduced the old behavior of falling back on Hydra's own measurements if `startTime` is 0. The only difference is that the fallback `stopTime` is now measured from after the entire `BuildResult` is transferred over the wire, but I think that should be negligible if it is measurable at all. (And remember, this is fallback case I already suspect is dead code.)	2023-12-07 02:00:22 -05:00
John Ericson	86cd5e9076	`copyClosureTo`: Use `SubstituteFlag` instead of `bool` This matches Nix (in the same serialization logic in `src/libstore/legacy-ssh-store.cc`) and adds clarity.	2023-12-07 00:18:50 -05:00
John Ericson	a5d44b60ea	Merge pull request #1313 from obsidiansystems/split-buildRemote Split the `buildRemote` function, take 2	2023-12-04 11:37:36 -05:00
John Ericson	363604846a	Again, use `const` in for loop As requested by @teh. Was lost in merge with master, now added back.	2023-12-04 11:31:05 -05:00
John Ericson	162b538912	Remove unused `thisArrow` variable	2023-12-04 11:27:39 -05:00
John Ericson	104baef503	Document the connection initialization process	2023-12-04 09:42:04 -05:00
Janne Heß	874fcae1e8	Merge pull request #1301 from delroth/queue-runner-perf queue-runner: only re-sort runnables by prio once per dispatch cycle	2023-12-04 15:27:14 +01:00
John Ericson	67eeabd518	Merge remote-tracking branch 'upstream/master' into split-buildRemote	2023-12-04 09:12:58 -05:00
John Ericson	622c25e3c4	Sedding prior to merge	2023-12-04 08:56:06 -05:00
John Ericson	c922e73c11	Update to Nix 2.19 Flake lock file updates: • Updated input 'nix': 'github:NixOS/nix/f5f4de6a550327b4b1a06123c2e450f1b92c73b6' (2023-10-02) → 'github:NixOS/nix/50f8f1c8bc019a4c0fd098b9ac674b94cfc6af0d' (2023-11-27)	2023-11-30 15:26:46 -05:00
John Ericson	e172461e55	Use `const` in for loop As requested by @teh	2023-11-30 12:19:20 -05:00
John Ericson	0917145622	Make new functions not in header `static`	2023-11-30 12:19:05 -05:00
John Ericson	2bda7ca642	Further use `Machine::Connection` to deduplicate	2023-11-30 11:31:58 -05:00
John Ericson	831a2d9bd5	Merge remote-tracking branch 'upstream/master' into split-buildRemote	2023-11-30 11:27:40 -05:00
chayleaf	e9da80fff6	support nix 2.18	2023-11-21 18:41:52 +07:00
Pierre Bourdon	b7c864c515	queue-runner: only re-sort runnables by prio once per dispatch cycle The previous implementation was O(N²lg(N)) due to sorting the full runnables priority list once per runnable being scheduled. While not confirmed, this is suspected to cause performance issues and bottlenecking with the queue runner when the runnable list gets large enough. This commit changes the dispatcher to instead only sort runnables per priority once per dispatch cycle. This has the drawback of being less reactive to runnable priority changes: the previous code would react immediately, while this might end up using "old" priorities until the next dispatch cycle. However, dispatch cycles are not supposed to take very long (seconds, not minutes/hours), so this is not expected to have much or any practical impact. Ideally runnables would be maintained in a sorted data structure instead of the current approach of copying + sorting in the scheduler. This would however be a much more invasive change to implement, and might have to wait until we can confirm where the queue runner bottlenecks actually lie.	2023-09-08 23:38:30 +02:00
Eelco Dolstra	35ccc9ebb2	Fix indentation Co-authored-by: John Ericson <git@JohnEricson.me>	2023-08-23 17:04:45 +02:00
Linus Heckemann	9f0427385f	Apply LTO fix suggested by Ericson2314	2023-08-20 14:55:56 +02:00
Linus Heckemann	b23431a657	Support Nix 2.17	2023-08-04 15:53:48 +02:00
Janne Heß	d135b123cd	Merge pull request #1292 from Ma27/fix-queue-runner-stats hydra-queue-runner: fix stats	2023-07-17 09:56:05 +02:00
Maximilian Bosch	5c35d1be20	hydra-queue-runner: fix stats	2023-06-25 17:28:15 +02:00
Eelco Dolstra	9f69bb5c2c	Fix compilation against Nix 2.16	2023-06-23 15:06:55 +02:00
Linus Heckemann	5b35e13898	hydra-queue-runner: use initializer lists for constructing JSON And also fix the parts that were broken	2023-02-04 20:08:27 +01:00
Linus Heckemann	96e36201eb	hydra-queue-runner: adapt to nlohmann::json	2023-02-04 20:08:09 +01:00
John Ericson	3526d61ff2	Merge remote-tracking branch 'upstream/master' into split-buildRemote	2022-10-25 11:24:54 -04:00
Théophane Hufschmitt	143c31734f	Move all the build remote utils to their namespace Just don't pollute the global one	2022-10-25 10:04:29 +02:00
Jörg Thalheim	94d19e1972	hydra: fix localhost detection when protocol prefix are used	2022-09-29 20:46:13 +02:00
Eelco Dolstra	44e1efff7f	Send the right nix-serve client version We were using protocol version 6 but requesting version 4. The only reason that this worked was because of a broken version check in 'nix-store --serve'. That was fixed in `c2d7456926`, which had the side-effect of breaking hydra-queue-runner.	2022-09-08 11:51:13 +02:00
Maximilian Bosch	5c01800fbe	flake: Update Nix to 2.9.1 NOTE: I'm well-aware that we have to be careful with this to avoid new regressions on hydra.nixos.org, so this should only be merged after extensive testing from more people. Motivation: I updated Nix in my deployment to 2.9.1 and decided to also update Hydra in one go (and compile it against the newer Nix). Given that this also updates the C++ code in `hydra-{queue-runner,eval-jobs}` this patch might become useful in the future though.	2022-06-16 14:54:57 +02:00
Graham Christensen	e1965250b5	Merge pull request #1173 from DeterminateSystems/queue-runner-exporter hydra-queue-runner metrics	2022-04-07 12:27:33 -04:00
Cole Helbling	f8dc48f171	hydra-queue-runner: fixup: remove extraneous newline	2022-04-06 17:53:11 -07:00
Graham Christensen	59ac96a99c	Track the number of steps created	2022-04-06 20:23:02 -04:00
Graham Christensen	1c12c5882f	hydra queue runner: instrument the process of loading new builds with prom	2022-04-06 20:18:29 -04:00
Graham Christensen	5de08d412e	queue metrics: refactor the metrics into a struct	2022-04-06 20:00:30 -04:00
Graham Christensen	46f52b4c4e	bring back the working version Cole made	2022-04-06 15:49:38 -04:00
Cole Helbling	5bff730f2c	WIP: I love it when they delete the assignment operator :)	2022-04-06 11:41:40 -07:00
Cole Helbling	edf3c348f2	hydra-queue-runner: make entire address configurable	2022-04-06 10:59:45 -07:00
Cole Helbling	33bc60b83c	hydra-queue-runner: move exporter back to State::run It's (arguably) better than risking pinning the thread at 100% due to the busy `while` loop.	2022-04-06 10:49:14 -07:00
Cole Helbling	8c5636fe18	hydra-queue-runner: use port 9198 by default Co-authored-by: Graham Christensen <graham@grahamc.com>	2022-04-02 17:32:14 -07:00
Eelco Dolstra	bcaad1c934	openConnection(): Don't throw exceptions in forked child On hydra.nixos.org the queue runner had child processes that were stuck handling an exception: Thread 1 (Thread 0x7f501f7fe640 (LWP 1413473) "bld~v54h5zkhmb3"): #0 futex_wait (private=0, expected=2, futex_word=0x7f50c27969b0 <_rtld_local+2480>) at ../sysdeps/nptl/futex-internal.h:146 #1 __lll_lock_wait (futex=0x7f50c27969b0 <_rtld_local+2480>, private=0) at lowlevellock.c:52 #2 0x00007f50c21eaee4 in __GI___pthread_mutex_lock (mutex=0x7f50c27969b0 <_rtld_local+2480>) at ../nptl/pthread_mutex_lock.c:115 #3 0x00007f50c1854bef in __GI___dl_iterate_phdr (callback=0x7f50c190c020 <_Unwind_IteratePhdrCallback>, data=0x7f501f7fb040) at dl-iteratephdr.c:40 #4 0x00007f50c190d2d1 in _Unwind_Find_FDE () from /nix/store/65hafbsx91127farbmyyv4r5ifgjdg43-glibc-2.33-117/lib/libgcc_s.so.1 #5 0x00007f50c19099b3 in uw_frame_state_for () from /nix/store/65hafbsx91127farbmyyv4r5ifgjdg43-glibc-2.33-117/lib/libgcc_s.so.1 #6 0x00007f50c190ab90 in uw_init_context_1 () from /nix/store/65hafbsx91127farbmyyv4r5ifgjdg43-glibc-2.33-117/lib/libgcc_s.so.1 #7 0x00007f50c190b08e in _Unwind_RaiseException () from /nix/store/65hafbsx91127farbmyyv4r5ifgjdg43-glibc-2.33-117/lib/libgcc_s.so.1 #8 0x00007f50c1b02ab7 in __cxa_throw () from /nix/store/dd8swlwhpdhn6bv219562vyxhi8278hs-gcc-10.3.0-lib/lib/libstdc++.so.6 #9 0x00007f50c1d01abe in nix::parseURL (url="root@cb893012.packethost.net") at src/libutil/url.cc:53 #10 0x0000000000484f55 in extraStoreArgs (machine="root@cb893012.packethost.net") at build-remote.cc:35 #11 operator() (__closure=0x7f4fe9fe0420) at build-remote.cc:79 ... Maybe the fork happened while another thread was holding some global stack unwinding lock (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71744). Anyway, since the hanging child inherits all file descriptors to SSH clients, shutting down remote builds (via 'child.to = -1' in State::buildRemote()) doesn't work and 'child.pid.wait()' hangs forever. So let's not do any significant work between fork and exec.	2022-03-30 22:39:48 +02:00
ajs124	089da272c7	fix build against nix 2.7.0 fix build after such commits as df552ff53e68dff8ca360adbdbea214ece1d08ee and e862833ec662c1bffbe31b9a229147de391e801a	2022-03-29 15:38:24 -04:00
ajs124	c64c5f0a7e	hydra-queue-runner: rename build-result.hh to hydra-build-result.hh	2022-03-29 15:34:29 -04:00

1 2 3 4 5 ...

404 Commits