hydra

Author	SHA1	Message	Date
Jörg Thalheim	5b9c22dd18	bump nixpkgs	2025-04-09 11:31:47 -04:00
K900	e15070c6c2	Add metric for builds waiting for download slot (cherry picked from commit f23ec71227911891807706b6b978836e4d80edde)	2025-04-09 11:31:47 -04:00
Jörg Thalheim	37744c7018	don't build hydra twice in a pull request + enable merge queue	2025-04-09 11:31:47 -04:00
Pierre Bourdon	1e3929e75f	queue-runner: switch to pseudorandom ordering of builds processing We don't rely on sequential / monotonic build IDs processing anymore, so randomizing actually has the advantage of mixing builds for different systems together, to avoid only one chunk of builds for a single system getting processed while builders for other systems are starved.	2025-04-09 11:31:47 -04:00
Pierre Bourdon	28da0a705f	queue runner: introduce some parallelism for remote paths lookup Each output for a given step being ingested is looked up in parallel, which should basically multiply the speed of builds ingestion by the average number of outputs per derivation.	2025-04-09 11:31:47 -04:00
Pierre Bourdon	2050b2c324	queue-runner: reduce the time between queue monitor restarts This will induce more DB queries (though these are fairly cheap), but at the benefit of processing bumps within 1m instead of within 10m.	2025-04-09 11:31:47 -04:00
Pierre Bourdon	21d6d805ba	queue-runner: remove id > X from new builds query Running the query with/without it shows that it makes no difference to postgres, since there's an index on finished=0 already. This allows a few simplifications, but also paves the way towards running multiple parallel monitor threads in the future.	2025-04-09 11:31:47 -04:00
Pierre Bourdon	478bb01f7f	queue-runner: add prom metrics to allow detecting internal bottlenecks By looking at the ratio of running vs. waiting for the dispatcher and the queue monitor, we should get better visibility into what hydra is currently bottlenecked on. There are other side effects we can try to measure to get to the same result, but having a simple way doesn't cost us much.	2025-04-09 11:31:47 -04:00
Pierre Bourdon	08bf31b71a	queue-runner: limit parallelism of CPU intensive operations My current theory is that running more parallel xz than available CPU cores is reducing our overall throughput by requiring more scheduling overhead and more cache thrashing.	2025-04-09 11:31:47 -04:00
Pierre Bourdon	641056bd0e	web: Skip System on /machines It is redundant	2025-04-09 11:31:47 -04:00
Jörg Thalheim	29a7ab8009	test/gitea: fix eval	2025-04-09 11:31:47 -04:00
John Ericson	eddc234915	Fix evaluation of NixOS tests, avoid `with`	2025-04-09 11:31:47 -04:00
Maximilian Bosch	80f917d8fa	readIntoSocket: fix with store URIs containing an `&` The third argument to `open()` in `-\|` mode is passed to a shell if it's a string. In my case the store URI contains `?secret-key=${signingKey.directory}/secret&compression=zstd` For the `nix store cat` case this means that * until `&` the process will be started in the background. This fails immediately because no path to cat is specified. * `compression=zstd` is a variable assignment * the `$path` argument to `store cat` is attempted to be executed as another command Passing just the list solves the problem. (cherry picked from commit 3ee51dbe589458cc54ff753317bbc6db530bddc0)	2025-04-09 11:31:47 -04:00
git@71rd.net	5cb82812f2	Stream files from store instead of buffering them When an artifact is requested from hydra the output is first copied from the nix store into memory and then sent as a response, delaying the download and taking up significant amounts of memory. As reported in https://github.com/NixOS/hydra/issues/1357 Instead of calling a command and blocking while reading in the entire output, this adds read_into_socket(). the function takes a command, starting a subprocess with that command, returning a file descriptor attached to stdout. This file descriptor is then by responsebuilder of Catalyst to steam the output directly (cherry picked from commit 459aa0a5983a0bd546399c08231468d6e9282f54)	2025-04-09 11:31:47 -04:00
ajs124	17094c8371	lazy-load evaluation errors Closes #1362	2025-04-09 11:31:47 -04:00
Maximilian Bosch	d5fb163618	Only show stepname if it doesn't equal the name of the drv When building e.g. nixpkgs, the "Running builds" view will mostly look like this hello.x86_64-linux (Build of hello-X.Y) exa.x86_64-linux (Build of exa-X.Y) ... This doesn't provide any useful information. Showing the step name only makes sense if it's not a child of the job's derivation. With this patch, that information will only be shown if the drv name (i.e. w/o `/nix/store/` prefix, .drv ext & hash) is not equal to the drv name of the job itself (build.nixname).	2025-04-09 11:31:47 -04:00
Maximilian Bosch	baec2bbb4c	Running builds view: show build step names When using Hydra to build machine configurations, you'll often see "nixosConfigurations.foo" five times, i.e. for each build step being run. This isn't very helpful I think because in such a case, a single build step can also be compiling the Linux kernel. This change also fetches the `drvpath` and `type` from the `buildsteps` relation. We're already joining it, so this doesn't make much difference (confirmed via query logging that this doesn't cause extra SQL queries). Unfortunately build steps don't have a human readable name, so I'm deriving it from the drvpath by stripping away the hash (assuming that it'll never contain a `-` and that `/nix/store/` is used as prefix). I decided against using the Nix bindings for that to avoid too much overhead due to store operations for each build step.	2025-04-09 11:31:47 -04:00
Maximilian Bosch	b55bd25581	Make "timed out" and "log limit exceeded" builds aborted In `73694087a0` I gave builds that failed because of a timeout or exceeded log limit a stop sign and I stand by that reasoning: with that it's possible to distinguish between actual build failures and rather transient things such as timeouts. Back then I considered it a feature that these are shown in a different tab, but I don't think that's a good idea anymore. When using a jobset to e.g. track the regressions from a mass rebuild (like a compiler or gcc update), "Newly failed builds" should exclusively display regressions (and flaky builds of course, not much I can do about that). Also, when a bunch of builds fail in such a jobset because of e.g. a broken connection to a builder that results in a timeout, I want to be able to restart them all w/o rebuilding actual regressions. To make it clear that we not only have "Aborted" builds in the tab, I renamed the label to "Aborted / Timed out".	2025-04-09 11:31:47 -04:00
Pierre Bourdon	1ca17faed4	web: include current step status on /machines	2025-04-09 11:31:47 -04:00
John Ericson	9c022848cf	Fix the build	2025-04-09 11:31:47 -04:00
John Ericson	f58a752419	Fix Nix code Can now at least enter dev shell, but build is still broken.	2025-04-09 11:31:47 -04:00
John Ericson	0769853dec	flake.lock: Update to nix and nix-eval-jobs 2.28 Flake lock file updates: • Updated input 'nix': 'github:NixOS/nix/d0f98c76f962147610489e84c10033ca92e9c532?narHash=sha256-u6RhBWQ1XohTZ4Ub5ml1PTcaxQgtqFNng6Sohy1rojw%3D' (2025-04-07) → 'github:NixOS/nix/a4962f73b5fc874d4b16baef47921daf349addfc?narHash=sha256-r%2BpsCOW77vTSTNbxTVrYHeh6OgB0QukbnyUVDwg8s4I%3D' (2025-04-07) • Updated input 'nix-eval-jobs': 'github:nix-community/nix-eval-jobs/62f9c9e8d00d2ff6ab27a6197ab459a8e0808e59?narHash=sha256-PypQspB7h7EENe4RQQUQj2Ay8J1%2BO49AKNO9JbAU4Ek%3D' (2025-04-07) → 'github:nix-community/nix-eval-jobs/cba718bafe5dc1607c2b6761ecf53c641a6f3b21?narHash=sha256-v5n6t49X7MOpqS9j0FtI6TWOXvxuZMmGsp2OfUK5QfA%3D' (2025-04-07)	2025-04-09 11:31:47 -04:00
John Ericson	21c6afa83b	Fix build (due to C++ API changes)	2025-04-09 11:31:47 -04:00
John Ericson	1022514027	flake.lock: Update to nix and nix-eval-jobs 2.27 Flake lock file updates: • Updated input 'nix': 'github:NixOS/nix/e310c19a1aeb1ce1ed4d41d5ab2d02db596e0918?narHash=sha256-q/RgA4bB7zWai4oPySq9mch7qH14IEeom2P64SXdqHs%3D' (2025-02-18) → 'github:NixOS/nix/d0f98c76f962147610489e84c10033ca92e9c532?narHash=sha256-u6RhBWQ1XohTZ4Ub5ml1PTcaxQgtqFNng6Sohy1rojw%3D' (2025-04-07) • Updated input 'nix-eval-jobs': 'github:nix-community/nix-eval-jobs/f7418fc1fa45b96d37baa95ff3c016dd5be3876b?narHash=sha256-Lo4KFBNcY8tmBuCmEr2XV0IUZtxXHmbXPNLkov/QSU0%3D' (2025-03-26) → 'github:nix-community/nix-eval-jobs/62f9c9e8d00d2ff6ab27a6197ab459a8e0808e59?narHash=sha256-PypQspB7h7EENe4RQQUQj2Ay8J1%2BO49AKNO9JbAU4Ek%3D' (2025-04-07)	2025-04-09 11:31:47 -04:00
Jörg Thalheim	2d4232475c	gitignore hydra-data as created by foreman	2025-04-09 11:31:47 -04:00
Jörg Thalheim	d799742057	fix development workflow after switching to meson-based build	2025-04-09 11:31:47 -04:00
Robin Stumm	485aa93f2d	hydra-eval-jobset: do not wait on n-e-j inside transaction fixes #1429	2025-04-09 11:31:47 -04:00
Josef Kemetmüller	590e8d8511	Fix rendering of metrics with special characters My main motivation here is to get metrics with brackets to work in order to support "pytest" test names: - test_foo.py::test_bar[1] - test_foo.py::test_bar[2] I couldn't find an "HTML escape"-style function that would generate valid html `id` attribute names from random strings, so I went with a hash digest instead.	2025-04-09 11:31:47 -04:00
Maximilian Bosch	90a8a0d94a	Reimplement (named) constituent jobs (+globbing) based on nix-eval-jobs Depends on https://github.com/nix-community/nix-eval-jobs/pull/349 & #1421. Almost equivalent to #1425, but with a small change: when having e.g. an aggregate job with a glob that matches nothing, the jobset evaluation is failed now. This was the intended behavior before (hydra-eval-jobset fails hard if an aggregate is broken), the code-path was never reached however since the aggregate was never marked as broken in this case before.	2025-04-09 11:31:47 -04:00
zowoq	eb17619ee5	flake.lock: Update Flake lock file updates: • Updated input 'nix-eval-jobs': 'github:nix-community/nix-eval-jobs/4b392b284877d203ae262e16af269f702df036bc?narHash=sha256-3wIReAqdTALv39gkWXLMZQvHyBOc3yPkWT2ZsItxedY%3D' (2025-02-14) → 'github:nix-community/nix-eval-jobs/f7418fc1fa45b96d37baa95ff3c016dd5be3876b?narHash=sha256-Lo4KFBNcY8tmBuCmEr2XV0IUZtxXHmbXPNLkov/QSU0%3D' (2025-03-26)	2025-04-09 11:31:47 -04:00
zowoq	ebefdb0a3d	hydraTest: remove outdated postgresql version error: postgresql_12 has been removed since it reached its EOL upstream	2025-04-09 11:31:47 -04:00
Martin Weinelt	55349930f1	Fix race condition in hydra-compress-logs	2025-04-09 11:31:47 -04:00
John Ericson	847a8ae6cd	Revert "Use `LegacySSHStore`" There were some hangs caused by this. Need to fix them, ideally reproducing the issue in a test, before trying this again. This reverts commit `4a4a0f901c`.	2025-04-09 11:31:47 -04:00
ahuston-0	86d0009448	add declaritive hydra spec	2025-04-01 15:02:44 -04:00
ahuston-0	a20f37b97f	add gitea refs Signed-off-by: ahuston-0 <aliceghuston@gmail.com> Reviewed-on: https://<censored>/ahuston-0/hydra/pulls/1	2025-03-31 14:52:51 -04:00
ahuston-0	a94f84118c	add Gitea pulls docs entry Signed-off-by: ahuston-0 <aliceghuston@gmail.com>	2025-03-31 14:52:51 -04:00
Faye Chun	99e3ad325c	Merge branch 'NixOS:master' into add-gitea-pulls	2025-03-01 22:04:13 -05:00
John Ericson	18c0d76210	Merge pull request #1444 from NixOS/use-legacy-ssh-store Use `LegacySSHStore`	2025-02-18 14:37:17 -05:00
John Ericson	4a4a0f901c	Use `LegacySSHStore` In https://github.com/NixOS/nix/pull/10748 it is extended with everything we need.	2025-02-18 14:07:42 -05:00
John Ericson	881462bb4e	Merge pull request #1447 from NixOS/newer-2.6 Bump to newer 2.26.* Nix version	2025-02-18 13:00:40 -05:00
John Ericson	af72b694d8	Bump to newer 2.26.* Nix version Needed one more thing before trying out using `LegacySSHStore` directly. Flake lock file updates: • Updated input 'nix': 'github:NixOS/nix/674a87462cb93f605d4fbeef607d3453e7e5a7d8?narHash=sha256-TBoHqnIdVWhsBcL05vO2B1hSl9m//5Mz2NU%2BPMk3h3Y%3D' (2025-02-16) → 'github:NixOS/nix/e310c19a1aeb1ce1ed4d41d5ab2d02db596e0918?narHash=sha256-q/RgA4bB7zWai4oPySq9mch7qH14IEeom2P64SXdqHs%3D' (2025-02-18)	2025-02-18 12:43:31 -05:00
John Ericson	c92342d12f	Merge pull request #1446 from NixOS/newer-2.6 Bump to newer 2.26.* Nix version	2025-02-16 19:10:10 -05:00
John Ericson	df07670a21	Bump to newer 2.26.* Nix version Flake lock file updates: • Updated input 'nix': 'github:NixOS/nix/970942f45836172fda410a638853382952189eb9?narHash=sha256-jGFuyYKJjJZsBRoi7ZcaVKt1OYxusz/ld1HA7VD2w/0%3D' (2025-02-12) → 'github:NixOS/nix/674a87462cb93f605d4fbeef607d3453e7e5a7d8?narHash=sha256-TBoHqnIdVWhsBcL05vO2B1hSl9m//5Mz2NU%2BPMk3h3Y%3D' (2025-02-16)	2025-02-16 18:44:32 -05:00
John Ericson	51944a5fa5	Merge pull request #1443 from NixOS/nix-2.26 Nix 2.26	2025-02-13 22:13:32 -05:00
John Ericson	341b2f1309	Update build system to depend on Nix 2.26	2025-02-13 21:54:35 -05:00
John Ericson	4dc0f11379	Update flake.nix for Nix 2.26 Flake lock file updates: • Removed input 'libgit2' • Updated input 'nix': 'github:NixOS/nix/d652513e4519ed4eb48c92f8670e5a71c7793fc3?narHash=sha256-mIpJgIwPS4o4xYhN1B%2B/fHESEXoxpu6nVoZTzZ0MfTg%3D' (2025-02-12) → 'github:NixOS/nix/970942f45836172fda410a638853382952189eb9?narHash=sha256-jGFuyYKJjJZsBRoi7ZcaVKt1OYxusz/ld1HA7VD2w/0%3D' (2025-02-12) • Removed input 'nix/libgit2' • Updated input 'nix-eval-jobs': 'github:nix-community/nix-eval-jobs/6d4fd5a93d7bc953ffa4dcd6d53ad7056a71eff7?narHash=sha256-1dZLPw%2BnlFQzzswfyTxW%2B8VF1AJ4ZvoYvLTjlHiz1SA%3D' (2025-02-13) → 'github:nix-community/nix-eval-jobs/4b392b284877d203ae262e16af269f702df036bc?narHash=sha256-3wIReAqdTALv39gkWXLMZQvHyBOc3yPkWT2ZsItxedY%3D' (2025-02-14) • Updated input 'nixpkgs': 'github:NixOS/nixpkgs/dbebdd67a6006bb145d98c8debf9140ac7e651d0?narHash=sha256-Xc9lEtentPCEtxc/F1e6jIZsd4MPDYv4Kugl9WtXlz0%3D' (2024-09-18) → 'github:NixOS/nixpkgs/97a719c9f0a07923c957cf51b20b329f9fb9d43f?narHash=sha256-1o1qR0KYozYGRrnqytSpAhVBYLNBHX%2BLv6I39zGRzKM%3D' (2025-02-13)	2025-02-13 21:54:31 -05:00
John Ericson	ea09952b7e	Merge pull request #1442 from NixOS/clean-up-flake-lockfile Clean up flake lockfile stuff	2025-02-13 20:52:40 -05:00
John Ericson	81d21979ef	Clean up flake lockfile stuff The `flake = false;` for `nix-eval-jobs` didn't fully take before. Flake lock file updates: • Removed input 'nix-eval-jobs/flake-parts' • Removed input 'nix-eval-jobs/flake-parts/nixpkgs-lib' • Removed input 'nix-eval-jobs/nix-github-actions' • Removed input 'nix-eval-jobs/nixpkgs' • Removed input 'nix-eval-jobs/treefmt-nix' • Removed input 'nix-eval-jobs/treefmt-nix/nixpkgs'	2025-02-13 20:23:08 -05:00
John Ericson	0ed9a82912	Merge pull request #1441 from NixOS/nix-2.25 Nix 2.25	2025-02-13 19:53:07 -05:00
John Ericson	80241fc8be	Make code change necessary for building with Nix 2.25	2025-02-13 19:10:09 -05:00

1 2 3 4 5 ...

4361 Commits