hydra

Author	SHA1	Message	Date
Jörg Thalheim	2dad87ad89	hydra-queue-runner: fix compilation warning instead of converting to double, we can convert to float right away.	2024-09-20 07:50:24 +02:00
Jörg Thalheim	b6f44b5cd0	Merge pull request #1402 from NixOS/like-sub tests: use `like` for testing regexes	2024-09-15 23:50:13 +02:00
Janne Heß	c8b7a0fea9	Merge pull request #1403 from NixOS/docs Devdocs: mention nix develop and nproc	2024-09-03 15:03:01 +02:00
Jörg Thalheim	2d79b0a4da	Merge pull request #1406 from NixOS/fix/remove-url-literal default.nix: Drop URL literal	2024-08-27 21:41:42 +02:00
Martin Weinelt	f730433789	Create eval-jobset role and guard /api/push route	2024-08-27 19:49:05 +02:00
Janne Heß	916531dc9c	api: Require POST for /api/push	2024-08-27 17:52:13 +02:00
Janne Heß	0ead8dc65c	default.nix: Drop URL literal	2024-08-27 17:44:36 +02:00
Jörg Thalheim	b1a0501520	Merge pull request #1405 from hacker1024/patch-2 Use Nix::Store and Nix::Utils in NARInfo.pm	2024-08-27 17:09:37 +02:00
hacker1024	b94a7b6d5c	Use Nix::Store and Nix::Utils in NARInfo.pm These are required for the `signString` and `readFile` subroutines used when signing NARs.	2024-08-25 17:25:08 +10:00
Jörg Thalheim	9ee3c6aea2	Merge pull request #1400 from SuperSandro2000/feat/buildlogs-zstd CompressLog: Add zstd compression	2024-08-21 09:43:39 +02:00
Jörg Thalheim	02a514234b	hacking.md: make build parallel	2024-08-21 08:42:22 +02:00
Jörg Thalheim	54a9729a0f	hacking.md: mention nix develop	2024-08-21 08:42:22 +02:00
Jörg Thalheim	250780aaf2	tests: use `like` for testing regexes This gives us better diagnostics when the test fails.	2024-08-21 08:34:25 +02:00
Jörg Thalheim	4bb2f08be1	Merge pull request #1396 from nh2/hash-length-12 renderInputDiff: Increase git hash length 8 -> 12	2024-08-20 09:55:04 +02:00
Jörg Thalheim	c23973785f	Merge pull request #1399 from Mindavi/bugfix/too-strict-timeouts Looser timeouts, disable broken test, less verbose output	2024-08-20 09:54:40 +02:00
Sandro Jäckel	b2b2d6e26c	Expand docs with new compression options	2024-08-18 17:59:36 +02:00
Sandro Jäckel	99ca560d58	Use configured compression in hydra-compress-logs service	2024-08-18 17:59:36 +02:00
Janne Heß	2c886f51d3	CompressLog: Add zstd compression	2024-08-09 18:52:03 +02:00
Janne Heß	7de7122479	Merge pull request #1398 from marius851000/document_foreman_user Document the default Hydra user and port in hacking.md	2024-08-01 09:58:30 +02:00
Rick van Schijndel	54002f0fcf	t/evaluator/evaluate-oom-job.t: always skip, the test always fails We should look into how to resolve this, but I tried some things and nothing really worked. Let's put it skipped for now until someone comes along to improve it.	2024-07-31 17:15:02 +02:00
Rick van Schijndel	a6b14369ee	t/test.pl: increase event-timeout, set qvf Only log issues/failures when something's actually up. It has irked me for a long time that so much output came out of running the tests, this seems to silence it. It does hide some warnings, but I think it makes the output so much more readable that it's worth the tradeoff. Helps for highly parallel running of jobs, sometimes they'd not give output for a while. Setting this timeout higher appears to help. Not completely sure if this is the right place to do it, but it works fine for me.	2024-07-31 17:15:02 +02:00
Rick van Schijndel	578a3d2292	t: increase timeouts for slow commands with high load We've seen many fails on ofborg, at lot of them ultimately appear to come down to a timeout being hit, resulting in something like this: Failure executing slapadd -F /<path>/slap.d -b dc=example -l /<path>/load.ldif. Hopefully this resolves it for most cases. I've done some endurance testing and this helps a lot. some other commands also regularly time-out with high load: - hydra-init - hydra-create-user - nix-store --delete This should address most issues with tests randomly failing. Used the following script for endurance testing: ``` import os import subprocess run_counter = 0 fail_counter = 0 while True: try: run_counter += 1 print(f"Starting run {run_counter}") env = os.environ env["YATH_JOB_COUNT"] = "20" result = subprocess.run(["perl", "t/test.pl"], env=env) if (result.returncode != 0): fail_counter += 1 print(f"Finish run {run_counter}, total fail count: {fail_counter}") except KeyboardInterrupt: print(f"Finished {run_counter} runs with {fail_counter} fails") break ``` In case someone else wants to do it on their system :). Note that YATH_JOB_COUNT may need to be changed loosely based on your cores. I only have 4 cores (8 threads), so for others higher numbers might yield better results in hashing out unstable tests.	2024-07-31 17:13:28 +02:00
marius david	ada51d70fc	Document the default user and port in hacking.md	2024-07-23 22:39:22 +02:00
Niklas Hambüchen	bc19e7cd65	renderInputDiff: Increase git hash length 8 -> 12 See investigation on lengths required to be conflict-free in practice: https://github.com/NixOS/hydra/pull/1258#issuecomment-1321891677	2024-07-20 23:45:12 +02:00
John Ericson	d7986226f0	Merge pull request #1227 from SuperSandro2000/gitea-push-hook Add gitea push hook	2024-07-09 14:31:10 -04:00
John Ericson	b3e0d9a8b7	Merge pull request #1387 from NixOS/pipe-buffer-size queue-runner: try larger pipe buffer sizes	2024-05-23 11:50:15 -04:00
Pierre Bourdon	5728011da1	queue-runner: try larger pipe buffer sizes (cherry picked from commit 18466e83261d39b997a73bbd9f0f249c3a91fbeb)	2024-05-23 11:42:35 -04:00
John Ericson	559376e907	Merge pull request #1377 from SuperSandro2000/fix-doi-1375 Fix doi resolution after #1375	2024-05-21 18:11:21 -04:00
Janne Heß	998df1657e	Merge pull request #1382 from Mic92/patch-1 README: update wiki link	2024-05-08 21:37:18 +02:00
Jörg Thalheim	f99cdaf5fe	README: update wiki link	2024-05-08 21:31:32 +02:00
John Ericson	3bf00e31c0	Merge pull request #1381 from NixOS/factor-out-tests Try again to ensure hydra module is usable	2024-05-03 12:50:02 -04:00
John Ericson	e149da7b9b	Try again to ensure hydra module is usable Nixpkgs only contains a `hydra_unstable`, not `hydra`, package, so adjust the default accordingly, and then override it to our package in the separate module which does that.	2024-05-03 12:41:17 -04:00
John Ericson	e81c36ac92	Merge pull request #1380 from NixOS/factor-out-tests Factor out NixOS tests, and clean up	2024-05-03 12:33:50 -04:00
John Ericson	743795b2b0	Factor out NixOS tests, and clean up Due to newer nixpkgs, there were a number of things that could be cleaned up in the process.	2024-05-03 12:26:06 -04:00
John Ericson	50378aef22	Merge pull request #1379 from NixOS/remove-unneeded-override Remove `PrometheusTiny` from overlay	2024-05-03 12:06:29 -04:00
John Ericson	92155f9a07	Remove `PrometheusTiny` from overlay It's in Nixpkgs for a good while now.	2024-05-03 11:41:48 -04:00
John Ericson	29ce5c603c	Merge pull request #1378 from NixOS/nix-2.22 Update to Nix 2.22	2024-05-03 11:14:49 -04:00
John Ericson	4bd687e3e6	Update to Nix 2.22 Flake lock file updates: • Updated input 'nix': 'github:NixOS/nix/60824fa97c588a0faf68ea61260a47e388b0a4e5' (2024-04-11) → 'github:NixOS/nix/1c8150ac312b5f9ba1b3f6768ff43b09867e5883' (2024-04-23) • Added input 'nix/flake-parts': 'github:hercules-ci/flake-parts/9126214d0a59633752a136528f5f3b9aa8565b7d' (2024-04-01) • Added input 'nix/flake-parts/nixpkgs-lib': follows 'nix/nixpkgs' • Added input 'nix/pre-commit-hooks': 'github:cachix/pre-commit-hooks.nix/40e6053ecb65fcbf12863338a6dcefb3f55f1bf8' (2024-04-12) • Added input 'nix/pre-commit-hooks/flake-compat': follows 'nix' • Added input 'nix/pre-commit-hooks/flake-utils': 'github:numtide/flake-utils/5aed5285a952e0b949eb3ba02c12fa4fcfef535f' (2022-11-02) • Added input 'nix/pre-commit-hooks/gitignore': follows 'nix' • Added input 'nix/pre-commit-hooks/nixpkgs': follows 'nix/nixpkgs' • Added input 'nix/pre-commit-hooks/nixpkgs-stable': follows 'nix/nixpkgs'	2024-05-03 10:47:43 -04:00
Sandro Jäckel	1b8154e67f	Fix doi resolution after #1375 This fixes: > Caught exception in Hydra::Controller::Root->realisations "Undefined subroutine &Hydra::Controller::Root::queryRawRealisation called at /nix/store/v842xb35ph8ka1yi1xanjhk4xh1pn5nm-hydra-2024-04-22/libexec/hydra/lib/Hydra/Controller/Root.pm line 371."	2024-05-03 14:31:34 +02:00
Pierre Bourdon	b72528be50	web: serveFile: also serve a CSP putting served HTML in its own origin	2024-04-22 16:28:50 +02:00
John Ericson	8b48579593	Merge pull request #1374 from Mindavi/bugfix/rendering-issue-content-addressed ca-derivations: fix rendering issue	2024-04-18 13:08:30 -04:00
John Ericson	ef7bf1e67b	Merge pull request #1375 from NixOS/nix-2.21 Nix 2.21	2024-04-12 17:28:37 -04:00
John Ericson	ab1f64aa4d	flake.lock: Update Flake lock file updates: • Updated input 'nix': 'github:NixOS/nix/c4ebb82da4eade975e874da600dc50e9dec610cb' (2024-02-12) → 'github:NixOS/nix/60824fa97c588a0faf68ea61260a47e388b0a4e5' (2024-04-11) • Updated input 'nixpkgs': 'github:NixOS/nixpkgs/a1982c92d8980a0114372973cbdfe0a307f1bdea' (2024-01-12) → 'github:NixOS/nixpkgs/1d6a23f11e44d0fb64b3237569b87658a9eb5643' (2024-04-11) • Removed input 'nixpkgs-for-fileset'	2024-04-12 12:26:11 -04:00
Rick van Schijndel	3f913a771d	t: content-addressed: add a comment about a misleading testcase	2024-04-03 22:55:42 +02:00
Rick van Schijndel	71986632ce	hydra-server: findLog: fix issue with ca-derivations enabled When content addressed derivations are built on the hydra server, one may run into an issue where some builds suddenly don't load anymore. This seems to be caused by outPaths that are NULL (which is allowed for ca-derivations). Filter them out to prevent querying the database for them, which is not supported by the database abstraction layer that's currently in use. On my instance this appears to resolve the issue. I feel like I might be doing this at the wrong abstraction layer, but on the other hand -- it seems to resolve it and it also doesn't really look like it will hurt anything. The test added in a previous commit uncovers this issue, and this commit resolves it. So I'm happy with this patch for now. The issue I was seeing on my server: hydra-server[2549]: [error] Couldn't render template "undef error - DBIx::Class::SQLMaker::ClassicExtensions::puke(): Fatal: NULL-within-IN not implemented: The upcoming SQL::Abstract::Classic 2.0 will emit the logically correct SQL instead of raising this exception. at /nix/store/<hash>-hydra-unstable-2024-03-08_nix_2_20/libexec/hydra/lib/Hydra/Helper/Nix.pm line 190 See also short discussion here: https://github.com/NixOS/nixpkgs/pull/297392#issuecomment-2035366263	2024-04-03 22:47:22 +02:00
Rick van Schijndel	1665aed5e3	t: content-addressed: add test for caDependingOnFailingCA This uncovers an issue with the front-end.	2024-04-03 22:45:53 +02:00
John Ericson	b676b08fac	Merge pull request #1368 from Ma27/login-submit-btn Use `submit` event in login form	2024-03-26 11:23:51 -04:00
John Ericson	d614163e9c	Merge pull request #1370 from Ma27/reconnect-db hydra-queue-runner: drop broken connections from pool	2024-03-26 11:21:35 -04:00
Maximilian Bosch	99afff03b0	hydra-queue-runner: drop broken connections from pool Closes #1336 When restarting postgresql, the connections are still reused in `hydra-queue-runner` causing errors like this main thread: Lost connection to the database server. queue monitor: Lost connection to the database server. and no more builds being processed. `hydra-evaluator` doesn't have that issue since it crashes right away. We could let it retry indefinitely as well (see below), but I don't want to change too much. If the DB is still unreachable 10s later, the process will stop with a non-zero exit code because of a missing DB connection. This however isn't such a big deal because it will be immediately restarted afterwards. With the current configuration, Hydra will never give up, but restart (and retry) infinitely. To me that seems reasonable, i.e. to retry DB connections on a long-running process. If this doesn't work out, the monitoring should fire anyways because the queue fills up, but I'm open to discuss that. Please note that this isn't reproducible with the DB and the queue runner on the same machine when using `services.hydra-dev`, because of the `Requires=` dependency `hydra-queue-runner.service` -> `hydra-init.service` -> `postgresql.service` that causes the queue runner to be restarted on `systemctl restart postgresql`. Internally, Hydra uses Nix's pool data structure: it basically has N slots (here DB connections) and whenever a new one is requested, an idle slot is provided or a new one is created (when N slots are active, it'll be waited until one slot is free). The issue in the code here is however that whenever an error is encountered, the slot is released, however the same broken connection will be reused the next time. By using `Pool::Handle::markBad`, Nix will drop a broken slot. This is now being done when `pqxx::broken_connection` was caught.	2024-03-15 14:09:31 +01:00
ajs124	8f56209bd6	Merge pull request #1361 from Ma27/fix-gitea-test flake: fix gitea integration test	2024-03-08 15:28:07 +01:00

1 2 3 4 5 ...

4231 Commits