hydra

Author	SHA1	Message	Date
Eelco Dolstra	2946899504	Turn hydra-notify into a daemon It now receives notifications about started/finished builds/steps via PostgreSQL. This gets rid of the (substantial) overhead of starting hydra-notify for every event. It also allows other programs (even on other machines) to listen to Hydra notifications.	2019-08-13 18:18:21 +02:00
Eelco Dolstra	8d26144121	Fix building against nix master	2018-10-30 14:41:21 +01:00
Eelco Dolstra	b04dc6c76e	Fix root creation when the root already exists but is owned by another user	2017-10-19 12:28:38 +02:00
Eelco Dolstra	45b138373b	hydra-queue-runner: Write GC roots for outputs paths We lost this behaviour somewhere. So build outputs could be GC'ed when running the collector with --option gc-keep-outputs false.	2017-10-12 18:55:38 +02:00
Eelco Dolstra	7c976d2aec	hydra-queue-runner: Make build notification more reliable Previously, when hydra-queue-runner was restarted, any pending "build finished" notifications were lost. Now hydra-queue-runner marks finished but unnotified builds in the database and uses that to run pending notifications at startup.	2017-07-26 15:17:51 +02:00
Will Dietz	719df63190	queue-monitor: never move lastBuildId forward without processing jobs.	2017-07-25 20:05:37 -05:00
Eelco Dolstra	dc5e0b120a	Fix a race that can cause hydra-queue-runner to ignore newly added builds As @dtzWill discovered, with the concurrent hydra-evaluator, there can be multiple active transactions adding builds to the database. As a result, builds can become visible in a non-monotonically increasing order, breaking the queue monitor's assumption that build IDs only go up. The fix is to have hydra-eval-jobset provide the lowest build ID it just added in the builds_added notification, and have the queue monitor check from there. Fixes #496.	2017-07-21 14:34:48 +02:00
Eelco Dolstra	66ae66024e	Sync with latest Nix	2017-07-17 11:38:58 +02:00
Eelco Dolstra	5810042a3b	Periodically clear Store's path info cache Otherwise the queue runner can consider paths as valid that have been garbage-collected since the first time it queried them.	2017-04-06 17:20:23 +02:00
Eelco Dolstra	8771f7f913	Merge pull request #382 from shlevy/cached-build-notifications Send BuildFinished notifications on cached build results.	2017-03-29 18:52:20 +02:00
Eelco Dolstra	8bb36e79bd	Support testing build determinism Builds can now specify the attribute "isDeterministic = true" to tell Hydra to build with build-repeat > 0. If there is a mismatch between rounds, the step / build fails with a suitable status. Maybe this should be a meta attribute, but that makes it invisible to hydra-queue-runner, and it seems reasonable to make a claim of mandatory determinism part of the derivation (since e.g. enabling this flag should trigger a rebuild).	2016-12-06 17:46:06 +01:00
Eelco Dolstra	7863d2e1da	Step cancellation: Don't use pthread_cancel() This was a bad idea because pthread_cancel() is unsalvageable broken in C++. Destructors are not allowed to throw exceptions (especially in C++11), but pthread_cancel() can cause a __cxxabiv1::__forced_unwind exception inside any destructor that invokes a cancellation point. (This exception can be caught but must be rethrown.) So let's just kill the builder process instead.	2016-11-07 19:38:24 +01:00
Eelco Dolstra	b3169ce438	Kill active build steps when builds are cancelled We now kill active build steps when there are no more referring builds. This is useful e.g. for preventing cancelled multi-hour TPC-H benchmark runs from hogging build machines.	2016-10-31 14:58:29 +01:00
Eelco Dolstra	ee2e9f5335	Update to reflect BinaryCacheStore changes BinaryCacheStore no longer implements buildPaths() and ensurePath(), so we need to use copyPath() / copyClosure().	2016-10-07 20:23:05 +02:00
Shea Levy	5962367ffc	Send BuildFinished notifications on cached build results. Fixes #342	2016-08-17 06:40:12 -04:00
Eelco Dolstra	177bf25d64	Queue monitor: Bail out earlier if a step has failed previously Currently, the hydra.nixos.org queue contains 1000s of Darwin builds that all depend on a stdenv-darwin that previously failed. However, before, first createStep() would construct a dependency graph for each build, then getQueuedBuilds() would discover that one of the steps had failed previously and discard all those steps. Since the graph construction involves a lot of uncached calls to isValidPath(), this took several seconds per build. Now createStep() detects the previous failure right away and bails out.	2016-04-15 14:32:16 +02:00
Eelco Dolstra	d6f188a01a	Typo	2016-04-13 16:45:40 +02:00
Eelco Dolstra	f3f661bac1	Reuse build products / metrics stored in the database Previously, if the queue monitor thread encounters a build that Hydra has previously built, it downloaded the output paths from the binary cache, just to determine the build products and metrics. This is very inefficient. In particular, when doing something like merging nixpkgs:staging into nixpkgs:master, the queue monitor thread will be locked up for a long time fetching files from S3, causing the build farm to be mostly idle. Of course this is entirely unnecessary, since the build products/metrics are already in the Hydra database. So now we just look up a previous build with the same output path, and copy the products/metrics.	2016-04-13 16:30:52 +02:00
Eelco Dolstra	00c78440b1	Disambiguate "marking build as succeeded" message	2016-04-13 16:30:52 +02:00
Eelco Dolstra	80ff78b1b6	Unify build and step status codes Also remove the obsolete status code 5 from the database.	2016-03-09 15:30:43 +01:00
Eelco Dolstra	718fef29ef	Keep track of time required to load builds	2016-03-08 13:09:29 +01:00
Eelco Dolstra	45b237453a	hydra-queue-runner: Recycle finishedDrvs This should prevent the queue monitor thread from looking up the same derivations over and over again.	2016-03-08 11:52:13 +01:00
Eelco Dolstra	2ab8e9a1e0	hydra-queue-runner: Fix handling of missing derivations This barfed with 'queue monitor: ERROR: column "errormsg" of relation "builds" does not exist' due to the removal of the errorMsg column.	2016-03-07 19:05:24 +01:00
Eelco Dolstra	7cd08c7c46	Warn if PostgreSQL appears stalled	2016-02-29 15:10:30 +01:00
Eelco Dolstra	02190b0fef	Support hydra-build-products on binary cache stores	2016-02-26 14:45:03 +01:00
Eelco Dolstra	8321a3eb27	Sync with Nix	2016-02-24 14:04:31 +01:00
Eelco Dolstra	88a05763cc	Pool local store connections	2016-02-20 00:04:08 +01:00
Eelco Dolstra	2d0dd7fb49	hydra-queue-runner: Write directly to a binary cache	2016-02-15 21:10:29 +01:00
Eelco Dolstra	92d8b59361	Process Nix API changes	2016-02-11 15:59:47 +01:00
Eelco Dolstra	c087472c71	Remove superfluous "has" function	2015-11-02 14:29:12 +01:00
Eelco Dolstra	4d1816b152	Remove obsolete Builds columns and provide accurate "Running builds" This removes the "busy", "locker" and "logfile" columns, which are no longer used by the queue runner. The "Running builds" page now only shows builds that have an active build step.	2015-10-27 15:37:17 +01:00
Eelco Dolstra	53c80d9526	getQueuedBuilds(): Periodically stop to handle priority bumps Previously, priority bumps could take a long time to get noticed if getQueuedBuilds() was busy processing zillions of queue additions. (This was made worse by the reintroduction of substitute checking.)	2015-10-22 17:00:46 +02:00
Eelco Dolstra	71bf7e02d5	Use nix::willBuildLocally()	2015-10-21 15:44:29 +02:00
Eelco Dolstra	82504fe010	hydra-queue-runner: Use substitutes This allows Hydra to use binaries from available binary caches. It makes the queue monitor thread quite a bit slower, so if you don't want to use binary caches, it's better to add "--option build-use-substitutes false" to the hydra-queue-runner invocation. Fixed #243.	2015-10-05 14:57:44 +02:00
Eelco Dolstra	f8141fdc98	Set propagatedFrom for cached failed build steps	2015-09-11 15:55:26 +02:00
Eelco Dolstra	ee9bf7ace7	Account steps with preferLocalBuild as a separate system type They will show up in machineTypes as (e.g.) x86_64-linux:local instead of x86_64-linux. This is to prevent the Hydra provisioner from creating machines for steps that are supposed to be executed locally.	2015-09-02 13:42:25 +02:00
Eelco Dolstra	99bfc37764	Don't abort steps that have an unsupported system type This is necessary because the required system type can become available later (e.g. by being provisioned by the auto-scaler). However, in the future, we may want to fail steps if they have been unsupported for more than a certain amount of time.	2015-08-17 15:10:41 +02:00
Eelco Dolstra	ea1eb2e3fb	Keep track of requiredSystemFeatures in the machine stats For example, steps that require the "kvm" feature may require a different kind of machine to be provisioned. This can also be used to require performance-sensitive tests to run on a particular kind of machine, e.g., by setting requiredSystemFeatures to something like "ec2-i2.8xlarge".	2015-08-17 14:37:57 +02:00
Eelco Dolstra	d4759c1da2	hydra-queue-runner: Detect changes to the scheduling shares	2015-08-12 13:17:56 +02:00
Eelco Dolstra	576dc0c120	For completeness, re-implement meta.schedulingPriority	2015-08-12 12:05:43 +02:00
Eelco Dolstra	b7965df928	Load the queue in order of global priority	2015-08-11 02:14:34 +02:00
Eelco Dolstra	97f11baa8d	Revive jobset scheduling (I.e. taking the jobset scheduling share into account.)	2015-08-11 01:31:56 +02:00
Eelco Dolstra	eb13007fe6	Allow build to be bumped to the front of the queue via the web interface Builds now have a "Bump up" action. This will cause the queue runner to prioritise the steps of the build above all other steps.	2015-08-10 16:19:47 +02:00
Eelco Dolstra	27182c7c1d	Start steps in order of ascending build ID	2015-08-10 16:19:47 +02:00
Eelco Dolstra	4d26546d3c	Add support for tracking custom metrics Builds can now emit metrics that Hydra will store in its database and render as time series via flot charts. Typical applications are to keep track of performance indicators, coverage percentages, artifact sizes, and so on. For example, a coverage build can emit the coverage percentage as follows: echo "lineCoverage $pct %" > $out/nix-support/hydra-metrics Graphs of all metrics for a job can be seen at http://.../job/<project>/<jobset>/<job>#tabs-charts Specific metrics are also visible at http://.../job/<project>/<jobset>/<job>/metric/<metric> The latter URL also allows getting the data in JSON format (e.g. via "curl -H 'Accept: application/json'").	2015-07-31 00:57:30 +02:00
Eelco Dolstra	7e026d35f7	Split hydra-queue-runner.cc more	2015-07-21 15:14:17 +02:00

46 Commits