debian-forge-composer

Author	SHA1	Message	Date
Tom Gundersen	626530818d	worker/server: requeue unresponsive jobs If a job is unresponsive the worker has most likely crashed or been shut down and the in-progress job been lost. Instead of failing these jobs, requeue them up to two times. Once a job is lost a third time it fails. This avoids infinite loops. This is implemented by extending FinishJob to RequeuOrFinish job. It takes a max number of requeues as an argument, and if that is 0, it has the same behavior as FinishJob used to have. If the maximum number of requeues has not yet been reached, then the running job is returned to pending state to be picked up again.	2022-11-02 15:26:00 +01:00
Sanne Raymaekers	e94ea7c995	internal/worker: add rhsm to ostree resolve job	2022-10-28 16:14:30 +02:00
Sanne Raymaekers	ebeb339f96	osbuild-worker: add ostree resolve job This job resolves an ostree ref. Similar to the depsolve and container resolve jobs, this should be a dependency of a manifest job.	2022-10-19 18:14:10 +02:00
Sanne Raymaekers	599829a3b8	worker: Return dependent jobs in OsbuildJobStatus	2022-08-30 16:14:52 +02:00
Sanne Raymaekers	0fe3f1b2ae	jobqueue: Query job dependents	2022-08-30 16:14:52 +02:00
Sanne Raymaekers	099b34b301	worker: Define new jobs to handle copying and resharing of images The copy job copies from one region to another. It does not preserve the sharing on the ami and it's snapshot, that needs to be queued separately.	2022-08-30 16:14:52 +02:00
Christian Kellner	388154d7f6	cloudapi: support container embedding Add support for embedding container images via the cloud API. For this the container resolve job was plumbed into the cloud api's handler and the API specification updated with a new `containers` section that mimics the blueprint section with the same name.	2022-08-04 14:37:12 +02:00
Sanne Raymaekers	111feda1f5	worker: Remove ellipsis operator from clienterrors.Error The ellipsis operator was used as a hack to not need to pass any details as an argument, but it makes what the end object will actually look like less obvious. It also makes it impossible to pass an array to details without getting a nested array. Fixes #2874	2022-08-03 13:51:52 +02:00
Gianluca Zuccarelli	e5d9d2d045	worker/server: rename JobStatus() to JobInfo() Since the `jobStatus` functions return a `JobInfo` struct that contains the `JobStatus`, it makes sense to rename the function names for the sake of consistency.	2022-07-27 13:37:14 +02:00
Gianluca Zuccarelli	95c8657f9e	metrics: remove arch from osbuild type The osbuild jobtype currently contains the architecture as a suffix. Since the arch is now being supplied as a label, the `arch` suffix can be removed.	2022-07-27 13:37:14 +02:00
Gianluca Zuccarelli	967ac1c35e	worker/server: job status struct The number of return values from the `jobStatus` function was growing and getting out of hand. Not all return values were being used in all cases and so returning a single struct with the information and status of a job makes more sense. Then in each case the resulting fields can be used as needed.	2022-07-27 13:37:14 +02:00
Gianluca Zuccarelli	9f4e765657	metrics: build jobs arch label Add the architecture label to build jobs which will enable filtering and monitoring build jobs by architecture. Build job results contain the `arch` field in the results struct, this is then used to pass to the metrics, where there is a value, otherwise it is set to an empty string.	2022-07-27 13:37:14 +02:00
Gianluca Zuccarelli	8b4aff3857	worker/server: remove duplicate metrics Remove a duplicate call to the `DequeueJobMetrics` function in the worker server. This duplicate call resulted in negative numbers for pending jobs in the prometheus metrics.	2022-07-27 13:37:14 +02:00
Christian Kellner	50e630a76f	worker: add new container resolve job type This is a new job that can be used to resolve containers. It uses the existing `container.Resolver` class to do the actual work.	2022-07-25 21:21:44 +02:00
Tomas Hozza	77a1672b79	worker/koji-finalize: handle multiple upload targets Enhance the `koji-finalize` job implementation to be able to cope with multiple upload targets being specified for an `OSBuildJob`. Implement a convenience method `OSBuildJobResult.TargetResultsByName()` for filtering the target results attached to the job result by their name. Cover the method with an unit test. And lastly use this method in the `koji-finalize` job to find the appropriate Koji upload target results. This is a preparation for enabling cloud uploads for Koji composes.	2022-07-22 11:39:49 +01:00
Tomas Hozza	58696e849f	worker/koji-finalize: always report status back to composer and Koji Enhance the `koji-finalize` job implementation to use deferred function to ensure that the job status is always reported back to the composer. In addition, if the `JobError` is set, also fail the Koji job. Previously, composer and Koji were not updated in some corner cases when the job would fail.	2022-07-22 11:39:49 +01:00
Christian Kellner	7f3f016ed1	distro: add containers arg to `ImageType.Manifest` This is the first step to support embedding container images. Here we add the `containers []container.Spec` argument to supply images with resolved container specifications. For now all distros will return an error in case a container is actually supplied since none of them currently support embedding containers. NB: also no apis or tools will actually resolve containers.	2022-07-21 13:32:07 +02:00
Ondřej Budai	e779562f3c	worker: remove osbuild-koji job Koji API removed by the previous commit was the last user of osbuild-koji job. Let's remove it since nothing uses it. This also removes all of the compatibility code in Cloud API, see concerns below: Compatibility concerns: - the internal deployment was moved to a completely different composer instance, thus there are no old jobs - Fedora deployment is still unused in prod, thus we don't care about keeping backward compatibility of the old jobs Signed-off-by: Ondřej Budai <ondrej@budai.cz>	2022-07-19 16:00:52 +02:00
Achilleas Koutsou	9d4a351ca6	Rename osbuild2 package to osbuild	2022-07-14 16:54:00 +02:00
Achilleas Koutsou	c1956ba6e1	Use osbuild2 Manifest in all tests The test_distro Manifest, which is used in tests across multiple packages, was using the old structure. Updated to the v2 structure and adapted all tests.	2022-07-14 16:54:00 +02:00
Sanne Raymaekers	03b57f002c	jobqueue: Move jobqueue out of internal	2022-07-04 15:37:28 +02:00
Tomas Hozza	776a54135f	worker: move osbuild exports from OSBuildJob to target The osbuild export is specific to the upload target and different targets may require using a different export. While osbuild-composer still does not support multiple exports for osbuild jobs, this prepares the ground for such support in the future. The backward compatibility with older implementations of the composer and workers is kept on the JSON (Un)mashaling level, where the JSON message is always a super-set of the old and new way of providing the exports to osbuild job.	2022-07-01 18:55:01 +01:00
Tomas Hozza	4e26ba82d0	worker: drop `ImageName` from the `OSBuildJob` struct The `ImageName` in `OSBuildJob` is not used any more by any API implementation or any worker job implementation. Drop it from the structure.	2022-07-01 18:55:01 +01:00
Tomas Hozza	95e2e75851	worker/osbuild: stop handling VMDK stream-optimized conversion A backward compatibility code handling the conversion of VMDK image to stream-optimized sub-format has been kept in the implementation since PR#2529 [1] merged on May 4th 2022. Since this change, no API implementation is submitting jobs, which would hit this conversion code, because VMDK images are already being produced in the desired sub-format. On-premise deployments are expected to use the same composer and worker versions. There are no composer / worker instances in production, which are not running the modified code. Delete the backward compatibility code. [1] https://github.com/osbuild/osbuild-composer/pull/2529	2022-07-01 18:55:01 +01:00
Tomas Hozza	6dcadc9d20	worker/osbuild: move target errors to detail of job error Add a new worker client error type `ErrorTargetError` representing that at least one of job targets failed. The actual target errors are added to the job detail. Add a new `OSBuildJobResult.TargetErrors()` method for gathering a slice of target errors contained within an `OSBuildJobResult` instance. Cover the method with unit test.	2022-07-01 18:55:01 +01:00
Tomas Hozza	59ded68457	worker: delete `TargetErrors` from `OSBuildJobResult` The `TargetErrors` is not used any more since PR#2192 [1] and there is no need to keep the backward compatibility any more, because there are no composer / worker instances in production, which are not running the modified code. In addition, delete unit tests covering this legacy error handling. [1] https://github.com/osbuild/osbuild-composer/pull/2192	2022-07-01 18:55:01 +01:00
Ondřej Budai	0693274ffe	worker/server: set a job error when heartbeat gets missing Previously, we just used an empty struct when heartbeat failed. This is fine for the osbuild job because it's treated as a failed one when result.OSBuildResult == false which is the default value. koji-finalize works differently though: It's in a failed state if there's an job error of kojiError != "". So when failed heartbeat set the struct to be empty, this was treated as success because there's no error. Let's fix this by introducing a new error for the situation where we don't get a heartbeat in time for a specific job.	2022-06-29 16:44:10 +02:00
Tomas Hozza	bdf009f800	`UploadJobArtifact()`: return `400` if not accepting artifacts The worker server API handler `UploadJobArtifact()` was previously silently discarding artifacts uploaded by the worker, if the server was configured to not accept artifacts. Change the behavior to return HTTP error "Bad Request" (`400`) to the worker, in case it tries to upload artifact to the server, but the server is configured to not accept any artifacts. Add a new unit test testing the new behavior and adjust existing unit tests, which were relying on the artifact being previously silently discarded.	2022-06-17 17:37:15 +02:00
Tomas Hozza	fc7d090498	cloudapi: add `EnsureJobChannel()` middleware to verify job channel Add `EnsureJobChannel()` middleware method, intended for `compose/<id>` endpoints. Its purpose is to ensure that the tenant channel set in the request `echo.Context` matches the tenant channel associated with the compose. In case of mismatch, `404` is returned. Add `JobChannel()` method to the worker server implementation for requesting channel associated with the job.	2022-06-10 14:48:18 +01:00
Tomas Hozza	db2ad7bc5f	cloudapi: switch `osbuild-koji` -> `osbuild` for Koji build jobs Switch to using `osbuild` job type with `koji` upload target for Koji build jobs, instead of using `osbuild-koji` job type. Modify unit tests accordingly.	2022-06-10 14:48:18 +01:00
Tomas Hozza	fc8af28231	worker/server: delete `CheckBuildDependencies()` Replace all uses of `CheckBuildDependencies()` with `JobDependencyChainErrors()` and delete `CheckBuildDependencies()`.	2022-06-10 14:48:18 +01:00
Tomas Hozza	fa37005a32	worker/server: add `JobDependencyChainErrors()` method Add new `JobDependencyChainErrors()` method for gathering a stack trace of job errors from the job's dependencies which caused it to fail. The `JobDependencyChainErrors()` implementation uses job-type specific `...Status()` methods intentionally, because job-type specific status methods check the job's result in a slightly different way and set the result.JobError to a specific value. Due to this reason, it would not be practical to introduce a generic `JobStatus()` method and get rid of the `switch` block, because in reality, the new method would have to implement an equivalent `switch` block as well. Add unit test covering the method functionality.	2022-06-10 14:48:18 +01:00
Tomas Hozza	5bd02f2f27	worker: treat `ErrorKojiFailedDependency` as a dependency error The `ErrorKojiFailedDependency` was previously not treated as a dependency error. Fix it.	2022-06-10 14:48:18 +01:00
Tomas Hozza	d9e4889866	worker: rename `HasDependencyError()` -> `IsDependencyError()` Rename the `HasDependencyError()` method to `IsDependencyError()` to better express what it does.	2022-06-10 14:48:18 +01:00
Tomas Hozza	66f7eaf440	worker/osbuild: check errors of all job dependencies Ensure that none of the job dependencies failed. This covers the case when there are more than one job dependencies, which will be the case for Koji composes.	2022-06-10 14:48:18 +01:00
Tomas Hozza	97da1e7ad6	worker/osbuild: handle manifest dynamic argument index Previously, the `OSBuild` job assumed that it can have only a single job dependency, which could be only the `ManifestJobByID`. This won't work well for the Koji use case, because the Koji OSBuild job has also dependency on the Koji-init job. Extend the `worker.OSBuildJob` structure with a new field, which holds the `ManifestJobByIDResult` index in the job's dynamic arguments slice. This value is considered in case when there is more than one dependency of the `OSBuild` job.	2022-06-10 14:48:18 +01:00
Tomas Hozza	a4e6531565	worker: define job types as constants Define supported job type names as constants and use them in all places, instead of string literals. There are multiple benefits of this approach. Using constants removed the room for typos in the string literals. One can use autocompletion in IDE for job types. Using constant makes it easier to find all references where it is used and thus all places that are handling a specific job type.	2022-06-10 14:48:18 +01:00
Tomas Hozza	69b9f115c9	worker: allow enqueueing OSBuild job with multiple dependencies Change the definition of `EnqueueOSBuildAsDependency()` function to accept a slice of job IDs on which the OSBuild job depends. Previously, only the manifest job ID was accepted as the only possible dependency. This change will be needed in order to enqueue OSBuild jobs for Koji, which depends on two jobs.	2022-06-10 14:48:18 +01:00
Tomas Hozza	bb54318432	worker/osbuild: add host OS and architecture to job result It is generally useful to have this information in the `OSBuildJobResult`. This information is currently part of the `OSBuildKojiJobResult`. Instead of moving it to the new `KojiTargetResultOptions`, lets move it to the `OSBuildJobResult` structure and set it for all jobs.	2022-06-10 14:48:18 +01:00
Chloe Kaubisch	873798514b	prometheus: add tenant label Include a tenant label for all prometheus metrics. Modify jobstatus function in the worker accordingly to return channel so it can be passed to prometheus.	2022-06-07 16:35:03 +02:00
Achilleas Koutsou	c8ce3e4428	worker: test depsolve job format compatibility Test the conversion of the new and old DepsolveJob given the custom marshaller. The deserialised old format is not exactly the same as it would have been before, but it is functionally equivalent, with the added benefit of supporting depsolve jobs where we don't want base repositories to be used by all depsolves.	2022-06-01 11:36:52 +01:00
Achilleas Koutsou	94c7fda779	worker: make DepsolveJob serialisation backwards compatible Add custom marshaller for DepsolveJob that serialises the struct into a format compatible with both the new and old formats. The format on the wire is a superset of both the new and old format and can be deserialised into either while retaining all information.	2022-06-01 11:36:52 +01:00
Achilleas Koutsou	c092783a70	simplify package set chain handling Move package set chain collation to the distro package and add repositories to the package sets while returning the package sets from their source, i.e., the ImageType.PackageSets() method. This also removes the concept of "base repositories". There are no longer repositories that are added implicitly to all package sets but instead each package set needs to specify all the repositories it will be depsolved against. This paves the way for the requirement we have for building RHEL 7 images with a RHEL 8 build root. The build root package set has to be depsolved against RHEL 8 repositories without any "base repos" included. This is now possible since package sets and repositories are explicitly associated from the start and there is no implicit global repository set. The change requires adding a list of PackageSet names to the core rpmmd.RepoConfig. In the cloud API, repositories that are limited to specific package sets already contain the correct package set names and these are now copied to the internal RepoConfig when converting types in genRepoConfig(). The user-specified repositories are only associated with the payload package sets like before.	2022-06-01 11:36:52 +01:00
Achilleas Koutsou	8a23a77c5b	worker: add new error type for RepoError dnf-json now returns a new error kind: RepoError Add it to the list of known error types and handle it in the worker.	2022-06-01 11:36:52 +01:00
Tom Gundersen	4eeaebd40b	prometheus/job: measure time spent pending rather than queued We are interested in the time it takes from a job could be dequeued until it is, but if a job has dependencies that are not yet finished, it cannot be dequeued. Change the logic to measure the time since the last dependency was dequeued rather than when the job was queued. The purpose of this metric is to have an alert fire in case we have too few workers processing jobs.	2022-05-14 17:47:38 +01:00
Tom Gundersen	4621768c14	server/requestJob: record metrics last This ensures that only if the dequeuing is successful are metrics recorded.	2022-05-14 17:47:38 +01:00
Tom Gundersen	ac642c3d70	server/requestJob: failing to read job status is fatal Error out early in case reading a job status fails. The state would otherwise be inconsistent if only some of the job statuses have been read out.	2022-05-14 17:47:38 +01:00
Tomas Hozza	0bf67dfad5	Stop setting the `StreamOptimized` option in Weldr and Cloud APIs The VMDK image is already produced as stream-optimized. Therefore stop setting the `StreamOptimized` option in `OSBuildJob` structure by both, Weldr and Cloud APIs. Keep the handling of the option in worker for backward compatibility, in case an older instance of Composer server is used, which does not produce VMDK manifests as stream-optimized. In such case, the worker needs to convert the image.	2022-05-04 16:22:29 +02:00
Ondřej Budai	6fce34a5ea	worker: add proxy support to composer and oauth calls In the internal deployment, we want to talk with composer over a http/https proxy. This proxy adds new composer.proxy field to the worker config that causes the worker to connect to composer and the oauth server using a specified proxy. NB: The proxy is not supported when connection to composer via unix sockets. For testing this, I added a small HTTP proxy implementation, pls don't use this in production, it's just good enough for tests. Signed-off-by: Ondřej Budai <ondrej@budai.cz>	2022-05-03 06:19:31 +01:00
Ondřej Budai	9ee3997428	worker: use custom requester also for oauth refresh Just so we can share e.g. proxy server or other http transport settings. Signed-off-by: Ondřej Budai <ondrej@budai.cz>	2022-05-03 06:19:31 +01:00

1 2 3 4

179 commits