debian-forge-composer

Author	SHA1	Message	Date
Michael Vogt	e8a0e8ff49	weldr: update depsolve calls in weldr API Update the weldr API to work with the new depsolve API. Update tests to match (adding repo_id). Co-authored-by: Achilleas Koutsou <achilleas@koutsou.net>	2025-01-29 18:03:11 +01:00
Achilleas Koutsou	dab836de19	weldr/test: fix test run name	2025-01-29 18:03:11 +01:00
Michael Vogt	a6ba0785b0	cloudapi: fix manifestSource.Serialize()	2025-01-29 18:03:11 +01:00
Sanne Raymaekers	4e803af8cd	cloudapi: get rid of localSave check in local target The local target shouldn't require any specific configuration and should just be available always.	2025-01-24 15:26:15 +01:00
Sanne Raymaekers	7bfcac30dd	cloudapi: support worker server target artifact retrieval In order to get the artifact location from the cloudapi, add a helper function in the worker server.	2025-01-24 15:26:15 +01:00
Brian C. Lane	df16f7fc63	v2_test: Add testing for cloudapi /depsolve/blueprint Test the depsolve using a mocked response, test for mismatched distributions, and for unsupported architectures.	2025-01-23 11:39:53 -08:00
Brian C. Lane	f377c5e3eb	v2_test: Add a test-distro-1 repository This also adds an actual repository json file for the test-disro. Without this the repo.ListDistros() function doesn't return any actual distros. Related: RHEL-60125	2025-01-23 11:39:53 -08:00
Brian C. Lane	02d0b8ec01	cloudapi: Request depsolve from osbuild-worker and return the response to the client. This uses the worker to depsolve the requested packages. The result is returned to the client as a list of packages using the same PackageMetadata schema as the ComposeStatus response. It will also time out after 5 minutes and return an error, using the same timeout constant as depsolving during manifest generation. Related: RHEL-60125	2025-01-23 11:39:53 -08:00
Brian C. Lane	e06e62ca03	cloudapi: Add /depsolve/blueprint route This will allow depsolving blueprints and returning package metadata for the dependencies. Related: RHEL-60125	2025-01-23 11:39:53 -08:00
Brian C. Lane	4f3c93ef1e	cloudapi: Make sigmd5 in PackageMetadata optional In order to reuse PackageMetadata with DepsolveResponse and not include unused fields this changes the sigmd5 entry to an optional field. This doesn't effect the use of PackageMetadata in the Compose response since it is always set, and it allows it to be omitted in the response for depsolving. Also adds a basic test for stagesToPackageMetadata Related: RHEL-60125	2025-01-23 11:39:53 -08:00
Brian C. Lane	08dc5f3041	cloudapi: Move GetCustomizationsFromBlueprintRequest This function only depends on the Blueprint (cloudapi request type, not the internal/blueprint) so move it to a function on that so that it can be reused by other users of the cloudapi Blueprint. Related: RHEL-60125	2025-01-23 11:39:53 -08:00
Sanne Raymaekers	425581fcc1	cloudapi/v2: support local upload target The target validation rework broke the local upload target, which is needed for cockpit-image-builder.	2025-01-22 13:54:40 +01:00
Brian C. Lane	73101d2ff2	Fix non-constant log strings Newer versions of the go compiler (1.24 in this case) fail when running go test during a mock rebuild of the srpm created by 'make srpm' on Fedora 42. Even though we currently don't support go1.24, fix these so they don't become an issue when we do.	2025-01-21 16:51:20 -08:00
Michael Vogt	af0543d27c	many: update images Manifest() API for PR#1107 This updates composer to use the updated API in images around the seed handling for manifests, see images PR#1107 for details. Note that this has no semantic changes yet. We could now simplfy some things because images will auto-seed but that is for a followup.	2025-01-20 09:50:49 +01:00
Lukas Zapletal	d531f62488	blueprint: add cacert customization	2025-01-10 10:26:54 +01:00
Florian Schüller	65b7ee65b2	osbuild-service-maintenance: implement removal of launch templates Launch templates of instances that are terminated should be removed. HMS-3632	2024-12-10 11:43:51 +01:00
Florian Schüller	a96ea533c0	osbuild-service-maintenance: implement removal of security groups Security groups of instances that are terminated should be removed. HMS-3632	2024-12-10 11:43:51 +01:00
Florian Schüller	7ebe266d3c	osbuild-service-maintenance: implement removal on invalid parent Add a safeguard to ensure secure instances without valid parent instances are terminated, as they are unnecessary to retain. Typically, the parent does not exist if the secure instance is older than 2 hours, but this check provides additional validation. HMS-3632	2024-12-10 11:43:51 +01:00
Tomáš Hozza	1f590aa232	Weldr/ComposeRequest: OSTree options `nil` if not set Previously, the `OSTree` property in the Weldr API `ComposeRequest` struct was not a pointer to the `ostree.ImageOptions` type. As a result, it was initialized to an empty struct, even if not set in the client API call. As a result, the `OSTree` property in the `distro.ImageOptions` was always not `nil`, when initializing the osbuild manifest. However, after a change in `osbuild/images` [0], providing OSTree options for non-OSTree image types is no longer considered valid. This caused a failure to submit a new compose for any non-OSTree image type. Change the `OSTree` property in Weldr `ComposeRequest` to be a pointer and mark it as optional. [0] https://github.com/osbuild/images/pull/1071 Signed-off-by: Tomáš Hozza <thozza@redhat.com>	2024-12-09 09:46:54 +01:00
Sanne Raymaekers	f6feb7675b	cloud/awscloud: use any instance create fleet returns Even in case of errors, as long as create fleet returns an instance, attempt to use it. In some cases AWS returns `InsufficientInstanceCapacity` but still creates an instance: ``` msg="Won't retry CreateFleet with OnDemand instance, retry: false, errors: InsufficientInstanceCapacity: There is no Spot capacity available that matches your request.; Already launched instance ([i-...]), aborting create fleet" msg="doCreateFleetRetry: returning retry: false, msg: [InsufficientInstanceCapacity: There is no Spot capacity available that matches your request. Already launched instance ([i-...]), aborting create fleet]" msg="doCreateFleetRetry: cancelling retry, instance already exists: [i-...]" msg="doCreateFleetRetry: setting retry to true" msg="Checking to retry fleet create on error InsufficientInstanceCapacity (msg: There is no Spot capacity available that matches your request.)" ```	2024-12-03 14:00:12 +01:00
Lukas Zapletal	4b55bc2825	cloudapi: carry ostree MTLS secret over	2024-12-03 13:59:45 +01:00
Sanne Raymaekers	779053d910	cloud/awscloud: give secure instances a name That way you can just enter the parent instance id into the search bar and get both the worker and its executor.	2024-12-03 11:56:52 +01:00
Sanne Raymaekers	38b799f162	cloud/awscloud: exclude really old instance types RHEL 10 (nightly) builds fail on stage with "Fatal glibc error: CPU does not support x86-64-v3", this is most likely due to very old instance types not supporting a specific instruction set.	2024-11-29 15:42:27 +01:00
Ondřej Budai	64ff0e3dad	awscloud: add very verbose logging to createFleet creation We still see this error sometimes: Unable to start secure instance: Unable to create fleet: InsufficientInstanceCapacity: There is no Spot capacity available that matches your request This is awkward because the message mentions that there is no spot capacity, even though the current code should retry on InsufficientInstanceCapacity. I also confirmed this by searching for the retries log messages: there are none in the logs. We need a bigger hammer. Let's log everything that happens in the createFleet method in order to have better understanding why the retry logic isn't triggered. We should probably move most of the newly added logs to the debug level, but let's delay that until we have more insight into what's happening.	2024-11-26 16:12:09 +01:00
Sanne Raymaekers	54ffc08814	awscloud/secure-instance: pass on fleet information on error By surfacing the output even in case of an error, the fleet ID and instance ID can be extracted if present. Thus the instance can be terminated before its dependencies are deleted.	2024-11-26 12:52:12 +01:00
Sanne Raymaekers	7a166cd356	awscloud/secure-instance: log error code comparisons We're seeing some behaviour where create fleet is not retried and subsequently the SI cleanup fails due to the security group already being tied to an existing instance. There is no error that an instance was launched anyway.	2024-11-26 12:52:12 +01:00
Florian Schüller	446e8448e3	awscloud/secure-instance: retry for 10 minutes retry for 10 x 60sec. and don't log retries twice	2024-11-22 12:19:32 +01:00
Florian Schüller	4ec8894244	awscloud/secure-instance: retry on error in terminated waiter terminated waiter sometimes responded with "waiter state transitioned to Failure" where we want to retry waiting for the termination	2024-11-22 12:19:32 +01:00
Sanne Raymaekers	8fd36225be	cloudapi/v2: support HyperV generation in Azure upload options	2024-11-21 11:22:20 +01:00
Sanne Raymaekers	fb3e1b0701	internal/upload/azure: support different hyper v generations When registering an image, users should be able to choose their hyper V gen, as gen1 is quite outdated by now.	2024-11-21 11:22:20 +01:00
Sanne Raymaekers	d2f50a4224	internal/target: add Azure image HyperV generation	2024-11-21 11:22:20 +01:00
Florian Schüller	b5c71cd7e2	awscloud/secure-instance: enrich logging with secure instance id we'll log as direct URL to the console for easier tracing	2024-11-19 17:26:23 +01:00
Florian Schüller	992f876da0	cloudapi/v2/server: rephrase error message	2024-11-19 13:55:38 +01:00
Florian Schüller	02778b5361	cloudapi/v2/server: assure order of fail-calls by avoiding map but rather using a slice the order of SetFailed is maintained	2024-11-19 13:55:38 +01:00
Florian Schüller	ca3f0a190f	internal/jobqueue/jobqueuetest/jobqueuetest: fix DB tests I got confused as the jobqueue interface is asymmetric. It expects an object and returns a json.RawMessage and when handing over to postgres this is abstracted away by postgres	2024-11-19 13:55:38 +01:00
Florian Schüller	2f4d7d3140	internal/cloudapi/v2/server: remove osbuild job explicitly set "failed" osbuild job is a dependency of the resolve and manifest jobs so leaving the state and it will fail as a depency is also fine	2024-11-19 13:55:38 +01:00
Florian Schüller	d3e3474fb7	internal/worker/server: return an error on depsolve timeout HMS-2989 Fixes the special case that if no worker is available and we generate an internal timeout and cancel the depsolve including all followup jobs, no error was propagated.	2024-11-19 13:55:38 +01:00
Sanne Raymaekers	2eb3c9f44c	worker/server: add tests for job heartbeats	2024-11-07 17:18:48 +01:00
Sanne Raymaekers	14bd8d38ca	worker/server: add basic tests for Pending / Running job metrics	2024-11-07 17:18:48 +01:00
Sanne Raymaekers	a971f9340b	worker/server: update metrics on requeue When requeuing a job the next worker requesting the job would decrement pending counter, but the pending counter only ever got incremented once, when the job was first enqueued. Thus make sure to increment the pending counter when a job is requeued.	2024-11-07 17:18:48 +01:00
Sanne Raymaekers	056b3c5ea6	jobqueue: return if a job was requeued or not	2024-11-07 17:18:48 +01:00
Lukas Zapletal	64f479092d	osbuild-worker: use the new ostree resolver API	2024-11-07 16:17:56 +01:00
Florian Schüller	ece16307c6	jobqueuetest: avoid warning and provide a valid JSON Not needed for the test but just generates a useless warning	2024-11-06 15:16:42 +01:00
Florian Schüller	00d3f07d08	Makefile: implement `make db-tests` enables the option to run the DB tests locally that are executed in the github actions	2024-11-06 15:16:42 +01:00
Sanne Raymaekers	aeba9d5a68	cloud/awscloud: don't specify max spot price The current spot price could be limiting the available instance pool significantly. ARM instances specifically are experiencing a lot of capacity errors.	2024-10-30 15:41:09 +01:00
Sanne Raymaekers	4afcd8c3fd	cloud/awscloud: fix another nilpointer in maintenance functions	2024-10-25 17:46:49 +02:00
Sanne Raymaekers	4f90a757dc	cloud/awscloud: fix retrying to create secure instances Set the correct target capacity specification type, just setting the spot options to nil doesn't result in an on demand instance.	2024-10-24 20:25:48 +02:00
Sanne Raymaekers	6ccfc7f818	cloud/awscloud: fix nil pointer dereference in maintenance fns The maintenance pod is crashing when describing the images by tag, most likely something else is failing.	2024-10-24 12:05:42 +02:00
Sanne Raymaekers	d5912259a0	cloud/awscloud: rework create fleet retry logic The current path sometimes launches two instances, which is problematic because the rest of the secure instance code expects exactly one instance. A security group could be attached to both instances, and would block the worker from launching any more SIs, as it tries to delete the old security group first, which is still held by one of the surplus SIs which didn't get terminated. Only retry if: - on "UnfulfillableCapacity" or "InsufficientInstanceCapacity" error codes; - there wasn't an instance launched anyway. If either of these checks fail, do not try to launch another one, and just fail the job.	2024-10-24 10:29:26 +02:00
Sanne Raymaekers	1c7a276d6f	cloud/aws: add maintenance functions for secure instance cleanup	2024-10-23 10:32:57 +02:00

1 2 3 4 5 ...

3245 commits