debian-forge

Author	SHA1	Message	Date
Achilleas Koutsou	3a717e170a	sources: add org.osbuild.skopeo-index source A new source module that can download a multi-image manifest list from a container registry. This module is very similar to the skopeo source, but instead downloads a manifest list with `--multi-arch=index-only`. The checksum of the source object must be the digest of the manifest list that will be stored and the manifest that is downloaded must be a manifest-list.	2023-03-31 14:57:26 +02:00
Achilleas Koutsou	ce29a4af73	sources/skopeo: change local container format Change the local storage format for containers to the `dir` format. The `dir` format will be used to retain signatures and manifests. The remove-signatures option is removed since the storage format now supports them. The final move (os.rename()) at the end of the fetch_one() method now creates the checksum directory if it doesn't exist and moves the child archive into it, adding to any existing archives that might exist in other formats (from a previous version downloading a `docker-archive`). Dropped the .tar suffix from the symlink in the skopeo stage since it's not necessary and the target of the link might be a directory now. The parent class exists() method checks if there is a file in the sources cache that matches the checksum. For containers, this used to be a file called container-image.tar under a directory that matches the checksum, so for containers it always returned False. Added an override for the skopeo source that checks for the new directory archive.	2023-03-31 14:57:26 +02:00
Achilleas Koutsou	be76d6f355	sources/skopeo: fix comment typo	2023-03-31 14:57:26 +02:00
Sanne Raymaekers	ae563ff896	sources/ostree: fix quotation marks in mTLS remote options Example of broken repo config: ``` ... "tls-client-key-path=/etc/pki/consumer/key.pem" "tls-client-cert-path=/etc/pki/consumer/cert.pem" ```	2023-01-13 11:35:43 +01:00
Sanne Raymaekers	925ca9b41e	sources/ostree: set contenturl when pulling from remote If a contenturl is specified, the url is used only for metadata. This is useful when the actual content is hosted separately.	2022-10-14 12:04:54 +02:00
Sanne Raymaekers	cc9d05c201	sources/ostree: fix mTLS certs remote option These options take the format --set "KEY_VALUE". Also fix the string formatting.	2022-10-14 12:04:54 +02:00
Sanne Raymaekers	fcaad0462a	sources/ostree: pull from remote using rhsm mTLS certs The consumer certs are used to uniquely identify a system against candlepin. These consumer certs can be used to identify the system when pulling from RH controlled ostree repositories.	2022-10-11 16:49:45 +02:00
Simon de Vlieger	ea6085fae6	osbuild: run isort on all files	2022-09-12 13:32:51 +02:00
Achilleas Koutsou	f699720dbd	sources/curl: quote URL paths before downloading Some package versions [1] can contain carets and other characters that curl doesn't like. These need to be URL encoded. Interestingly, the documented way of replacing components in a parsed URL from urllib in Python is by calling the (seemingly private) `_replace()` method [2]. [1] https://docs.fedoraproject.org/en-US/packaging-guidelines/Versioning/#_snapshots [2] https://docs.python.org/3/library/urllib.parse.html#url-parsing	2022-08-31 22:28:54 +01:00
Christian Kellner	51315a985a	stages/skopeo: use extra intermediate download dir Instead of downloading the image directly to the temporary directory and then moving that temporary directory into the cache use one more intermediate directory and move that into the cache. The reason is that on Python 3.6 removing the temporary directory itself will make Python crash like this: Python 3.6.8 (default, Sep 9 2021, 07:49:02) [GCC 8.5.0 20210514 (Red Hat 8.5.0-3)] on linux Type "help", "copyright", "credits" or "license" for more information. >>> import tempfile >>> with tempfile.TemporaryDirectory(prefix="tmp-download-") as tmpdir: ... import os ... os.rename(tmpdir, "/tmp/foo") Traceback (most recent call last): File "<stdin>", line 3, in <module> File "/usr/lib64/python3.6/tempfile.py", line 809, in __exit__ self.cleanup() File "/usr/lib64/python3.6/tempfile.py", line 813, in cleanup _shutil.rmtree(self.name) File "/usr/lib64/python3.6/shutil.py", line 477, in rmtree onerror(os.lstat, path, sys.exc_info()) File "/usr/lib64/python3.6/shutil.py", line 475, in rmtree orig_st = os.lstat(path) FileNotFoundError: [Errno 2] No such file or directory: '/tmp/tmp-download-adl86mwa'	2022-07-19 19:52:25 +02:00
Christian Kellner	4647140808	source/skopeo: use subprocess.check_output Use `subprocess.check_output` instead of `run(..., capture_output=True)` since the latter only got added in Python 3.7 and our codebase needs to be compatible with 3.6 due to RHEL 8.x.	2022-07-13 20:06:42 +02:00
Simon de Vlieger	3fd864e5a9	osbuild: fix optional-types Optional types were provided in places but were not always correct. Add mypy checking and fix those that fail(ed).	2022-07-13 17:31:37 +02:00
Achilleas Koutsou	c8073b5836	sources: support calling curl with --insecure Add support for the `--insecure` curl flag, which makes curl skip the verification step when making secure connections (e.g., https://). This allows osbuild to download files from servers configured with SSL/TLS but whose certificate cannot be validated. This is supported for configuring repository sources in osbuild-composer.	2022-06-14 22:13:39 +02:00
Simon de Vlieger	7b0e1fe5fd	sources: curl max_workers 2 * num_cpus This changes the curl source to use the number of cpus times two for its thread count. A conservative number but a commonly used default.	2022-05-24 19:45:23 +02:00
Thomas Lavocat	ac2a194cd4	sources: check if ostree object exists in cache The generic ways of checking if an object is in the cache does not apply for ostree as the internal structure of a repo is quite specific. Thus we need to use the ostree executable to ask it to explore its repo for us.	2022-05-11 04:32:42 -05:00
Thomas Lavocat	1de74ce2c9	sources: generalizing download method Before, the download method was defined in the inherited class of each program. With the same kind of workflow redefined every time. This contribution aims at making the workflow more clear and to generalize what can be in the SourceService class. The download worklow is as follow: Setup -> Filter -> Prepare -> Download The setup mainly step sets up caches. Where the download data will be stored in the end. The filter step is used to discard some of the items to download based on some criterion. By default, it is used to verify if an item is already in the cache using the item's checksum. The Prepare step goes from each element and let the overloading step the ability to alter each item before downloading it. This is used mainly for the curl command which for rhel must generate the subscriptions. Then the download step will call fetch_one for each item. Here the download can be performed sequentially or in parallel depending on the number of workers selected.	2022-05-11 04:32:42 -05:00
Thomas Lavocat	0953cf64e0	sources: provide an unverified tmpdir Some downloading program need a global unverified tmpdir to work within before storing the definitive data. Provide this in the workflow directly.	2022-05-11 04:32:42 -05:00
Thomas Lavocat	128845da3c	sources: tidy the download method Only the "items to download" need to be passed as parameters. The rest is unpacked as attributes during the Setup step of the workflow.	2022-05-11 04:32:42 -05:00
Thomas Lavocat	92fe237f24	sources: introduce per-source content_type Introduce a new class member `content_type` that specifies what type of items the source will store in the cache. Use that to generalize the setup step, which is shared across all sources.	2022-05-11 04:32:42 -05:00
Thomas Lavocat	34cd9ef9f0	sources: generalize cache generation Introduce a `setup` step in the workflow that is responsible of generating the cache folder. This is then used in each download method.	2022-05-11 04:32:42 -05:00
Tom Gundersen	e175529f7c	sources/curl: don't limit total download time Some RPMs might be very large, and limiting the total download time might lead to failed build even in cases where downloading is making progress. Instead, set a minimum download speed (1kbps). If the minimum is not surpassed for 30 seconds in a row, the download fails and is retried. This follows the logic employed by DNF. Adjust the number of retries to 10 and the connection timeout to 30, in order to match what DNF does. One difference is that DNF does 10 retries across all downloads, whereas we do it per download, this could be changed in a follow-up. Old: - a download taking more than 5 minutes is unconditionally aborted New: - slow but working downloads will never be aborted - downloads will be stalled for at most five minutes in total before being aborted - time spent making progress does not count towards the five minutes Signed-off-by: Tom Gundersen <teg@jklm.no>	2022-03-16 14:48:03 +01:00
Alexander Larsson	46a228df38	Add support for installing containers in images This adds a stage called org.osbuild.skopeo that installs docker and oci archive files into the container storage of the tree being constructed. The source can either be a file from another pipeline, for example one created with the existing org.osbuild.oci-archive stage, or it can be using the new org.osbuild.skopeo source and org.osbuild.containers input, which will download an image from a registry and install that. There is an optional option in the install stage that lets you configure a custom storage location, which allows the use of the additionalimagestores option in the container storage.conf to use a read-only image stores (instead of /var/lib/container). Note: skopeo fails to start if /etc/containers/policy.json is not available, so we bind mount it from the build tree to the buildroot if available.	2022-02-10 14:43:17 +01:00
Christian Kellner	c902a7a754	sources: port to host services Port sources to also use the host services infrastructure that is used by inputs, devices and mounts. Sources are a bit different from the other services that they don't run for the duration of the stage but are run before anything is built. By using the same infrastructure we re-use the process management and inter process communcation. Additionally, this will forward all messages from sources to the existing monitoring framework. Adapt all existing sources and tests.	2021-09-22 00:00:20 +02:00
Alexander Larsson	072b75d78e	org.osbuild.curl: Don't load secrets if not needed This moves the check for already downloaded files earlier so that if all files are already downloaded we don't need to load the secrets. This is faster, but also it allows a pre-seeded object store to run the manifest on a system (like a VM) that isn't subscribed.	2021-09-22 00:00:20 +02:00
Christian Kellner	7576191c2d	sources/inline: fix schema The top-level node "items" was not defined and the required property "encoding" was wrongly called "method".	2021-06-30 12:06:30 +02:00
Martin Sehnoutka	ee3760e1ba	sources/curl: Implement new way of getting RHSM secrets The previous version covered too few use cases, more specifically a single subscription. That is of course not the case for many hosts, so osbuild needs to understand subscriptions. When running org.osbuild.curl source, read the /etc/yum.repos.d/redhat.repo file and load the system subscriptions from there. While processing each url, guess which subscription is tied to the url and use the CA certificate, client certificate, and client key associated with this subscription. It must be done this way because the depsolving and fetching of RPMs may be performed on different hosts and the subscription credentials are different in such case. More detailed description of why this approach was chosen is available in osbuild-composer git: https://github.com/osbuild/osbuild-composer/pull/1405	2021-06-04 18:23:05 +01:00
Christian Kellner	2025184325	sources: introduce org.osbuild.inline Add a new source for transporting binary data within the source entry itself. The data is ascii encoded in the `data` property of the inline source item, with the encoding that is used being specified in the `encoding` property.	2021-05-12 14:26:16 +02:00
Christian Kellner	3ebfc6f657	sources/curl: use util.checksum.verify_file Now that there is a common utility function to verify the checksum of a file, use that. Also fix the json schema entry for the property to have to correct minium and maximum digest length, given the supported algorithm, which is 32 (md5) and 128 (sha512) characters.	2021-05-12 14:26:16 +02:00
Christian Kellner	a05a8aaed6	sources/ostree: remove export functionality Since the `sources.SourcesServer` has been removed, nothing is using the export functionality anymore. Inputs are now used to make content in the store available to stages. Remove all the export logic from org.osbuild.ostree.	2021-04-29 12:58:01 +02:00
Christian Kellner	518940cfe0	sources/curls: refactor downloading code Now that the `export` functionality is gone, the download code can be simplified, since we are not downloading a subset of the urls, but all of them.	2021-04-29 12:58:01 +02:00
Christian Kellner	5c19360cbe	sources/curl: remove export functionality Since the `sources.SourcesServer` has been removed, nothing is using the export functionality anymore. Inputs are now used to make content in the store available to stages. Remove all the export logic from org.osbuild.curl.	2021-04-29 12:58:01 +02:00
Christian Kellner	2dcc1d9cee	sources/ostree: capture ostree output Instead of using stderr for the ostree subprocess command capture its output so that in the case of an error we get properly return the error output. With the old behavior all the `ostree` command output would land in the journal of the worker.	2021-03-12 18:49:41 +01:00
Christian Kellner	b609bb81dd	source/ostree: fix download only case Source, for compatability reasons, have two modes: download only and download and export. The difference is the arguments that are passed to the source: For download only, the `output` param is empty. In this case also `checksums` can be empty and if so it means everything, i.e. the commits, should be fetched. The latter was not properly handled so far. Adjust the logic, which now closely mimics that of the `org.osbuild.curl` source to fix this case. Also catch exceptions invoking `ostree` and properly return them via the json error messaging.	2021-03-12 18:49:41 +01:00
Christian Kellner	81c8374d3e	sources: rename org.osbuild.{files -> curl} The `org.osbuild.files` source provides files, but might in the future not be the only one that does. Therefore rename it to match the internal tool that is being used to fetch the files. This is done for most other osbuild modules that target tools. The format v1 loader is adapted to make this change transparent for users of the v1 format, so we are backwards compatible. Change the MPP depsolve preprocessor so that for format v2 based manifest `org.osbuild.curl` source is used. Also rename the corresponding source test. Adapt the format v2 mod test to use the curl source.	2021-02-12 19:27:08 +01:00
Christian Kellner	fa9c288988	sources: source itself controls cache sub-dir Instead of supplying the full cache dir, i.e. the directory in the store where the source will place the fetched resources, to the source, only supply the root folder of the cache and let the source itself create the desired sub-directory. This allows the source to determine what type of resource it provides. This makes the final directory independent of the name of the source: a `org.osbuild.curl` source can place file-like resource in the `org.osbuild.files` sub-directory. Then the `org.osbuild.files` input can be used to get those from the cache directory.	2021-02-12 19:27:08 +01:00
Christian Kellner	1d5d1fd44a	sources/ostree: support format version 2 In format version 2, the source specific keys for the sources, here "urls", is replaced by a generic `items` key, common to all sources. Express that in the schema.	2021-02-12 15:55:43 +01:00
Christian Kellner	429f833434	sources/files: add format version 2 support In format version 2, the source specific keys for the sources, here "urls", is replaced by a generic `items` key, common to all sources. Express that in the schema.	2021-02-12 15:55:43 +01:00
Christian Kellner	931eac23c3	sources: introduce source items All sources fetch various types of `items`, the specific nature of which is dependent on the source type, but they are all identifyable by a opaque identifier. In order for osbuild to check that all the inputs that a stage needs are are indeed contained in the manifest description, osbuild must learn what ids are fetched by what source. This is done by standarzing the common "items" part, i.e. the "id" -> "options for that id" mapping that is common to all sources. For the version 1 of the format, extract the files and ostree the item information from the respective options. Adapt the sources (files, ostree) so that they use the new items information, but also fall back to the old style; the latter is needed since the sources tests still uses the SourceServer.	2021-02-10 15:44:24 +01:00
Christian Kellner	ee9df25a02	sources/ostree: ability to only pull commits Split the internal logic into two parts: 1) fetching the commit into the internal cache repo and then 2) exporting that commit, i.e. a local pull from the cache repo to the output directory. If no `output` directory was specified, only fetch the commit, do not attempt to export it. NB: this commit changes at what point the gpg verification is done. Previously the check was on export. Now, we are checking the signature on import only. The export step will be replaced by an ostree `Input` that will have the ability to verify commits a second time.	2021-02-06 12:04:30 +01:00
Christian Kellner	127be09ba8	sources/files: ability to only download files Split the internal logic of the stage in two parts: 1) downloading files to the internal cache and 2) exporting the downloaded files from said cache to the output directory. Additionally, ff no such `output` directory was specified, i.e. it is empty or `None`, only download files but do not attempt to export them.	2021-02-06 12:04:30 +01:00
Christian Kellner	ee1d860755	sources: drop dnf stage This source has been declared obsolete some time ago and is not support anymore. We wont support it in the upcoming new manifest format, therefore drop it now.	2021-01-22 17:17:54 +01:00
Christian Kellner	cbcb335b3e	osbuild: fix spelling mistakes found by codespell Run codespell on the source ('codespell -f -L msdos -S coverity -S rpmbuild -S samples') and fix all uncovered mistakes.	2020-10-06 14:41:00 +02:00
Ondřej Budai	7b0db90c76	sources/files: do not pass floats to --max-time curl uses strtod from the C standard library to convert the --max-time's value from string to double. However, this is what strtod expects: nonempty sequence of decimal digits optionally containing decimal-point character (as determined by the current C locale) Yeah, unfortunately, the decimal-point character is determined by the current C locale. For example, Czech and German locale uses a comma as the decimal-point character. For reasons I don't fully understand, Python thinks it's running on en_US locale, even though LC_NUMERIC is set to cs_CZ, so it uses a full stop as the decimal-point character when converting float to string. However, as written before, curl fails to parse this because it expects comma. The fix I chose is simple: Use math.ceil, so only an integer can be passed to curl. Why ceil? Because --max-time == 0 sounds fishy. math.ceil should return an integer (and it does in Python 3.8) but the documentation is not 100% clear on this topic, so let's be paranoid and also convert it to int after the ceiling.	2020-06-25 21:25:17 +02:00
Tom Gundersen	82f4d1cc96	sources/files: reduce the concurrent curl processes We appear to be throttled by some mirrors if we are too eager. Back off. Signed-off-by: Tom Gundersen <teg@jklm.no>	2020-06-10 14:42:10 +02:00
Tom Gundersen	cf8216aea9	sources/files: don't spam stderr with error messages Silence the errors, but include instead the error code in the returned error message. Signed-off-by: Tom Gundersen <teg@jklm.no>	2020-06-07 22:08:34 +02:00
Tom Gundersen	d8e0469516	sources/files: don't retry curl on the same URL The retry logic was meant to work around issues where a round-robin redirect of mirrors gave us random mirrors of varying quality. This was not used in practice, rather fixed mirrors were always used (either hard-coded as basurl, or resolved from metalink). The retry logic meant that when we did hit very slow mirrors we would time-out and retry, potentially failing altogether, even though the data was coming. Each retry would not help, as the mirror was anyway the same. As a result our CI gave us avoidable false negative test results some of the time. The proper solution to this is to gain support for librepo and metalinks to adopt the same retry logic that dnf uses. For now, improve on the retry logic by retrying until a max total time, rather than an increasing timeout on each try. Up the given timeouts to be one minute to connect and five minutes to complete the download. This avoids hanging forever if the mirror is truly broken, but still gives more time to finish the download than each iteration in the old code did. There are no new tests for this, as before this change the tests mostly passed, and after it they will hopefully still mostly pass (but more often). Signed-off-by: Tom Gundersen <teg@jklm.no>	2020-06-07 22:08:34 +02:00
Christian Kellner	f967bf7164	sources/dnf: add documentation and schema Since the dnf stage is not used anymore only a placeholder schema and documentation is added.	2020-06-02 09:50:14 +02:00
Christian Kellner	42ef470740	sources/files: add documentation and schema Add a brief documentation text and its JSON schema so that osbuild can verify options org.osbuild.files source entries.	2020-06-02 09:50:14 +02:00
Christian Kellner	66d1dc1206	sources/ostree: add documentation and schema Add a brief documentation text and its JSON schema so that osbuild can verify options org.osbuild.ostree source entries.	2020-06-02 09:50:14 +02:00
David Rheinsberg	faaa6c1a6b	modules: fix format-strings without interpolation Fix all occurrences of format-strings without any interpolation. pylint warns about those (and for some reason did not do so for our modules). A followup will fix the pylint tests, so make sure all the warnings are resolved.	2020-05-29 11:07:44 +02:00

1 2

67 commits