debian-forge

Author	SHA1	Message	Date
David Rheinsberg	2c18a54e4d	util/fscache: avoid RENAME_NOREPLACE on commit We used to commit cache-entries with a rename+RENAME_NOREPLACE. This, however, is not available on NFS. Change the code to use `os.rename()` and rely on the _documented_ kernel behavior that non-empty target directories cannot be replaced. Signed-off-by: David Rheinsberg <david.rheinsberg@gmail.com>	2022-12-20 16:56:32 +01:00
David Rheinsberg	e6b77ac7df	util/fscache: avoid RENAME_NOREPLACE in _atomic_file() The `RENAME_NOREPLACE` option is not available on NFS. Avoid using it in _atomic_file() to allow NFS backed storage. If the caller allows replacing the destination entry, we simply use the original `os.rename()` system call. This will unconditionally replace the destination on all file-systems. If the caller requests `no-replace`, we cannot use `os.rename()`. Instead, we use `os.link()` to create a new hard-link on the destination. This will always fail if the destination already exists. We then rely on the cleanup-path to unlink the original temporary entry. This will require adjustments in future maintenance tasks on the cache, since they need to be aware that entries can be hardlinked temporarily. However, we already consider `uuid-*` entries in the object-store to be temporary and unaccounted for similar reasons, so this doesn't even break our cache-maintenance ideas. Signed-off-by: David Rheinsberg <david.rheinsberg@gmail.com>	2022-12-20 16:56:32 +01:00
David Rheinsberg	8a9efa89fc	util/fscache: provide store_tree() helper Add a helper that copies an entire directory tree including all metadata into the cache. Use it in the ObjectStore to commit entries. Unlike FsCache.store() this does not require entering the context from the call-site. Instead, all data is directly passed to the cache and the operation is under full control of the cache. The ObjectStore is adjusted to make use of this. This requires exposing the root-path (rather than the tree-path) to be accessible for individual objects, hence a `path`-@property is added alongside the `tree`-@property. Note that `__fspath__` still refers to the tree-path, since this is the only path really required for outside access other than from the object-manager itself. Signed-off-by: David Rheinsberg <david.rheinsberg@gmail.com>	2022-12-20 16:56:32 +01:00
David Rheinsberg	50f8f6ac47	util/fscache: simplify get(*, None) The default value for `get()` is `None`, so no reason to specify it explicitly. Simplify the respective calls in FsCache. Signed-off-by: David Rheinsberg <david.rheinsberg@gmail.com>	2022-12-20 16:56:32 +01:00
Christian Kellner	3cc26444c9	monitor: show pipeline's source epoch If set, print the source epoch of the pipeline.	2022-12-15 13:10:35 +00:00
Christian Kellner	6c0183da21	pipeline: set source_epoch for tree This will lead to all mtimes that are newer than the creation time of `tree` being clamped to `source_epoch`, if that was specified for the pipeline. Specifically it means that all files that were created during the build will be clamped to it. This should make builds more reproducible.	2022-12-15 13:10:35 +00:00
Christian Kellner	15dc8b7a00	objectstore: clamp mtime on commit When we commit objects to the store and there is a `source_epoch` set on the `Object`, clamp the mtime. This is needed because it is possible that the object corresponds to the last stage of a pipeline[1] and it could later directly be exported without going through `finalize` again. Also we are doing in on object itself and not the cloned path so that resuming and checkpointing will behave identical. [1] not even necessarily the pipeline we are currently building.	2022-12-15 13:10:35 +00:00
Christian Kellner	76197c70c4	objectstore: support `source_epoch` for `Object` Add a new `source_epoch` attribute that if set, will lead to all mtimes that are newer or equal to the creation date being clamped to the specified `source_epoch` time when the object is finalized.	2022-12-15 13:10:35 +00:00
Christian Kellner	b3c53e7275	objectstore: record creation time in `Object` When an new Object is created, save the creation time in a new metadata entry called `info`. A new property called `created` is added to inspect the creation date.	2022-12-15 13:10:35 +00:00
Christian Kellner	39d38d33fd	util/path: new clamp mtime function New utility function to clamp all mtimes of a given path to a certain timestamp. Clamp here means that any timestamp later than the specified upper bound will be set to the upper bound.	2022-12-15 13:10:35 +00:00
David Rheinsberg	ef20b40faa	util/fscache: introduce versioning Add a new field to the cache-information called `version`, which is a simple integer that is incremented on any backward-incompatible change. The cache-implementation is modified to avoid any access to the cache except for `<cache>/staging/`. This means, changes to the staging area must be backwards compatible at all cost. Furthermore, it means we can always successfully run osbuild even on possibly incompatible caches, because we can always just ignore the cache and fully rely on the staging area being accessible. The `load()` method will always return cache-misses. The `store()` method simply discards the entry instead of storing it. Note that `store()` needs to provide a context to the caller, hence this implementation simply creates another staging-context to provide to the caller and then discard. This is non-optimal, but keeps the API simple and avoids raising an exception to the caller (but this can be changed if it turns out to be problematic or unwanted). Lastly, the `cache.info` field behaves as usual, since this is also the field used to read the cache-version. However, this file is never written to improve resiliency and allow blacklisting buggy versions from the past. Signed-off-by: David Rheinsberg <david.rheinsberg@gmail.com>	2022-12-15 08:55:39 +01:00
Sanne Raymaekers	cb989f79b1	util: fix typo in get_consumer_secrets Consumer key and cert paths were swapped.	2022-12-09 21:46:43 +01:00
Christian Kellner	ae0680da11	osbuid: integrate FsCache into ObjectStore Integrate the recently added file system cache `FsCache` into our object store `ObjectStore`. NB: This changes the semantics of it: previously a call to `ObjectStore.commit` resulted in the object being in the cache (i/o errors aside). But `FsCache.store`, which is now the backing store for objects, will only commit objects if there is enough space left. Thus we cannot rely that objects are present for reading after a call to `FsCache.store`. To cope with this we now always copy the object into the cache, even for cases where we previously moved it: for the case where commit is called with `object_id` matching `Object.id`, which is the case for when `commit` is called for last stage in the pipeline. We could keep this optimization but then we would have to special case it and not call `commit` for these cases but only after we exported all objects; or in other words, after we are sure we will never read from any committed object again. The extra complexity seems not worth it for the little gain of the optimization. Convert all the tests for the new semantic and also remove a lot of them that make no sense under this new paradigm. Add a new command line option `--cache-max-size` which will set the maximum size of the cache, if specified.	2022-12-09 12:03:40 +01:00
Christian Kellner	1e0e1fa2c2	util: add helper to parse size strings Code is based on `common.DataSizeToUint64` in Composer, with a modification to allow `unlimited` so that the result is compatible with `fscache.MaximumSizeType`. [1] `f4aed3e6e2/internal/common/helpers.go (L46)`	2022-12-09 12:03:40 +01:00
Christian Kellner	809c9e7828	pipeline,api: write metadata directly Instead of transmitting stage metadata over a socket and then writing it via `Object.meta.write`, use the latter and bind mount the corresponding file into the stage so it can directly be written to from the stage. Change `api.metadata` to do so, which means that this change is transparent for the stages.	2022-12-09 12:03:40 +01:00
Christian Kellner	8b638562d1	pipeline: remove metadata from BuildResult The metadata field became unused with the previous commit, which uses `Object.meta` to read the metadata.	2022-12-09 12:03:40 +01:00
Christian Kellner	4b94769f6b	format: read metadata from object not result Now that metadata is stored and can be accessed via `Object.meta`, read it from the built or stored objects when serializing the result in the `format.output` functions.	2022-12-09 12:03:40 +01:00
Christian Kellner	1205de0abb	objectstore: integrate metadata object Integrate the new `Metadata` object as `meta` property on `Object`. Use it to actually store metadata after a successful stage run. A new class `PathAdapter` is introduce which is in turned used to expose the base path of `Object` as `os.PathLike` so it can be passed as path to `Metadata`. The advantage is that any changes to the base path in `Object` will automatically be picked up by `Metadata`; the prominent, and currently only, case where this is happening in `Object` is `store_tree`.	2022-12-09 12:03:40 +01:00
Christian Kellner	fec9dcea97	objectstore: implement a new metadata class Implement a new class, nested inside `Object`, to read and write metadata. It is indexed by a key and individual pieces of meta- data are stored in separate files. Empty files are not created.	2022-12-09 12:03:40 +01:00
Christian Kellner	baa547b5e9	fmt/v2: extract metadata gathering into function Extract the piece of code that gathers the metadata from the result struct into its own (nested) method. It is easier to read but also prepares for a future change where we read the metadata from the store instead of the result dict.	2022-12-09 12:03:40 +01:00
Christian Kellner	8f40faf3d5	cli: move result reporting into store context Move the reporting of results into the try-cache and ObjectStore context. This prepares to use the store during the `fmt.output` call and possible reporting of store cache usages.	2022-12-09 12:03:40 +01:00
Christian Kellner	917c5bb2f5	objectstore: store object data within subfolder Instead of storing the (tree) data directly at the root of the object specific directory, move it into a `data/tree` subfolder. This prepares for two things: 1) the `tree` folder will allow us to add another folder next to it to store metadata. 2) storing both, `tree` and the future metadata folder in a common subfolder, prepares for the future integration with the new caching layer (`FsCache`).	2022-12-09 12:03:40 +01:00
schutzbot	cf4608fe43	Post release version bump [skip ci]	2022-12-07 08:15:17 +00:00
David Rheinsberg	4df05b8509	util: add file system cache This commit introduces a new utility module called `fscache`. It implements a cache module that stores data on the file system. It supports parallel access and protects data with file-system locks. It provides three basic functions: FsCache.load("<name>"): Loads the cache entry with the specified name, acquires a read-lock and yields control to the caller to use the entry. Once control returns, the entry is unlocked again. If the entry cannot be found, a cache miss is signalled via FsCache.MissError. FsCache.store("<name>"): Creates a new anonymous cache entry and yields control to the caller to fill in. Once control returns, the entry is renamed to the specified name, thus committing it to the object store. FsCache.stage(): Create a new anonymous staging entry and yield control to the caller. Once control returns, the entry is completely discarded. This is primarily used to create a working directory for osbuild pipeline operations. The entries are volatile and automatic cleanup is provided. To commit a staging entry, you would eventually use FsCache.store() and rename the entire data directory into the non-volatile entry. If the staging area and store are on different file-systems, or if the data is to be retained for further operations, then the data directory needs to be copied. Additionally, the cache maintains a size limit and discards any entries if the limit is exceeded. Future extensions will implement cache pruning if a configured watermark is reached, based on last-recently-used logics. Many more cache extensions are possible. This module introduces a first draft of the most basic cache and hopefully lays ground for a new cache infrastructure. Lastly, note that this only introduces the utility helper. Further work is required to hook it up with osbuild/objectstore.py.	2022-12-06 09:48:38 +01:00
David Rheinsberg	efe4ad4b92	linux: add Libc accessor with renameat2(2) Add a new utility that wraps ctypes.CDLL() for the self-embedded libc.so. Initially, it only exposes renameat2(2), but more can be added when needed in the future. The Libc class is very similar to the existing LibCap class, with a similar instantiation logic with singleton access. In the future, the Libc class will allow access to other system calls and libc.so functionality, when needed.	2022-12-06 09:48:38 +01:00
David Rheinsberg	ebbedd1e89	linux: add proc_boot_id() A new helper for the util.linux module which exposes the linux boot-id. For security reasons, the boot-id is never exposed directly, but instead only exposed through an application-id combined with the boot-id via HMAC-SHA256. Note that a raw kernel boot-id is always considered confidential, since we never want an outside entity to deduce any information when they see a boot-id used in protocol A and one in protocol B. It should not be possible to tell whether both are from the same user and boot or not. Hence, both should use their own boot-id namespace.	2022-12-06 09:48:38 +01:00
David Rheinsberg	aefaf21411	linux: add accessor for fcntl file locking ops This adds a new accessor-function for the file-locking operations through `fcntl(2)`. In particular, it adds the new function `fcntl_flock()`, which wraps the `F_OFD_SETLK` command on `fcntl(2)`. There were a few design considerations: * The name `fcntl_flock` comes from the `struct flock` structure that is the argument type of all file-locking syscalls. Furthermore, it mirrors what the `fcntl` module already provides as a wrapper for the classic file-locking syscall. * The wrapper only exposes very limited access to the file-locking commands. There already is `fcntl.fcntl()` and `fcntl.fcntl_flock()` in the standard library, which expose the classic file-locks. However, those are implemented in C, which gives much more freedom and access to architecture dependent types and functions. We do not have that freedom (see the in-code comments for the things to consider when exposing more fcntl-locking features). Hence, this only exposes a very limited set of functionality, exactly the parts we need in the objectstore rework. * We cannot use `fcntl.fcntl_flock()` from the standard library, because we really want the `OFD` version. OFD stands for `open-file-description`. These locks were introduced in 2014 to the linux kernel and mirror what the non-OFD locks do, but bind the locks to the file-description, rather than to a process. Therefore, closing a file-description will release all held locks on that file-description. This is so much more convenient to work with, and much less error-prone than the old-style locks. Hence, we really want these, even if it means that we have to introduce this new helper. * There is an open bug to add this to the python standard library: https://bugs.python.org/issue22367 This is unresolved since 2014. The implementation of the `fcntl_flock()` helper is straighforward and should be easy to understand. However, the reasoning behind the design decisions are not. Hence, the code contains a rather elaborate comment explaining why it is done this way. Lastly, this adds a small, but I think sufficient unit-test suite which makes sure the API works as expected. It does not test for full functionality of the underlying locking features, but that is not the job of a wrapping layer, I think. But more tests can always be added.	2022-12-06 09:48:38 +01:00
Thomas Lavocat	5112f72cbf	mounts: use the options object for mountopts This modification will allow a user to ask to mount the system as read only for instance. Which would be super useful for image-info who is progressively using more of OSbuild internals to mount partitions.	2022-11-30 14:21:10 +01:00
schutzbot	5f1e2cf5aa	Post release version bump [skip ci]	2022-11-23 08:16:54 +00:00
Christian Kellner	8abed7677f	devices: new helper to ensure a device node exists Add a new `ensure_device_node` helper function that will create the specified device not, if it does not yet exist.	2022-11-22 18:28:38 +01:00
Christian Kellner	f8ca0cf4bc	objectstore: direct path i/o for Object The `Object.{read,write}` methods were introduced to implement copy on write support. Calling `write` would trigger the copy, if the object had a `base`. Additionally, a level of indirection was introduced via bind mounts, which allowed to hide the actual path of the object in the store and make sure that `read` really returned a read-only path. Support for copy-on-write was recently removed[1], and thus the need for the `read` and `write` methods. We lose the benefits of the indirection, but they are not really needed: the path to the object is not really hidden since one can always use the `resolve_ref` method to obtain the actual store object path. The read only property of build trees is ensured via read only bind mounts in the build root. Instead of using `read` and `write`, `Object` now gained a new `tree` property that is the path to the objects tree and also is implementing `__fspath__` and so behaves like an `os.PathLike` object and can thus transparently be used in many places, like e.g. `os.path.join` or `pathlib.Path`. [1] `5346025031`	2022-11-21 17:26:53 +01:00
Christian Kellner	881b2bb5c9	mounts: convert paths to str during serialization When creating the JSON data, call `os.fspath` on all paths, like `root` and `devices.tree` to ensure they are strings; this allows for tree to be an object that conforms to `os.PathLike`.	2022-11-21 17:26:53 +01:00
Christian Kellner	0a29694593	devices: convert tree to str during serialization When creating the JSON data, call `os.fspath` on `tree` to ensure it is a string; this allows for tree to be an object that conforms to `os.PathLike`.	2022-11-21 17:26:53 +01:00
Christian Kellner	74e1dea1f7	objectstore: remove context manager from Object As `ObjectStore.object` is currently not used via a context manager anywhere in the source, remove the code.	2022-11-16 11:09:44 +01:00
Christian Kellner	28b8252a04	objectstore: implicit clone based on object ids If the object's id does not match with the one supplied for the commit, we create a clone. Otherwise we store the tree. The code path is arranged in a way that we always go through `Object.store_tree` so we always call `Object.finalize` as a prepration for the future, where we might actually do something meaningful in the finalizer, like reset the *times or count the tree size.	2022-11-16 11:09:44 +01:00
Christian Kellner	5346025031	objectstore: remove copy on write from object Remove copy-on-write support from `objectstore.Object`. The main reason for introducing copy-on-write was to save an additional copy in the non DAG-pipeline model[1]. With the introduction of the latter and the explicit `--export` option, we can achieve the same result without the complexity of copy-on-write semantics. [1] See commit `39213b7`, part of 3b7c87d5..42a365d1 changeset.	2022-11-16 11:09:44 +01:00
Christian Kellner	3e8d2c21dc	pipeline: opt out of copy-on-write for objects When committing an object to the store, clone it if the current stage is not the latests stage, i.e. `todo` has still entries. This is the second step of the removal of copy-on-write support in `Object`.	2022-11-16 11:09:44 +01:00
Christian Kellner	daa2e1c3bb	objectstore: option to clone object on commit Add a new `clone` parameter to the `commit` method on `ObjectStore` that when used will clone the object to the store instead of using the `store_tree` method which moves the object and resets it. This is the first step of removing copy-on-write support from `Object`.	2022-11-16 11:09:44 +01:00
Christian Kellner	1762048c1f	objectstore: add clone method for object Right now this is basically a clone(!) of `export` but this will change in the future when we change the layout of how objects are stored.	2022-11-16 11:09:44 +01:00
Christian Kellner	c3c06a1ebd	objectstore: small comment fix Just fix a typo, and start the comment with a capital letter.	2022-11-16 11:09:44 +01:00
Christian Kellner	ecb24a8eb7	util: add module to parse PE32+ files Add an new module with utility functions to inspect PE32+ files, mainly listing the sections and their addresses and sizes. Include a simple test to check that we can successfully parse the EFI stub contained in systemd (systemd-udev package).	2022-11-14 20:10:59 +01:00
schutzbot	70bed2e984	Post release version bump [skip ci]	2022-11-09 08:19:42 +00:00
schutzbot	845f58631c	Post release version bump [skip ci]	2022-10-26 08:18:16 +00:00
schutzbot	6523fd8d41	Post release version bump [skip ci]	2022-10-12 08:19:37 +00:00
Sanne Raymaekers	fcaad0462a	sources/ostree: pull from remote using rhsm mTLS certs The consumer certs are used to uniquely identify a system against candlepin. These consumer certs can be used to identify the system when pulling from RH controlled ostree repositories.	2022-10-11 16:49:45 +02:00
Christian Kellner	1c81d1e966	monitor/log: show build root and runner In the `LogMonitor`, print the build root and its runner for each pipelie. That could help identifying issues with runners.	2022-10-11 12:49:16 +02:00
Christian Kellner	5bdc8d030c	osbuild: auto-detect best available runner Use the new `Index.detect_runner` method that will give us the best available runner for a requested one. To do so a new `pipeline.Runner` class is introduced that stores the `meta.RunnerInfo` class for the specific runner and the original name that was requested. In the manifest loading and describing functions of the formats, use `Index.detect_runner` to get the `RunnerInfo` for a requested runner and then wrap it in a `pipeline.Runner` object, which is then passed to the `Manifest.add_pipeline` method. See also commit "meta: ability to auto-detect runner". Adjust all test.	2022-10-11 12:49:16 +02:00
Christian Kellner	77e717f4f4	buildroot: explicitly bind mount runner Instead of relying on the assumption that the specific runner will be in `/run/osbuild/lib/runners/` we now bind-mount the runner at a specific well known path and execute it from there.	2022-10-11 12:49:16 +02:00
Christian Kellner	7a8b3d3c6f	meta: ensure we have an absolute path for Index Ensure that we always have an absolute path so that all paths in `ModuleInfo` and `RunnerInfo` are absolute as well.	2022-10-11 12:49:16 +02:00
Christian Kellner	e923f559ab	meta: cache host runner The host runner does not change and we can thus avoid looking it up over and over again.	2022-10-11 12:49:16 +02:00

... 3 4 5 6 7 ...

823 commits