Gianluca Zuccarelli
5aae10c951
templates/dashboards: update worker queries
...
The workers now use a new metric to record all
http requests. This commit updates the worker dashboard
to use the new `image_builder_worker_request_count`
query.
2023-01-09 16:52:16 +01:00
Gianluca Zuccarelli
50237e3797
templates/dashboards: update composer queries
...
osbuild-composer now uses a new metric to record all
http requests. This commit updates the composer dashboard
to use the new `image_builder_composer_request_count`
query.
2023-01-09 16:52:16 +01:00
Sanne Raymaekers
b5d1c8866a
templates/dashboards: Bump worker dashboard version
2022-09-14 19:43:47 +02:00
Sanne Raymaekers
db978c32bd
templates/dashboards: Fix tenant name to org id mapping
...
The crc stage tenant and fedora stage tenant were mixed up.
2022-09-14 19:43:47 +02:00
Sanne Raymaekers
cb38a92a39
templates/dashboards: Expand job wait duration panels
2022-09-14 19:43:47 +02:00
Gianluca Zuccarelli
1fb6a574cb
templates: filter worker dashboard on arch
...
Add the ability to filter the build job
types by architecture using the `arch`
dropdown.
2022-08-03 13:38:52 +02:00
Sanne Raymaekers
14208d872b
templates/dashboards: Add brew tenants
...
Also:
- Gives tenants a nice display name.
- Makes "All" the default
2022-08-01 21:45:06 +01:00
Sanne Raymaekers
9347a30775
templates/dashboards: Drop arch from osbuild jobtype
...
This changed in #2845 , and the dashboards stopped working properly as
they were looking for `osbuild+:arch`.
Keep the glob however, to also capture older metrics. The glob can be
removed after 1 month, as that's how long metrics are stored.
2022-08-01 13:37:28 +02:00
Chloe Kaubisch
86971ca312
templates: update dashboards to include tenant
...
Add a tenant variable to the composer dashboard, with the option
to select multiple tenants. Add tenant filter to queries accordingly.
link to dashboard: https://grafana.stage.devshift.net/d/image-builder-worker-with-tenant/image-builder-worker?orgId=1
2022-07-18 18:55:13 +02:00
Sanne Raymaekers
edcc0866b3
templates/dashboards: Bump dashboard versions
...
[skip ci]
2022-05-17 19:06:25 +02:00
Sanne Raymaekers
01e2caf95e
templates/dashboards: Set default timerange to 28 days
...
All our SLOs apply to a 28d period. The default state of the board
should reflect that.
2022-05-17 19:06:25 +02:00
Sanne Raymaekers
be6f6f04b8
templates/dashboards: Rename composer latency titles
...
These measure latency across all requests, not just compose requests.
2022-05-17 19:06:25 +02:00
Sanne Raymaekers
c4d529be5c
templates/dashboards: Add thresholds to duration/latency graphs
...
Show the threshold where we have an SLO target.
2022-05-17 19:06:25 +02:00
Sanne Raymaekers
2da910d3e4
templates/dashboards: Bump duration/latency gauges to 95p
...
This reflects the SLO target of 95%.
2022-05-17 19:06:25 +02:00
Sanne Raymaekers
4eb4894c3a
templates/dashboards: Reverse order in duration/latency graphs
...
In these graphs p99 isn't very important. If 1% of jobs are slow that's
fine. The p50 and p95 slices are the important ones, so reorder and
recolor the duration graphs to reflect this.
2022-05-17 19:06:25 +02:00
Sanne Raymaekers
060d3ae85d
templates/dashboards: Bump worker latency slo variable to 0.95
...
This reflects the actual SLO target of 95%.
2022-05-17 19:06:25 +02:00
Sanne Raymaekers
16491149fc
templates/dashboards: Reduce the interval
...
The interval dictates the granularity of the graphs. As the interval
decreases, spikes and dips become more pronounced. 28 days as an
interval doesn't actually show much, reduce this to 6h by default which
is a happy medium.
2022-05-17 19:06:25 +02:00
Sanne Raymaekers
8a51b5db39
templates/dashboards: Remove max from compose req success budget
...
Values over 100% are useful as those actually impact the error budget.
2022-05-17 19:06:25 +02:00
Sanne Raymaekers
eded793788
templates/dashboards: Remove max from build error rate budget
...
Values over 100% are useful as those actually impact the error budget.
2022-05-17 19:06:25 +02:00
Sanne Raymaekers
c1a44b6813
templates/dashboards: Bump grafana schema version
...
This makes the following diffs smaller.
2022-05-17 19:06:25 +02:00
Gianluca Zuccarelli
19e2fb7fb5
template: composer dashboard queries
...
Tidy up the queries for the composer dashboard
and making them more readable in grafana. Additionally
add some fallback values for when empty query results
are returned from prometheus.
2022-03-14 16:11:05 +01:00
Gianluca Zuccarelli
1f2fd8cb76
templates: worker depsolve error display
...
Fix the display of the depsolve error rate
panel. The panel had an incorrect min value of
3 (or 300%).
2022-03-14 16:11:05 +01:00
Gianluca Zuccarelli
8e8d99336f
templates/worker: fix depsolve error rate
...
The depsolve error rate had the incorrect query
and was returning the error rate for the build
jobs. This has now been fixed.
2022-02-22 19:55:14 +00:00
Gianluca Zuccarelli
e8d7519c7d
templates/dashboard: worker metric queries
...
The prometheus queries have been updated with
the correct namepsace for the job metrics
Additionally, this commit fixes some of the
queries to add fallback values when the
query results are returned empty.
2022-02-09 14:09:50 +01:00
Gianluca Zuccarelli
dbf396db2b
templates/dashboards: worker error metrics
...
Update the grafana dashboard for the workers
to show information on the success rate for
osbuild and depsolve jobs.
2022-02-07 20:40:37 +01:00
sanne
8a8ed14319
templates/dashboards: Fixed grafana uids
...
This way we get a nice URL `.../d/image-builder-(composer|worker)`.
2022-01-19 12:27:33 +01:00
Gianluca Zuccarelli
10f34de88b
templates: add worker dashboard
...
Add an initial dashboard for the job metrics.
For now, the dashboard includes graphs and
burn rates for osbuild job duration and depsolve
job duration
2021-12-15 08:52:52 +00:00
Alex Njaastad
0731857d6c
fix uid
2021-12-03 18:38:50 +00:00
Alex Njaastad
595a6fea70
fix version, error-budget interval
2021-12-03 18:38:50 +00:00
Alex Njaastad
a389dae79d
fix slo numbers
2021-12-03 18:38:50 +00:00
Alex Njaastad
72109bb775
more dashboard fixes
2021-12-03 18:38:50 +00:00
Alex Njaastad
79caf7b536
add more panels
2021-12-03 18:38:50 +00:00
Alex Njaastad
3cf41cddcd
fix interval variable
2021-12-03 18:38:50 +00:00
Alex Njaastad
50bcdf7bc4
dashboard updates
2021-12-03 18:38:50 +00:00
Gianluca Zuccarelli
3443fb8771
templates: update dashboard metrics
...
Update the composer dashboard to make use of the
namespaced metrics.
2021-11-19 22:48:25 +01:00
Gianluca Zuccarelli
47c41a0b8d
templates: add latency metrics to dashboard
...
Update the grafana dashboard to with metrics
for latency requests, including error budget
burn for compose latency.
2021-11-02 00:23:57 +00:00
Gianluca Zuccarelli
57250f5496
templates: update dashboard config map
...
Minor fix for a capitalisation of `image-builder`
in the grafana configmap
2021-10-28 22:17:45 +01:00
Gianluca Zuccarelli
22aed692f1
templates: add grafana dashboard
...
Add initial grafana dashboard with
reporting on compose success rate,
error budget and the number of
total composes.
2021-10-28 21:17:55 +01:00