SRE Team Update
High level summary
The SRE team continues work on Cardano environment improvements and general maintenance.
Some notable recent changes, updates or improvements include:
-
Preparation for
10.7.0pre-release is underway and SRE is working on integrations forkes-agentanddmq-nodefor the release binaries, node nixos service and OCI containers as appropriate. CI tests for Consensus db-tooling (ie: db-analyser, db-truncater, db-synthesizer) are being added to a nixos test run on Hydra to ensure bundled node version and db-tools version maintain compatibility. -
Iterative deployments of
10.7.0pre-release candidates to select pre-release environments are on-going with issues being reported back to developers. -
Darwin CI build machine updates are underway along with some optimizations and fixes to reduce flaky Darwin platform bugs and noisy alerts as well as a refactor to reduce code complexity. A number of these improvements will appear in the next SRE biweekly update.
-
Loki logging has been added to more of our cardano-parts environments (ie: cardano-playground and cardano-mainnet). Custom Loki dashboards are also being prepared to improve the Loki experience and will appear in the next cardano-parts PR.
Repository Work -- Merged
Cardano-monitoring
- Adds Loki to
playground,mainnetandnetworkteammonitoring servers - Raises
max_outstanding_per_tenantto accommodate large dashboards w/o errors
- Adjusts Loki log retention to a per-environment setting
Devx-ci
- Increases nofile soft/hard limit to avoid failures on higher nofile requirement builds like virtiofs virtualized images
Repository Work In Progress -- PRs and Branches
- Cardano-mainnet: https://github.com/input-output-hk/cardano-mainnet/pull/42
- Cardano-node: https://github.com/IntersectMBO/cardano-node/pull/6410
- Cardano-parts: https://github.com/input-output-hk/cardano-parts/pull/79
- Cardano-playground: https://github.com/input-output-hk/cardano-playground/pull/55
- Devx-ci: https://github.com/input-output-hk/devx-ci/pull/143
