SRE Team Update
High level summary
The SRE team continues work on Cardano environment improvements and general maintenance.
Some notable recent changes, updates or improvements include:
-
Cardano-parts and cardano-playground were updated with cardano-node
10.6.2, cardano-node pre-release10.7.0, nixpkgs25.11, ZFS AMI support, new Loki log dashboards, and extensive monitoring improvements including per-machine absent metrics alerting and mempool timeout alerts. -
The dijkstra network was fully respun with updated secrets, configs, and a Van Rossem PV11 cost model governance action prepared.
-
CloudFormation stack hardening was applied: dedicated S3 server access logs bucket, TLS-only bucket policies, DynamoDB deletion protection with PITR, and KMS encryption.
-
Ouroboros-network-ops was brought up to a recent cardano-parts release with new resource tagging for CloudFormation and OpenTofu resources.
Repository Work -- Merged
Cardano-airgap
- Adds
midnight-clito the air-gapped signing toolset
Cardano-mainnet
- Deploys all nodes to
10.6.2, and all dbsyncs to13.6.0.7 - Upgrades nixpkgs to
25.11and nix to2.33-maint - Adds bootstrap OpenTofu environment and ZFS AMI NixOS module support
- Adds Loki log shipping with four new log dashboards; removes superseded node-exporter Loki dashboard
- Adds per-machine
machine_metrics_absentalert, tx mempool timeout alerts, and tightened blockHeight threshold - Hardens CloudFormation stack with TLS-only policies, DynamoDB deletion protection and PITR, and KMS encryption
- Rotates the mainnet pool KES keys
- See the PR description for additional details
Cardano-parts
- Bumps cardano-node release to
10.6.2, pre-release to10.7.0, cardano-db-sync release to13.6.0.7, pre-release to13.7.0.1, and other component updates - Bumps nixpkgs to
25.11and nix to2.33-maintwith required compatibility fixes - Introduces ZFS AMI support via a new
ami.nixnixosModule withtank/{root,nix,home,state}dataset layout and new bootstrap OpenTofu environment - Removes the deprecated Grafana Agent (EOL 2025-11-01), migrating fully to Grafana Alloy with Loki log shipping support
- Adds four new Loki log dashboards:
cardano-node-logs.json,cardano-node-logs-json.json,systemd-logs.json, andsystemd-logs-json.json - Adds per-machine
machine_metrics_absentalert with multi-offset detection; adds tx mempool timeout alerts; tightens blockHeight unchanged alert from 10 to 7 minutes - Hardens CloudFormation stack: dedicated S3 server access logs bucket, TLS-only bucket policies, DynamoDB deletion protection with PITR, and KMS encryption
- Adds Van Rossem PV11 cost model JSON to template cost-models
- Restructures cardano-node.json dashboard with mempool timeout panels, instance filtering, and restart/version-change annotations
- Re-adds sanchonet support to process-compose stacks and template scripts
- See the PR description for additional details
Cardano-playground
- Sets cardano-node release to
10.6.2, pre-release to10.7.0, cardano-db-sync to13.6.0.7, pre-release to13.7.0.1 - Upgrades nixpkgs to
25.11and nix to2.33-maint - Adds bootstrap OpenTofu environment and ZFS AMI NixOS module support
- Adds Loki log shipping with four new log dashboards; removes superseded node-exporter Loki dashboard
- Adds per-machine
machine_metrics_absentalert, tx mempool timeout alerts, and tightened blockHeight threshold - Creates dijkstra respin with new secrets, updated network configs, and Van Rossem PV11 cost model governance action
- Converts
preview3-bp-c-1andmainnet1-rel-a-3to LSM storage backend - Hardens CloudFormation stack with TLS-only policies, DynamoDB deletion protection and PITR, and KMS encryption
- Large colmena cleanup: group-based import system, removes metrics-scraper module
- Re-integrates sanchonet via upstream iohk-nix
- See the PR description for additional details
Ouroboros-network-ops
- Bumps cardano-parts from
v2025-06-24to post-v2025-08-14 - Adds new resource tags to CloudFormation and OpenTofu resources:
owner,project,costCenter - Updates pre-existing
organizationandenvironmenttags - Applies breaking change updates from cardano-parts release
Devx-ci
- Bumps nix in linux and darwin hosts and guests to resolve: GHSA-g3g9-5vj6-r3gj / CVE-2026-39860
- Also bumps the darwin guest bootstrap nixpkgs version in apply.sh from 25.05 to 25.11
