Skip to content

Conversation

@seunghun1ee
Copy link
Member

Adds four playbooks used for migrating Vault to OpenBao.

The version of stackhpc.hashicorp collection needs to be bumped after stackhpc/ansible-collection-hashicorp#85 is merged and released.

But as SKC's contents are ready, marked as ready.

@seunghun1ee seunghun1ee self-assigned this Jan 20, 2026
@seunghun1ee seunghun1ee requested a review from a team as a code owner January 20, 2026 16:21
@seunghun1ee seunghun1ee added documentation Improvements or additions to documentation ansible Ansible playbooks Epoxy labels Jan 20, 2026
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

The pull request introduces a comprehensive set of Ansible playbooks and updated documentation to support the migration from Hashicorp Vault to OpenBao. The new playbooks are well-structured to manage the migration process across seed and overcloud environments, including configuration updates. However, a critical issue exists in the migration playbooks where the stackhpc_ca_secret_store variable is used dynamically to include secret store keys. This can lead to incorrect key retrieval if the variable is already set to 'openbao' during a Vault migration, causing the migration to fail. Additionally, there are minor documentation formatting issues and some file permissions that could be more restrictive for sensitive configuration files.

@seunghun1ee
Copy link
Member Author

Linters are failing because missing playbook is not released from stackhpc.hashicorp collection yet

@seunghun1ee seunghun1ee requested a review from bbezak January 26, 2026 13:39
@seunghun1ee seunghun1ee requested a review from mnasiadka January 29, 2026 13:15
Previously rockylinux/rockylinux:9 was used.
As soon as a new release was tagged in the docker hub it would be used
to fix multiple vulnerabilities
to see which sources are downloaded before docker build
to accomodate temporary errors from ark (was getting a 500 error)
elelaysh and others added 30 commits February 12, 2026 10:47
CVE-2025-68428 is still present in opensearch-dashboards 2.19.4
because jspdf is still in version 3.0.1
This makes it easier to see the changes generated by the
package-update-kayobe.yml workflow [1].

[1] https://github.com/stackhpc/stackhpc-release-train/actions/workflows/package-update-kayobe.yml
Documentation builds on Read the Docs have been failing following the
release of myst-parser 5.0.0. Cap requirement to use myst-parser 4.x
instead, which is not compatible with Sphinx 9.
Needed updating to rebase on rocky 9.7
03:00.0 3D controller [0302]: NVIDIA Corporation GH100 [H200 NVL] [10de:233b] (rev a1)
The new kernel version includes a fix for CVE-2025-68285 [1] in libceph.

[1] https://nvd.nist.gov/vuln/detail/CVE-2025-68285
We often see CI failures where the reboot playbook is successful, but
the growroot playbook invoked immediately after it fails with a
`Connection timed out` error.

Add a small 5 seconds extra delay after the reboot to ensure hosts have
finished booting successfully.

The delay can be customised with the `post_reboot_delay_s` variable.
This brings updated versions of OpenSSL to address multiple
vulnerabilities, including CVE-2025-15467.
Basic support for Dell SONiC switches for Arcus. Release note only since
images were rebuilt for another reason.

Patch: stackhpc/networking-generic-switch#136
This prevents the need to maintain the documentation in two separate
places.
Fix colored output on errors. It is quite easy to miss the failures otherwise.

Co-authored-by: pescobar.it@gmail.com
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ansible Ansible playbooks documentation Improvements or additions to documentation Epoxy

Projects

None yet

Development

Successfully merging this pull request may close these issues.