Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve Pulsar AWS deployment (terraform + ansible) #415

Open
wants to merge 5 commits into
base: master
Choose a base branch
from

Conversation

melhindi
Copy link

@melhindi melhindi commented Jul 3, 2024

This PR enhances the deployment of Pulsar on AWS using Terraform and Ansible, incorporating several improvements and optimizations:

  1. Prevent redundant benchmark code uploads: Optimized benchmark code deployment by splitting and using creates to avoid unnecessary uploads.

  2. Dynamic discovery of storage devices: Replaced hardcoded disk labels with Ansible-based discovery, ensuring reliable deployment across AWS instances where device labels may change post-reboot.

  3. Replace terraform-inventory with Ansible Provider and support for spot instances
    Upgraded to Ansible Provider and Terraform Ansible-Collection, retiring outdated terraform-inventory plugin.
    Introduced support for deploying instances as AWS spot instances via a new configurable variable (default = False).

These changes collectively enhance deployment reliability and introduce flexibility in instance provisioning for Pulsar and Bookkeeper deployments on AWS.

Co-authored-by: toziegler

Replace the outdated terrafrom-inventory plugin with the Ansible
Provider and the Terraform Ansible-Collection
Add a new variable to deploy instances as AWS spot instances
Co-authored-by: toziegler
When deploying AWS instances, with every re-boot the label of NVMe
devices changes. E.g., `/dev/nvme1n1`, `/dev/nvme0n1`
Hence, hard-coding the disk labels leads to failed deployments when,
e.g., `/dev/nvme1n1` is used as boot disk.
By discovering storage devices with ansible we can dynamically select
available storage devices for pulsar/bookkeeper
Co-authored-by: toziegler
When network connectivity is slow, uploading the benchmark code to nodes
via ansible takes long and is not required if the step has been
performed before. By splitting the copy and rename step and using
`creates` we can prevent multiple (slow) uploads.
Co-authored-by: toziegler

For long running benchmarks on-demand instances is a more meaningful
default value
Copy link

stale bot commented Sep 2, 2024

This issue has been automatically marked as stale because it has not had recent activity. It will be closed after 5 days if no further activity occurs. Thank you for your contributions.

@stale stale bot added the stale label Sep 2, 2024
@melhindi
Copy link
Author

melhindi commented Sep 2, 2024

Hi @merlimat,
could you pls. have a look at this PR and let me know if I need to change anything to get it merged?
Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant