Elasticsearch Ansible Setup #369

thepsalmist · 2025-03-10T11:48:55Z

This PR introduces Ansible playbooks and roles for deploying ad managing an Elasticsearch cluster.
This builds off from the discussion on #366 and the draft at #367

…default role

…se case --disable x-pack default settings

Introduce secrets managememt using ansible-vault

DavidTheProgrammer

Looks good, I've left a few comments on the general structure and I have a couple of questions about the role we're using, is it something we've hand-rolled or it's available on Ansible Galaxy? It seems to be environment dependent with the vars files and that is setting off some flags for me as roles should not be environment dependent. Is there any reason we can't use a 3rd party installation modules for ES 8+ Like this one: https://galaxy.ansible.com/ui/standalone/roles/geerlingguy/elasticsearch/install/ ?

DavidTheProgrammer · 2025-03-14T12:09:06Z

conf/ansible/inventories/group_vars/vault.yml

This directory ideally should only contain variables shared by specific groups and since we don't have a group named "vault", I'd recommend we move this one level up to the inventory level. The rationale is that this is the vault used for this particular inventory.

DavidTheProgrammer · 2025-03-14T12:14:32Z

conf/ansible/inventories/hosts.yml

I have found the alternative directory layout located in the Ansible best practices that seperates the inventories based on environment to work quite well in the past. Reduces filters needed and allows us to actually use all group for things like agent installation that needs to happen on all hosts while only affecting specific environments.

https://docs.ansible.com/ansible/2.8/user_guide/playbooks_best_practices.html#alternative-directory-layout

DavidTheProgrammer · 2025-03-14T12:15:33Z

conf/ansible/playbooks/es-install.yml

+  hosts: elasticsearch
+  vars_files:
+    - ../inventories/group_vars/vault.yml
+    - ../roles/elasticsearch/vars/{{ env | default('production') }}.yml


Can we default to staging? I think that's a safer default.

DavidTheProgrammer · 2025-03-14T12:18:35Z

conf/ansible/playbooks/es-install.yml

+    - ../roles/elasticsearch/vars/{{ env | default('production') }}.yml
+  become: true
+
+  roles:


Use include_role instead of the traditional roles: section

Following this article we should prefer include_roles instead of the roles section. I think it's okay.

https://www.ansiblejunky.com/blog/ansible-101-standards/#playbooks

DavidTheProgrammer · 2025-03-14T12:22:07Z

conf/ansible/playbooks/es-uninstall.yml

Can we move these to a standalone role and include it in this playbook instead of having the tasks written directly? This also comes from the recommendations in the Ansible 101 article.

https://www.ansiblejunky.com/blog/ansible-101-standards/#playbooks

philbudne · 2025-03-14T15:51:48Z

I started the ansible files. Regarding selection of a role to install ES: I surveyed the available roles, and I'm pretty sure the geerlingguy module was one I glanced at. I didn't keep notes on why I rejected it (the fact that the docs don't show ES 8.x support, and it requires a separate module for Java install probably didn't weigh in it's favor). I ended up choosing between two forks of the Elastic role that had been adapted for ES8, selecting the more lightly modified one, and forked it to make one or two changes necessary to get it to run, keeping it as a separate (mediacloud) repo because (a) it had it's own licence, and (b) I didn't want to get too deep into modifiying it. Some of my thoughts / principles: To keep the work compact, and avoid spreading stuff out into too many files: Keeping the node names ONLY in the YAML inventory file, with per-group settings in the same file, allowing testing by selecting an alternate inventory group. I regard ansible as a necessary evil (hateful in many ways, tho the best/only game in town). Past experience has shown me that data organization is key (being a data driven solution), and that it's easy to make a twisted mess that's hard to untangle (and noone wants to touch because it's so easy to break). Like in many things, I'm not a purist or true believer. My experience is that it's likely that any ansible file written a few years ago won't run as-it today, and I have no reason to believe the same won't be true of today's files in five years: I tried to avoid things that caused overt warnings, but I view "best practices" and "future proofing" as at best unnecessary, or at worst harmful to clarity and simplicity (making something that gets the job done, and future people can understand and adapt quickly) as opposed to maximally reusable by others. Trying to be a "hero" was an ABSOLUTE non-goal. I quickly realized that my install of statsd-agent.py SHOULD have been written as a sequence of tasks, rather than a slab of shell commands, AND that if I made it a role, it could be reused to install other tools in the same repo that could be used to rebuild other servers (ie; tarbell, the web server), *BUT* I didn't do either of those things; it simply isn't worth the investment of time (I find coding ANYTHING for ansible takes 3-5 times longer than it would to write a shell script, debugging to be painful, and documentation to be barely adequate). My concern with Xavier's work so far is that it increased the number of files and duplication of data, and that I might be the only one qualified to review it, and thus would have to wrap my head around it or accept it as-is. We DO NOT have the luxury of time, to have this be a multi-week incremental crawl towards a solution. In my mind, something that works today is good enough, and is (at least) superior to written documentation, because future people will at least know that it worked, and was used at some point in time. The one thing I would view as least desireable is any need for human intervention in the install/configuration process.

philbudne and others added 30 commits February 23, 2025 00:03

First draft of anisble scripting to install ES

552ddcb

cleanup for ansible-lint, pre-commit

e491bc2

Don't remove ansible-elasticsearch directory if modified

4872c23

update message for ansible-elasticsearch

167743d

Supply network.host per-host, allow port overrides

2cc456c

add ansible.cfg to default interpreter_python to /usr/bin/python3

2537b91

es-test-vars.sh: remove attempt to set interpreter_python

9858bc4

temp: change docker host

12979c2

temp: change docker host

7bcc93c

temp: change docker host

e74f74a

temp: change docker host

08f0ecd

temp: change docker host

0206d68

Refactor es-ansible deployment

5d28307

Fix es_test_vars script

4091d68

Update --set network host

275f78a

Cleanup es-vars

1b60317

Update --add secrets.yml file

9b88730

Update --modify test-vars for production

31e969b

Fix --rename production hosts group to elasticsearch to conform with …

caf092a

…default role

Fix --rename production hosts group to elasticsearch to conform with …

430429a

…default role

Update --add all other 5 data nodes

1200cee

Fix --xpack.security features

aa5ed8c

Fix --x-pack security settings

ebc1d0f

Fix --set default for transport.ssl and http.ssl

ffd6df8

Fix -- transport and http x-pack settings

79bc22c

Fix -- transport and http x-pack settings

7819b28

Update --refactor elasticsearch ansible role, customize to specific u…

6720326

…se case --disable x-pack default settings

Fix --discovery and cluster formation settings

790ca65

Fix --seed-hosts, discovery should default to provided transport.port

8523bde

Set ES node name to ansible_host

35a5cf4

thepsalmist added 4 commits March 11, 2025 14:01

Set ES node name to ansible_host

6dec4ff

Update -- refactor and cleanup to ansible best practices

d1c5aa3

Set correct host group

7b1d86b

revert working deb repo

dbfb078

thepsalmist self-assigned this Mar 11, 2025

thepsalmist added infrastructure elasticsearch labels Mar 11, 2025

thepsalmist linked an issue Mar 11, 2025 that may be closed by this pull request

Investigate elasticsearch configuration using ansible #366

Open

thepsalmist mentioned this pull request Mar 11, 2025

Enhance Ansible Playbook with Additional Elasticsearch Setup Tasks #370

Open

3 tasks

thepsalmist added 2 commits March 11, 2025 18:25

Update README.md

61e6b1d

Introduce secrets managememt using ansible-vault

Updates to README.md

7232a7f

thepsalmist marked this pull request as ready for review March 11, 2025 15:43

thepsalmist requested review from pgulley, m453h, philbudne and kilemensi March 11, 2025 15:43

thepsalmist changed the title ~~Draft ES Ansible~~ Elasticsearch Ansible Setup Mar 11, 2025

ansible rekey

ee9c399

DavidTheProgrammer reviewed Mar 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Elasticsearch Ansible Setup #369

Elasticsearch Ansible Setup #369

thepsalmist commented Mar 10, 2025 •

edited

Loading

DavidTheProgrammer left a comment

DavidTheProgrammer Mar 14, 2025

DavidTheProgrammer Mar 14, 2025

DavidTheProgrammer Mar 14, 2025

DavidTheProgrammer Mar 14, 2025

DavidTheProgrammer Mar 14, 2025

philbudne commented Mar 14, 2025 via email

Elasticsearch Ansible Setup #369

Are you sure you want to change the base?

Elasticsearch Ansible Setup #369

Conversation

thepsalmist commented Mar 10, 2025 • edited Loading

DavidTheProgrammer left a comment

Choose a reason for hiding this comment

DavidTheProgrammer Mar 14, 2025

Choose a reason for hiding this comment

DavidTheProgrammer Mar 14, 2025

Choose a reason for hiding this comment

DavidTheProgrammer Mar 14, 2025

Choose a reason for hiding this comment

DavidTheProgrammer Mar 14, 2025

Choose a reason for hiding this comment

DavidTheProgrammer Mar 14, 2025

Choose a reason for hiding this comment

philbudne commented Mar 14, 2025 via email

thepsalmist commented Mar 10, 2025 •

edited

Loading