Commit Graph

2900 Commits

Author SHA1 Message Date
Digimer
dfc0c2c492
Merge pull request #326 from ClusterLabs/anvil-tools-dev
* Fixed a bug where, when DRBD->gather_data() calls 'drbdadm dump-xml…
2023-05-03 00:40:14 -04:00
digimer
26fa3c7e32 Fixed a bug where Get->available_resources() was missing LVM/storage group data in some cases.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-05-02 16:28:05 -04:00
digimer
510db70253 Another attempt to resolve the stoage group race condition. This moves the check for auto-assembly to scan-lvm. It only works for the first assemble, after that the user can/should use anvil-manage-storage-groups.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-05-02 00:07:40 -04:00
digimer
e483840ceb Second attempt to fix the storage group race condition. This time, we only let node 1 assemble storage groups.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-05-01 20:29:20 -04:00
digimer
d64044c7d1 Test fix for storage group race condition.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-05-01 13:48:27 -04:00
digimer
1bba56a5b1 Hard coded anvil-provision-server to log level 2 while chasing a race condition is storage groups.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-05-01 10:54:51 -04:00
digimer
9a58f4d1ff * This is a small commit to increase logging while chasing down a race condition issue with assembling storage groups.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-30 19:47:58 -04:00
digimer
895f1ec262 This fixes a race condition when multiple servers are provisioned at (nearly) the same time.
* In DRBD->get_next_resource(), implemented a "hold" system where the DRBD minor and TCP port(s) returned are marked as being held for one minute. So subsequent calls won't use the same numbers.
* In anvil-daemon, added a check in run_jobs() where only one instance of a given job command will be started per 2-second loop. This should help reduce the chance of simultaneous race confitions in general.
* Removed from anvil-provision-server and most other tools the call to Job->get_job_uuid(). If the program is called without the job_uuid, don't try to find it. This allows a human (or script) to make repeated calls to a program without one of those calls running a pending job instead.

Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-28 00:19:53 -04:00
digimer
e7537b0ca3 * Fixed a bug where, when DRBD->gather_data() calls 'drbdadm dump-xml' and the output includes usage data, it breaks XML parsing.
* Fixed a bug in Get->available_resources() where DELETED servers were being counted in the used resources math.

Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-25 13:12:13 -04:00
digimer-bot
cc32d5b606
Merge pull request #320 from ClusterLabs/anvil-tools-dev
Anvil tools dev
2023-04-19 17:48:51 -04:00
digimer
c11be1ad1a Added a skip to ignore dot files when looking at new files.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-19 12:36:05 -04:00
digimer
dc7b909bfc More logging to debug storage group race condition
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-18 19:14:59 -04:00
digimer
bd575c6a7d Bumped logging for storage group management.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-18 19:02:51 -04:00
digimer
0874ad571a Updated anvil-safe-start to not give up on starting corosync/pacemaker if it fails on the first try.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-18 14:33:58 -04:00
digimer
8ba613952c Typo fix.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-18 12:32:52 -04:00
digimer
83a527f4fa * Removed enabling anvil-safe-start out of the RPM and into anvil-join-anvil.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-18 11:18:42 -04:00
digimer
89eae7098e NOTE: This updates the reserved RAM to 8 GiB from 4 GiB!
* Adds support for 'anvil_resources:🐏:reserved' that can be set to a number of MiB to override the default 8192.
* Adds support for 'anvil::<anvil_uuid>::resources:🐏:reserved' to allow for per-Anvil! node override on the reserved RAM default, and over the 'anvil_resources:🐏:reserved' option.

Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-17 20:43:28 -04:00
digimer
f086c1be39 Fixed a bug where the total RAM was shown instead of the free RAM.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-14 13:02:50 -04:00
digimer
fdf49c696f Updated anvil-report-usage to ignore deleted servers. Also added a check to ensure hosts are loaded if not.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-14 12:23:21 -04:00
digimer
c956f75406 Enabled anvil-safe-start in '%post node'.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-14 10:18:12 -04:00
digimer
025c2a6f54 * Updated Email->get_next_server() to ignore DELETED mail servers, and it now loads mail servers if not yet in memory.
This resolves issue #306.

Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-13 00:26:32 -04:00
digimer
fb70836126 This moves the call of anvil-safe-start out of scancore and into a new, dedicated systemd unit that runs on boot only.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-12 22:26:15 -04:00
Digimer
6bce292969
Merge pull request #319 from ClusterLabs/anvil-tools-dev
Anvil tools dev
2023-04-11 23:31:29 -04:00
digimer
83aa4e6a5f Updated scan-cluster to check for FAILED resources (servers) and, if found, attempt to recover it.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-11 16:32:31 -04:00
digimer
1afa7ce09e * Created Cluster->recover_server() that uses crm_resource to try to recover a server that has entered a FAILED state.
* Updated (not not yet completed) scan-cluster's check_resources() function to check if a FAILED server is ready to try to recover.

Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-10 23:04:15 -04:00
digimer
f9689a7106 Updated ocf:alteeve:server to look for /tmp/<resource>.fail' and, if that file exists, exits with rc:1. This is done to allow for testing.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-10 17:40:46 -04:00
digimer
9bf0f50084 Added a check to see if the server's UUID exists and looping if not to prevent unitialized variable warnings.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-09 23:38:39 -04:00
Digimer
660f38ac16
Merge branch 'main' into anvil-tools-dev 2023-04-05 16:11:01 -04:00
digimer
cf73d8ed36 * Updated System->configure_ipmi() to auto-configure DR hosts once they've been assigned a BCN IP address.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-04-05 15:04:39 -04:00
Tsu-ba-me
567abff9de fix(striker-ui): add manage UPS tab 2023-04-05 15:04:39 -04:00
Tsu-ba-me
759cd6f58a fix(striker-ui): add form validation and message in ManageUpsPanel 2023-04-05 15:04:39 -04:00
Tsu-ba-me
2f84f52090 fix(striker-ui): passthrough input validation in EditUpsInputGroup 2023-04-05 15:04:39 -04:00
Tsu-ba-me
aa5aad4689 fix(striker-ui): add input validation to AddUpsInputGroup 2023-04-05 15:04:39 -04:00
Tsu-ba-me
d3894081f6 fix(striker-ui): add input tests to CommonUpsInputGroup 2023-04-05 15:04:39 -04:00
Tsu-ba-me
afdd376759 fix(striker-ui): correct validity test on first render in InputWithRef 2023-04-05 15:04:39 -04:00
Tsu-ba-me
0c1ec5a88a fix(striker-ui): expose blur and focus event handler slots in SelectWithLabel 2023-04-05 15:04:39 -04:00
Tsu-ba-me
36f9938767 fix(striker-ui): organize types in useFormUtils hook 2023-04-05 15:04:39 -04:00
Tsu-ba-me
26881c0436 fix(striker-ui): expose isRequired in build test batch functions 2023-04-05 15:04:39 -04:00
Tsu-ba-me
737850f9d0 fix(striker-ui): add hook useFormUtils 2023-04-05 15:04:39 -04:00
Tsu-ba-me
442427cf63 fix(striker-ui): add arbitrary slot before action area in ConfirmDialog 2023-04-05 15:04:39 -04:00
Tsu-ba-me
4400bf6645 fix(striker-ui): make buildMapToMessageSetter() handle array ids 2023-04-05 15:04:39 -04:00
Tsu-ba-me
301b279000 fix(striker-ui): show UPS type as required 2023-04-05 15:04:39 -04:00
Tsu-ba-me
1345e026cc fix(striker-ui): expose required in SelectWithLabel 2023-04-05 15:04:39 -04:00
Tsu-ba-me
051dde3e80 fix(striker-ui): add Link to UPS type options 2023-04-05 15:04:39 -04:00
Tsu-ba-me
d8efd72c21 fix(striker-ui-api): extract link from UPS type description 2023-04-05 15:04:39 -04:00
Tsu-ba-me
b94e459e0a fix(striker-ui): connect add and edit forms with ManageUpsPanel 2023-04-05 15:04:39 -04:00
Tsu-ba-me
5e2f7b4974 fix(striker-ui): allow UPS type selection in EditUpsInputGroup 2023-04-05 15:04:39 -04:00
Tsu-ba-me
f97463f2c4 fix(striker-ui): export input id and include previous value in AddUpsInputGroup 2023-04-05 15:04:39 -04:00
Tsu-ba-me
130d70078d fix(striker-ui): export input id in CommonUpsInputGroup 2023-04-05 15:04:39 -04:00
Tsu-ba-me
c85cea0094 fix(striker-ui): pass id to input element in SelectWithLabel 2023-04-05 15:04:39 -04:00