Commit Graph

1088 Commits

Author SHA1 Message Date
Tsu-ba-me
1a0e3612dc fix(tools): use integer check from validate module in manage vnc pipe 2024-06-12 12:36:55 -04:00
Tsu-ba-me
a1f429201d fix(tools): use uuid check from validate module in manage vnc pipe 2024-06-12 12:36:55 -04:00
Madison Kelly
420445d875 Altered nmcli sleeps and bumped logging to debug DB connection issue.
Signed-off-by: Madison Kelly <mkelly@alteeve.com>
2024-06-10 15:04:54 -04:00
Madison Kelly
6a0c9f27df Updated anvil-join-anvil to auto-create the anvil-configure-host job
* This will, when all NICs can be accounted for, allow the reconfiguring
  of the network from stored database data after a node has been
  reinstalled during an 'anvil-join-anvil --rejoin' run.

Signed-off-by: Madison Kelly <mkelly@alteeve.com>
2024-06-09 21:49:22 -04:00
Madison Kelly
94dacd08d8 Created Database->get_variables().
* Updated (NOT COMPLETE!) System->check_if_configured to take the new
  'thorough' parameter to see if the network is no longer configured.
  When used, the method attempts to detect if a host has been
  rebuilt and, thus, no longer configured.
* Started work on having 'anvil-join-anvil --rejoin' try to see if the
  network needs to be reconfigured prior to rejoining the cluster.

Signed-off-by: Madison Kelly <mkelly@alteeve.com>
2024-06-08 14:18:28 -04:00
digimer
8c1c0597da Updated anvil-daemon to run anvil-configure-host in the foreground.
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-30 14:49:02 -04:00
digimer
25a0454dce Better handling of lost DB connections.
* Added a sync call to Tools->nice_exit() to ensure logs are flushed.
* Updated Database->quote() to be in an eval block to better handle
  cases where the DB handle is lost.
* Added an hourly check to anvil-daemon and moved the memory in use
  check to run only once per hour.

Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-29 20:41:12 -04:00
digimer
b86493fff4 More logging to debug apparent hang
* Added an explicit 'sync' call when writing to logs. TO BE REMOVED!
* Disabled anvil-monitor-daemons and anvil-monitor-performance in case
  this is somehow trigging program exits.
* Converted prints to Log->entry calls in anvil-change-password
* Added PID state info logging for running jobs.

Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-29 13:40:57 -04:00
digimer
4766ceff70 Added logging to debug network config issue.
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-29 00:35:27 -04:00
digimer
8dc3a8262f Updated pod on requiring 'new' for manifest_uuid when creating new
manifests.

Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-28 22:22:47 -04:00
digimer
566887462e Fixed parameter names being sent to Striker->generate_manifest().
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-28 20:23:52 -04:00
digimer
a3ac5cf7f8 Fixed a bug that prevented install manifests from being saved.
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-28 14:27:16 -04:00
digimer
368673eac2 Added a flag for when NM is changed and, if set, NM is restarted.
* Also bumped nmcli sleeps to 5s.

Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-27 00:07:18 -04:00
digimer
acf30229ef Added code to restart NetworkManager if needed
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-25 22:13:44 -04:00
digimer
ab33c716cb Created a specific check that there's a hosts entry for each DB
* This is meant to deal with a case where, when a DB is added to
  anvil.conf but that new entry is not yet in hosts, the program crashes
  because of a duplicate key when calling insert_or_update_hosts for all
  DBs.

Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-25 20:19:26 -04:00
digimer
3d50f45984 Added a 1 second delay to nmcli calls
* Also fixed a bug Database->get_storage_group_data() to add a missing
  column to adding members.

Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-24 14:20:41 -04:00
digimer
033052f449 Shortened the time to reboot when no DBs come back after net reconfig
* Also updated to directly call a reboot.

Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-17 20:12:04 -04:00
digimer
8e53993f67 Shortened the anvil-daemon job start up delay.
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-15 23:00:31 -04:00
digimer
6d121dc0c0 Mapped each interface name in match.interface-name to a UUID lookup.
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-08 00:17:16 -04:00
digimer
7925a3f42c * Added more man pages.
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-05-07 22:34:18 -04:00
digimer
1152c50f3a Added pcsd config, and -y support.
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-04-16 00:43:58 -04:00
digimer
3e63b726d3 Added node 2 joining an Anvil! node if not started by node 1.
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-04-14 01:36:28 -04:00
digimer
e00dec7cba Added loading existing corosync/authkey from peer during rebuild.
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-04-13 17:46:19 -04:00
digimer
bd2e4c46ae Updated Network->load_ips() to use the device_name when available.
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-04-13 16:55:18 -04:00
digimer
7ecd0a4d70 Starting work on rejoining a replacement subnode to an Anvil! node
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-04-13 01:35:20 -04:00
digimer
44aa0fb8d9 Bumped logging to debug periodic strike init resync failure
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-04-10 10:39:17 -04:00
digimer
76e66e6fa6 Added anvil.conf to log collection.
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-04-10 10:39:17 -04:00
digimer
60759cd9aa No longer fail is fence method exists already when trying to create it.
* Also fixed a string insertion variable typo

Signed-off-by: digimer <mkelly@alteeve.ca>
2024-04-10 10:39:17 -04:00
Tsu-ba-me
5d086f5e79 fix(tools): log websockify output 2024-04-09 15:13:54 -04:00
Tsu-ba-me
468a6d18ee fix(tools): get all logs in audit/ 2024-04-08 14:19:57 -04:00
Tsu-ba-me
11fd4098c4 fix(tools): collect /var/log/audit/audit.log for debug 2024-04-08 14:19:57 -04:00
digimer
937d8ed2ef Added collecting corosync to log collection
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-04-05 12:45:14 -04:00
digimer
cca4a7ea1b Bumped logging.
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-04-04 14:53:48 -04:00
digimer
a28bb083fa Commented out a stray debug print
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-04-03 16:41:48 -04:00
Fabio M. Di Nitto
f1584c770d striker-collect-debug: disable screenshot collection by default
Closes: #620

Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
2024-03-30 21:51:34 +01:00
Fabio M. Di Nitto
510af7e4b1 striker-collect-debug: remove tempdir when done
Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
2024-03-30 21:51:34 +01:00
Fabio M. Di Nitto
b64cf7052a striker-collect-debug: align output format to other parts of the tool
Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
2024-03-30 21:51:34 +01:00
Fabio M. Di Nitto
3b018e56c6 striker-collect-debug: create output dir only when necessary
Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
2024-03-30 21:51:34 +01:00
Fabio M. Di Nitto
da67156ee1 striker-collect-debug: cleaner messages
Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
2024-03-30 21:51:34 +01:00
Fabio M. Di Nitto
635f38b489 anvil-safe-stop: don´t use locked version of pcs
Add pcs_direct tool path and use it for anvil-safe-stop

Closes: #623

Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
2024-03-30 07:06:10 +01:00
Digimer
4fd12879c3
Merge branch 'main' into daemon-management 2024-03-28 02:32:24 -04:00
Digimer
e63ebf7ba8
Merge branch 'main' into libvirt_fixes 2024-03-27 18:47:39 -04:00
digimer
371aaae6a8 Updated to set location constraints to minimize migrations
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-03-27 13:32:16 -04:00
digimer
21c8084b2f Updated to support Sys::Virt::Domain generating PNG screenshots
* This should work with older versions still generating PPM screenshots.

Signed-off-by: digimer <mkelly@alteeve.ca>
2024-03-26 18:56:07 -04:00
digimer
15a79d7bac Added '--all' to anvil-manage-daemons to enable optional daemons.
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-03-26 13:34:59 -04:00
digimer
b74900c2fc Beginning to repurpose anvil-manage-server for server resync
Signed-off-by: digimer <mkelly@alteeve.ca>
2024-03-25 22:41:34 -04:00
Fabio M. Di Nitto
50ad874909 striker-collect-debug: fix collection of cib.xml
Closes: #534

Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
2024-03-24 14:39:38 +01:00
Tsu-ba-me
f506ec4ac8 fix(tools): allow operations (currently set) on hash children in execute mode of access module 2024-03-21 17:08:44 -04:00
Tsu-ba-me
b5264131c4 fix(tools): allow reference to children of in execute operation of access module 2024-03-21 17:08:44 -04:00
digimer
2d92f339c2 Fixed a bug related to changing the hostname during a manifest run
* The original hostname would be used to form the cluster, even though
  the hostname was updated.

Signed-off-by: digimer <mkelly@alteeve.ca>
2024-03-18 23:28:42 -04:00