digimer
77bae80534
Added default values for MTU and DNS if not set in a manifest.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-28 17:47:53 -04:00
digimer
b8fb3d62e3
Added checks for the screenshot directory before collection.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-28 16:30:36 -04:00
digimer
663a1e0527
Quieted screenshot logging in anvil-daemon.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-28 16:27:40 -04:00
digimer
fcbace6713
Updated anvil-join-anvil to hold if either node is still running anvil-configure-host
...
* Fixed a minor bug and added logging of maintenance_mode calls in anvil-configure-host.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-28 16:01:32 -04:00
digimer
e480337239
Fixed wait loop for subnodes
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-28 11:47:48 -04:00
digimer
582a8b292c
Added more job updates to anvil-manage-power.
...
* This is a test to see if the job waiting for the uptime to be 300s,
leaving the job_progress as 0, was causing the job to be repeatedly
called.
* This is related to issue #479
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-28 01:19:52 -04:00
digimer
5b286f2696
Added missing switch reading in anvil-manage-power.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-28 00:24:29 -04:00
digimer
ef042eef25
Cleaned up logging while waiting for subnodes.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-28 00:15:14 -04:00
digimer
5d5270486e
Added a wait loop when forming node clusters.
...
* This adds a check where anvil-join-anvil waits until both subnodes are
marked as configured and not in maintenance mode.
* Should address issue #479 (maybe, this shouldn't trigger reboots, but
it was certainly a race condition found while investigating).
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-27 22:38:07 -04:00
digimer
d58521ceca
Added screenshot capture to striker-collect-debug.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-27 11:46:56 -04:00
Digimer
745081b649
Merge branch 'main' into patch-screenshot
2023-09-22 23:28:12 -04:00
digimer
b1562e0301
* Added the new screenshot tool.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-22 17:16:19 -04:00
digimer
c039c58128
* This commit moves taking screenshots of hosted servers onto the strikers using the Sys::Virt module. This was needed because the screenshots were being taken by scan-server, and that was causing it to take a long time to run. It should never have been handled by the scan agent anyway. This update requires a WebUI fix to use the new screenshot tool. This tool also adds holding multiple screenshots to allow users to "scrub" through screenshots up to 10 hours in the past.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-22 17:15:09 -04:00
digimer
8925dabb9d
* Updated anvil-shutdown-server to take the new '--immediate' switch which forces a server to shut down immediately (akin to pulling the power on a traditional machine). This is needed to allow a user to recover a crash or hung server.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-21 18:56:12 -04:00
digimer
3c9086d1f3
Fixed bugs related to running jobs.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-07 20:06:00 -04:00
digimer
e8a84e1c97
Added job handling to anvil-manage-server-storage (needs more testing though).
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-07 15:37:31 -04:00
digimer
2f429d2bc7
Fixed bugs related to adding drives and extending drives to servers.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-05 22:53:52 -04:00
digimer
e895e1f264
* Finished writting the anvil-manage-server-storage.
...
* Fixed handling --eject and --insert to work without a device target specified when only one exists, or to find the file path when only the file name is given.
* Updated anvil-manage-server-storage to show files when processing an optical devices without a file being passed.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-05 16:53:08 -04:00
digimer
17078347ee
Reworked anvil-manage-server-storage to use the translation system.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-09-03 16:02:47 -04:00
Tsu-ba-me
adcf5ec818
fix(tools): enhance logging in access module
2023-08-21 20:57:31 -04:00
Tsu-ba-me
7845f8c170
fix(tools): correct key check in access_chain of access module
...
The old check evaluates the expression before determining whether the
resulting value is defined. However, when the expression refers to a
subroutine, it gets executed; if the subroutine doesn't protect against
missing parameters, it'll cause executions with bad input, i.e., the
Striker->generate_manifest subroutine without parameters.
The new check uses can(), which correctly determines whether the key
"exists" on the blessed object; although it doesn't strictly mean
"exists", but it does the job.
2023-08-21 20:50:24 -04:00
Tsu-ba-me
ee259f5a3a
fix(tools): reduce UUIDv4 check strictness in access module
2023-08-21 18:59:32 -04:00
digimer
d255adc7b4
* Updated anvil-daemon to set the mode of /mnt/shared/* to 0777 during creation and to check that that mode is set for existing sub-directories. This resolves issue #443 .
...
* Cleaned up anvil-manage-dr.8 hyphen escapes.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-08-17 22:14:40 -04:00
digimer
4c118cf3b4
* Created empty man page files all remaining tools.
...
* Started fixing existing man pages, and writting needed ones.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-08-11 13:55:51 -04:00
digimer
7bf7c3747c
* Added anvil-file-details.8, but it's not completely written yet. Converted anvil-file-details to use the new switch loading method.
...
* Fixed the RC in ocf:alteeve:server to exit with 0 on notify calls, resolves issue #392 .
* Fixed typo references in issue #390 .
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-08-10 16:21:20 -04:00
digimer
1667d1f132
* Wrote the man page for anvil-download-file, and updated it to use the new switch method. Changed it's default download directory from /mnt/shared/temp to /mnt/shared/incoming.
...
* Added the new man pages to the Makefile.am file.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-08-10 16:19:05 -04:00
digimer
d39b0af67b
* Added support for migration networks, and wrote the man page for anvil-cycle-vm-nics.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-08-10 16:17:49 -04:00
digimer
a81a110261
* Remove forced log level and secure logging. This addresses issue #386
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-08-09 18:20:14 -04:00
digimer
a0cb791f47
This contains fixes needed for beta from additional testing.
...
* Updated the pcs wrapper to flock anything but status calls.
* Updated scan-apc-pdu to purge regardless of the host it's called on any host.
* Fixed a bug striker-purge-target that wouldn't purge anvil nodes in various cases.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-08-09 18:07:03 -04:00
digimer
cc71df686b
Added a pcs wrapper to serialize pcs constraint calls.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-08-08 14:32:33 -04:00
Fabio M. Di Nitto
824e3e07e3
virsh: add wrapper to serialize calls to virsh list
...
avoid storm of virsh list that overloads libvirtd API causing
unnecessary timeouts during pcmk monitoring operations.
Resolves: https://github.com/ClusterLabs/anvil/issues/395
Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
2023-08-07 08:35:08 +02:00
digimer
6ee2ad75db
* Updated anvil-delete-server to actively check for and delete any drbd-fenced attributes left over in the CIB after a server is deleted. This addresses issue #374 .
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-25 21:45:34 -04:00
digimer
ed480cf1cb
* Fixed a double-$ bug in Remote->_check_known_hosts_for_target()
...
* Updated striker-update-cluster to take '--timeout' and a number of seconds, or 'Xm' or 'Xh' for minutes or hourse, respectively. Also updated to show the remaining time while waiting, and added waiting timeout to the rest of the while loops that prior had no time limit. This addresses issue #383 and issue #382 .
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-25 19:13:41 -04:00
digimer
0471fb90ea
* Upped the logging in these three tools to help diagnose run errors. To be removed before tagging beta
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-25 13:07:38 -04:00
digimer
88e8978305
* Fixed a bug where getting the job_uuid after a no-db run wouldn't actually update the job progress.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-25 12:43:28 -04:00
digimer
be290bf561
This commit fixes a bug where the drbd kernel module build was being killed mid-compile, leaving DBRD unusable.
...
* Created System->wait_on_dnf() which was plucked from anvil-daemon, and now also called in scancore and anvil-safe-start.
* Updated scancore and anvil-safe-start to check on start that DRBD's kernel module is available (and build if not).
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-24 22:32:41 -04:00
digimer
d68adb5b4e
* Updated anvil-manage-power to not reboot if anvil-version-changes is running (which, if it's taking time, is generating new kmods).
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-24 20:44:40 -04:00
digimer
8b3d472b9c
Updated striker-update-cluster to set primary_host_uuid to node 1 if not returned from Cluster->get_primary_host_uuid.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-24 19:00:19 -04:00
digimer
556e91238d
* Updated Network->find_access() to clear the data from previous scans, which fixes a bug where checking multiple hosts could return stale data for the previous host.
...
* Updated anvil-manage-server-storage, striker-collect-debug, and striker-update-cluster to be able to find a connection on an interface when none were found on preferred networks.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-24 15:43:54 -04:00
digimer
9a5e617a2d
* Test fix for the issue #379
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-24 00:22:43 -04:00
digimer
f57ab1a78c
* Updated anvil-daemon to not hold jobs at startup is the host isn't configured yet.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-23 23:34:39 -04:00
digimer
66c82e5e22
* Fixed a bug in anvil-update-system where updating a single package with --reboot wouldn't request a reboot. Finished reworking it so that a check is made to see if the kernel or DRBD kmod will be updated and, if so, removes the kmod-drbd RPMs prior to doing the update (as opposed to the sloppier check-on-error method).
...
* Fixed a bug in System->reboot_needed() where the cache file path had a typo in the hash key.
* Updated anvil-daemon to use the full path to dnf when determining if a dnf process was running.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-23 21:43:26 -04:00
digimer
e278de4b5a
The main change in this commit deals with anvil-daemon startup. During OS updates, it would pick up the queued update job and run it while the other --no-db one was still running. This could become an issue for other tasks in the future, so updated anvil-daemon to not run any jobs for the first minute after startup. Also updated it to see if an OS update is underway (given how it can start mid-RPM update, before packages like kmod-drbd are ready to build). While doing this, implemented caching of daily tasks (like agine out data, archiving data, network scans, etc) to only run once per day, period. As it was before, they would always run on anvil-daemon startup, then wait 24 hours.
...
Note that work has started it reworking anvil-update-system, but it is incomplete (and broken) in this commit.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-23 21:43:26 -04:00
Tsu-ba-me
714ccdb5b6
chore(tools): log start/stop pipe errors in manage vnc pipe
2023-07-23 21:43:26 -04:00
Tsu-ba-me
1f13a416b9
chore(tools): remove unused anvil-manage-tunnel; missing from d2a61da
2023-07-23 21:43:26 -04:00
Tsu-ba-me
9163cf4513
chore: remove anvil-manage-tunnel
2023-07-23 21:43:26 -04:00
Tsu-ba-me
47f7c71e95
fix(tools): stop ws with source port when target port is unavailable
2023-07-23 21:43:26 -04:00
Tsu-ba-me
99dc4ba6ba
fix(tools): handle possible remnant websockify daemon wrapper
2023-07-23 21:43:26 -04:00
Tsu-ba-me
ba335cc411
fix(tools): match server vncinfo variable name
2023-07-23 21:43:26 -04:00
Tsu-ba-me
25f5c38ade
chore: rename striker-manage-vnc-pipes->anvil-manage-vnc-pipe
2023-07-23 21:43:26 -04:00
Tsu-ba-me
c29041d2f7
fix(tools): start websockify as daemon, re-find its pid and ports
2023-07-23 21:43:26 -04:00
Tsu-ba-me
cb98d28eb0
fix(tools): add target host to vnc info variable
2023-07-23 21:43:26 -04:00
Tsu-ba-me
0b91ee0314
fix(tools): remove all tunnel-related tasks
2023-07-23 21:43:26 -04:00
Tsu-ba-me
084394c66f
fix(tools): hoist find server vnc port
2023-07-23 21:43:26 -04:00
Tsu-ba-me
ee091d4e7b
fix(tools): format output of existing tunnel
2023-07-23 21:43:26 -04:00
Tsu-ba-me
d42f202609
fix(tools): flip ports based on forward type
2023-07-23 21:43:26 -04:00
Tsu-ba-me
0210323730
fix(tools): pass tunnel list to start, stop tunnel
2023-07-23 21:43:26 -04:00
Tsu-ba-me
19f6cefd8d
fix(tools): isolate prepare tunnel parents
2023-07-23 21:43:26 -04:00
Tsu-ba-me
9f8a153fe0
fix(tools): replace tilda with home path
2023-07-23 21:43:26 -04:00
Tsu-ba-me
f8e65416c4
fix(tools): correct receiving find tunnels output
2023-07-23 21:43:26 -04:00
Tsu-ba-me
1854cf4872
fix(tools): enable search full command in find tunnel parent processes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
02f89b24b1
fix(tools): pass debug level to find, start, stop tunnel in start pipe
2023-07-23 21:43:26 -04:00
Tsu-ba-me
6dbec289a1
fix(tools): treat empty tunnel list as no tunnels
2023-07-23 21:43:26 -04:00
Tsu-ba-me
3debdb846d
fix(tools): start background processes with system call
2023-07-23 21:43:26 -04:00
Tsu-ba-me
b695414c86
fix(tools): improve debug hashes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
8c37b49132
fix(tools): remove extra space in find tunnel parent sed
2023-07-23 21:43:26 -04:00
Tsu-ba-me
55bc44fc6e
fix(tools): correct typos in manage vnc pipes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
bedbf576ab
fix(tools): correct loop over tunnel list
2023-07-23 21:43:26 -04:00
Tsu-ba-me
4de0b675f1
fix(tools): don't find when tunnel list doesn't exist
2023-07-23 21:43:26 -04:00
Tsu-ba-me
2eb96f9d10
fix(tools): reuse existing tunnels in manage vnc pipes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
834e1a568a
fix(tools): improve debug log of start processes in manage vnc pipes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
7baa52e37d
fix(tools): ignore mismatches when find websockify, tunnel parent processes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
41abd4f9e4
fix(tools): correct websockify command in manage vnc pipes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
75b6ab94df
fix(tools): correct reversed set, delete operations in manage vnc pipes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
6906551851
fix(tools): remove repeated UUIDv4 test in manage vnc pipes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
6ec2dea741
fix(tools): correct renamed call variable in manage vnc pipes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
b1e7b0e244
fix(tools): correct input to keys in manage vnc pipes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
9431d89b61
fix(tools): correct brackets of set_ws_process in manage vnc pipes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
393782cf83
fix(tools): log inputs in manage vnc pipes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
ff7fe8b3a3
fix(tools): add missing grep path in manage vnc pipes
2023-07-23 21:43:26 -04:00
Tsu-ba-me
d192356c5a
fix(tools): remove unused database connection in manage tunnel
2023-07-23 21:43:26 -04:00
Tsu-ba-me
afade80f39
fix(tools): manage all VNC pipe components on subnodes/dr
2023-07-23 21:43:26 -04:00
Tsu-ba-me
ba467ccfa7
fix(tools): manage forward list of parent connection in manage tunnel
2023-07-23 21:43:26 -04:00
Tsu-ba-me
db06747513
fix(tools): make target optional when using external parent in manage tunnel
2023-07-23 21:43:26 -04:00
Tsu-ba-me
d29dac4fa9
fix(tools): return code after port forward fails in manage tunnel
2023-07-23 21:43:25 -04:00
Tsu-ba-me
711cb5b696
refactor: rename striker-open-ssh-tunnel->anvil-manage-tunnel
2023-07-23 21:43:25 -04:00
Tsu-ba-me
40e94cda46
fix(tools): enable open parent connection, child tunnel in open ssh tunnel
2023-07-23 21:43:25 -04:00
Tsu-ba-me
f2d3b06a10
fix(tools): remove all remote calls in manage vnc pipes
2023-07-23 21:43:25 -04:00
Tsu-ba-me
6c776e5a6a
fix(tools): enable remote forward in open ssh tunnel
2023-07-23 21:43:25 -04:00
digimer
b24b81c17c
Removed outer double-quotes from Anvil! node description in XML usage reporting. Related to issue #321 .
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-23 11:02:45 -04:00
digimer
942c4c94bf
Escaped double-quotes in Anvil! node descriptions when reporting usage as XML format. Should resolve issue #321 .
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-23 11:00:59 -04:00
digimer
9b90647cc0
Fixed a bug where the XML output was not valid.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-22 22:28:25 -04:00
digimer
d07933a31c
* Updated anvil-report-usage to accept the new '--machine' which reports the usage information in XML format.
...
* Added the anvil-report-usage.8 man page
* Updated anvil-update-system to enable scancore when the OS update is complete.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-22 22:21:48 -04:00
digimer
01b714f3b3
Fixed typo from issue #369 .
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-22 20:09:13 -04:00
digimer
b0c54b6dae
* Updated anvil-update-system to check if another instance of anvil-update-system is running and, if so, exit.
...
* Removed the new tasks from anvil-special-operations.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-22 20:03:39 -04:00
digimer
7bd76c10dc
Major thing in this commit is reworking striker-update-cluster to work without expecting anvil-daemon to be running on target machines. Similarly, they had to be able to work when the Striker DBs were not available. This is to account for cases where the Striker dashboards have updated, and the schema has changed, preventing the not-yet-updated DR hosts and subnodes from being able to use the DB. To do this, anvil-safe-stop, anvil-update-system, and anvil-shutdown-server had to be updated to use the new --no-db switch, which tells then to run without the database being available.
...
* Updated Server->shutdown_virsh() to work without a database connection.
* Updated System->reboot_needed() to store/read from a cache file when the database is not available.
* Updated anvil-safe-start to remove the old --enable/disable/status switches, now that we use anvil-safe-start.service systemd unit.
* Reworked anvil-safe-stop to work without a database connection, and to work on DR hosts.
* Updated anvil-special-operations to add new tasks, but it's likely these new tasks aren't needed and will be removed very shortly.
* Added/updated multiple man pages.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-22 18:09:01 -04:00
digimer
541381e317
* Finished getting anvil-manage-server-storage to add new volumes to running servers.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-18 13:31:52 -04:00
digimer
afaf129733
* Updated anvil-manage-server-storage to connect the new drive to the VM. Still need to update the on-disk and in-DB definitions though.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-17 21:33:46 -04:00
digimer
de86cf88fe
* Updated anvil-manage-server-storage to now handle a new volume stuck in 'Negotiating', and to do the initial sync when there are three connected peers.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-17 21:04:36 -04:00
digimer
9bc78860a6
* Updated anvil-update-system to detect kmod-drbd upgrade problems and fix them.
...
* Updated striker-update-cluster and anvil-update-system to take '--reboot' to request a reboot if any packages are updated.
Signed-off-by: digimer <mkelly@alteeve.ca>
2023-07-16 20:45:47 -04:00