56 Commits (b8fb3d62e3271422d4cc40310a95f4bfa6dae2f4)

Author SHA1 Message Date
digimer fcbace6713 Updated anvil-join-anvil to hold if either node is still running anvil-configure-host 1 year ago
digimer 5d5270486e Added a wait loop when forming node clusters. 1 year ago
digimer f57ab1a78c * Updated anvil-daemon to not hold jobs at startup is the host isn't configured yet. 1 year ago
digimer fea10e5bb1 * Prefixed all 'virsh' calls with 'setsid --wait' to help prevent future hangs if the call happens without a shell. 2 years ago
digimer 147f31aeeb * Added a loop when calling 'anvil-change-password' in a loop as there appears to be an unknown condition where during setup, this is called but never actually runs. 2 years ago
digimer 64bb5ab8e1 * Updated striker to only complain about unconfigured networks on nodes, not DR hosts. 2 years ago
digimer b8b4352117 * Added support for Migration Network configs in old striker and anvil-configure-host 2 years ago
Tsu-ba-me a7b80b2e36 fix(tools): parse switches in anvil-configure-host 2 years ago
Digimer 3b721b849c * Fixed a bug in anvil-configure-host where if the same MAC address was assigned to two interfaces, it would cause an endless reboot loop. 2 years ago
Digimer 55dd28e7f1 * Added the anvil-configure-host man page. 2 years ago
Digimer 171ea74000 * There is a fix in this commit to resolve a race condition where, when reconfiguring the network, the request to set a job to reboot would fail because the connections to all Strikers could be lost, causing Database->_test_access() would error out, blocking the reboot. When restarted, the network would not be changed, so no reboot would be requested, leaving the machine in an innaccesible state. 2 years ago
Digimer 763821a21d Fixed a variable substitution bug. 3 years ago
Digimer 87a2454a09 Moved anvil-configure-host reboot logging to use log_0687 to help grep for reboot causes. 3 years ago
Digimer 920d38d250 Moved anvil-configure-host reboot logging to use log_0687 to help grep for reboot causes. 3 years ago
Digimer 04f7571097 * Fixed a typo causing anvil-manage-power to not compile. 4 years ago
Digimer 023f43eda9 * In the never-ending attempt to resolve the build consistency issues, this commit enables extra debugging logging and, hopefully, implements a fix in anvil-daemon where a job could be started repeatedly. 4 years ago
Digimer 5a343d6d75 * WIP; Started work on Cluster->check_server_constraints() that will track when a server's location constraint needs to be updated when the old preferred node is lost. 4 years ago
Digimer 76689aa245 * I've decided that live reconfiguring of NetworkManager interfaces is too unreliable. This commit disables all attempts to reconfigure the network while it's up, and simply reboots on changes. 4 years ago
Digimer 629c2b8e8c * Moved up when the reboot happens, when it's needed, avoiding a network reload when a reboot is going to happen anyway. 4 years ago
Digimer bbee77d265 * Re-enabled reboot 4 years ago
Digimer 08a958ec60 * Finished updating Network->check_network() to check/heal bridges. 4 years ago
Digimer 6a8a192cfd * Added an explicit delete call when network changes. 4 years ago
Digimer bd24c1c5bb * I _might_ have fixed the network configuration issue in anvil-configure-host... Updated it so that if 'nmcli' doesn't report a valid device name, it looks for it in the ifcfg-X file, and uses 'X' if not found there. 4 years ago
Digimer c7c6c8dee5 * Reworked the attempt to repair the network in anvil-daemon to not touch the network until the machine has been running for at least two minutes. 4 years ago
Digimer 11b1900e1b Note: Continuing to resolve the build issues with network startup. Expect breakage. 4 years ago
Digimer a1b06e4355 * Continuing to try to get the network to reliably start during configuration... 4 years ago
Digimer 0dd92a08c5 * Small change to variable name to help make logs clearer. 4 years ago
Digimer 0b6a9e37fa * Added scan_lvm_pv_sector_size to the scan_lvm_pvs table in the scan-lvm. This will be used later for growing a requested disk size for the DRBD metadata. 4 years ago
Digimer 80bdac8e34 * Updated the pacemaker server config to drop the stop timeout to 5 minutes and the migration timeout to 10 minutes. This will avoid blocking the entire cluster when a stop or migrate operation times out. Will update scan-server to clean these up when they happen. 4 years ago
Digimer 24ec17f8f7 * Added a new parameter called 'sensitive' to Database->connect() that returns after connections before any ancilliary checks are done, minimizing connect time. 4 years ago
Digimer 41cd1e0319 * Several bugs fixed and enhancements; 4 years ago
Digimer 3ed857bacd * Bumped logging for striker-auto-initialize-all debugging. 4 years ago
Digimer 5536e8ff47 * Updated Cluster->assemble_storage_groups() and Cluster->anvil_name_from_uuid() and ->available_resources() to try to detect the anvil_uuid if not passed in. 4 years ago
Digimer 5e9e7e4dde * Removed debug logging from tools. 4 years ago
Digimer 54496cbeb0 * Added a check to Database->get_ip_addresses() to check is a hash is set before using it, to help avoid unitialized variable messages. 4 years ago
Digimer 3733220b50 * Updated Log->entry() to prefix log lines with the short 'job-uuid', when the log entry is coming from a program running as a job. This is meant to make it easier to break up what log lines belong to what jobs, if multiple jobs are running at the same time (ie: when initializing multiple nodes / dr hosts in parallel). 4 years ago
Digimer 1a520b03d5 * Cleaned up a lot of logging in anvil-daemon and tools it calls. 4 years ago
Digimer 0f7267eae1 * Moved the '_host_name', '_short_host_name', and '_domain_name' private methods in Tools.pm over to Get.pm (removing the leading '_' in the method names). 4 years ago
Madison Kelly 30f2b3fa8e * Switched all hash 'local' keys to be the host's short user name. Untested, likely bugs to be fixed in the next commit. 4 years ago
Digimer 1fa63d2ea3 * Added 'anvil_uuid' as a set parameter in Database->get_hosts(). 4 years ago
Digimer de43ea3ac1 * Renamed all Validate->is_X to Validate->X. Also created Validate->ipv6() to validate IPv6 addresses using Data::Validate::IP (and added it as a requirement to the .spec base RPM). 5 years ago
digimer 42c9099033 * Renamed all Validate->is_X to Validate->X. Also created Validate->ipv6() to validate IPv6 addresses using Data::Validate::IP (and added it as a requirement to the .spec base RPM). 5 years ago
Digimer 453f5c6223 * Fixed a bug where $anvil->nice_exit() was being passed 'exit' instead of 'exit_code' as a parameter. 5 years ago
Digimer 1e89ef55f3 * Updated Network->get_ips() to (again) record the MAC addresses and to create a MAC to interface name lookup hash. This was (accidentally?) removed back when the ->get_ips() was changed to store the data in a host-specific hash. 5 years ago
Digimer 726a4374d1 * Renamed the database table 'host_keys' to 'ssh_keys' to better represent what it stores. 5 years ago
Digimer 530fb31478 * Updated Jobs->get_job_details() to use --job-uuid switch or, failing that, look for an incomplete on this host with the same command as the calling program. 5 years ago
Digimer a16e86dc7c * Finished tools/anvil-configure-host! More notably, it can now update the network without a reboot, and it cleanly handles losing and reconnecting to dashboards. 5 years ago
Digimer 414bf81f25 * Fixed a bug where the BOOTPROTO="x" was set to dhcp for a bond interface that is connected to a bridge and should have been set to 'none'. 5 years ago
Digimer e9ee74553a * Fixed a comment bug in Network->reconfigure_network() indicating that MAC lookup was using the wrong hash key. 5 years ago
Digimer 9bcad1c4bf * Did more work on adapting tools/anvil-configure-host to work on nodes and DR hosts. 5 years ago